multi-label classification
This is a compressed package containing nine multi-label text classification data sets, including AAPD, CitySearch, Heritage, Laptop, Ohsumed, RCV1, Restaurant, Reuters, and Sentihood.
- Categories:
The dataset is the sampling dataset from EURLEX57k and built for multi-answer questioning task with EUROVOC. , Each legal document in the EURLEX57k dataset is assigned several labels from the European Vocabulary (EUROVOC), which maintains thousands of concepts such as "export industry" and "organic acid". Before building the data, the sample is chosen. A Z-scorebased online sample size calculator is used to determine the sample sizes. The given confidence level is 95%. A 5% margin of uncertainty is used. The computation results in a 381 out of 45,000 train sample size.
- Categories:
The concept of wellness, as proposed by Halbert L. Dunn, recognizes the importance of multiple dimensions, such as social and mental well-being, in maintaining overall health. Neglecting these dimensions can have long-term negative consequences on an individual's mental well-being. In the context of traditional in-person therapy sessions, efforts are made to manually identify underlying factors that contribute to mental disturbances, as these factors, if triggered, can potentially lead to severe mental health disorders.
- Categories: