Datasets
Standard Dataset
SOS-HL-1K
- Citation Author(s):
- Submitted by:
- Hongzhi Qi
- Last updated:
- Sun, 06/09/2024 - 05:32
- DOI:
- 10.21227/pyzc-8h56
- Data Format:
- License:
106 Views
- Categories:
- Keywords:
0 ratings - Please login to submit your rating.
Abstract
We sourced our data by crawling comments from the “Zoufan” blog within the Weibo social platform. Subsequently, a team of qualified psychologists were enlisted to annotate the data. In our study, strict data preprocessing measures were adopted to protect users’ privacy.
SOS-HL-1K (Suicide Risk Classification)
- Categories: High risk, Low risk
- Number of Samples:
- High risk: 601
- Low risk: 648
- Data Split:
- Training set: 999 samples
- Test set: 250 samples
- Average Number of Words per Post: 47.79
- Labels: Each post is labeled with either 'high risk' or 'low risk'.
Funding Agency:
National Natural Science Foundation of China
Grant Number:
72174152, 72304212 and 82071546