Datasets
Standard Dataset
depression reddit dataset
- Citation Author(s):
- Submitted by:
- Jaedong Oh
- Last updated:
- Fri, 03/24/2023 - 13:59
- DOI:
- 10.21227/0dfh-5a29
- Data Format:
- License:
957 Views
- Categories:
- Keywords:
0 ratings - Please login to submit your rating.
Abstract
We pre-processed and built posts and comments posted during 2010-2016 on the subreddit r/depression.
Instructions:
We construct reddit data using posts and comments posted during 2010-2016 on the subreddit r/depression. How we processed data is explained in readme.md file.
Dataset Files
- dataset1 is original post, comment, title and dataset2 is single sentence that Dataset1 divided into sentences Reddit.zip (147.78 MB)
- reddit data processor processor.py (6.32 kB)
- convert *.zst file to *.csv file data_extract.py (1.21 kB)
- create reddit depression dataset1 create_dataset1.py (2.47 kB)
- create reddit depression dataset2 create_dataset2.py (1.91 kB)
Documentation
Attachment | Size |
---|---|
it explains how we process reddit data collected from 'r/depression' | 262 bytes |
Comments
Good.
Hello. I am working on developing ML models for depression detection