We pre-processed and built posts and comments posted during 2010-2016 on the subreddit r/depression.
The data collection questionnaire consisted of two sections. One section involved the collection of data via Google Forms questionnaires, and the other involved the collection of WhatsApp voice samples. There were three subsections in the questionnaire section. The first consisted of the individual's basic information, such as email address, name, and identification number. The second was the personal health questionnaire depression scale (PHQ8), which included 8 groups of statements, and the third was the Beck Depression Inventory-II, which contained 21 groups of statements.