artificial intelligence; machine learning; natural language processing; sentiment analysis; named entity recognition; relation extraction

Using Python. we crawl a total of 18, 793 diabetes related Q&A between Jun. 1, 2016 and Sept. 1, 2020 on xywy.com, a famous Chinese Online Medical Community. Each data contains four parts of the question detail page: TitleProblem DescriptionUser ID and Question Time, and three parts of the doctor’s answer page: Doctor IDAnswer Content and Answer Time. After preprocessing such as cleaning and deduplication, we finally obtain 18,521 valid data.

Categories:
16 Views