Datasets
Standard Dataset
phy-10k
- Citation Author(s):
- Submitted by:
- Yuxin Ji
- Last updated:
- Tue, 11/12/2024 - 04:25
- DOI:
- 10.21227/k0cr-sj47
- License:
- Categories:
- Keywords:
Abstract
This dataset is crawled from educational websites, focusing on the subject of high school physics, covering multiple-choice, single-choice and fill-in-the-blank questions, totaling tens of thousands of high-quality Chinese test questions. Each question is equipped with detailed text descriptions and intuitive image descriptions, and stored in json format to ensure easy parsing and efficient utilization of the data. Each question contains key information such as label, question stem, question type, answer, knowledge point and image path. By accessing the “question_stem_img_dir” attribute, users can easily obtain the related image data. This dataset is suitable for educational technology research, intelligent question bank construction, knowledge point annotation and online learning platform development, etc. It is especially advantageous for multimodal educational resources research.
All the questions are saved in json format, and the question images are saved in the path named “image”. You can get the corresponding image by getting the “question_stem_img_dir” attribute of each question.
Here is an example.
{ "question_id": 1, "chapters_title": "运动的描述", "level1_knowledge_point": "质点:参考系和坐标系", "level2_knowledge_point": "机械运动", "questions_type": " 多选题", "question_stem_img_dir": "physics_1.png", "question_stem": "三个质点A.B.C均由N点沿不同路径运动至M点,运动轨迹如图6所示,三个质点同时从N点出发,同时达到M点,下列说法正确的是( )", "question_answer": "A,D", "question_option": [ "三个质点从N点到M点的平均速度相同", "三个质点任意时刻的速度方向都相同", "三个质点从N点出发到任意时刻的平均速度都相同", "三个质点从N点到M点的位移相同,路程不同" ] }
The specific meaning of each property is explained below:
{
“question_id": the number of the question
“chapters_title": the first-level knowledge point of the question
“level1_knowledge_point": the second-level knowledge point of the question
“level2_knowledge_point": the third-level knowledge point of the test question
“questions_type": options for the question
“question_stem_img_dir": the image corresponding to the test question
“question_stem": information about the question
“question_answer": the answer
“question_option": list of options for the choice question
}