A Dataset of Inertial Measurement Units for Handwritten English Alphabets: Leveraging Diversity in Indian Context

Citation Author(s):: Hari Prabhat Gupta ( IIT (BHU) Varanasi)

Tanima Dutta ( IIT (BHU) Varanasi)

Rahul Mishra ( IIT (BHU) Varanasi)

Garvit Banga ( IIT (BHU) Varanasi)

Shubham Pandey ( IIT (BHU) Varanasi)

Krishna Sharma ( IIT (BHU) Varanasi)

Himanshu Sahu ( IIT (BHU) Varanasi)
Submitted by:: HARI GUPTA
Last updated:: Tue, 07/04/2023 - 04:04
DOI:: 10.21227/av6q-jj17
Data Format:: *.avi; *.csv; *.txt; *.zip

820 views

Categories:

Keywords:

Artificial Intelligence; Dataset; Machine Learning

ACCESS DATASET CITE

Abstract

This report presents an end-to-end methodology for collecting datasets to recognize handwritten English alphabets in the Indian context by utilizing Inertial Measurement Units (IMUs) and leveraging the diversity present in the Indian writing style. The IMUs are utilized to capture the dynamic movement patterns associated with handwriting, enabling more accurate recognition of alphabets. The Indian context introduces various challenges due to the heterogeneity in writing styles across different regions and languages. By leveraging this diversity, the collected dataset and the collection system aim to achieve higher recognition accuracy. Some preliminary experimental results demonstrate the effectiveness of the dataset in accurately recognizing handwritten English alphabets in the Indian context. This research can be extended and contributes to the field of pattern recognition and offers valuable insights for developing improved systems for handwriting recognition, particularly in diverse linguistic and cultural contexts. Watch the data collections process on youtube.

Instructions:

During the writing process, a sensor was employed to record data. When the student was actively writing a character, the sensor registered a value of 1, along with other sensory measurements. On the other hand, when the student was not writing a character, the sensor recorded a value of 0, along with the corresponding sensory data. This allowed for the differentiation between active writing periods and non-writing periods. Throughout the dataset collection, the sensors recorded data from all 50 instances of each character being written by the students. This comprehensive data collection approach enabled the capture of multiple repetitions of each character, providing a rich dataset for analysis and modelling purposes. By combining information on the sensory values, timing, and the distinction between writing and non-writing periods, the collected data offers valuable insights into the writing behaviour and patterns exhibited by the students. The sensory dataset is stored in txt files, following the directory hierarchy as shown in Figure.

Datasets

Standard Dataset

A Dataset of Inertial Measurement Units for Handwritten English Alphabets: Leveraging Diversity in Indian Context

Abstract

Instructions:

Dataset Files

DOCUMENTATION

DATASET SCRIPTS

QUESTIONS?

More from this Author

Channel State Information Dataset for Multi-Human Activity Recognition in Indoor Environments

More like this Dataset

Weather Monitoring Station For Farms And Agriculture

Trilateration based on RSSI values in transmitters and receivers

The FLAME dataset: Aerial Imagery Pile burn detection using drones (UAVs)

Retinal Fundus Multi-disease Image Dataset (RFMiD)

Experimental database for detecting and diagnosing rotor broken bar in a three-phase induction motor.

Edge-IIoTset: A New Comprehensive Realistic Cyber Security Dataset of IoT and IIoT Applications: Centralized and Federated Learning