Datasets
Standard Dataset
Chronological POCSO Judgments Dataset: Pre-COVID Compilation (2012 - 2020)
- Citation Author(s):
- Submitted by:
- Vartul Shrivastava
- Last updated:
- Fri, 08/30/2024 - 01:57
- DOI:
- 10.21227/n3ce-hw35
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
The Protection of Children from Sexual Offences (POCSO) Act was an important legislation that was enacted in India in 2012. It aims to safeguard children from sexual exploitation through various enforcement and legal redressal mechanisms. This dataset has been scraped from eCourts India Services using Python script which uses Selenium. We have mined apex and high courts’ judgements, which mentioned the POCSO Act and its respective sections. We have chronologically scraped POCSO judgements from 2012 to 2020 in the corpus. This dataset aims to facilitate a vested array of upcoming studies to understand legal trends within the premise of POCSO judgements using language processing techniques. Documentation within this dataset dates to the pre-COVID pandemic period, enabling legal enthusiasts to study judicial POCSO patterns before the onset of COVID in India.
The dataset has been organized hierarchically based on the chronological order. Each year is distinctively mentioned under which date-wise folders are elucidated with the directory name ‘DD-MM-YYYY-PCS’. Judgements are in TXT format while keeping the usual indentation intact.
Documentation
Attachment | Size |
---|---|
readme.md | 2.08 KB |
Comments
This dataset aims to provide bedrock for upcoming researches in natural language and legal domain.