Chronological POCSO Judgments Dataset: Pre-COVID Compilation (2012 - 2020)

Citation Author(s):
Vartul
Shrivastava
Submitted by:
Vartul Shrivastava
Last updated:
Fri, 08/30/2024 - 01:57
DOI:
10.21227/n3ce-hw35
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

The Protection of Children from Sexual Offences (POCSO) Act was an important legislation that was enacted in India in 2012. It aims to safeguard children from sexual exploitation through various enforcement and legal redressal mechanisms. This dataset has been scraped from eCourts India Services using Python script which uses Selenium. We have mined apex and high courts’ judgements, which mentioned the POCSO Act and its respective sections. We have chronologically scraped POCSO judgements from 2012 to 2020 in the corpus. This dataset aims to facilitate a vested array of upcoming studies to understand legal trends within the premise of POCSO judgements using language processing techniques. Documentation within this dataset dates to the pre-COVID pandemic period, enabling legal enthusiasts to study judicial POCSO patterns before the onset of COVID in India.

Instructions: 

The dataset has been organized hierarchically based on the chronological order. Each year is distinctively mentioned under which date-wise folders are elucidated with the directory name ‘DD-MM-YYYY-PCS’. Judgements are in TXT format while keeping the usual indentation intact.

Comments

This dataset aims to provide bedrock for upcoming researches in natural language and legal domain.

Submitted by Vartul Shrivastava on Wed, 08/28/2024 - 01:33

Documentation

AttachmentSize
File readme.md2.08 KB