Skip to main content

Datasets

Standard Dataset

DAAPNet Dataset

Citation Author(s):
Yongchun Miao
Submitted by:
Yongchun Miao
Last updated:
DOI:
10.21227/wpmm-ke15
5 views
Categories:
Keywords:
No Ratings Yet

Abstract

This dataset contains 10,000 synthesized sequences (10 seconds each) of North Atlantic Right Whale vocalizations for acoustic event detection research. It features four vocalization types (upcalls, gunshots, screams, moancalls) with varying durations from 0.8-4.2 seconds. The data is stratified across four signal-to-noise ratio levels (-10 to 10 dB) and split into training (7,000), validation (1,500), and test (1,500) sets. Created using the Scaper library with controlled acoustic parameters, this benchmark enables evaluation of detection algorithms under realistic marine acoustic conditions with overlapping vocalizations.

Instructions:

This dataset contains 10,000 synthesized sequences (10 seconds each) of North Atlantic Right Whale vocalizations for acoustic event detection research. It features four vocalization types (upcalls, gunshots, screams, moancalls) with varying durations from 0.8-4.2 seconds. The data is stratified across four signal-to-noise ratio levels (-10 to 10 dB) and split into training (7,000), validation (1,500), and test (1,500) sets. Created using the Scaper library with controlled acoustic parameters, this benchmark enables evaluation of detection algorithms under realistic marine acoustic conditions with overlapping vocalizations.