Artificial Intelligence
Publicly available dataset weibo_senti_100k, which consists of Weibo comments, verify the validity of the model. We have assigned the label of 0 to negative semantics, 1 to neutral statements, and 2 to positive semantics in the dataset,comments data is divided into a training set, a test set and a validation set, distributed in a ratio of 3:1:1 to facilitate the training and evaluation of our machine learning model.
- Categories:
Experimental data were collected from Acheng pig farm in Harbin, Heilongjiang Province, China. Considering the common situation of mixed breeds and sizes of pigs in actual breeding, a total of 45 Landrace pigs, 25 Beijing Black pigs, and 16 Yellow pigs were selected and placed in normal breeding areas, as well as in sow and piglet breeding areas. Recordings were made using Hikvision DS-IPC-KH-LWT surveillance cameras in normal breeding areas, sow and piglet breeding areas.
- Categories:
create
We're excited to present a unique challenge aimed at advancing automated depression diagnosis. Traditional methods using written speech or self-reported measures often fall short in real-world scenarios. To address this, we've curated a dataset of authentic depression clinical interviews from a psychiatric hospital.
- Categories:
For the semantic segmentation to be effectively done, a labelled flood scene image dataset was created. This initiative was undertaken with official permission obtained from the BBC News Website and YouTube channel, providing a valuable dataset for our research. We were granted permission to use flood-related videos for research purposes, ensuring ethical and legal considerations. Specifically, videos were sourced from the BBC News YouTube channel. The obtained videos were then processed to extract image frames, resulting in a dataset comprising 10,854 images.
- Categories:
Accounting for the cost of software development projects involves multiple aspects, including human resources, hardware equipment, software tools, training, testing, maintenance, etc. Here are some steps and factors to consider, including labor costs: calculating labor costs such as salaries, benefits, bonuses, etc. for development team members. This may include developers, testers, project managers, designers, etc.
- Categories:
An IEEE 802.15.4 backscatter communication dataset for Radio Frequency (RF) fingerprinting purposes.
It includes I/Q samples of transmitted frames from six carrier emitters, including two USRP B210 devices (labeled as c#) and four CC2538 chips (labeled as cc#), alongside ten backscatter tags (identified as tag#). The carrier emitters generate an unmodulated carrier signal, while the backscatter tags employ QPSK modulation within the 2.4 GHz frequency band, adhering to the IEEE 802.15.4 protocol standards.
- Categories:
The Colour-Rendered Bosphorus Projections (CRBP) Face Dataset represents an innovative advancement in facial recognition and computer vision technologies. This dataset uniquely combines the precision of 3D face modelling with the detailed visual cues of 2D imagery, creating a multifaceted resource for various research activities. Derived from the acclaimed Bosphorus 3D Face Database, the CRBP dataset introduces colour-rendered projections to enrich the original dataset.
- Categories:
This work presents a new labeled dataset of videos with native and professional interpreters articulating words and expressions in Libras (Brazilian Sign Language). We used a methodology based on related studies, the support of the team of articulators, and the existing datasets in the literature.
- Categories:
NCBI: The NCBI dataset is a biomedical corpus containing 793 PubMed abstracts, each manually annotated to include disease mentions and their corresponding concepts, providing a high-quality gold standard for disease name recognition and normalization research.
- Categories:
This dataset comprises audio recordings of ultra-high-frequency ambient noise stored in the lossless waveform format (WAW). The recordings were sampled at a frequency sample rate of 2.048 MHz and then provided at a downsampled audio rate of 48 kHz for compatibility and practical usage. The total length of the dataset is 01:30:29, consisting of approximately 260 million data points. (2024-03-30)
- Categories: