Artificial Intelligence
The PermGuard dataset is a carefully crafted Android Malware dataset that maps Android permissions to exploitation techniques, providing valuable insights into how malware can exploit these permissions. It consists of 55,911 benign and 55,911 malware apps, creating a balanced dataset for analysis. APK files were sourced from AndroZoo, including applications scanned between January 1, 2019, and July 1, 2024. A novel construction method extracts Android permissions and links them to exploitation techniques, enabling a deeper understanding of permission misuse.
- Categories:
The SINEW 15 2023 Biomarker dataset was extracted from the sensor data collected by a longitudinal study called Sensors IN-home for Elder Wellbeing (SINEW).
- Categories:
In the captured image, a drone is seen in flight, displaying its advanced technological features and capabilities. The image highlights the drone's robust design and aerodynamic structure, which are essential for its diverse applications in research and development. Drones, also known as Unmanned Aerial Vehicles (UAVs), are increasingly being utilized in various fields due to their ability to collect data from hard-to-reach or hazardous areas.
- Categories:
Multimodal large language models (MLLMs) have shown remarkable progress in high-level semantic tasks such as visual question answering, image captioning, and emotion recognition. However, despite advancements, there remains a lack of standardized benchmarks for evaluating MLLMs performance in multi-object sentiment analysis, a key task in semantic understanding. To address this gap, we introduce MOSABench, a novel evaluation dataset designed specifically for multi-object sentiment analysis.
- Categories:
kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti kitti
- Categories:
These datasets originate from two separate power transformers, identified as Transformer 1 and Transformer 2. Each dataset is further categorized into two distinct time intervals for data collection. The first time interval, labeled as "m," represents a short-term sampling period with data recorded every 15 minutes, capturing detailed temporal fluctuations over shorter durations. The second time interval, labeled as "h," signifies a longer-term sampling period with data recorded at hourly intervals, providing an overview of broader trends and patterns.
- Categories:
The Human voice Natural Language from On-demand media (HENLO) dataset is a high-quality emotional speech dataset created to address the need for representative and realistic data in speech emotion recognition research. Unlike many existing datasets, which rely on simulated emotions performed by untrained speakers or directed participants, HENLO sources its data from professionally produced films and podcasts available on Media On-Demand (MOD).
- Categories:
In this paper we use Natural Language Processing techniques to improve different machine learning approaches (Support Vector Machines (SVM), Local SVM, Random Forests) to the problem of automatic keyphrases extraction from scientific papers. For the evaluation we propose a large and high-quality dataset: 2000 ACM papers from the Computer Science domain. We evaluate by comparison with expert-assigned keyphrases.
- Categories: