Artificial Intelligence
SUNBURST Attack Dataset for Network Attack Detection
Overview:
The SUNBURST dataset is a unique and valuable resource for researchers studying network intrusion detection and prevention. This dataset provides real-world network traffic data related to SUNBURST, a sophisticated supply chain attack that exploited the SolarWinds Orion software. It focuses on the behavioral characteristics of the SUNBURST malware, enabling the development and evaluation of security mechanisms.
- Categories:
Single-cell RNA sequencing (scRNA-seq) enables high-resolution analysis of cellular heterogeneity, but dropout events, where gene expression is undetected in individual cells, present a significant challenge. We propose \textbf{scMASKGAN}, which transforms matrix imputation into a pixel restoration task to improve the recovery of missing gene expression data. Specifically, we integrate masking, convolutional neural networks (CNNs), attention mechanisms, and residual networks (ResNets) to effectively address dropout events in scRNA-seq data.
- Categories:
Single-cell RNA sequencing (scRNA-seq) enables high-resolution analysis of cellular heterogeneity, but dropout events, where gene expression is undetected in individual cells, present a significant challenge. We propose \textbf{scMASKGAN}, which transforms matrix imputation into a pixel restoration task to improve the recovery of missing gene expression data. Specifically, we integrate masking, convolutional neural networks (CNNs), attention mechanisms, and residual networks (ResNets) to effectively address dropout events in scRNA-seq data.
- Categories:
Single-cell RNA sequencing (scRNA-seq) enables high-resolution analysis of cellular heterogeneity, but dropout events, where gene expression is undetected in individual cells, present a significant challenge. We propose \textbf{scMASKGAN}, which transforms matrix imputation into a pixel restoration task to improve the recovery of missing gene expression data. Specifically, we integrate masking, convolutional neural networks (CNNs), attention mechanisms, and residual networks (ResNets) to effectively address dropout events in scRNA-seq data.
- Categories:
The application of large language models (LLMs) in urban planning has gained momentum, with prior research demonstrating their value in participatory planning, process streamlining, and event forecasting. This study focuses on further enhancing urban planning through the integration of more comprehensive datasets. We introduce a newly developed instruction dataset that amalgamates crucial information from several prominent urban datasets, including highD, NGSIM, the Road Networks dataset, TLC Trip data, and the Urban Flow Prediction Survey dataset.
- Categories:
The Left Atrium2018 dataset was used in the 2018 Left Atrium Segmentation Challenge and has the following characteristics:
Data Content
Image Type: It consists of 154 three-dimensional gadolinium-enhanced magnetic resonance imaging (LGE-MRI) images, which is currently the largest cardiac LGE-MRI dataset in the world.
Label Information: It contains the relevant labels of the left atrium segmented by three medical experts.
Application Fields
- Categories:
The data presented is "Gray-Box Dynamic Model for Wave Glider Driven by a Hybrid of Deep Learning and Physics-Based Models". In this study, the data set utilized for training the surrogate model consists of a total of 5 columns. These columns are FX, FZ, t, velx, and velz. Specifically, FX and FZ represent the hydrodynamic forces acting on the X - axis and Z - axis of the glider respectively. The variable t stands for time, while velx and velz denote the speeds of the glider on the z - axis and x - axis.
- Categories:
The rapid growth of online shopping and e-commerce platforms has led to an explosion of product reviews. These reviews often contain valuable information about users’ opinions on various aspects of the products, including comparisons between different devices. Understanding comparative opinions from product reviews is crucial for manufacturers and consumers alike. Manufacturers can gain insights into the strengths and weaknesses of their products compared to competitors, while consumers can make more informed purchasing decisions based on these comparative insights.
- Categories:
e used a mixed dataset\cite{ye2023geneface}in our experiments, where part of the data was referenced from the publicly available dataset provided by GaussianTalking\cite{li2025talkinggaussian}, and additional data was collected by ourselves. Specifically, we selected four high-definition talking video clips from the publicly available dataset, including two male portraits, "Macron" and "Obama" and one female portrait, "May". These video clips are centered on the subject, with an average length of 6500 frames and a frame rate of 25 FPS.
- Categories:
The Tunnel Cable Fire dataset is derived experimentally, this dataset contains images of cable flames at different stages, different cable layers, and different wind speeds, with a special focus on computer vision tasks such as fire detection and segmentation. These images have been enhanced with mosaic data for a total of 1812 datasets, including single and double layer cable fire images in the case of no wind and wind speed of 2.7m/s.
- Categories: