Artificial Intelligence
This dataset contains 37 estrogen receptor immunohistochemistry (ER-IHC) whole slide images (WSIs) obtained from Universiti Malaya Medical Centre (UMMC), Malaysia. The WSI is scanned using 3DHistech Pannoramic DESK at 20x magnification with an approximate dimension of 80,000 pixels width and 200,000 pixels height per WSI.
- Categories:
The real system in our experiment comprises four production stations: Pick and place, assembly, muscle compressing and sorting. These modular stations are controlled by Siemens PLC. This is the data gathered from a real manufacturing system and its Digital Twin data when under the denial of service attacks.
- Categories:
In precision agriculture, detecting productive crop fields is an essential practice that allows the farmer to evaluate operating performance separately and compare different seed varieties, pesticides, and fertilizers. However, manually identifying productive fields is often time-consuming, costly, and subjective. Previous studies explore different methods to detect crop fields using advanced machine learning algorithms to support the specialists’ decisions, but they often lack good quality labeled data.
- Categories:
The morphological characteristics of skeletal muscles, such as fascicle orientation, fascicle length, and muscle thickness, contain valuable mechanical information that aids in understanding muscle contractility and excitation due to commands from the central nervous system. Ultrasound (US) imaging, a non-invasive measurement technique, has been employed in clinical research to provide visualized images that capture morphological characteristics. However, accurately and efficiently detecting the fascicle in US images is challenging.
- Categories:
As a hot research topic, there are many related datasets for occlusion detection. Due to the different scenarios and definitions of occlusion for different tasks, there are significant differences between different occlusion detection datasets, making existing datasets difficult to apply to the video shot occlusion detection task. To this end, we contribute the first large-scale video shot occlusion detection dataset, namely VSOD, which serves as a benchmark for evaluating the performance of shot occlusion detection methods.
- Categories:
This dataset is made for traditional, machine learning, and deep neural-network-based virtual sensor development and evaluation.
- Categories:
- Categories:
Forum-java is a log dataset that we collected in an open source java-based web forum system {https://github.com/Qbian61/forum-java.}. It is a Java-based forum platform developed by a technology company and widely used for social media and programming technique sharing it contains abundant and diverse functions, like posting articles, creating FAQs, etc., which can satisfy most of the requirements of users.
- Categories:
The risks to children of online predators in real time gaming environments have been an area of growing concern. Research towards the development of near real time capabilities has been the focus of most queries published in this area of study. In this paper, we present Protectbot, a comprehensive safety framework used to interact with users in online gaming chat rooms. Protectbot employs a variant of the GPT-2 model known as DialoGPT, a generative pre-trained transformer designed specifically for conversation.
- Categories:
This Named Entities dataset is implemented by employing the widely used Large Language Model (LLM), BERT, on the CORD-19 biomedical literature corpus. By fine-tuning the pre-trained BERT on the CORD-NER dataset, the model gains the ability to comprehend the context and semantics of biomedical named entities. The refined model is then utilized on the CORD-19 to extract more contextually relevant and updated named entities. However, fine-tuning large datasets with LLMs poses a challenge. To counter this, two distinct sampling methodologies are utilized.
- Categories: