Reliability

We develope a novel TCM hallucination detection dataset, Hallu-TCM, sine no prior work has attempted this task in TM. We selected 1,260 TCM exam questions including 16 TCM subjects, input them into GPT-4, and collected their feedback. In the first level, we utilize Qwen-Max interface to annotate feedback multiple times with the binary label. If Qwen-Max consistently provided the same label across annotations, we adopted that label. For contentious cases, we recruited higher-degree research students who can understand and solve complex questions, including three Ph.D.

Categories:
157 Views

We organized and collected two years' worth of complete fault work orders from a wind farm, and structured these work orders into a fault diagnosis event knowledge graph using the proposed algorithm. This graph includes fault modes, fault impacts, fault symptoms, inspection schemes, root cause identification, and maintenance strategies, covering all potential fault information and handling methods for wind turbines. This dataset records the head entity-relation-tail entity information in the form of triples using JSON format.

Categories:
646 Views

This study is utilized for submodule open-circuit fault detection uncertainty analysis of modular multilevel converters. The dataset consists of 8 uncertainty factors and 15 system variables under four operation scenarios. The 1000 sets of uncertainty factor samples are generated randomly as initial configuration of the system. The 15 system variables are obtained by 1000 Monte Carlo simulations. We found that there are 153 residual samples exceeded the threshold of 0.8, which indicated a high false alarm rate.

Categories:
960 Views

The REST (REpresentational State Transfer) paradigm has become essential for designing distributed applications that leverage the HTTP protocol, enabling efficient data exchange and the development of scalable architectures such as microservices. However, selecting an appropriate framework among the myriad available options, especially given the diversity of emerging execution environments, presents a significant challenge. Often, this decision neglects crucial factors such as performance and energy efficiency, favoring instead developer familiarity and popularity within the industry.

Categories:
373 Views

This dataset is shared for capacitor C and ESR estimation using convolution neural network. The dataset is collected in a experimental modular moultilevel converter, which includes the capacitor voltage at low and medium frequency band, and the arm current. Wavelet transform is used to transfer the time series data to images, which present the inherent data features to image patterns. In a degraded capacitor, the C decreases and the ESR increases, which result in different image patterns.

Categories:
204 Views

Putting together software that lets patients make their appointments and software that helps them book appointments is making healthcare centres more accessible and useful. These systems put all of the appointment bookings in one place, so patients can make appointments across many areas and specialties. Standardizing the planning process for healthcare workers can help them save time and money by streamlining their work and lowering the costs of running their businesses.

 

Categories:
213 Views

This data repository contains test data and corresponding test code for evaluating the performance of a machine learning model. The dataset includes 950 labeled samples across 7 different classes. The test code provides implementations of several common evaluation metrics, including accuracy, precision, recall, and F1-score. This resource is intended to facilitate the benchmarking and comparison of different machine learning algorithms on a standardized task.

Categories:
23 Views

Anomaly detection plays a crucial role in various domains, including but not limited to cybersecurity, space science, finance, and healthcare. However, the lack of standardized benchmark datasets hinders the comparative evaluation of anomaly detection algorithms. In this work, we address this gap by presenting a curated collection of preprocessed datasets for spacecraft anomalies sourced from multiple sources. These datasets cover a diverse range of anomalies and real-world scenarios for the spacecrafts.

Categories:
739 Views

The Partial Discharge - Localisation Dataset, abbreviated: PD-Loc Dataset is an extensive collection of acoustic data specifically curated for the advancement of Partial Discharge (PD) localisation techniques within electrical machinery. Developed using a precision-engineered 32-sensor acoustic array, this dataset encompasses a wide array of signals, including chirps, white Gaussian noise, and PD signals.

Categories:
668 Views

Pages