Machine Learning

By querying open data of notorious scientific databases via representational state transfers, and subsequently enforcing data management practices with a dynamic topic modeling approach on the referred metadata available, this work achieves a feasible form of article set analysis and classification. Research trends for a given field in specific moments are identified, and also the referred trends evolution throughout the years.


This dataset is used to train learners in our paper entitled: Hybrid Learning Aided Inactive Constraints Filtering Algorithm to Enhance AC OPF Solution Time



A Indústria enfrenta desafios graves e fracassa sem competitividade. Atacando esta problemática, conferiu-se o oferecimento de maior eficiência a processos industriais para promover a produtividade, elevar a qualidade e impulsionar mudanças. A solução desenvolvida incluiu dispositivos com sensores não invasivos, simples de instalar, que contabilizam os itens sendo transportados em linhas de produção.


This dataset contains four types of geospatial events coverage in Indonesian news online portal: flood, traffic jam, earthquake, and fire. The corpus itself was composed of 926 manually annotated, disambiguated, and event extracted sentences that was filtered from 83 of 645,679 documents of our earlier news corpus based on four major geospatial events: flood, earthquake, fire, and accidents



Power transmission system losses can typically represent from five to ten percent of the total generation, a quantity worth millions of dollars per year. The purpose of loss allocation in the context of pool dispatch is to assign to each individual generation and load the responsibility of paying for part of the system transmission losses. Since the system losses are non-separable, non-linear functions of the real power generation and loads, the allocation of transmission loss is a challenging and contentious issue in a fully deregulated system.


This repository introduces a novel dataset for the classification of Chronic Obstructive Pulmonary Disease (COPD) patients and Healthy Controls. The Exasens dataset includes demographic information on 4 groups of saliva samples (COPD-HC-Asthma-Infected) collected in the frame of a joint research project, Exasens (, at the Research Center Borstel, BioMaterialBank Nord (Borstel, Germany).


Imagine you just moved to your brand-new home and hired your energy provider. They tell you that based on the provided information they will set up a direct debit of €50/month. However, at the end of the year, that prediction was not quite accurate, and you end up paying a settlement amount of €300, or if you are lucky, they give you back some money. Either way, you will probably be disappointed with your energy provider and might consider moving on to another one. Predicting energy consumption is currently a key challenge for the energy industry as a whole.

Last Updated On: 
Tue, 07/20/2021 - 06:35

This is the dataset used in our journal paper, submitted to: IEEE Transactions on Dependable and Secure Computing, Special Issue on Explainable Artificial Intelligence for Cyber Threat Intelligence (XAI-CTI) Applications.

Submitted = 12-Jan-2021 (under review)
ID = TDSCSI-2021-01-0045

Source codes are available at IEEE Code Ocean:

Hatma Suryotrisongko (2020) Botnet DGA [Source Code].


This is the raw MMG data.


The aircraft fuel distribution system has two primary functions: storing fuel and distributing fuel to the engines. These functions are provided in refuelling and consumption phases, respectively. During refuelling, the fuel is first loaded in the Central Reservation Tank and then distributed to the Front and Rear Tanks. In the consumption phase, the two engines receive an adequate level of fuel from the appropriate tanks. For instance, the Port Engine (PE) will receive fuel from Front Tank and the Starboard Engine (SE) will receive fuel from Rear Tank.