Machine Learning

This dataset addresses the challenge of limited vocal recordings available in secondary datasets, particularly those that predominantly feature foreign accents and contexts. To enhance the accuracy of our solution tailored for Sri Lankans, we employed primary data-gathering methods.

The dataset comprises vocal recordings from a sample population of youth. Participants were instructed to read three specific sentences designed to capture a range of vocal tones:

Categories:
182 Views

The Facial Expression Dataset (Sri Lankan) is a culturally specific dataset created to enhance the accuracy of emotion recognition models in Sri Lankan contexts. Existing datasets, often based on foreign samples, fail to account for cultural differences in facial expressions, affecting model performance. This dataset bridges that gap, using high-quality data sourced from over 100 video clips of professional Sri Lankan actors to ensure expressive and clear facial imagery.

Categories:
444 Views

This is part of our external validation set, which contains 40 volunteers and about 80 hematological examination items. Among them, Cl, BHB, AG, RBP, HCO3, FT3, aTPO, CYSC, FT4, Folate, UA and aTG contribute more to the prediction. Because the data involves personal privacy and research confidentiality, it cannot be fully public. However, you can still make predictions by using our ML model and get a high accuracy on the external dataset.

Categories:
126 Views

The raw data from Artsy (from https://www.artsy.net/ ) were collected using the original Python scraping program. Artsy labeled the categories as follows: "Painting", "Work on Paper", "Sculpture", "Print", "Photography", and "Textile Arts". These categories were predicted by other explanatory variables. The rationale behind the selection of this task is the price levels differ greatly across the various categories.

Categories:
17 Views

The SINEW (Sensors in Home for Elderly Wellbeing) dataset consists of 15 high-level biomarker features, derived from raw sensor readings collected by in-home sensors used for predictive modeling research: SINEW Weekly Biomarker.

This dataset was collected for a study focused on the early detection of mild cognitive impairment, providing an opportunity for timely intervention before it progresses to Alzheimer's disease.

Categories:
121 Views

The SINEW (Sensors in Home for Elderly Wellbeing) dataset consists of 15 high-level biomarker features, derived from raw sensor readings collected by in-home sensors used for predictive modeling research: SINEW 15 - Monthly Biomarker.

This dataset was collected for a study focused on the early detection of mild cognitive impairment, providing an opportunity for timely intervention before it progresses to Alzheimer's disease.

Categories:
119 Views

Two thousand MTJ samples in total were included in the dataset for analysis; First, shuffle the dataset randomly to eliminate bias, then split it into ten equal folds of 200 samples each. For each iteration of cross-validation, use nine folds for training and one for testing, rotating the test fold across all ten groups so that every sample is tested once. 

Categories:
36 Views

This dataset contains LoRa physical layer signals collected from 60 LoRa devices and six SDRs (PLUTO-SDR, USRP B200 mini, USRP B210, USRP N210, RTL-SDR). It is intended for use by researchers in the development of a federated RFFI system, whereby the signals collected from different receivers and locations can be employed for evaluation purposes.

More details can be found at https://github.com/gxhen/federatedRFFI

Categories:
691 Views

This study presents a English-Luganda parallel corpus comprising over 2,000 sentence pairs, focused on financial decision-making and products. The dataset draws from diverse sources, including social media platforms (TikTok comments and Twitter posts from authoritative accounts like Bank of Uganda and Capital Markets Uganda), as well as fintech blogs (Chipper Cash and Xeno). The corpus covers a range of financial topics, including bonds, loans, and unit trust funds, providing a comprehensive resource for financial language processing in both English and Luganda.

Categories:
254 Views

Two-year price movements from 01/01/2014 to 01/01/2016 of 88 stocks are selected to target, coming from all the 8 stocks in the Conglomerates sector and the top 10 stocks in capital size in each of the other 8 sectors. The full list of 88 stocks and their companies selected from 9 sectors is available in StockTable, a facsimile of the paper appendix appendix_table_of_target_stocks.pdf.

Categories:
102 Views

Pages