Machine Learning

Cyberbullying is a growing problem on social media. This dataset helps detect cyberbullying in Bangla by collecting comments from YouTube, Facebook, Instagram, and TikTok. The data is categorized into two types: bullying and non-bullying. It includes various abusive and harmful texts, along with normal conversations. This dataset will help researchers and developers train AI models to automatically identify cyberbullying in Bangla text. The goal is to create better tools to keep online spaces safe for Bangla-speaking users.

 

Categories:
209 Views

This dataset comprises a comprehensive collection of educational courses, each characterized by several key attributes: interests, title, description, category, level, past experience, and rating.

Categories:
10 Views

This dataset contains high-resolution solar and wind measurement data collected from the Feni region, Bangladesh, spanning from 2017 to 2019. Logged at a 1-minute interval, the dataset provides a comprehensive record of atmospheric and meteorological conditions, essential for renewable energy analysis, climatological studies, and resource assessment.

Categories:
164 Views

This dataset contains high-resolution wind measurement data collected from 22 channels at varying heights, providing valuable insights for wind energy assessment, atmospheric research, and meteorological studies. The dataset includes wind speed, wind direction, and environmental parameters measured at multiple altitudes ranging from 10m to 120m. Each channel records parameters such as average wind speed, standard deviation, minimum and maximum values, gust speed, and wind vane direction. Additionally, atmospheric parameters such as temperature, relative humidity, and pressure are included.

Categories:
155 Views

<p>This meteorological data is provided by the Inner Mongolia Meteorological Bureau and includes data from three stations.

Categories:
36 Views

<p>This meteorological data is provided by the Inner Mongolia Meteorological Bureau and includes data from three stations.

Categories:
9 Views

We present a dataset of histopathology images from OSCC patients treated at Sun Yat-sen Memorial Hospital (2015–2022). Each case includes two tissue sections (core and boundary), with six images per patient captured at ×200, ×400, and ×1000 magnifications (2592×1944 pixels). Key histopathological features—such as cancer cells, nests, keratin pearls, nuclear atypia, and necrosis—are included. The study was approved by the Ethics Committee with a waiver of informed consent, and patient-level diagnosis and prognosis annotations were obtained from electronic records.

 

Categories:
13 Views

Artificial Intelligence (AI) has increasingly influenced modern society, recently in particular through significant advancements in Large Language Models (LLMs). However, high computational and storage demands of LLMs still limit their deployment in resource-constrained environments. Knowledge distillation addresses this challenge by training a smaller language model (student) from a larger one (teacher). Previous research has introduced several distillation methods for both generating training data and training the student model.

Categories:
18 Views

Artificial Intelligence (AI) has increasingly influenced modern society, recently in particular through significant advancements in Large Language Models (LLMs). However, high computational and storage demands of LLMs still limit their deployment in resource-constrained environments. Knowledge distillation addresses this challenge by training a smaller language model (student) from a larger one (teacher). Previous research has introduced several distillation methods for both generating training data and training the student model.

Categories:
13 Views

Shape completion remains a fundamental challenge in computer vision and image processing, particularly for tasks involving hand-drawn sketches and occluded objects. Traditional deep learning methods such as Generative Adversarial Networks (GANs) and Convolutional Neural Networks (CNNs) often suffer from high computational costs and poor generalization on sparse, abstract structures.

Categories:
27 Views

Pages