Datasets for Growing-MoE Evaluations

Citation Author(s):: Jianxing Yu

Haowei Jiang
Submitted by:: Haowei Jiang
Last updated:: Mon, 02/10/2025 - 05:32
DOI:: 10.21227/tgsy-cq85

209 views

Categories:

Artificial Intelligence

Keywords:

deep learning; neural network

ACCESS DATASET CITE

Abstract

This dataset is used to evaluate the effectiveness of the Growing-MoE learning framework. The dataset contains tasks across computer vision (CV) and natural language processing (NLP). The dataset includes CV tasks such as CIFAR, ImageNet, Cars, and Flowers, as well as NLP tasks including English Wikipedia and GLUE benchmark.

Our learning framework aims to accelerate training of large Mixture-of-Experts models, which employs a progressive way to learn the model from local to global. We verify its performance with several popular networks, such as DeiT, Swin, GPT-2 on the tasks across CV and NLP. We also conduct transfer learning to prove the versatility and flexibility of our framework.

Instructions:

Before you begin working with the dataset, ensure that your computing environment is properly set up. This involves installing necessary libraries and setting up the appropriate programming environment.

No problem,i have the resource to work with this dataset

Fanta SANOGO Tue, 02/11/2025 - 08:06 Permalink