Datasets for Growing-MoE Evaluations

Citation Author(s):
Jianxing
Yu
Haowei
Jiang
Submitted by:
Haowei Jiang
Last updated:
Mon, 02/10/2025 - 00:32
DOI:
10.21227/tgsy-cq85
License:
108 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

This dataset is used to evaluate the effectiveness of the Growing-MoE learning framework. The dataset contains tasks across computer vision (CV) and natural language processing (NLP). The dataset includes CV tasks such as CIFAR, ImageNet, Cars, and Flowers, as well as NLP tasks including English Wikipedia and GLUE benchmark.

Our learning framework aims to accelerate training of large Mixture-of-Experts models, which employs a progressive way to learn the model from local to global. We verify its performance with several popular networks, such as DeiT, Swin, GPT-2 on the tasks across CV and NLP. We also conduct transfer learning to prove the versatility and flexibility of our framework.

Instructions: 

Before you begin working with the dataset, ensure that your computing environment is properly set up. This involves installing necessary libraries and setting up the appropriate programming environment.

Comments

No problem,i have the resource to work with this dataset 

Submitted by Fanta SANOGO on Tue, 02/11/2025 - 03:06