Skip to main content

Datasets

Standard Dataset

Optimal control of HalfCheetah benchmark

Citation Author(s):
Shanwu Li (Harbin Institute of Technology)
Yongchao Yang (Eastern Institute of Technology, Ningbo)
Submitted by:
Yongchao Yang
Last updated:
DOI:
10.21227/2ak7-wr69
Data Format:
No Ratings Yet

Abstract

The video demonstration corresponding to the 100th time step in Figure 13 for the HalfCheetah controlled by the random policy and the learned
policies with different methods. MDDPG(5) denotes the model-free counterpart with 5-step TD target. FNN-Model-MDDPG(5) and ResNet-Model-MDDPG(5) denote the FNN-model-based and our ResNet-model-based schemes with 5 dynamics models, respectively.

Instructions:

None

Funding Agency
National Natural Science Foundation of China
Grant Number
52478315