Skip to main content

Yongchao Yang

First Name
Yongchao
Last Name
Yang

Dataset Entries from this Author

The video demonstration corresponding to the 100th time step in Figure 13 for the HalfCheetah controlled by the random policy and the learned
policies with different methods. MDDPG(5) denotes the model-free counterpart with 5-step TD target. FNN-Model-MDDPG(5) and ResNet-Model-MDDPG(5) denote the FNN-model-based and our ResNet-model-based schemes with 5 dynamics models, respectively.

Categories: