The simulation code for the paper:
"AoI-Aware Resource Allocation for Platoon-Based C-V2X Networks via Multi-Agent Multi-Task Reinforcement Learning"
The overall architecture of the proposed MARL framework is shown in the figure.
Modified MADDPG: This algorithm trains two critics (different from legacy MADDPG) with the following functionalities: