Academic Projects Links About 🚇

Back

Tags: #rl

May 1, 2025

RL Practice (2): DQN and Improvements

From Q-table to deep reinforcement learning

8 min read
- rl
May 1, 2025

RL Practice (3): Policy Gradient + Actor-Critic

Policy distribution (Softmax / Gaussian) design, return accumulation, and parallel sampling.

8 min read
- rl
May 1, 2025

RL Practice (4): Continuous Control (DDPG/TD3/SAC)

Actor/Critic inputs and outputs, replay buffer, exploration noise, and the key differences in each update.

7 min read
- rl