Xiaohei's Blog
Blog
Research
Technical
Seek & Ponder
Week Journal
Academic
Projects
Links
About
Travellings
🚇
Search
切换到English
中文
Dark Theme
Menu
Seek & Ponder
Page 2 - Showing 1 of 16 posts
View all posts by years →
May 1, 2025
强化学习算法程序实践(4):连续控制(DDPG / TD3 / SAC)
Actor/Critic 输入输出、Replay Buffer、探索噪声、以及各自 update 的关键差异
11 min read
rl
1
2