Chen Yulin's Blog

Posted 2026-02-05Updated 2026-03-23Review11 minutes read (About 1682 words)

Posted 2025-11-22Updated 2026-03-23Review2 minutes read (About 233 words)

论文使用**RSSM(Recurrent State Space Model)**：使用encoder来编码环境和动作生成latent state, 预测未来latent state，最后基于latent state预测奖励。

Posted 2024-12-31Updated 2026-03-23Note7 minutes read (About 1016 words)

IL是区别于传统手动编程来赋予机器人自主能力的方法。
IL 允许机器通过演示（人类演示专家行为）来学习所需的行为，从而消除了对显式编程或特定于任务的奖励函数的需要。
IL主要有两个类别：