성장중 •͈ᴗ•͈
[RL] 간단하게 정리한 On-policy, Off-policy, Online, Offline Reinforcement Learning