1.Introduction
1. Types of Machine Learning(ML) 2. Sequential decision making Each time step t: Agent takes an action at Environment updates with new state and emits observation, ot and reward, rt Agent receives ot and rt 3. History ht = (a1,o1,r1,,,,at,ot,rt) 4. World state Agent state와 다름. 실제 세계 5. Agent state $s_{t} = f(h_{t}) = (a_{1}, o_{1}, r_{1}, ,..
2023.05.01