Content pfp
Content
@
0 reply
0 recast
0 reaction

𝚐π”ͺ𝟾𝚑𝚑𝟾 pfp
𝚐π”ͺ𝟾𝚑𝚑𝟾
@gm8xx8
the new agent, Ξ”-IRIS, uses a model-based RL approach with a world model that encodes changes between time steps and predicts future changes. sets new performance records on the Crafter benchmark and trains significantly faster than previous models. i’ll leave this here… https://arxiv.org/abs/2406.19320
3 replies
1 recast
7 reactions

Emily πŸŽ©πŸƒπŸ– pfp
Emily πŸŽ©πŸƒπŸ–
@mahla
πŸ€ŒπŸ–x33
0 reply
0 recast
0 reaction