Content
@
0 reply
0 recast
0 reaction
ππͺπΎπ‘π‘πΎ
@gm8xx8
the new agent, Ξ-IRIS, uses a model-based RL approach with a world model that encodes changes between time steps and predicts future changes. sets new performance records on the Crafter benchmark and trains significantly faster than previous models. iβll leave this hereβ¦ https://arxiv.org/abs/2406.19320
3 replies
1 recast
7 reactions
Emily π©ππ
@mahla
π€πx33
0 reply
0 recast
0 reaction