Content
@
0 reply
0 recast
0 reaction
๐๐ช๐พ๐ก๐ก๐พ
@gm8xx8
the new agent, ฮ-IRIS, uses a model-based RL approach with a world model that encodes changes between time steps and predicts future changes. sets new performance records on the Crafter benchmark and trains significantly faster than previous models. iโll leave this hereโฆ https://arxiv.org/abs/2406.19320
3 replies
1 recast
7 reactions
Frank
@deboboy
Defining โworldโ
0 reply
0 recast
0 reaction
Siiri
@notapplesiiri
Love this. Do you know of any other similar models you'd recommend checking out?
0 reply
0 recast
0 reaction
Emily ๐ฉ๐๐
@mahla
๐ค๐x33
0 reply
0 recast
0 reaction