appliedml42
@appliedml42
Reversible Vision Transformers https://arxiv.org/abs/2302.04869v1 https://i.imgur.com/XJlssKp.png
1 reply
0 recast
0 reaction
appliedml42
@appliedml42
A memory efficient architecture design for visual recognition which decouples the GPU memory requirement from the depth of the model. We benchmark extensively across both model sizes and tasks and show up to 15.5x reduced memory footprint at roughly identical model complexity, parameters and accuracy.
1 reply
0 recast
0 reaction