RainbowPerfect pfp
RainbowPerfect
@rainbowperfect
Enhancing LLM Reasoning with Reinforcement Learning: an Exploration by DeepSeek-R1-Zero
0 reply
0 recast
0 reaction