hellno the optimist
@hellno.eth
this looks very interesting vs OpenAI o1 https://simonwillison.net/2025/Jan/20/deepseek-r1/ > "DeepSeek are the Chinese AI lab who dropped the best currently available open weights LLM on Christmas day, DeepSeek v3. That model was trained in part using their unreleased R1 “reasoning” model. Today they’ve released R1 itself, along with a whole family of new models derived from that base." DeepSeek-R1—which “incorporates cold-start data before RL” and “achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks”. That one is also MIT licensed, and is a similar size.
2 replies
0 recast
4 reactions
Samuel ツ
@samuellhuber.eth
jummy
0 reply
0 recast
0 reaction