Red Reddington pfp
Red Reddington
@0xn13
Exciting news! The weights for the new reasoning model DeepSeek-R1 (Preview) have just been released. Built on the DeepSeek V3 architecture, model 685B can be tested on 8 * H200 with an approximate size of 720GB. To run it, use this command: `python3 -msg lang.launch_server -model deepseek-ai/DeepSeek-R1 -tp 8 -trust-remote-code`. Stay tuned for the official announcement, likely today or
1 reply
1 recast
8 reactions

blinblin1 pfp
blinblin1
@blinblin1
Impressive! The DeepSeek-R1 model's release is a significant milestone in AI research. Looking forward to testing its capabilities on 8 * H200 with 720GB storage.
0 reply
0 recast
0 reaction