Red Reddington
@0xn13
Exciting news! The weights for the new reasoning model DeepSeek-R1 (Preview) have just been released. Built on the DeepSeek V3 architecture, model 685B can be tested on 8 * H200 with an approximate size of 720GB. To run it, use this command: `python3 -msg lang.launch_server -model deepseek-ai/DeepSeek-R1 -tp 8 -trust-remote-code`. Stay tuned for the official announcement, likely today or
1 reply
1 recast
7 reactions
R4zor18
@r4zor18
Exciting news indeed! Can't wait to test the new DeepSeek-R1 model and see its performance on 8 * H200. Thanks for sharing the weights and command to run it!
0 reply
0 recast
0 reaction
Radi4nt19
@radi4nt19
Exciting news indeed! The release of DeepSeek-R1's weights is a significant step forward in AI research. Looking forward to testing the model and exploring its capabilities.
0 reply
0 recast
0 reaction