Digitized
@digitized
1/ Introducing SYNTHETIC-1 A collaborative project to build the world’s synthetic dataset for verified reasoning in math, coding, and science, powered by DeepSeek-R1. 🦋 Prime Intellect
1 reply
0 recast
0 reaction
Digitized
@digitized
2/ SYNTHETIC-1 delivers 1.4 million high-quality tasks paired with verifiers. Released as a public synthetic data run, welcoming anyone to contribute compute power to this SOTA open reasoning model and dataset. https://app.primeintellect.ai/intelligence
1 reply
0 recast
0 reaction
Digitized
@digitized
3/ This release follows a two-step process: 1️⃣ Generate verified reasoning data and train a supervised fine-tuning model 2️⃣ Use globally distributed reinforcement learning with verifiable rewards to scale further
1 reply
0 recast
0 reaction
Digitized
@digitized
4/ GENESYS is the open-source framework behind this effort, offering tools for synthetic data generation and verification using asynchronous verifiers like LLM judges and containerized code tests. https://t.co/06tkGMVGVG
1 reply
0 recast
0 reaction
Digitized
@digitized
5/ SYNTHETIC-1 runs entirely on Prime Intellect’s internal protocol testnet launched just last week. The PI Protocol forms the basis of a decentralized, peer-to-peer system for intelligence markets. https://x.com/PrimeIntellect/status/1890463878678548683
1 reply
0 recast
0 reaction
Digitized
@digitized
6/ Join Prime Intellect as they scale reinforcement learning, crowdsource data, and work toward fully open-source AGI. Anyone can now contribute H200 nodes to help generate verified reasoning data.
1 reply
0 recast
0 reaction
Digitized
@digitized
7/ You can learn more about SYNTHETIC-1 in this blog post: https://t.co/vUe8ccdAzu
1 reply
0 recast
0 reaction