Fahim In Tech pfp
Fahim In Tech
@fahimintech
1/ 🎙️a tiny two-person team out of Korea just dropped Dia, a state-of-the-art open-source speech model with 1.6 billion parameters. It's like if ElevenLabs and ChatGPT had a hyper-expressive, emotionally intelligent baby. Let's break it down 🧵👇
1 reply
0 recast
0 reaction

Fahim In Tech pfp
Fahim In Tech
@fahimintech
2/ What makes Dia wild is how real it sounds. It reads a line like “I’m fine (laughs)” and actually puts the laugh in there. Plus, it supports zero-shot voice cloning—you just give it a voice clip and boom, it talks like that person. Straight outta sci-fi.
1 reply
0 recast
0 reaction