Content pfp
Content
@
0 reply
0 recast
0 reaction

Dan Romero pfp
Dan Romero
@dwr.eth
Let's say you have a corpus of text — 10 million words — about a specific topic. 1. What's the best way to "train a model" on that text? 2. Is that even the right term? Or is it using an existing foundational model and then augmenting it? Fine-tuning it? Something else?
18 replies
2 recasts
117 reactions

Lucas Lejeune pfp
Lucas Lejeune
@lucaslejeune
Fine tuning a pre existing model would be the best way I believe. Probably with python, or there's a webui called oogabooga which lets you do just that
0 reply
0 recast
0 reaction