Lucas Lejeune on Warpcast

Content pfp

0 reply

0 recast

0 reaction

Dan Romero pfp

Let's say you have a corpus of text — 10 million words — about a specific topic. 1. What's the best way to "train a model" on that text? 2. Is that even the right term? Or is it using an existing foundational model and then augmenting it? Fine-tuning it? Something else?

15 replies

6 recasts

61 reactions

Lucas Lejeune pfp

Fine tuning a pre existing model would be the best way I believe. Probably with python, or there's a webui called oogabooga which lets you do just that

0 reply

0 recast

0 reaction