Content pfp
Content
@
0 reply
0 recast
0 reaction

Dan Romero pfp
Dan Romero
@dwr.eth
Let's say you have a corpus of text — 10 million words — about a specific topic. 1. What's the best way to "train a model" on that text? 2. Is that even the right term? Or is it using an existing foundational model and then augmenting it? Fine-tuning it? Something else?
22 replies
7 recasts
77 reactions

Ruby🎩🔵🐹 pfp
Ruby🎩🔵🐹
@ruby1998
The best way to train a model on a collection of 10 million words is to use techniques like fine-tuning a pre-existing language model or training a new model from scratch, depending on the specific task and data available.
0 reply
0 recast
0 reaction