Brenner pfp
Brenner
@brenner.eth
Starting Andrej Karpathy's Zero to Hero today. Will be posting daily updates on progress and thoughts https://www.youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ
2 replies
4 recasts
18 reactions

Brenner pfp
Brenner
@brenner.eth
ok made it through the first video today Neural Nets at their core are not that complicated: given inputs, and goal outputs, continuously tweak the weights of the inputs until we reach the goal outputs Everything else on top of that is efficiency optimization, increasing the number of inputs, and operationalizing
1 reply
0 recast
0 reaction

Brenner pfp
Brenner
@brenner.eth
video 2: makemore part 1, done make more names from existing names starting with using bigrams (predicting the next letter based on the last letter) two ways to get the weights: - by counting them from the training data (intuitive, works for bigrams) - start with random weights, use gradient decent / a neural net to train (loop) to minimize "loss" (less intuitive imo, works for more input data) both ways get to the same weights! lots of matrix math. gotta be rotating some shapes in your head. I wish there were 3D visualizations of this
1 reply
0 recast
0 reaction