AI technology Neural Networks - Zero to Hero
Let's build GPT: from scratch, in code, spelled out
by Andrej Karpathy https://www.youtube.com/watch?v=kCc8FmEb1nY&t=1s
- GPT --> Generatively Pre-trained Transformer
- Training on a small toy dataset (tiny Shakespeare)
- Build nanoGPT from scratch
- Character-level language model
- Characters --> integers
- Very simple, results in long sequences
Tokenize text
- Convert raw text as a string --> some sequence of integers according to some vocabulary of possible elements
- Example tokenizers