Griffpatch Scratch Tutorials Advanced

Tutorial 02: LSTM from Scratch

The MLP from Tutorial 01 is stateless. Feed it 'c' and it predicts /k/. Feed it 'c' again after reading "ch" and it still predicts /k/. It has no memory of what came before. That's a real problem. In ...

GitHub

Tutorial 03: Transformer from Scratch

The Transformer has more moving parts than the MLP or LSTM. You're not just wiring layers together — you're wiring them together with attention, and attention has several subtle details that make it ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Tutorial 02: LSTM from Scratch

Tutorial 03: Transformer from Scratch

Trending now