Posts

the philosophy of agent frameworks

May 2, 2026

Every agent framework is just a disagreement about how much to trust the model. Here's how to actually think about the landscape.

inference engineering

May 1, 2026

Your GPU is mostly idle during text generation. The entire inference stack exists to fix that.

the annotated microgpt

Apr 2, 2026

Karpathy's nanoGPT, annotated line by line. (I call my version microGPT.)

what happens when you press 'submit' on chatgpt

Mar 26, 2026

From raw internet text to a streaming response in your browser - pretraining, alignment, inference, and everything in between.

from residual connections to attention residuals

Mar 18, 2026

The residual connection solved deep learning in 2015. Ten years later, Moonshot AI noticed it's been sabotaging deep networks the whole time. The fix is 30 lines of PyTorch.

building pytorch from scratch

Mar 11, 2026

Building PyTorch from scratch in ~1300 lines - reverse-mode autograd, conv2d, optimizers. Trains MNIST. No C++, no CUDA, just closures and NumPy.

five questions I'd rather ask

Feb 13, 2026

Five questions I'd rather ask than 'derive Kadane's algorithm in 30 minutes pretending you've never seen it.'

building rag for my website

Jan 27, 2026

My blogs are too long so I built a RAG over them. Chunking, FAISS, reranking, streaming via Groq - and an eval pipeline because vibes aren't metrics.

all the math you need for ai

Jan 21, 2026

Every derivative, gradient, and matrix operation you need to understand deep learning - with code alongside every concept.

numpy & pytorch for dummies

Jan 7, 2026

Ground-up NumPy and PyTorch - we build MNIST classifiers two ways to see how the pieces fit together.

how to read a paper (deepseekv3)

Jan 6, 2026

Taking DeepSeekV3 apart - MLA, MoE routing, multi-token prediction, and how I read a 53-page paper without losing my mind.

scaling laws

Jan 6, 2026

Maybe intelligence is just a lot of compute applied to a lot of data having a lot of parameters. OpenAI wrote a paper on this.

understanding object detection

Nov 21, 2025

The full YOLO story from v1 to v10 - grid system, loss function, NMS, mAP, and a PyTorch implementation.

i gave claude a spotify remote

Oct 25, 2025

I built an MCP server so Claude can control my Spotify. Play, pause, queue songs by mood - all through natural language.

hello world of neural networks

Oct 25, 2025

Neural networks from scratch with actual math - then we build a digit classifier with no frameworks, just NumPy.

how is python written

Apr 27, 2025

Forking CPython, modifying the grammar, adding a 'nuke' statement, and other things I found digging through Python's C source.

things i like

Jan 1, 2025

A curated list of things I like - books, music, tools, and more.