Posts
the philosophy of agent frameworks
May 2, 2026Every agent framework is just a disagreement about how much to trust the model. Here's how to actually think about the landscape.
inference engineering
May 1, 2026Your GPU is mostly idle during text generation. The entire inference stack exists to fix that.
the annotated microgpt
Apr 2, 2026Karpathy's nanoGPT, annotated line by line. (I call my version microGPT.)
what happens when you press 'submit' on chatgpt
Mar 26, 2026From raw internet text to a streaming response in your browser - pretraining, alignment, inference, and everything in between.
from residual connections to attention residuals
Mar 18, 2026The residual connection solved deep learning in 2015. Ten years later, Moonshot AI noticed it's been sabotaging deep networks the whole time. The fix is 30 lines of PyTorch.
building pytorch from scratch
Mar 11, 2026Building PyTorch from scratch in ~1300 lines - reverse-mode autograd, conv2d, optimizers. Trains MNIST. No C++, no CUDA, just closures and NumPy.
five questions I'd rather ask
Feb 13, 2026Five questions I'd rather ask than 'derive Kadane's algorithm in 30 minutes pretending you've never seen it.'
building rag for my website
Jan 27, 2026My blogs are too long so I built a RAG over them. Chunking, FAISS, reranking, streaming via Groq - and an eval pipeline because vibes aren't metrics.
all the math you need for ai
Jan 21, 2026Every derivative, gradient, and matrix operation you need to understand deep learning - with code alongside every concept.
numpy & pytorch for dummies
Jan 7, 2026Ground-up NumPy and PyTorch - we build MNIST classifiers two ways to see how the pieces fit together.
how to read a paper (deepseekv3)
Jan 6, 2026Taking DeepSeekV3 apart - MLA, MoE routing, multi-token prediction, and how I read a 53-page paper without losing my mind.
scaling laws
Jan 6, 2026Maybe intelligence is just a lot of compute applied to a lot of data having a lot of parameters. OpenAI wrote a paper on this.
understanding object detection
Nov 21, 2025The full YOLO story from v1 to v10 - grid system, loss function, NMS, mAP, and a PyTorch implementation.
i gave claude a spotify remote
Oct 25, 2025I built an MCP server so Claude can control my Spotify. Play, pause, queue songs by mood - all through natural language.
hello world of neural networks
Oct 25, 2025Neural networks from scratch with actual math - then we build a digit classifier with no frameworks, just NumPy.
how is python written
Apr 27, 2025Forking CPython, modifying the grammar, adding a 'nuke' statement, and other things I found digging through Python's C source.
things i like
Jan 1, 2025A curated list of things I like - books, music, tools, and more.