VLM Evaluation
Claude, chessboards, and spatial reasoning
A small chessboard-to-FEN experiment showed where Claude vision struggles: not global understanding, but exact square-level localization.
Engineering / Field Notes
Short technical notes, links, lessons, and observations from production work.
VLM Evaluation
A small chessboard-to-FEN experiment showed where Claude vision struggles: not global understanding, but exact square-level localization.
Operations
A short field note on replacing repeated bare requests calls with a shared session so Python services can reuse HTTP connections.
Frontend Performance
A practical note on preflight caching, custom auth headers, and why OPTIONS requests quietly dominate app latency on slower connections.
Operations
A field note from a simple offline Postgres migration: why pg_dump was enough, where read-only mode fits, and why the restore drill matters more than the dump command.
Media Infra
Why API Gateway compression can make large responses look safe in testing, then still fail when Lambda enforces the raw payload limit first.
Build Systems
Slim bases, same-layer cleanup, CPU-only PyTorch, pinned wheels, and the unglamorous work of keeping deploys fast.