VLM Evaluation
Claude, chessboards, and spatial reasoning
A small chessboard-to-FEN experiment showed where Claude vision struggles: not global understanding, but exact square-level localization.
Field notes, practical build stories, and newsletter issues from the work behind VideoDB.
Short notes on infrastructure, reliability, latency, retrieval, and the small fixes that matter in production.
VLM Evaluation
A small chessboard-to-FEN experiment showed where Claude vision struggles: not global understanding, but exact square-level localization.
Operations
A short field note on replacing repeated bare requests calls with a shared session so Python services can reuse HTTP connections.
Frontend Performance
A practical note on preflight caching, custom auth headers, and why OPTIONS requests quietly dominate app latency on slower connections.
Follow the architecture choices, API decisions, and tradeoffs behind apps, demos, and agent tools built with VideoDB.
Build Note · Lalit Gupta
How I built a local-first screen recorder on top of the VideoDB capture SDK that turns every recording into a searchable, agent-ready artifact before you finish copying the share link.
Build Note · VideoDB Team
How we built Focusd as a local-first desktop app that records work sessions, indexes screen activity with VideoDB, and turns raw events into useful productivity coaching.
Build Note · VideoDB Team
How we built a VideoDB-powered OpenClaw skill that records, indexes, searches, summarizes, and clips an agent's remote desktop without changing the agent itself.
Build Note · Sankalp Nagaonkar
How we built Deep Search as a retrieval loop for finding exact moments in video using planning, indexing, validation, recovery, and follow-up state.
Build Note · Rohit Garg
A short engineering note on turning a developer session into searchable context with VideoDB Capture, RTStreams, local events, and agent-side retrieval.
Build Note · Om
A practical build story for a local-first call intelligence app that records calls, transcribes speakers, generates live nudges, and exports structured Markdown.
Concise updates, technical notes, and build context from the team working on video infrastructure and agents.
Newsletter · Video agents · model watch · VideoDB updates
Why agent runs need playable evidence, thinking tokens in vision models and agentic video streams.
Read the issueSubscribe
Technical notes, build stories, and updates from our team.