Read How to Evaluate Multimodal VLMs for Your Video Use Case#001 · Research note · May 15, 2026
Sankalp Nagaonkar
A practical workflow for evaluating video VLM setups with VideoDB and Langfuse, from task definition and dataset design to tracing, scoring, and deployment decisions.
Read Do Thought Streams Matter? Evaluating Reasoning in Gemini Vision-Language Models for Video Scene Understanding#002 · arXiv preprint arXiv:2604.11177 · 2026/4/13
Shivam Sharma, Sankalp Nagaonkar, Ashish Choithani, Ashutosh Trivedi
Benchmarks how internal reasoning traces affect video scene understanding in Gemini models, including where quality gains plateau and how tight budgets increase compression-step hallucination.
Read Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments#003 · arXiv preprint arXiv:2502.06445 · 2025/2/10
Sankalp Nagaonkar, Augustya Sharma, Ashish Choithani, Ashutosh Trivedi
Introduces an open-source benchmark for evaluating VLMs on OCR tasks in dynamic video environments across 1,477 manually annotated frames.
Read Modeling the Spread and Control of COVID-19#004 · Systems 9(3), 53 · 2021/7/13
Ashutosh Trivedi, Nanda Kishore Sreenivas, Shrisha Rao
Presents an agent-based extension of epidemic modeling for confined spaces, comparing black-box and glass-box views while simulating lockdowns, social distancing, hygiene, quarantine, and hospitalization policies.
Read Agent-Based Modeling of Emergency Evacuations Considering Human Panic Behavior#005 · IEEE Transactions on Computational Social Systems 5(1), 277–288 · 2018/1/17
Ashutosh Trivedi, Shrisha Rao
Models emergency evacuations with agents whose panic behavior is shaped by psychological and physical factors, then uses simulations to identify bottlenecks and compare evacuation strategies.
Read Solving Logical Puzzles with Natural Language Processing#006 · PyCon India talk · 2015
Ashutosh Trivedi
A practical talk that starts from rule-based NLP features and WordNet, then motivates distributed word representations and deep learning for solving logical puzzles from natural language.
Read Chitrakāvya#007 · Research project · Ongoing
Ashutosh Trivedi
Sanskrit picture-poems rebuilt as computational artifacts, connecting sloka, geometry, knight's tours, palindromes, and early algorithmic thinking.
Read Emergence#008 · Research project · Ongoing
Ashutosh Trivedi
Notes on multi-agent systems, collective intelligence, alignment, swarm behavior, and computational models of consciousness.