top of page


Measuring the Unmeasurable: A Benchmarker's Guide to Agentic AI
For decades, AI benchmarks lived in comfortable isolation. A model answered a question, we checked the answer, we assigned a score. Agentic AI broke that contract. When a model can browse the web, write and execute code, call external APIs, and chain its own decisions across hundreds of steps, a single accuracy number tells you almost nothing about whether the system is actually trustworthy. Evaluating an agent is less like grading an exam and more like auditing a junior empl

Rajeev Gadgil
May 186 min read
Â
Â
Â


The Evolution of Software Performance with Agentic AI
Discover how agentic AI is flipping the performance engineering paradigm. Learn how to evolve from a "loop tuner" to a "constraints governor" with our checklist for AI-proof SLOs.

Rajeev Gadgil
Jan 122 min read
Â
Â
Â
bottom of page

