
< session />
Tue, April 21 at 12:10 PM - 12:40 PM GMT+5:30DeepTech OpsTech Architecture
AI agents do not fail because they cannot reason. They fail because the cost of their behaviour is not designed upfront. What begins as a simple agentic workflow with a planner, tools, and retrieval quickly grows into a system of chained calls, recursive loops, repeated memory access, and cascading model invocations. Each step may appear inexpensive, but at scale the combined cost becomes significant. For enterprise systems, this unpredictability is a critical risk.
This session examines the unit economics of agentic AI systems in production. Drawing from real deployments, it identifies where costs accumulate, including orchestration overhead from uncontrolled reasoning loops, excessive tool calls due to poorly defined boundaries, context window growth from unmanaged memory, and escalation from multi-agent coordination. The session also presents practical approaches to address these challenges, including agent loop budgeting, tiered model routing for sub-tasks, caching strategies for tool outputs, and observability patterns that make spend predictable.
What You Will Learn
Who Should Attend
< speaker_info />
Anannya Roy is a Generative AI Architect and Developer Advocate at Amazon Web Services, where she builds production-grade LLM and agentic AI systems for enterprise use. With 8+ years of experience across software engineering, data science, and AI, she specializes in multi-agent architectures, RAG pipelines, and scalable AI systems.
She previously led award-winning GenAI platforms at Capgemini, delivering measurable impact across industries. Anannya is an active speaker and workshop leader, helping developers move from experimentation to production, with a strong focus on reliability, observability, and responsible AI.