Your Test Suite Can’t Catch a Hallucination: Talk on AI in Production | Rashi Agrawal | TestMu 2025
In this insightful session at TestMu Conference 2025, 𝐑𝐚𝐬𝐡𝐢 𝐀𝐠𝐫𝐚𝐰𝐚𝐥, Head of AI Engineering - Senior Manager at GoodLeap, dives into one of the biggest challenges in AI: dealing with hallucinations in AI systems. Learn how traditional testing fails to catch these elusive errors and explore how to build better evaluation frameworks to ensure reliability and resilience in AI models.
Rashi explains the importance of shifting from testing logic to testing outcomes and introduces tools and techniques to handle AI failures proactively. This session is a must-watch for anyone working with AI in production.
Traditional tests are insufficient for AI systems.
Unit tests can't reliably catch hallucinations in AI models.
Test outcomes should focus on business impact and user trust, not just logic.
Observability and continuous evaluation are essential for AI reliability.
AI testing is a team effort; collaboration between QA, engineering, and domain experts is key to success.
Testμ
Testμ(TestMu) Conference is LambdaTest’s annual flagship event, one of the world’s largest virtual software testing conferences dedicated to decoding the future of testing and development. Built by the community, for the community, it’s a space where you’re at the center, connecting, learning, and leading together. From deep-dive sessions on emerging trends in engineering, testing, and DevOps, to hands-on workshops and inspiring culture-driven talks, every experience is designed to keep you at the heart of the conversation.