Keynote: How Software Testing Can Increase Agent Autonomy | Luis Héctor Chávez | TestMu 2025
In this keynote session, Luis Héctor Chávez, Chief Technology Officer at Replit, explores the evolving relationship between AI agents, code generation, and software testing. As AI systems increasingly write and improve code on their own, Luis explains why testing remains critical - not just to catch mistakes, but to actually increase the autonomy of AI agents themselves.
Through real-world insights and a case study of Replit’s flagship AI coding agent, Luis demonstrates how testing serves as a powerful feedback loop that allows agents to go further without human intervention. He also unveils new progress in automated end-to-end and multimodal testing, showing how Replit is tackling the challenges of visual web applications and improving agent reliability.
What AI agents are, how they work, and the role of the “agent loop”.
Why testing is essential for scaling agent autonomy and reliability.
Real-world benchmarks (e.g., SWE-bench) showing where agents succeed and fail today.
Replit’s case study: integrating automated testing directly into AI-driven app creation.
How multimodal and end-to-end testing can make agents 3x more autonomous.
Testμ
Testμ (TestMu) is more than just a virtual conference. It is an immersive experience designed by the community, for the community! A 3-day gathering that unites testers, developers, community leaders, industry experts, and ecosystem partners in the testing and QA field, all converging under a single virtual roof. 😀