TestMu 2025 Home / Video /

Keynote: How Software Testing Can Increase Agent Autonomy | Luis Héctor Chávez | TestMu 2025

Keynote: How Software Testing Can Increase Agent Autonomy | Luis Héctor Chávez | TestMu 2025

...Playlist

...

About the talk

In this keynote session, Luis Héctor Chávez, Chief Technology Officer at Replit, explores the evolving relationship between AI agents, code generation, and software testing. As AI systems increasingly write and improve code on their own, Luis explains why testing remains critical - not just to catch mistakes, but to actually increase the autonomy of AI agents themselves.

Through real-world insights and a case study of Replit’s flagship AI coding agent, Luis demonstrates how testing serves as a powerful feedback loop that allows agents to go further without human intervention. He also unveils new progress in automated end-to-end and multimodal testing, showing how Replit is tackling the challenges of visual web applications and improving agent reliability.

Key Takeaways:

What AI agents are, how they work, and the role of the “agent loop”.

Why testing is essential for scaling agent autonomy and reliability.

Real-world benchmarks (e.g., SWE-bench) showing where agents succeed and fail today.

Replit’s case study: integrating automated testing directly into AI-driven app creation.

How multimodal and end-to-end testing can make agents 3x more autonomous.

Testμ

Testμ

Testμ (TestMu) is more than just a virtual conference. It is an immersive experience designed by the community, for the community! A 3-day gathering that unites testers, developers, community leaders, industry experts, and ecosystem partners in the testing and QA field, all converging under a single virtual roof. 😀

More Videos from TestMu 2025

LT Video

Opening Note by Joe Colantonio

TestMu 2025
LT Video

Keynote: Air Fryers, Automation, and AI

TestMu 2025
LT Video

Intent Over Scripts: Modernizing Software Testing with AI

TestMu 2025
LT Video

CI and the Great Flakiness Adventure

TestMu 2025
LT Video

AI for Accessibility: Empowering Inclusive Digital Experiences

TestMu 2025
LT Video

Panel Discussion: AI and Community: Shifting Roles, Rising Impact

TestMu 2025
LT Video

Ask Me Anything: Future-Proof Your Career: AI, Testing & Path Ahead

TestMu 2025
LT Video

2025: The Agentic Shift – Are We Reasoning, Or Just Retrieving Smarter?

TestMu 2025
LT Video

Accelerating Success: How to Optimize Value Delivery with DORA

TestMu 2025
LT Video

Rapid Threat-to-Test for Agents

TestMu 2025
LT Video

How to Build Enterprise-Grade AI Agents Using Robust Evaluation

TestMu 2025
LT Video

Ship Code. Without Writing It.

TestMu 2025
LT Video

Exploratory Testing with AI

TestMu 2025
LT Video

AI-Powered Debugging & Browser Automation with Playwright MCP

TestMu 2025
LT Video

Network Control for End-to-End Web Testing

TestMu 2025
LT Video

Keynote: Zero-UI Engineering: Architecting Systems for Agent Experience (AX)

TestMu 2025
LT Video

So you think a new tool will help? Here’s an idea-t to think about…

TestMu 2025
LT Video

AI, Automation & DevEx: Fueling High-Velocity Engineering

TestMu 2025
LT Video

Fast Doesn’t Mean Fragile: Delivering AI-Powered Software at Scale

TestMu 2025
LT Video

How to Test LLM Agents

TestMu 2025
LT Video

The Great Reckoning: How AI is Exposing the Existential Crisis of Software Testing

TestMu 2025
LT Video

Your Test Suite Can’t Catch a Hallucination: Real Talk on AI in Production

TestMu 2025
LT Video

Event Driven Architecture: Love Triangle in Integration Testing

TestMu 2025
LT Video

When Life Gives You Lemons… Are You Counting Them or Making Lemonade?

TestMu 2025
LT Video

AI-Driven Quality Engineering Practices

TestMu 2025
LT Video

Transforming Retail with Quality Engineering for Seamless Digital Experiences

TestMu 2025
LT Video

Role of Quality Engineering in Shaping the Future of Financial Services

TestMu 2025
LT Video

Opening Note Day 2

TestMu 2025
LT Video

Build Your Testing Sidekick: Custom Tools with Model Context Protocol

TestMu 2025
LT Video

Reactive Browser Testing with WebDriver BiDi

TestMu 2025
LT Video

How Software Testing can Increase Agent Autonomy

TestMu 2025
LT Video

What can go wrong with AI in testing?

TestMu 2025
LT Video

Code It Forward: Making Your Mark in Open Source

TestMu 2025
LT Video

Testing the Untestable: Agent to Agent Testing

TestMu 2025
LT Video

Testing Early, Testing Right - Balancing Early Testing with Real-World Reliability

TestMu 2025
LT Video

The Enterprise AI Playbook: Strategies for Scaling AI in Quality Engineering

TestMu 2025
LT Video

Advanced Playwright with AI

TestMu 2025
LT Video

Generative to Agentic to Quantum - The Evolution of AI

TestMu 2025
LT Video

Test Data Key to Effective Test Coverage

TestMu 2025
LT Video

QE Strategic Shift: What's Changing with AI, Automation, and Speed?

TestMu 2025
LT Video

Building AI Fluency: Leading Teams Through the Learning Curve

TestMu 2025
LT Video

The Practical Automation Playbook

TestMu 2025
LT Video

Building a Handwriting Recognition System for the New York Times Crossword

TestMu 2025
LT Video

Agentic Cloud: Using Agents to Build Tomorrow’s Cloud

TestMu 2025
LT Video

QA to QE: Scaling Quality with Ownership, Code, and Culture

TestMu 2025
LT Video

Automated Test Data Portal for Financial Services

TestMu 2025
LT Video

QA in the Age of AI: Enhancing Agent Reliability Through Evaluation-Driven Development

TestMu 2025
LT Video

Ensuring quality testing in an AI-driven world

TestMu 2025
LT Video

AI-Driven Strategies for Scalable & Resilient Performance Engineering

TestMu 2025
LT Video

Day 3 Opening Note

TestMu 2025
LT Video

Mastering Appium 3: Architecture, Gestures & Beyond

TestMu 2025
LT Video

From Zero to MCP: Automating Test Environments for DevOps & QA

TestMu 2025
LT Video

AI & GenAI in Quality Engineering: Crawl, Walk, Run

TestMu 2025
LT Video

Embracing Agentic AI: From Autonomous Goals to Enterprise Guarantees

TestMu 2025
LT Video

Oops, AI Did It Again: How to Get AI to Stop Being Weird and Actually Be Useful

TestMu 2025
LT Video

Should We Let AI Take Over Test Automation Completely?

TestMu 2025
LT Video

From Hours to Minutes: Run Thousands of CI Tests in Just Minute

TestMu 2025
LT Video

Evaluating RAG Applications: From Retrieval to Response Quality

TestMu 2025
LT Video

Stop Breaking Your Teams: Seeing the Whole Instead of Pieces

TestMu 2025
LT Video

Surviving and Thriving with AI in QA

TestMu 2025
LT Video

The Quality Leadership Shift: From Compulsiveness to Cautiousness

TestMu 2025
LT Video

Full Court Quality: Lacing Up for Speed, Stability & Style

TestMu 2025
LT Video

Navigating Mobile App Testing and App Store Rejection: From Review to Release

TestMu 2025
LT Video

Randomized testing: Gotta Catch ‘Em All

TestMu 2025
LT Video

Balancing release & sprint delivery speed with thorough testing

TestMu 2025
LT Video

Building for AI at Scale: Infrastructure, Integrity, and Innovation

TestMu 2025
LT Video

Trusting the Machine: Building Confidence in AI-Driven Testing Decisions

TestMu 2025
LT Video

Observability - Holistic Quality across Software Systems

TestMu 2025
LT Video

From SDLC to ADLC: The Enterprise Agent Development Lifecycle

TestMu 2025
LT Video

Agentic Testing: Your Skilled Human Tester

TestMu 2025
LT Video

Evolution of Quality Engineering in Financial Services

TestMu 2025
LT Video

Closing Note Day 3

TestMu 2025