LangWatch Agent Simulations

Agentic testing for agentic codebases

1.1K followers

LLMs•Testing and QA software•AI Metrics and Evaluation

Open-source testing platform for AI agents. Run simulations, catch regressions, and ship autonomous agents with confidence. Built for developers who treat AI like software. Agent simulations are the new unit tests

This is the 3rd launch from LangWatch Agent Simulations. View more

LangWatch Scenario - Agent Simulations

Agentic testing for agentic codebases

As AI agents grow more complex, reasoning, using tools, and making decisions, traditional evals fall short. LangWatch Scenario simulates real-world interactions to test agent behavior. It’s like unit testing, but for AI agents.

LangWatch Scenario - Agent Simulations gallery image

Free Options

Launch tags:

Open Source•Artificial Intelligence•Development

Launch Team

Manouk Draisma

LangWatch Agent Simulations

Maker

📌

Hey Product Hunt! 👋

We're excited to be launching LangWatch Scenario the first and only testing platform that allows you to test agents in simulated realities, with confidence and alongside domain expertise.

The problem that we’ve found is that teams are building increasingly complex agents, but testing them is still manual, time-consuming, and unreliable. You tweak a prompt, manually chat with your agent, hope it works better... and repeat. It's like shipping software without unit tests.

Our solution: Agent simulations that automatically test your AI agents across multiple scenarios. Think of it as a test suite for agents — catch regressions before they hit production, simulate edge cases alongside domain experts in a collaborative fashion, and ship with confidence.

What makes us different:

🧠 Agent simulations that act as unit tests for AI agents

🧪 Simulate multi-turn, edge-case scenarios

🧑‍💻 Code-first, no lock-in, framework-agnostic

👩‍⚕️ Built for domain experts and not just devs

🔍 Catch failures before users see them

✅ Trust your agent in production, not just evals

🏗️ Works with any agent framework (LangGraph, CrewAI, etc.)

LangWatch scenarios is our latest breakthrough that will allow teams to ship agents with confidence, not crossed fingers.

Get started today:

⭐ GitHub: https://github.com/langwatch/scenario

📖 Docs: https://docs.langwatch.ai/ 

🎮 Try Agent Simulations: https://langwatch.ai/

If you're building and testing AI agents, we'd love to hear what you're working on and how we can help.

A big thanks to the PH community for all your feedback and support.

We're here all day and can't wait to hear your thoughts, questions, and feedback!

Report

1mo ago

Rogerio Chaves

LangWatch Agent Simulations

Maker

Hello everyone! 👋

I'm Rogerio, founder of LangWatch, been developing software for 15+ years, and my career really changed once I started dominating unit tests, TDD and so on, not only delivering mission critical software with zero bugs but also having a much more pleasant experience in doing so.

So I couldn't be more excited for the Agent Simulations solution we are bringing today to the world, it feels like finally the missing piece in delivering agents, bringing much stronger craftsmanship to agent development.

I'll be your technical guide here, ask me anything!

Report

1mo ago

Manouk Draisma

LangWatch Agent Simulations

Maker

@rogerio_chaves you’re the best!

Report

1mo ago

Job Rietbergen

Alphadoc

evals and quick testing of agents is much needed. will give this product a go. congrats on the launch!

Report

1mo ago

Manouk Draisma

LangWatch Agent Simulations

Maker

Thanks @jobrietbergen AI agents need even more than just evals, give it a try!

Report

1mo ago

LangWatch Agent Simulations gallery image

LangWatch Agent Simulations Launches

LangWatch Scenario - Agent Simulations Agentic testing for agentic codebases

Launched on June 26th, 2025

LangWatch Optimization Studio Evaluate & optimize your LLM performance with DSPy

Launched on December 19th, 2024

Do you use LangWatch Agent Simulations?

Forum Threads

p/langwatch

Manouk Draisma

•

4mo ago

Is there an AI quality Lead in your Dev/AI team?

Every day I speak with AI teams building with LLM-powered applications and something is changing.

I see a new role is quietly forming:

The AI Quality lead as the quality owner.

p/langwatch

Manouk Draisma

•

3mo ago

Use an Agent to test your Agent

How do you validate an AI agent that could reply in unpredictable ways?

My team and I have released Agentic Flow Testing an open-source framework where one AI agent autonomously tests another through natural language conversations.

View all

5.0

Based on 10 reviews

Review LangWatch Agent Simulations?

LangWatch Agent Simulations is highly praised for its transformative impact on AI testing. Users appreciate its open-source nature and focus on agentic testing, which enhances confidence in deploying autonomous agents. The platform's robust simulations, intuitive API, and visualization tools streamline the testing process, while community support and regular updates reflect a commitment to improvement. Users find it valuable for output monitoring, evaluation, and production optimization, with features like jailbreak detection and document tracking. Overall, it's considered essential for serious AI development and quality assurance.