Manouk Draisma

25d ago

Use an Agent to test your Agent

How do you validate an AI agent that could reply in unpredictable ways?

My team and I have released Agentic Flow Testing an open-source framework where one AI agent autonomously tests another through natural language conversations. 

Manouk Draisma

29d ago

Is there an AI quality Lead in your Dev/AI team?

Every day I speak with AI teams building with LLM-powered applications and something is changing.

I see a new role is quietly forming:

The AI Quality lead as the quality owner.

Samar Ali

1yr ago

LangWatch - Understand, measure and improve your LLMs

LangWatch provides an easy, open-source platform to improve and iterate on your current LLM pipelines, as well as mitigating risks such as jailbreaking, sensitive data leaks and hallucinations.

Manouk Draisma

5mo ago

LangWatch Optimization Studio - Evaluate & optimize your LLM performance with DSPy

LangWatch is the ultimate platform for LLM performance monitoring and optimization. Streamline pipelines, analyze metrics, evaluate prompts, and ensure quality. Powered by DSPy, we help AI developers ship 10x faster with confidence. Create an account for free.