2d ago
How do you validate an AI agent that could reply in unpredictable ways?
My team and I have released Agentic Flow Testing—an open-source framework where one AI agent autonomously tests another through natural language conversations.
6d ago
Every day I speak with AI teams building with LLM-powered applications and something is changing.
I see a new role is quietly forming:
The AI Quality lead as the quality owner.
4mo ago
12mo ago