Athina helps developers monitor and evaluate their LLMs applications in production.
Get complete visibility into your RAG pipeline and use our 40+ preset eval metrics to detect hallucinations and measure performance of your AI.
Athina is a complete AI development platform that enables AI teams to build, test, and monitor LLM-powered applications.
Teams can collaborate on prompts, flows and datasets, run experiments, compare / measure LLM outputs and monitor in production.
Hey PH,
I'm Shiv, a co-founder of Athina AI.
Building GenAI applications is a lot of hard work. Most teams are able to build prototypes quickly, but end up spending months getting their AI to work well in production.
A big part of the problem is a lack of good tooling. We've seen countless teams using scripts, python notebooks, spreadsheets, and even google docs to work with their prompts and datasets.
But these tools are not designed for building production-ready AI. 😥
Which is why many teams including unicorns like Perplexity, Meesho, and Doximity are choosing to use Athina... 🙌
We've spent the last 1.5 years building a comprehensive platform for teams building production-grade AI features.
Athina is an end-to-end LLM development platform that teams can use across the AI development lifecycle:
✅ Prompt Management: Manage, iterate, test and version control prompts
✅ Flows: Build powerful multi-step AI workflows in a notebook
✅ Datasets: Work with your datasets in a powerful spreadsheet-like UI
✅ Evaluation: Run evals on uploaded datasets, in CI / CD, or automatically on traces
✅ Experiments: Run experiments to compare outputs from different prompts / models
✅ Annotation: Have humans annotate your datasets for RLHF or human evaluation.
✅ Observability: Log traces, run evals continuously in production and monitor for regressions
And here's the best part – Athina is designed for both non-technical AND technical users to collaborate.
So your entire team can work together to build AI features without being bottlenecked on engineers to build flows or run experiments. No-code + SDK! 🔥
-----
🟠 We're backed by Y Combinator and have been working with some of the best AI teams in the world over the last year.
We're really excited to finally bring this platform to Product Hunt for everyone!
🌐 Website: https://athina.ai
🗓️ If you're interested in learning more, or seeing a demo, please feel free to get in touch with us: https://cal.com/shiv-athina/30min
@shivsak Athina looks like a solid solution for streamlining AI development. The full stack from prompt management to observability is impressive. How does the platform handle version control for datasets and experiments? Looking forward to learning more
Looks insanely cool, might even be able to use this for a current project we're running - I have to say, big fan of the name too - That was what initially caught my eye.
Athina AI
Muku.ai
Troops
OpenFunnel(YC F24)
Athina AI
Athina AI