Athina AI

Athina AI

Monitor LLMs and automatically detect hallucinations in prod

4.8
β€’5 reviewsβ€’

1.1K followers

Athina helps developers monitor and evaluate their LLMs applications in production. Get complete visibility into your RAG pipeline and use our 40+ preset eval metrics to detect hallucinations and measure performance of your AI.
This is the 2nd launch from Athina AI. View more
Athina

Athina

Build, test and monitor AI apps and agents
Athina was ranked #2 of the day for December 5th, 2024
Athina is a complete AI development platform that enables AI teams to build, test, and monitor LLM-powered applications. Teams can collaborate on prompts, flows and datasets, run experiments, compare / measure LLM outputs and monitor in production.
Athina gallery image
Athina gallery image
Athina gallery image
Athina gallery image
Athina gallery image
Athina gallery image
Athina gallery image
Free Options
Launch Team / Built With

What do you think? …

Shiv Sakhuja
Hey PH, I'm Shiv, a co-founder of Athina AI. Building GenAI applications is a lot of hard work. Most teams are able to build prototypes quickly, but end up spending months getting their AI to work well in production. A big part of the problem is a lack of good tooling. We've seen countless teams using scripts, python notebooks, spreadsheets, and even google docs to work with their prompts and datasets. But these tools are not designed for building production-ready AI. πŸ˜₯ Which is why many teams including unicorns like Perplexity, Meesho, and Doximity are choosing to use Athina... πŸ™Œ We've spent the last 1.5 years building a comprehensive platform for teams building production-grade AI features. Athina is an end-to-end LLM development platform that teams can use across the AI development lifecycle: βœ… Prompt Management: Manage, iterate, test and version control prompts βœ… Flows: Build powerful multi-step AI workflows in a notebook βœ… Datasets: Work with your datasets in a powerful spreadsheet-like UI βœ… Evaluation: Run evals on uploaded datasets, in CI / CD, or automatically on traces βœ… Experiments: Run experiments to compare outputs from different prompts / models βœ… Annotation: Have humans annotate your datasets for RLHF or human evaluation. βœ… Observability: Log traces, run evals continuously in production and monitor for regressions And here's the best part – Athina is designed for both non-technical AND technical users to collaborate. So your entire team can work together to build AI features without being bottlenecked on engineers to build flows or run experiments. No-code + SDK! πŸ”₯ ----- 🟠 We're backed by Y Combinator and have been working with some of the best AI teams in the world over the last year. We're really excited to finally bring this platform to Product Hunt for everyone! 🌐 Website: https://athina.ai πŸ—“οΈ If you're interested in learning more, or seeing a demo, please feel free to get in touch with us: https://cal.com/shiv-athina/30min
Tony Tong
@shivsak Athina looks like a solid solution for streamlining AI development. The full stack from prompt management to observability is impressive. How does the platform handle version control for datasets and experiments? Looking forward to learning more
Greg Ratner
@shivsak looks really powerful! Can’t wait to try it. Congrats on all the progress!! πŸš€
Aditya Lahiri
The LLM platform every product team needs - congratulations guys!!!
Sam @CRANQ
Looks insanely cool, might even be able to use this for a current project we're running - I have to say, big fan of the name too - That was what initially caught my eye.
Himanshu Bamoria
Glad to hear that, @cranqnow! Let's connect to discuss this better. Checked out Cranq- looks pretty useful for us too!