Product Hunt logo dark
  • Launches
    Coming soon
    Upcoming launches to watch
    Launch archive
    Most-loved launches by the community
    Launch Guide
    Checklists and pro tips for launching
  • Products
  • News
    Newsletter
    The best of Product Hunt, every day
    Stories
    Tech news, interviews, and tips from makers
    Changelog
    New Product Hunt features and releases
  • Forums
    Forums
    Ask questions, find support, and connect
    Streaks
    The most active community members
    Events
    Meet others online and in-person
  • Advertise
Subscribe
Sign in
Subscribe
Sign in

Selene by Atla

Frontier models to evaluate generative AI

141 followers

Frontier models to evaluate generative AI

141 followers

Visit website
LLMs
Find and fix AI mistakes at scale, and build more reliable GenAI applications. Use our LLM-as-a-Judge to test and evaluate prompts and model versions.
  • Overview
  • Launches1
  • Reviews
  • Alternatives
  • Team
  • Awards
  • More
Company Info
atla-ai.com/api
Selene by Atla Info
Y Combinator
Launched in 2025View 1 launch
Forum
p/selene-1
  • Blog
  • •
  • Newsletter
  • •
  • Questions
  • •
  • Forums
  • •
  • Product Categories
  • •
  • Apps
  • •
  • About
  • •
  • FAQ
  • •
  • Terms
  • •
  • Privacy and Cookies
  • •
  • X.com
  • •
  • Facebook
  • •
  • Instagram
  • •
  • LinkedIn
  • •
  • YouTube
  • •
  • Advertise
© 2025 Product Hunt
SocialLinkedInX

Similar Products

ChatGPT by OpenAI
ChatGPT by OpenAI
Get answers. Find inspiration. Be more productive.
4.8(1.1K reviews)
AILLMs
OpenAI
OpenAI
APIs and tools for building AI products
4.9(656 reviews)
LLMsAI Chatbots
Claude by Anthropic
Claude by Anthropic
A family of foundational AI models
4.9(581 reviews)
LLMsAI Chatbots
Gemini
Gemini
Google's answer to GPT-4
4.8(136 reviews)
LLMsAI Chatbots
Hugging Face
Hugging Face
The AI community building the future.
4.9(79 reviews)
LLMsAI Infrastructure Tools
View more
Selene 1 gallery image
Selene 1 gallery image
Free Options
Launch tags:
API•Developer Tools•Artificial Intelligence
Launch Team / Built With
Garry TanYoung Sun ParkMathias Leys
AWS
React
Cursor

What do you think? …

Maurice Burger
Maurice Burger

Selene by Atla

Maker
📌

Hey Product Hunt! Maurice here, CEO and co-founder of Atla. 


At Atla, we’re a team of researchers and engineers dedicated to training models and building tools that monitor AI performance. 


If you’re building with AI, you know that good evals are critical to ensuring your AI apps perform as intended.

Turns out, getting accurate evals that assess what matters for your use case is challenging. Human evaluations don’t scale and general-purpose LLMs are inconsistent evaluators. We’ve also heard that default eval metrics aren’t precise enough for most use cases, and prompt engineering custom evals from scratch is a lot of work. 

🌖 Our solution

  • Selene 1: a LLM Judge trained specifically for evals. Selene outperforms all frontier models (OpenAI’s o-series, Claude 3.5 Sonnet, DeepSeek R1, etc.) across 11 benchmarks for scoring, classifying, and pairwise comparisons.

  • Alignment Platform: a tool that helps users automatically generate, test, and refine custom evaluation metrics with just a description of their task, little-to-no prompt engineering required.


🛠️ Who is it for?
Builders of GenAI apps who need accurate and customizable evals—whether you’re fine-tuning LLMs, comparing outputs, or monitoring performance in production. Evaluate your GenAI products with Selene and ship with confidence.

You can start with our API for free. Our Alignment Platform is available for all users.

We’d love your feedback in the comments! What challenges have you faced with evals?

Report
5mo ago
Maurice Burger
Maurice Burger

Selene by Atla

Maker

@masump Hey Masum! Selene won't adapt itself out of the box, but we've built the alignment platform to make it easy to continually align your LLM judge to changing requirements.

Report
5mo ago
Jonas Urbonas
Jonas Urbonas
Fable Wizard

Fable Wizard

Keeping AI performance in check is no small task, and having an evaluator specifically trained for this sounds like a game-changer! How does Selene handle nuanced tasks where context is key—does it adapt based on different use cases?

Report
5mo ago
Maurice Burger
Maurice Burger

Selene by Atla

Maker

@jonurbonas Hey Jonas! Yes indeed. We trained Selene to be easily customizable. It excels at following evaluation criteria and score rubrics closely, and responds well to fine-grained steering. For instance, developers using LLM Judges frequently encounter the problem of evals getting saturated, i.e. model responses receiving high scores too frequently, making the eval less useful. In such situations, one might want to “make it harsher” such that fewer responses receive high scores.

you can read more here: https://www.atla-ai.com/post/selene-1

Report
5mo ago
Kyle
Kyle

Selene by Atla

Maker

@jonurbonas Thanks for the support! We built the alignment platform to make it super straightforward to adapt Selene to different use cases. Just describe your use case in natural language and the platform will auto-generate eval prompts to assess your AI app.

To your point, we trained Selene to be steerable to custom evals. For example, you might want to “make it harsher” so fewer responses receive high scores. Alternatively, you might want to “flip the scores” so that the eval gives high scores to failures rather than successes. Graph 👈 Here's a graph from our testing on benchmarks that shows this

Report
5mo ago
Maria-Cristina Muntean
Maria-Cristina Muntean

This is actually super interesting, and I'll check it out!

Report
5mo ago
Young Sun Park
Young Sun Park

Selene by Atla

Maker

@mia_k1 Thank you! Let us know if you have any questions

Report
5mo ago
Intercom
Intercom — Startups get 90% off Intercom + 1 year of Fin AI Agent free
Startups get 90% off Intercom + 1 year of Fin AI Agent free
Promoted

Do you use Selene by Atla?

Reviews
Helpful
Review Selene by Atla?Be the first to review Selene by Atla