Featherless AI
p/featherless-llm
Run every πŸ¦™ model & more from πŸ€— huggingface. Serverless
Wes George
Featherless AI β€” Run every πŸ¦™ AI model & more from πŸ€— huggingface
Featured
40
β€’
Featherless is a platform to use the very latest open source AI models from Hugging Face. With hundreds of new models daily, you need dedicated tools to keep with the hype. No matter your use-case, find and use the state of the art AI model with Featherless.
Replies
Eugene Cheah
Hello Product Hunters! πŸͺΆ I'm super excited to launch Featherless.AI today! A platform that allows quick access to all the top πŸ¦™ models you see in Hugging Face πŸ€— today. From the 8B to the merged 11B's and Qwen-2 72B's I know it's daunting for folks to download and set up all these large models on GPUs to try them, one at a time. Also renting GPUs can be incredibly expensive as it's typically several dollars per hour. On the flip side, popular providers may not have the various finetune, or even weird model sizes that you would love to try. That's why we built featherless AI. To eventually download and provide access to **all** of hugging face public models. Making all the various open-source AI models more accessible. Starting with a simple $10 or $25 monthly plan, with unlimited usage (within the concurrency limit). So anyone can use it personally, across any model, without worrying about token pricing in their day-to-day usage. Also: as the team who also helps build open-source foundation models (hey RWKV folks!), we fully understand the concerns various folks have over data security and privacy. As such, featherless.ai has no logging of any of your message prompts and completion. Why? Because we are not interested in stealing your data to train our models. So use it any way you want to, dun let someone else tell you how you should use your AI models. πŸͺ„ The magic behind it? At the heart of it, is a custom-built inference infrastructure, built by the team here from Recursal.AI : which was built to be able to dynamically hot-swap models in sub-seconds. Allowing us to rapidly autoscale and dynamically adjust our infrastructure based on what models are popular. Once we have the models downloaded into our cluster. This allows us to provide more models, where previous providers have been limited in ensuring every model hosted has a dedicated GPU for it. πŸ’¬ In summary πŸƒβ€β™‚οΈ Run any of over 450+ huggingface models πŸ› οΈ OpenAI compatible API, use your existing tools or client πŸ“ˆ Unlimited usage (within concurrent usage) πŸ¦™ Starting at $10/month for <15B models πŸ¦… To $25/month for 72B models 🎁 Special for PH: Signup with a subscription, and add referral `hello+producthunt@featherless.ai` for $10 off your next month bill Feel free to ask me anything here on the product hunt launch! And give it a try with a free trial, which allows you to chat with the models (up to a limited amount of messages) at https://featherless.ai
Eugene Cheah
@manisha_hr_ Thanks! Its OpenAI API - so if your devs were already using OpenAI styled API for AI, they should be able to try this instantly (as its the same API)
Eugene Cheah
@amit44 Thanks!
Tony Han
That's crazy that you offer this for $10-20; read about how you are swapping out models in almost real time, that's super cool! Thank you for making open source AI world a better place. As someone without the knowledge on how to deploy LLMs, this is a cool way to test out all kinds of OSAI quickly! Congrats @eugene_cheah !
Eugene Cheah
@tonyhanded Thanks! Hope you find a model you like for your use case, or just to play with =)
Evan Stites-Clayton
How does featherless tie into the novel models that you developed with recursal? Is it similar to OpenRouter? What are some of the top reasons to switch over to Featherless vs using other API's?
Wes George
@the_esc Expect to see the Eagle and RWKV models on featherless shortly - we targeted the Llama-3 based models first since it's more widely known and has a very rich and varied set of fine-tunes, but we definitely believe that the future belongs to RWKV :) We're certainly similar to OpenRouter in providing an enormous range of models servelessly. We aren't listing other providers and we will likely complement OpenRouter by acting as a provider for sufficiently-popular-but-still-niche models. The main reason to use featherless is to experiment and use a wider variety of fine-tunes than anywhere else.
Eugene Cheah
@the_esc PS: RWKV models are now online
Shi Ling
Awesome. It's nice to be able to quickly try out and preview different llama models without having to deploy them on my own servers. Been meaning to the different variety of roleplay models for my dnd sessions. Are you planning to support other open-source models besides llama?
Eugene Cheah
@taishiling - im glad you like being able to play with all the models. Yes, we are currently downloading more models which would be coming online over the next few days. We currently support RWKV and LLaMA based models. We plan to introduce Mistral MoE's next (to be confirmed), followed by potentially larger models. The main reason we started with llama, is because it was the largest pool of all the popular models our initial users and community wanted to use. But the goal remains: ALL huggingface models. One major group at a time
griimnak
Platform is very quick and fluid. Good UI, I think this has great potential. Update the main site with api pricing info, thanks!
Eugene Cheah
@griimnak @jonathanleung To both : the plans include API access!
Richard Cheng
Hey Team Featherless.AI, Congratulations on the launch of Featherless.AI! πŸŽ‰ This is a game-changer for anyone looking to access top models on Hugging Face without the hassle and high costs of setting up and renting GPUs. By offering a wide range of models with simple and affordable pricing plans, you’re making cutting-edge AI technology accessible to everyone. The focus on data security and privacy is commendable, and the custom-built infrastructure to dynamically hot-swap models is truly innovative. This ensures users can explore a vast array of models seamlessly. Excited to see how Featherless.AI transforms the AI landscape!
Hakim Zerhouni
How does Featherless ensure the integration of these models remains user-friendly? It sounds like an amazing platform! The ability to access the latest open-source AI models from Hugging Face is also fantastic.
Eugene Cheah
@hakz Thanks! We use OpenAI compatible API, so any tools that integrates and work with OpenAI can be used with featherless.ai
Muzzammil
Loved this man :)
Eugene Cheah
@humblemuzzu Thank You =)
Hossein Yazdi
This is very handy, great one Eugene!
Eugene Cheah
@hosseinyazdi - Thank you πŸ™
Andreas Sohns
Congrats on the launch! Featherless AI offers great access to Hugging Face models at a fair price Excited to see what it can do!
Eugene Cheah
@andreas_sohns Thanks! Next step: More models! After that: To make it easy for anyone to finetune their own models, for their use case. And use it =)
Kyrylo Silin
Hi Eugene, Featherless.AI sounds like an invaluable tool for accessing and experimenting with various LLMs. How do you manage the computational resources to support such a wide range of models? Also, are there plans to support custom model training or fine-tuning on the platform? Congrats!
Eugene Cheah
@kyrylosilin Our infrastructure auto scale according to current workload, with optimizations specific to our infra provider. We also adjust our setup for the best price to performance GPUs, to lower overall cost (aka no H100s), even for the larger models. In overall, this lets us keep costs to the minimum (no wasted GPUs), according to the user workloads. And yes, we do plan to support additional features like finetuning, and the option to switch to token based pricing, for larger scale commercial users in the future. But for now, this is aimed at individuals who would love to play with all the models
Inoticed your pricing details did not mention anything about API access. Perhaps you could update it so non PH users can see it too :) Congrats on the launch!
Eugene Cheah
@charlestehio Thats a good point, will make it more obvious! Thanks!
William Bowen
Great product, very useful to be able to use and change llama models instantly, without having to set up and deploy a server myself!!
Eugene Cheah
@william_bowen4 Exactly! Useful when you want to test many models!
Kehui Guo
Congrats on launching Featherless AI! It’s incredible to see a platform that simplifies access to the latest AI models from Hugging Face.
Nathan Wilce
Congrats on the launch: I was wondering how this is feasible with the cost of gpus and hosting this many models?
Eugene Cheah
@nathan_wilce Thanks! All the models are pre-downloaded to our clusters, and are on standby. The GPUs spin's up and swap these models on the fly under seconds. This was made possible with our custom built inference systems we built and optimized on. Allowing us to keep cost low, and auto scale to actual usage. PS: 450+ models takes up over 9TB of storage space, we are downloading 100+ more which will be coming online soon. (it takes time haha)
Julien Chaumond
wow this is very cool! @eugene_cheah
Eugene Cheah
@julien_c Thank you πŸ™ : now we just need to scale usage, then servers, then models - until we catch them all from huggingface πŸ˜‰
Julien Chaumond
@eugene_cheah let's do it! BTW https://recursal.ai/ looks quite cool too
Eugene Cheah
@julien_c Thanks! They are actually the same backend / infra πŸ˜‰ As we scale featherless.ai, we do plan to integrate it back to recursal, with the more "complicated features" like automated finetuning, etc. So that recursal will be providing offerings beyond RWKV.
Tim Pouw
Interesting. Shared it with our dev lead!
Eugene Cheah
@tim__pouw Thanks, hope you can find a useful model for your use case, or for fun!