
Launched on March 19th, 2025
Launched on October 16th, 2024
Predibase has released the first Reinforcement Fine-Tuning platform, promising a groundbreaking approach to customizing LLMs using reinforcement learning. Use RFT to train open-source LLMs that outperform GPT-4, even when labeled data is limited.
All 25 fine-tuned models… 📈 Outperform GPT-4, GPT-3.5-turbo, and mistral-7b-instruct for specific tasks ⚡️ Are cost-effectively served from a single GPU through LoRAX 💰 Were trained for less than $8 each on average
The Predibase Inference Engine, powered by LoRA eXchange, Turbo LoRA, and seamless GPU autoscaling, serves fine-tuned SLMs at speeds 3-4 times faster than traditional methods and confidently handles enterprise workloads of 100s of requests per second.