Groq® - Hyperfast LLM running on custom built GPUs

Ambassador

An LPU Inference Engine, with LPU standing for Language Processing Unit™, is a new type of end-to-end processing unit system that provides the fastest inference at ~500 tokens/second.

Replies

Best

Johan Steneros

It is fast, that is for sure. Where can I get more information about the chips and hardware? Is there a GPU cloud service?

Report

1yr ago

Johan Steneros

@amrutha_killada1 oh, thank you. Will dig in.

Report

1yr ago

Avi Basnet

This seems extremely interesting- I’m curious what you’ve seen to be the biggest use case for this LLM?

Report

1yr ago

Peter Schout

Tradepost.ai

Congratulations! speed/accuracy is incredible, no wonder NVDA took a dip 😯

Report

1yr ago

Cuong Vu

Locofy.ai

Groq is a promising product, and I believe your detailed insights could attract even more supporters, helping people better understand its value.

Report

1yr ago

Ian Nance

Man, that IS fast...Already loving it : )

Report

1yr ago

Borja Soler

Videotok

this will be incredible for the future of LLMs and all the products benefiting from them. super excited with all the new things that will come

1yr ago

amazing...

1yr ago

Locofy.ai

Congrat on the launch? Do you have any plan when to support custom training?

Report

1yr ago

Ivan Somov

Avatars by Studio Neiro AI

Good luck! I am really excited about this hardware stuff for LLMs!

Report

1yr ago

Abhilash Chowdhary

Crustdata

Going to give this a try, team Groq®. Looks interesting.

Report

1yr ago

Sourabh

Don't know why its ranked so low as of now, the speed is awesome. It does what it says.

Report

1yr ago

Manmohit Grewal

Crustdata

Congrats team Groq® on your launch.

Report

1yr ago

Junior Perassoli

It looks very promising. How can I find information on how to use the APIs?

Report

1yr ago

Aris Nakos

Llanai

Wow, you guys are innovating. Congratulations! I tested it out and was blown away.

Report

1yr ago

Daniel Rödler

Octomind

Wow, love it. We are heavily relying on LLMs and the slowness of our agents is a constant annoyance. A 14x speed-up would be a real game changer. Can't wait to see LPUs in action and at scale. Keep going!

Report

1yr ago

Mona Dey

This is helpful post.thanks

Report

1yr ago