Calculate the GPU memory you need for LLM inference

SelfHostLLM - Calculate the GPU memory you need for LLM inference

Ambassador

Calculate GPU memory requirements and max concurrent requests for self-hosted LLM inference. Support for Llama, Qwen, DeepSeek, Mistral and more. Plan your AI infrastructure efficiently.

Replies

Best

Chris Messina

Ambassador

Hunter

📌

Built to simplify planning for self-hosted AI deployments.

Unlike other AI infrastructure tools, SelfHostLLM lets you precisely estimate GPU requirements and concurrency for Llama, Qwen, DeepSeek, Mistral, and more using custom config.

B̶u̶t̶ n̶o̶w̶ I̶ w̶̶a̶n̶t̶ t̶o̶ s̶e̶e̶ ̶A̶p̶p̶l̶e̶ s̶i̶l̶i̶c̶o̶n̶ ̶a̶d̶d̶e̶d̶ t̶o̶ t̶h̶e̶ m̶i̶x̶!

Update: Now there's a Mac version too!

Report

15d ago

Cruise Chen

Agnes AI

Love how SelfHostLLM lets you actually estimate GPU needs for different LLMs—no more guessing and overbuying fr. Super smart idea, realy impressed!

Report

15d ago

Mcval Osborne

Very cool calculator, looking forward to checking this out.

Report

15d ago

Eran Sandler

SelfHostLLM

Maker

Hi all, I'm the creator of SelfHostLLM.org.

You can read more about why I created it here:

https://www.linkedin.com/posts/e...

Report

14d ago

Joey Judd

AltPage.ai

Launching soon!

No way, this is exactly what I needed! Figuring out GPU memory for LLMs has always been such a headache—super smart to automate it. Any plans to support multi-GPU setups?

Report

14d ago

Hamletina Tonakanyan

CoSupport AI

Super useful — sizing GPU memory and concurrency upfront saves a ton of headaches. Love that it works with different models.

Report

14d ago

Eran Sandler

SelfHostLLM

Maker

Here is the Mac version: https://selfhostllm.org/mac/

Report

14d ago

Xiaolong Zhu

@erans is it possible to connect more than one macs?

Report

10d ago

Frédéric Côté

@erans Thank you sooooo much 🙏🏻. There’s way too much misleading information out there on the internet, even for an AI like Perplexity, making it hard to get the right info. This is a real time-saver! I can now focus on using my private LLM instead of spending days trying to make it work with all the parameters to set and understand. I really hope your site will eventually offer even more useful information and tips. But for now, this is absolutely perfect for me 👌🏻.

Report

9d ago

SelfHostLLM - Calculate the GPU memory you need for LLM inference

Replies

Engineering & Development

AI

Work & Productivity

Marketing & Sales

Design & Creative

Social & Community

Finance

Product add-ons

Trending categories

Top reviewed

Trending products

Top forum threads

Engineering & Development

AI

Work & Productivity

Marketing & Sales

Design & Creative

Social & Community

Finance

Product add-ons

Trending categories

Top reviewed

Trending products

Top forum threads