All activity
Zac Zuo
Moonlight is the open-source 3B/16B MoE LLMs from Moonshot AI, trained with the Muon optimizer for ~2x compute efficiency compared to AdamW. Pretrained, instruct-tuned, and intermediate checkpoints available.
Moonlight
Efficient, Open-Source LLMs from Moonshot AI
Zac Zuo
Mercury, from Inception Labs, is the first commercial diffusion LLM. Up to 10x faster than autoregressive models, with comparable or better quality on coding tasks.
Mercury
Mercury
The First Commercial-Scale Diffusion LLM
Zac Zuo
DeepGEMM, from DeepSeek, is an open-source library for highly optimized FP8 GEMM kernels on Hopper GPUs. Clean codebase (~300 LOC), JIT-compiled, no heavy dependencies.
DeepGEMM
DeepGEMM
Unlock Maximum FP8 Performance on Hopper GPUs
Zac Zuo
Magma, the flagship project form Microsoft Research, is the first-ever foundation model for multimodal AI agents, designed to handle complex interactions across both virtual and real environments.
Magma
Magma
Foundation Model for Multimodal AI Agents
Zac Zuo
Gemini Code Assist for Individuals provides free, AI-powered coding assistance with a large context window, directly in your VS Code or JetBrains IDE.
Gemini Code Assist
Gemini Code Assist
Code faster and smarter for free
Zac Zuo
QwQ-Max from Qwen is a powerful new LLM excelling in reasoning, math, coding, and agent tasks. Features a "thinking mode" for complex problems. Open-source coming soon!
QWQ-Max
QWQ-Max
New LLM by Alibaba excelling in reasoning w/ "thinking mode"
Zac Zuo
DeepEP, from DeepSeek, is the open-source communication library powering the DeepSeek-V3 MoE model. Optimized for Hopper GPUs, NVLink, and RDMA.
DeepEP
DeepEP
Powering DeepSeek-V3's MoE Performance
Zac Zuo
Zac Zuo
left a comment
Good one! The Name is Winning :)
LFG 2.0
LFG 2.0
Discover places, inspire travel