Introducing Stable Virtual Camera, currently in research preview. This multi-view diffusion model transforms 2D images into immersive 3D videos with realistic depth and perspective—without complex reconstruction or scene-specific optimization.

Hi everyone!

Stability AI has a research preview out – Stable Virtual Camera – that turns a single photo (or a few) into a 3D video with impressive camera control!

You provide one or more images of a scene, and then specify how you want the camera to move (using presets like "zoom" or "orbit," or even defining a custom path). The model then generates a 3D-consistent video following that trajectory.

It's built on diffusion models, but it's not just generating pixels. It's creating a coherent 3D representation, which allows for smooth, long videos (up to 1000 frames!). They've achieved state-of-the-art results on novel view synthesis benchmarks.

The code is open-source, though the model itself is under a non-commercial license for now. While it works well in many cases, they do note that it can struggle with humans, animals, and dynamic textures like water in its current version. Very honest of them.

Give it a try – upload an image here and see what it can do.

many of our favorite LoRAs are SD

I use it mainly for the erase endpoint to remove text or object from any thumbnail. Super accurate and easy

The Stable Diffusion XL version is very cost-effective. Although it lacks semantic understanding and is not very stable, its excellent image performance and rich ecosystem of creators make it very suitable for content tools. Among AI comic creation tools, SDXL is a very suitable model. We also considered Midjourney, but from the perspective of cost-effectiveness, SDXL is more suitable for us.

many of our favorite LoRAs are SD

I use it mainly for the erase endpoint to remove text or object from any thumbnail. Super accurate and easy

Stable Diffusion

Open models in every modality, for everyone, everywhere.

Open models in every modality, for everyone, everywhere.

Stable Diffusion

Open models in every modality, for everyone, everywhere.

Open models in every modality, for everyone, everywhere.

Stable Diffusion Launches

Do you use Stable Diffusion?

Stable Diffusion Launches

Do you use Stable Diffusion?

Stable Virtual Camera