Tiny Dream Stable Diffusion C++ Library

A header-only Stable Diffusion inference library in C++ built for local CPU execution, smaller memory use, and straightforward embedding in existing applications.

Connect to the PixLab Console Use the PixLab Console to test endpoints, manage API access, and ship document, media, and vision workflows faster.

Version 2.197 (Release Notes ↗)

Tiny Dream is a header-only Stable Diffusion implementation* written in C++ with a primary focus on CPU efficiency and a smaller runtime footprint. Tiny Dream runs reasonably fast on typical consumer hardware, requiring roughly 1.7 GB to 5.5 GB of RAM depending on the selected output settings. It does not require an NVIDIA GPU and is designed for embedding into larger codebases through a compact C++ API.

* Tiny Dream still relies on a tensor backend. In this release that backend is ncnn, with a move to ggml planned on the roadmap.
Note: Current support targets Stable Diffusion 1.x with the CLIP transformer for token embedding, standard 512x512 output, and optional upscaling to 2048x2048.

Example output generated with the Tiny Dream C++ API using this prompt:
"pyramid, desert, palm trees, river, sun, (landscape), (high quality)"

Prompt used to generate such image: pyramid, desert, palm trees, river, sun, (landscape), (high quality)

Illustration courtesy of Binxu Wang ↗, Ph.D candidate, Harvard University.

Stable Diffusion ↗ is a powerful, open-source text-to-image generation model publicly launched by Stability.ai ↗ in August 2022. It is designed to produce images that match input text prompts. Stable Diffusion relies on the latent diffusion model ↗ architecture, a variant of the diffusion model that maps to the latent space using a fixed Markov chain ↗. Diffusion models aim to learn the latent structure of a dataset by modeling how data points diffuse through the latent space ↗. This latent space simplifies the relationships between data points, representing them in a lower-dimensional space where analysis and understanding of their connections are more efficient and intuitive.

PixLab | Symisc Systems open-source ML and DL projects include:
Tiny Dream, ASCII Art & SOD

Features

Low Runtime Memory Footprint for Stable Diffusion Inference
- Tiny Dream focuses on memory efficiency through ONNX quantization ↗, helping reduce runtime overhead during image generation.
- As of this release, 5.5 GB of RAM is enough for Stable Diffusion inference with high-resolution 2048x2048 output.
- Further savings (1.7 GB to 4 GB) are possible with standard 512x512 output.
No OpenCV Dependency
- You do not need to link against OpenCV or another large image-processing library.
- Only stb_image_write.h from the excellent stb ↗ single-header, public domain C library is required for saving images to disk.
Easy to Embed in Existing Codebases
- Drop tinydream.hpp and stb_image_write.h into your source tree alongside the pre-trained assets.
- The library exposes a single C++ class named tinyDream with just 8 public methods.
- No output to stdout/stderr. Supply your own log callback and route generated messages to the terminal, a file, or a network socket.

Reasonably fast on Intel/AMD CPUs

With TBB threading, Intel MKL*, and AVX vectorization.

Intel i9-9990XE	Intel i7-13700	Ryzen 9-5900X
2.98s ~ 5.07s Clock: 5.0GHz ~ 4.0GHz	10.09s ~ 8.16s Clock: 2.8GHz ~ 3.9GHz	5.38s ~ 4.02s Clock: 3.7GHz ~ 4.8GHz

Slowest to fastest inference time (single seed) expressed in seconds with clock speed (base and turbo boost) under various background loads for a standard 512x512, non-upscaled PNG image. PixLab maintains a private fork of NCNN heavily optimized for x86, so reproduced timings may vary on your machine.

Support Real-ESRGAN ↗, A Super Resolution Network Upscaler
- Generate extremely high-resolution, 2048x2048 pixels output thanks to this neural network. This extra step is a CPU intensive operation, and takes few seconds to complete.
Full Support for Negative Words
- Supply an extra set of words that allows you to list what you don't want to see generated such as gore or NSFW ↗ content besides your Positive Prompts.
Full Support for Words Priority
- Instruct the model to pay attention, and give higher priority to word (keywords) surrounded by parenthesis ().
- square brackets [] on the other side are used to reduce word importance.
Support for Output Metadata
- Link meta information to your output images such as copyright notice, comments, or any other meta data you would like to see linked to your image.
Support for Stable Diffusion Runtime Parameters
- Adjust Seed resizing: A way to generate the same image but at slightly different resolution.
- Control how much the image generation process follows the text prompt using guidance scale.
- Adjust sampling steps during Stable Diffusion Inference.

Getting Started

Integrating Tiny Dream into an existing codebase is straightforward. The steps below cover the minimum setup without forcing you through unnecessary build work.

1. Download The Tiny-Dream Source Code

Download the latest public release of Tiny Dream, and extract the package into a directory of your choice. In most cases, that means placing it inside or alongside your existing source tree.
Refer to the download section below to get a copy of the Tiny Dream source code as well as the Pre-Trained Models & Assets.

2. Embedding Tiny-Dream

The Tiny Dream source code comprises only two header files: tinydream.hpp and stb_image_write.h.
All you have to do is drop these two C/C++ header files on your source tree, and instantiate a new tinyDream object as shown on the pseudo C++ code below:

#include "tinydream.hpp"
int main(int argc, char *argv[]) 
{
    tinyDream td; // stack allocated tinyDream object

    // Display the library current inference engine, version number, and copyright notice
    std::cout << tinyDream::about() << std::endl;
    
    // At least a positive prompt must be supplied via command line
    if (argc < 2) {
        std::cout << "Example of Prompts:" << std::endl;
        // Example of built-in Positive/Negative Prompts
        auto prompts = tinyDream::promptExample();
        std::cout << "\tPositive Prompt: " << prompts.first << std::endl;
        std::cout << "\tNegative Prompt: " << prompts.second << std::endl;
        return -1;
    }

    // Register a log handler callback responsible of 
    // consuming log messages generated during inference.
    td.setLogCallback(logCallback, nullptr);
    
    // Optionally, set the assets path if the pre-trained models
    // are not extracted on the same directory as your executable
    td.setAssetsPath("/path/to/tinydream/assets"); // Remove or comment this if your assets are located on the same directory as your executable
    
    // Optionally, set a prefix of your choice to each freshly generated image name
    td.setImageOutputPrefix("tinydream-");
    
    // Optionally, set the directory where you want
    // the generated images to be stored
    td.setImageOutputPath("/home/photos/");

    /*
   * Finally, run Stable Diffusion in inference
   * 
   * The supplied log consumer callback registered previously should shortly receive
   * all generated log messages (including errors if any) during inference.
   * 
   * Refer to the official documentation at: https://pixlab.io/tiny-dream#tiny-dream-method
   * for the expected parameters the tinyDream::dream() method takes.
   */
    for (int seed = 1; seed < seedMax; seed++) {
        std::string outputImagePath;

        td.dream(
            positivePrompt, 
            negativePrompt, 
            outputImagePath, 
            true, /* Set to false if you want 512x512 pixels output instead of 2048x2048 output */
            seed,
            step
        );

        std::cout << "Output Image location: " << outputImagePath << std::endl; // uncomment this if too intrusive
    }
    return 0;
}

At this point, Tiny Dream is integrated. To build an executable for the boilerplate application ↗, refer to the next section.

Building Tiny Dream from source ↗ requires a modern C++17 compiler such as GCC 7 or later, Clang, or Microsoft Visual Studio (MSVC).
You also need to link against the default tensor backend to generate the executable.
As of this release, NCNN ↗ is the default tensor library. On the roadmap, we plan to replace ncnn with a lighter backend such as SOD or GGML while keeping CPU efficiency as the main priority.
An example of generating a highly optimized executable without relying on an external build system is shown below:

git clone https://github.com/symisc/tiny-dream.git
cd tiny-dream
g++ -o tinydream boileplate.cpp -funsafe-math-optimizations -Ofast -flto=auto  -funroll-all-loops -pipe -march=native -std=c++17 -Wall -Wextra `pkg-config --cflags --libs ncnn` -lstdc++ -pthread -Wl -flto -fopt-info-vec-optimized 
./tinydream "pyramid, desert, palm trees, river, (landscape), (high quality)"

Repeat the steps above in your terminal to generate an optimized build for your target platform.
Alternatively, you can use a build system such as CMake ↗. The Tiny Dream repository ↗ already includes the necessary CMake template to build from source.

4. Get the Pre-Trained Models & Assets

Once your executable is built, you will need the Tiny Dream pre-trained models and assets path accessible to your executable.
The Tiny Dream assets comprise all pre-trained models (over 2 GB as of this release) required by the tinyDream::dream() method to run Stable Diffusion inference (image generation).
You can download the pre-trained models from the Download section below.
Once downloaded, extract the assets ZIP archive in a directory of your choice (usually the directory where your executable is located), and set the full path via tinyDream::setAssetsPath() or from the Tiny Dream constructor.

5. C++ Usage Example

The C++ gist ↗ below shows a typical integration pattern for Tiny Dream in an existing C++ codebase:

The code above should be straightforward for most C++ developers to follow. A new tinyDream object is allocated on the stack and initialized on line 83 of the gist.
Library inference engine, version number & copyright notice, are shown on line 86 of the gist via call to the tinyDream::about() static method.
At least, one argument is required that is the Positive Prompt which consists of words separated by commas that describe something you'd like to see generated. If no arguments were provided, an example of Positive & Negative Prompts are shown on line 93 via call to the tinyDream::promptExample() static method, and the program exits immediately.
Optionally, a log consumer callback is registered on line 101 via tinyDream::setLogCallback(). Inference can take time depending on available resources, so it makes sense to log progress to the terminal or a text file for better visibility.
The pre-trained models path is set on line 106 via tinyDream::setAssetsPath(). The Tiny Dream assets comprise all pre-trained models (over 2 GB as of this release) required by the tinyDream::dream() method to run Stable Diffusion inference (image generation). You can download the pre-trained assets from the Download section below. You can also set the path directly from the constructor without calling this method.
If your Tiny Dream assets are located in the same directory as your executable, there is no need to specify a path. The default constructor argument already handles that case.
Seed & Steps extra parameters are collected if available which controls respectively the resolution, and the accuracy (input text criteria) of the image generation process.
Finally, Stable Diffusion inference took place on line 158 of the gist above via single call to the tinyDream::dream()method. On successful execution, the generated image path is copied to the outputImagePath parameter, and the supplied log callback should report any errors if any during the entire process.

6. Continue with The C++ API Reference Guide

The Tiny Dream C++ interface documents every public method exposed by the class. Once you understand the basic runtime flow, that section should serve as your reference guide.

Downloads

Tiny Dream Source Code
Release 1.7.5 ↗

This ZIP archive contains all C++ source code for Tiny Dream combined into a single header file for easier integration on your existing code base. You may refer to the Getting Started section above for a step-by-step integration guide.

Download

Tiny Dream Pre-Trained
Models & Assets
2GB of Assets

This ZIP archive contains all Pre-Trained Models & 2GB of Assets required for Stable Diffusion Inference. Once downloaded, extract the assets ZIP archive in a directory of your choice (usually the directory where your executable is located), and set the path via the Tiny Dream constructor or from the tinyDream::setAssetsPath() method.

Download

Roadmap

Highest Priority Move the tensor backend from NCNN to a lighter option such as SOD or GGML, with CPU performance as the priority.
Highest Priority Enable SVG and other editable outputs (potentially PSD) instead of static PNG files.
Hot Provide a cross-platform GUI for Tiny Dream, implemented with Dear ImGui.
Hot Develop a WebAssembly port once the new tensor backend (SOD or GGML) is available on WASM.
Low Create an Android proof-of-concept showcase APK.

Licensing

Tiny Dream is released under the GNU Affero General Public License

Tiny Dream is dual-licensed open-source software. The complete source code of the library and related utilities is available on GitHub ↗.
Tiny Dream is released under the GNU Affero General Public License (AGPLv3) ↗.
The AGPLv3 ↗ license allows you to use Tiny Dream at no cost, provided that if you use the library in a host application, the complete source code for that application is made available and freely redistributable under reasonable conditions.
If you want to distribute a commercial application without releasing it under the AGPLv3 ↗ or a compatible open-source license, you must purchase a non-exclusive commercial Tiny Dream license.
By purchasing a commercial license, you are no longer required to release your application's source code. Please contact [email protected] to place an order or for additional information about the licensing options.

C++ API Reference

This section defines the C++ interface to Tiny Dream. For a guided introduction, start with the Getting Started section above. As of this release, the library exposes a single class with eight public methods, making it practical to integrate into existing C++ projects. The methods are documented below.

tinyDream Public Methods:

tinyDream::tinyDream() - Constructor
tinyDream::dream() - Stable Diffusion Inference
tinyDream::setImageOutputPath()
tinyDream::setImageOutputPrefix()

Syntax

tinyDream::tinyDream(const std::string& assetsPath = "./assets/");

Description

Constructor for the tinyDream class.

Instantiate a new tinyDream object ready for Stable Diffusion Inference. The constructor takes an optional argument assetsPath which specifies the path to the pre-trained models required by the tinyDream::dream() method in order to accept prompts and generate images that match the input criteria.
Once the object instantiated, you can start registering log callback, assets path location, image storage directory, name prefixes, etc. and finally calls tinyDream::dream() for inference.
You can download the pre-trained assets from the Download section above. Once downloaded, extract the assets ZIP archive in a directory of your choice (usually the directory where your executable is located), and optionally set the path via the constructor or tinyDream::setAssetsPath().

Parameters

const std::string& assetsPath

Optional Argument: Full path to the pre-trained models location which can be downloaded from here. If this parameter is omitted, then the current path where the executable is located is assumed (recommended case).

Return Value

None

Syntax

void tinyDream::setLogCallback(std::function<void(const char* /* zLogMsg */, int /*msg length*/, void* /* pCookie */) xLogHandler, void* pUserData);

Description

Consume log messages via an external log handler callback. The main task of the supplied callback is to consume log messages generated during Stable Diffusion inference. Inference may take time depending on the available resources, so it makes sense to log progress to the terminal or a text file.

Parameters

std::function<> xLogHandler

Log consumer callback. The supplied callback must have the following signature:

void(const char *zLogMsg,int msgLen void *pUserData)

The supplied callback must accept three arguments:

The first argument is a pointer to a null terminated string holding the generated log message.
The second argument is the size in bytes of the generated log message.
The last argument is the pUserData opaque pointer forwarded verbatim to your callback whenever a log message is generated.

Depending on the load and inference parameters, the log consumer callback may be called dozens or even hundreds of times, so make sure your callback does not block and runs as quickly as possible.

void *pUserData

Optional Argument: Opaque user pointer forwarded unchanged to the log callback whenever a message is emitted.

Return Value

None

Syntax

void tinyDream::setAssetsPath(const std::string& assetsPath);

Description

Set the pre-trained models path. The Tiny Dream assets comprise all pre-trained models (over 2 GB as of this release) required by the tinyDream::dream() method for Stable Diffusion inference (image generation).

You can download the pre-trained assets from the Download section above. Once downloaded, extract the assets ZIP archive in a directory of your choice (usually the directory where your executable is located), and set the full path via this method or from the Tiny Dream constructor. You do not need to call this method if the assets are located on the same directory as your executable.

Parameters

const std::string& assetsPath

Full or relative path to the pre-trained models location which can be downloaded from here.

Return Value

None

Syntax

void tinyDream::setImageOutputPath(const std::string& outputPath = "./");

Description

Set a directory of your choice where you want your output images to be stored. The default path is set to the current directory where the executable reside.

Parameters

const std::string& outputPath

Full or relative path to the desired location where you want the generated images to be stored.

Return Value

None

Syntax

void tinyDream::setImageOutputPrefix(const std::string& outputImgPrefix = "tinydream-");

Description

Set a prefix of your choice to the yet to be generated image names. Output image names must be unique during each inference step. The assigned prefix alongside a random string plus the current step, and seed generation will be used to generate this unique name that will reside on the directory of your choice.

Parameters

const std::string& outputImgPrefix

Prefix to assign to each future image to be generated. The default prefix is set to tinydream-.

Return Value

None

Syntax

bool tinyDream::dream(const std::string& positivePrompt, const std::string& negativePrompt,
std::string& outputImgPath, bool upScale = true, int step = 30, int seed = 42);

Description

Stable diffusion inference - Generate high definition output images that matches the input criteria.

Prompts (Positive or Negative) in this implementation are keywords separated by commas where each word describe the thing you'd like to see (or not) generated.

Prior to calling this method, Pre-trained Models & Assets must be accessible to your executable. You can download the Pre-trained models from the Download section above. Once downloaded, extract the assets ZIP archive in a directory of your choice (usually the directory where your executable is located), and optionally set the full path (if located outside the current executable directory) via tinyDream::setAssetsPath() or from the Tiny Dream constructor.

Depending on the step and seed parameters, it may make sense to call this method more than once (for example, dozens of times) to achieve the desired result. In that case, we recommend installing a log consumer callback via tinyDream::setLogCallback() to capture log messages generated during inference and get a detailed overview of what's going on under the hood.

Stable Diffusion is based on the Latent Diffusion architecture, introduced in the High-Resolution Image Synthesis with Latent Diffusion Models ↗ paper. There are three main building blocks in Stable Diffusion:

Text Encoder: The text-encoder is responsible for converting the input prompt into an embedding space (latent vector).
Diffusion Model & Image Encoder: Diffusion models are trained to denoise random Gaussian noise (64x64 latent image patch) step by step, to get to a sample of interest.
Image Decoder: Finally, the decoder is responsible for converting the latent patches into 512x512 pixels output.

Parameters

const std::string& positivePrompt

Describe something you'd like to see generated using words separated by commas. High priority or meta instructions (eg image quality) must be surrounded by parenthesis.

Example: the following prompts will generate a high quality, landscape picture of a pyramid surrounded by palm trees and a river in the middle of the desert:
"pyramid, desert, palm trees, river, sun, (landscape), (high quality)".

const std::string& negativePrompt

Optional String, defaults to the empty string: An extra set of keywords that allows you to list what you do not want to see generated. Example of such keywords are:
"blood, mutilation, gore, genitals, nudity, destruction".

std::string& outputImgPath

On successful inference, the path to the generated image location will be copied to the supplied string object.

bool upScale

Optional Boolean, defaults to true. When true, the inference output is a high resolution, 2048x2048 image instead of the standard, medium resolution 512x512 Stable Diffusion output (when this field is set to false). This extra step is compute intensive, takes few seconds to complete, and is powered by Real-ESRGAN , a Super Resolution Network Upscaler.

int seed

Optional integer, defaults to 42. Seed in Stable Diffusion is a number used to initialize the generation. Controlling the seed can help you generate reproducible images, experiment with other parameters, or prompt variations.

int step

Optional integer, defaults to 30. An integer for adjusting the inference steps in Stable Diffusion. The more steps you use, the better quality you'll achieve but you shouldn't set steps as high as possible. Around 30 sampling steps (default value) are usually enough to achieve high-quality images. Using more may produce a slightly different picture, but not necessarily better quality. In addition, the iterative nature of the process makes generation slow; the more steps you'll use, the more time it will take to generate an image. In most cases, it's not worth the additional wait time.

Return Value

Boolean true is returned on successful inference, false is returned otherwise. On which case, log messages captured by your log consumer callback should give you an insight of what went wrong during the whole image generation process.

Syntax

 static std::pair<std::string /*Positive Prompt */, std::string /* Negative Prompt*/> tinyDream::promptExample();

Description

Return a hard-coded, prompt example template to be passed to the tinyDream::dream() method. This static method is of no particular interest except to familiarize the developer (in lack of imagination) with the library prompt inputs.

Parameters

None

Return Value

This static method never fails and always returns a standard template std::pair ↗ object holding the positive prompt in the first field of the std::pair, while the negative prompt is stored in the second field of the pair object.

Syntax

static const char * tinyDream::about();

Description

Return the current tensor engine, copyright notice, library identification and version number.

Parameters

None

Return Value

This static method never fails and always returns a pointer to a null-terminated string holding the copyright notice and current inference engine.