Tannic

Tannic is an extensible C++ tensor library built around a host–device execution model. Unlike monolithic frameworks, it provides only a minimal set of built‑in operators, focusing on a flexible architecture where new operations, data types, and backends can be added easily. This approach keeps the library lightweight while enabling adaptation to a wide range of computational needs.

This library is designed to serve as the foundational core for a neural network inference framework, but is equally suited to other domains such as classical ML or physics simulations—all without requiring Python.

You can find examples of neural networks inference with the tannic framework here:

CNN Server Example – demonstrates serving convolutional neural network models.
ViT Server Example – demonstrates serving Vision Transformer models.
LLaMa3 Text Completion Server Example – demonstrates serving a transformer server for text completion tasks.

📖 Full documentation: API Reference

Note: Class documentation was removed from headers since it was slowing down development, but will be added again in the future.

Quick Start

Below is a minimal example demonstrating tensor creation, initialization, basic indexing, and arithmetic operations with Tannic:

#include <iostream>
#include <tannic.hpp>

using namespace tannic;

int main() { 
    Tensor X(float16, {2,2}); //  X.initialize(Device()) // for CUDA support
    
    X[0, range{0,-1}] = 1;  
    X[1,0] = 3;             
    X[1,1] = 4;           
    
    Tensor Y(float16, {1,2}); //  Y.initialize(Device()) // for CUDA support
    Y[0,0] = 4;                            
    Y[0,1] = 6;    
    
    Y = log(X) + Y * Y - exp(X) + 3 * matmul(X, Y.transpose()); // assign expressions dynamically like in python 
 
    // broadcasting and type promotions supported. 
    std::cout << Y; 
}

It will output:

Tensor([[43.2812, 63.2812]
      , [105, 90.75]], dtype=float16, shape=(2, 2))

Equivalent PyTorch code for comparison:

import torch
 
X = torch.zeros((2, 2), dtype=torch.float16)
 
X[0, 0:] = 1       
X[1, 0] = 3
X[1, 1] = 4
 
Y = torch.zeros((1, 2), dtype=torch.float16)
 
Y[0, 0] = 4     
Y[0, 1] = 6       
Y = torch.log(X) + Y * Y - torch.exp(X) + 3 * torch.matmul(X, Y.t())
print(Y)

Giving:

tensor([[ 43.2812,  63.2812],
        [105.0000,  90.7500]], dtype=torch.float16)

Status

Note: Tannic is currently in an early development stage. It is functional but not fully optimized, and some features may still have bugs. The C backend API, used to extend the library, is under active development and may change significantly. The public API described in the documentation is mostly stable, with only minor breaking changes expected as the library evolves.

While the library is currently written in C++23, the arrival of C++26, is shaping up to be a monumental, too significant to ignore. At some point, it may be requirement for Tannic.

Features

Dynamic typing: Flexible tensor data types that support runtime type specification, enabling features like easy tensor serialization and deserialization.
Broadcasting: NumPy‑style automatic shape expansion in arithmetic operations, enabling intuitive and efficient tensor computations across dimensions.
Indexing and slicing: Intuitive multi-dimensional tensor access and manipulation.
Host–Device execution model: Unified support for CPU and CUDA-enabled GPU computation within the same codebase. While the device backend is currently developed in CUDA, the design is not tied to it and can support other backends in the future.
Minimal core operators: Only essential built-in math operations to keep the library lightweight and extensible.

What is comming...

cuBlas and cuTensor optional support: This may be added soon to accelerate tensor computations.
Autograd: Autograd is not necessary for inference, so it will be added to the library later when the runtime api is optimized and mature.
Graph mode: A constexpr graph mode will be added to the library, possibly with the arrival of C++26.
Quantization support: The library will support necessary dtypes to create quantized neural networks like bitnet.
Additional backends: Expansion beyond CUDA to support other device backends is planned. Host-Device computational model can be used as well with other hardware vendors.
Multi GPU support. Unfortunately I don't have either the expertise or the resources to add multigpu support, but the whole library was build taking this in mind so it won't be a breaking change when added.

Requirements

C++23 compiler: A compiler with C++23 support is required to build and run Tannic.
OpenBLAS (optional): If installed on your system, OpenBLAS will accelerate matrix multiplication.
CUDA Toolkit (optional): CUDA 12+ is required only if you want GPU support. If not installed, Tannic will build and run with CPU-only support.

Installation

Clone the repository:

git clone https://github.com/entropy-flux/Tannic.git

Debug build:

Use this for development — includes extra checks, assertions, and debug symbols for easier troubleshooting.

mkdir build && cd build
cmake .. -DCMAKE_BUILD_TYPE=Debug
make -j$(nproc)
ctest --output-on-failure

Release build

Use this for deployment or benchmarking — builds with full compiler optimizations and without debug checks.

mkdir build && cd build
cmake .. -DCMAKE_BUILD_TYPE=Release
make -j$(nproc)

Run the example

Run the example provided in the main.cpp file.

set -e
cmake -S . -B build -DTANNIC_BUILD_MAIN=ON -DCMAKE_BUILD_TYPE=Debug
cmake --build build --parallel "$(nproc)"
./build/main

Include Tannic in your project

#include <tannic.hpp>

CUDA support

CUDA support is enabled by default if a compatible CUDA toolkit (12+) is detected during configuration. If no CUDA installation is found, Tannic will automatically fall back to a CPU‑only build. You can explicitly disable CUDA with:

cmake .. -DTANNIC_ENABLE_CUDA=OFF

License

Tannic is licensed under the Apache License, Version 2.0. See the LICENSE file for details.

Contributing

Contributions to Tannic are welcome! If you'd like to report bugs, request features, or submit pull requests, please follow these guidelines:

Fork the repository and create a new branch for your feature or bugfix.
Open a pull request describing your changes and their purpose.

Tannic is licensed under the Apache License 2.0, a permissive open-source license that allows you to use, modify, and distribute the code freely—even in commercial projects. The only thing I ask in return is proper credit to the project and its contributors.

Recognition helps the project grow and motivates continued development.

By contributing, you agree that your contributions will also be licensed under Apache 2.0 and that proper attribution is appreciated.

Name		Name	Last commit message	Last commit date
Latest commit History 240 Commits
.github/workflows		.github/workflows
.vscode		.vscode
_documented/include		_documented/include
cmake		cmake
docs		docs
examples		examples
include		include
src		src
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
Doxyfile		Doxyfile
LICENSE		LICENSE
README.md		README.md
build.sh		build.sh
main.cpp		main.cpp
main.sh		main.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Tannic

Table of Contents

Quick Start

Status

Features

What is comming...

Requirements

Installation

Debug build:

Release build

Run the example

Include Tannic in your project

CUDA support

License

Contributing

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Uh oh!

License

Uh oh!

entropy-flux/Tannic

Folders and files

Latest commit

History

Repository files navigation

Tannic

Table of Contents

Quick Start

Status

Features

What is comming...

Requirements

Installation

Debug build:

Release build

Run the example

Include Tannic in your project

CUDA support

License

Contributing

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages