uff_custom_plugin

Adding A Custom Layer To Your TensorFlow Network In TensorRT In Python

Table Of Contents

Description

This sample, uff_custom_plugin, demonstrates how to use plugins written in C++ with the TensorRT Python bindings and UFF Parser. This sample uses the MNIST dataset.

How does this sample work?

This sample implements a clip layer (as a CUDA kernel), wraps the implementation in a TensorRT plugin (with a corresponding plugin creator) and then generates a shared library module containing its code. The user then dynamically loads this library in Python, which causes the plugin to be registered in TensorRT's PluginRegistry and makes it available to the UFF parser.

This sample includes: plugin/ This directory contains files for the Clip layer plugin.

clipKernel.cu A CUDA kernel that clips input.

clipKernel.h The header exposing the CUDA kernel to C++ code.

customClipPlugin.cpp A custom TensorRT plugin implementation, which uses the CUDA kernel internally.

customClipPlugin.h The ClipPlugin headers.

lenet5.py This script trains an MNIST network.

model.py This script converts the MNIST tensorflow model to UFF that replaces ReLU6 activation with the clip plugin.

sample.py This script transforms the trained model into UFF (delegating ReLU6 activations to ClipPlugin instances) and runs inference in TensorRT.

requirements.txt This file specifies all the Python packages required to run this Python sample.

Generate the UFF model

If running this sample in a test container, launch NVIDIA tf1 (Tensorflow 1.x) container in a separate terminal for generating the UFF model.
```
docker run --rm -it --gpus all -v `pwd`:/workspace nvcr.io/nvidia/tensorflow:21.03-tf1-py3 /bin/bash
```
Alternatively, install Tensorflow 1.15 pip3 install tensorflow>=1.15.5,<2.0

NOTE

On PowerPC systems, you will need to manually install TensorFlow using IBM's PowerAI.
On Jetson boards, you will need to manually install TensorFlow by following the documentation for Xavier or TX2.

Install the UFF toolkit and graph surgeon depending on your TensorRT installation method, or from PyPI:

pip3 install --no-cache-dir --extra-index-url https://pypi.ngc.nvidia.com uff
pip3 install --no-cache-dir --extra-index-url https://pypi.ngc.nvidia.com graphsurgeon

Run the sample to train the model, covert to UFF and save the model. Also save the test data:
```
mkdir -p models
python3 lenet5.py
python3 model.py
```

Prerequisites

For specific software versions, see the TensorRT Installation Guide.

Switch back to test container (if applicable) and install the dependencies for Python.
```
python3 -m pip install -r requirements.txt
```
Install CMake.
(For Windows builds) Visual Studio 2017 Community or Enterprise edition

Running the sample

Build the plugin and its corresponding Python bindings.

On Linux, run:

mkdir build && pushd build
cmake .. && make -j
popd

NOTE: If any of the dependencies are not installed in their default locations, you can manually specify them. For example:

cmake .. -DCMAKE_CUDA_COMPILER=/usr/local/cuda-x.x/bin/nvcc # (Or adding /path/to/nvcc into $PATH)
         -DCUDA_INC_DIR=/usr/local/cuda-x.x/include/  # (Or adding /path/to/cuda/include into $CPLUS_INCLUDE_PATH)
         -DTRT_LIB=/path/to/tensorrt/lib/
         -DTRT_INCLUDE=/path/to/tensorrt/include/

On Windows, run the following in Powershell, replacing paths appropriately:

mkdir build; pushd build
cmake .. -G "Visual Studio 15 Win64" /
   -DTRT_LIB=C:\path\to\tensorrt\lib /
   -DTRT_INCLUDE=C:\path\to\tensorrt\lib /
   -DCUDA_INC_DIR="C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v<CUDA_VERSION>\include"
# NOTE: msbuild is usually located under C:\Program Files (x86)\Microsoft Visual Studio\2017\<EDITION>\MSBuild\<VERSION>\Bin
#   You should add this path to your PATH environment variable.
msbuild ALL_BUILD.vcxproj
popd

`cmake ..` displays a complete list of configurable variables. If a variable is set to `VARIABLE_NAME-NOTFOUND`, then you’ll need to specify it manually or set the variable it is derived from correctly.

Run inference using TensorRT with the custom clip plugin implementation:

python3 sample.py

Verify that the sample ran successfully. If the sample runs successfully you should see a match between the test case and the prediction.

=== Testing ===
Loading Test Case: 3
Prediction: 3

Additional resources

The following resources provide a deeper understanding about getting started with TensorRT using Python:

Model

LeNet model

Dataset

MNIST dataset

Documentation

License

For terms and conditions for use, reproduction, and distribution, see the TensorRT Software License Agreement documentation.

Changelog

September 2021 Updated with instructions for building the plugin on Windows.

March 2019 This README.md file was recreated, updated and reviewed.

Known issues

There are no known issues in this sample.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Adding A Custom Layer To Your TensorFlow Network In TensorRT In Python

Description

How does this sample work?

Generate the UFF model

Prerequisites

Running the sample

Additional resources

License

Changelog

Known issues

Name		Name	Last commit message	Last commit date
parent directory ..
plugin		plugin
CMakeLists.txt		CMakeLists.txt
README.md		README.md
__init__.py		__init__.py
lenet5.py		lenet5.py
model.py		model.py
requirements.txt		requirements.txt
sample.py		sample.py

FilesExpand file tree

uff_custom_plugin

Directory actions

More options

Directory actions

More options

Latest commit

History

uff_custom_plugin

Folders and files

parent directory

README.md

Adding A Custom Layer To Your TensorFlow Network In TensorRT In Python

Description

How does this sample work?

Generate the UFF model

Prerequisites

Running the sample

Additional resources

License

Changelog

Known issues