examples

Examples

This directory contains self-documented cloud integrations and demos to illustrate examples of DeepSparse usage.

Open a Pull Request to contribute your own examples.

Example	Description
AWS Marketplace	How to launch a DeepSparse integrated instance via the AWS Marketplace.
AWS Sagemaker Integration	How to deploy a DeepSparse inference server on SageMaker.
AWS Serverless Integration	How to deploy a DeepSparse pipeline for batch or real-time inference on select serverless services.
Azure VM	How to launch a DeepSparse image in an Azure virtual machine.
DigitalOcean Marketplace	How to launch a DeepSparse integrated instance via the DigitalOcean Marketplace.
GCP Marketplace	How to launch a DeepSparse integrated instance on the Google Cloud Marketplace.
Google Cloud Run	How to deploy the DeepSparse Server on Cloud Run.
Google Kubernetes Engine	How to deploy the DeepSparse Server on GKE.

Example	Description
Benchmark and ONNX Model Correctness	Comparing predictions and benchmark performance between DeepSparse Engine and ONNXRuntime.
Benchmark UI	How to deploy a gradio UI for benchmarking SparseZoo models on a DigitalOcean instance or local machine.
ChatGPT Cheat Sheet	User guide for prompting ChatGPT segmented by use-case.
Hugging Face Transformers	Serving, benchmarking, and running NLP models from Hugging Face.
YOLOv3 and YOLOv5	Serving, benchmarking, and running annotation inferences with YOLOv3 and YOLOv5 models.
Image Classification	How to use image classification models from SparseZoo to perform inference and benchmarking with the DeepSparse Engine.
Object Detection	How to use object detection models from SparseZoo to perform inference and benchmarking with the DeepSparse Engine.
Instance Segmentation	How to use an optimized YOLACT model and the DeepSparse Engine to perform real-time instance segmentation.
SparseServer.UI	A Streamlit app for deploying the DeepSparse Server to compare the latency and accuracy of sparse BERT models.
Twitter Sentiment Analysis	Example of scraping, processing, and classifying Twitter data using the DeepSparse Engine for 10x faster performance on CPUs.
Flask Model Server	Simple model server and client example, showing how to use the DeepSparse Engine as an inference backend for a real-time inference server.