This directory contains self-documented cloud integrations and demos to illustrate examples of DeepSparse usage.
Open a Pull Request to contribute your own examples.
| Example | Description |
|---|---|
| AWS Marketplace | How to launch a DeepSparse integrated instance via the AWS Marketplace. |
| AWS Sagemaker Integration | How to deploy a DeepSparse inference server on SageMaker. |
| AWS Serverless Integration | How to deploy a DeepSparse pipeline for batch or real-time inference on select serverless services. |
| Azure VM | How to launch a DeepSparse image in an Azure virtual machine. |
| DigitalOcean Marketplace | How to launch a DeepSparse integrated instance via the DigitalOcean Marketplace. |
| GCP Marketplace | How to launch a DeepSparse integrated instance on the Google Cloud Marketplace. |
| Google Cloud Run | How to deploy the DeepSparse Server on Cloud Run. |
| Google Kubernetes Engine | How to deploy the DeepSparse Server on GKE. |
| Example | Description |
|---|---|
| Benchmark and ONNX Model Correctness | Comparing predictions and benchmark performance between DeepSparse Engine and ONNXRuntime. |
| Benchmark UI | How to deploy a gradio UI for benchmarking SparseZoo models on a DigitalOcean instance or local machine. |
| ChatGPT Cheat Sheet | User guide for prompting ChatGPT segmented by use-case. |
| Hugging Face Transformers | Serving, benchmarking, and running NLP models from Hugging Face. |
| YOLOv3 and YOLOv5 | Serving, benchmarking, and running annotation inferences with YOLOv3 and YOLOv5 models. |
| Image Classification | How to use image classification models from SparseZoo to perform inference and benchmarking with the DeepSparse Engine. |
| Object Detection | How to use object detection models from SparseZoo to perform inference and benchmarking with the DeepSparse Engine. |
| Instance Segmentation | How to use an optimized YOLACT model and the DeepSparse Engine to perform real-time instance segmentation. |
| SparseServer.UI | A Streamlit app for deploying the DeepSparse Server to compare the latency and accuracy of sparse BERT models. |
| Twitter Sentiment Analysis | Example of scraping, processing, and classifying Twitter data using the DeepSparse Engine for 10x faster performance on CPUs. |
| Flask Model Server | Simple model server and client example, showing how to use the DeepSparse Engine as an inference backend for a real-time inference server. |