DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Features

  • Uses a model trained by machine learning techniques
  • Based on Baidu's Deep Speech research paper
  • Uses Google's TensorFlow to make the implementation easier
  • A pre-trained English model is available for use
  • Download important inference material from the DeepSpeech releases page
  • Run in real time on all devices

Project Samples

Project Activity

See All Activity >

License

Mozilla Public License 2.0 (MPL 2.0)

Follow DeepSpeech

DeepSpeech Web Site

You Might Also Like
One Platform. Total IT Insight. Start with PRTG Now Icon
One Platform. Total IT Insight. Start with PRTG Now

Rely on a single source of truth. PRTG unifies monitoring for all your systems, apps, and services.

Why settle for fragmented monitoring? PRTG consolidates everything - servers, VMs, network devices, cloud services, and more, into one powerful platform. Get real-time status, customizable alerts, and deep analytics to drive smarter decisions. Designed for complex environments, PRTG scales with your needs, supports team collaboration, and helps you prevent outages before they impact users. Take control of your IT landscape and deliver the uptime your business requires.
Start Your Free PRTG Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DeepSpeech!