{"id":6932,"date":"2019-12-22T11:04:20","date_gmt":"2019-12-22T11:04:20","guid":{"rendered":"https:\/\/gtechbooster.com\/?p=6932"},"modified":"2023-04-01T01:36:47","modified_gmt":"2023-04-01T01:36:47","slug":"deepspeech-gets-smaller","status":"publish","type":"post","link":"https:\/\/gtechbooster.com\/deepspeech-gets-smaller\/","title":{"rendered":"DeepSpeech Gets Smaller"},"content":{"rendered":"\n<p>Mozilla DeepSpeech has been updated with support for TensorFlow Lite,\n resulting in a smaller package size and faster performance on some \nplatforms.<\/p>\n\n\n\n<div class=\"gtech-migrated-from-ad-inserter-placement-2\" style=\"text-align: center;\" id=\"gtech-2319415438\"><div style=\"margin-right: auto;margin-left: auto;text-align: center;\" id=\"gtech-126652114\"><a data-bid=\"1\" data-no-instant=\"1\" href=\"https:\/\/gtechbooster.com\/linkout\/17207\" rel=\"noopener\" class=\"notrack\" aria-label=\"26001\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/gtechbooster.com\/media\/2023\/01\/26001.jpeg\" alt=\"\"  srcset=\"https:\/\/gtechbooster.com\/media\/2023\/01\/26001.jpeg 1024w, https:\/\/gtechbooster.com\/media\/2023\/01\/26001-768x960.jpeg 768w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" width=\"500\" height=\"625\"  style=\"display: inline-block;\" \/><\/a><\/div><\/div><p>DeepSpeech is a deep leaning-based automatic speech recognition (ASR)  engine with a simple API developed by Mozilla. The speech recognition  technology and trained models in DeepSpeech are openly available to  developers. Mozilla also provides pre-trained English models.<\/p>\n\n\n\n<p>The latest release, version v0.6, comes with support for TensorFlow \nLite, the version of TensorFlow that\u2019s optimized for mobile and embedded\n devices. This has reduced the DeepSpeech package size from 98 MB to 3.7\n MB, and cut the English model size from 188 MB to 47 MB. The developers\n achieved the cut using post-training quantization, a technique to \ncompress model weights after training is done.<\/p>\n\n\n\n<p>While TensorFlow Lite is designed for mobile and embedded devices, \nthe DeepSpeech team found it also made DeepSpeech faster on desktop \nplatforms, so they&#8217;ve made it available on Windows, macOS, and Linux as \nwell as Raspberry Pi and Android. DeepSpeech v0.6 with TensorFlow Lite \nruns faster than real time on a single core of a Raspberry Pi 4. It also\n uses 22 times less memory.<\/p>\n\n\n\n<p>The move to TensorFlow 1.14 means the developers have been able to \nmake use of the CuDNN RNN APIs for their training code. This change has \ngiven them around two times faster training times, which means faster \nexperimentation and better models. Support for online feature \naugmentation has also been added.<\/p>\n\n\n\n<p>Along with the performance improvements, the new decoder exposes \ntiming and confidence metadata, providing new possibilities for \napplications. The new release also includes an extended set of functions\n in the API, not just the textual transcript. You also get metadata \ntiming information for each character in the transcript, and a \nper-sentence confidence value.<\/p>\n\n\n\n<p>DeepSpeech v0.6 now offers packages for Windows, with .NET, Python,  JavaScript, and C bindings. The .NET package is available in the NuGet  Gallery, and you can install it directly from Visual Studio.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">More Information<\/h2>\n\n\n\n<ul class=\"wp-block-list\"><li><a href=\"https:\/\/github.com\/mozilla\/DeepSpeech\/releases\/v0.6.0\">DeepSpeech On GitHub<\/a><\/li><li><a rel=\"noreferrer noopener\" href=\"https:\/\/www.nuget.org\/packages\/DeepSpeech\" target=\"_blank\">DeepSpeech In NuGet Gallery<\/a><\/li><li><a href=\"https:\/\/hacks.mozilla.org\/2019\/12\/deepspeech-0-6-mozillas-speech-to-text-engine\/\">DeepSpeech 0.6: Mozilla\u2019s Speech-to-Text Engine Gets Fast, Lean, and Ubiquitous<\/a><\/li><\/ul>\n<div class=\"gtech-end-cont\" id=\"gtech-3163394656\"><div style=\"margin-left: auto;margin-right: auto;text-align: center;\" id=\"gtech-1081391345\"><a data-bid=\"1\" data-no-instant=\"1\" href=\"https:\/\/gtechbooster.com\/linkout\/78935\" rel=\"noopener\" class=\"notrack\" aria-label=\"auyvc003\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/gtechbooster.com\/media\/2026\/03\/auyvc003.webp\" alt=\"\"  srcset=\"https:\/\/gtechbooster.com\/media\/2026\/03\/auyvc003.webp 1200w, https:\/\/gtechbooster.com\/media\/2026\/03\/auyvc003-768x768.webp 768w\" sizes=\"(max-width: 1200px) 100vw, 1200px\" width=\"500\" height=\"500\"  style=\"display: inline-block;\" \/><\/a><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Mozilla DeepSpeech has been updated with support for TensorFlow Lite, resulting in a smaller package size and faster performance on some platforms. DeepSpeech is a deep leaning-based automatic speech recognition (ASR) engine with a simple API developed by Mozilla. The speech recognition technology and trained models in DeepSpeech are openly available to developers. Mozilla also [&hellip;]<\/p>\n","protected":false},"author":7,"featured_media":6936,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1915],"tags":[247,569,6,1134],"class_list":["post-6932","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ndocs","tag-deep-learning","tag-mozilla","tag-programming","tag-speech-recognition"],"blocksy_meta":{"styles_descriptor":{"styles":{"desktop":"","tablet":"","mobile":""},"google_fonts":[],"version":6}},"_links":{"self":[{"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/posts\/6932","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/comments?post=6932"}],"version-history":[{"count":0,"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/posts\/6932\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/media\/6936"}],"wp:attachment":[{"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/media?parent=6932"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/categories?post=6932"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/gtechbooster.com\/api-json\/wp\/v2\/tags?post=6932"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}