Is there any decent speech recognition software for Linux?












45















The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate text.





More details:



I have unsatisfyingly tried the following:




  • CMU Sphinx

  • CVoiceControl

  • Ears

  • Julius


  • Kaldi (e.g., Kaldi GStreamer server)


  • IBM ViaVoice (used to run on Linux but was discontinued years ago)

  • NICO ANN Toolkit

  • OpenMindSpeech

  • RWTH ASR

  • shout


  • silvius (built on the Kaldi speech recognition toolkit)

  • Simon Listens

  • ViaVoice / Xvoice


  • Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselfly


All the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands). By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately.



On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.



Baidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. Benchmarks from Gigaom are encouraging as shown in the screenshot below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set):



enter image description here



There exist some very alpha open-source projects:





  • https://github.com/mozilla/DeepSpeech (part of Mozilla's Vaani project: http://vaani.io (mirror))

  • https://github.com/pannous/tensorflow-speech-recognition

  • Vox, a system to control a Linux system using Dragon NaturallySpeaking: https://github.com/Franck-Dernoncourt/vox_linux + https://github.com/Franck-Dernoncourt/vox_windows

  • https://github.com/facebookresearch/wav2letter

  • https://github.com/espnet/espnet


  • http://github.com/tensorflow/lingvo (to be released by Google, mentioned at Interspeech 2018)


I am also aware of this attempt at tracking states of the arts and recent results (bibliography) on speech recognition. as well as this benchmark of existing speech recognition APIs.





I am aware of Aenea, which allows speech recognition via Dragonfly on one computer to send events to another, but it has some latency cost:



enter image description here



I am also aware of these two talks exploring Linux option for speech recognition:





  • 2016 - The Eleventh HOPE: Coding by Voice with Open Source Speech Recognition (David Williams-King)


  • 2014 - Pycon: Using Python to Code by Voice (Tavis Rudd)










share|improve this question




















  • 2





    Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)

    – Theophrastus
    Jan 18 '16 at 18:20








  • 1





    @Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)

    – Franck Dernoncourt
    Jan 18 '16 at 18:24








  • 1





    I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu

    – Hatshepsut
    Jan 6 '17 at 18:18













  • @Hatshepsut thanks for the pointers.

    – Franck Dernoncourt
    Jan 6 '17 at 18:23
















45















The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate text.





More details:



I have unsatisfyingly tried the following:




  • CMU Sphinx

  • CVoiceControl

  • Ears

  • Julius


  • Kaldi (e.g., Kaldi GStreamer server)


  • IBM ViaVoice (used to run on Linux but was discontinued years ago)

  • NICO ANN Toolkit

  • OpenMindSpeech

  • RWTH ASR

  • shout


  • silvius (built on the Kaldi speech recognition toolkit)

  • Simon Listens

  • ViaVoice / Xvoice


  • Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselfly


All the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands). By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately.



On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.



Baidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. Benchmarks from Gigaom are encouraging as shown in the screenshot below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set):



enter image description here



There exist some very alpha open-source projects:





  • https://github.com/mozilla/DeepSpeech (part of Mozilla's Vaani project: http://vaani.io (mirror))

  • https://github.com/pannous/tensorflow-speech-recognition

  • Vox, a system to control a Linux system using Dragon NaturallySpeaking: https://github.com/Franck-Dernoncourt/vox_linux + https://github.com/Franck-Dernoncourt/vox_windows

  • https://github.com/facebookresearch/wav2letter

  • https://github.com/espnet/espnet


  • http://github.com/tensorflow/lingvo (to be released by Google, mentioned at Interspeech 2018)


I am also aware of this attempt at tracking states of the arts and recent results (bibliography) on speech recognition. as well as this benchmark of existing speech recognition APIs.





I am aware of Aenea, which allows speech recognition via Dragonfly on one computer to send events to another, but it has some latency cost:



enter image description here



I am also aware of these two talks exploring Linux option for speech recognition:





  • 2016 - The Eleventh HOPE: Coding by Voice with Open Source Speech Recognition (David Williams-King)


  • 2014 - Pycon: Using Python to Code by Voice (Tavis Rudd)










share|improve this question




















  • 2





    Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)

    – Theophrastus
    Jan 18 '16 at 18:20








  • 1





    @Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)

    – Franck Dernoncourt
    Jan 18 '16 at 18:24








  • 1





    I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu

    – Hatshepsut
    Jan 6 '17 at 18:18













  • @Hatshepsut thanks for the pointers.

    – Franck Dernoncourt
    Jan 6 '17 at 18:23














45












45








45


24






The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate text.





More details:



I have unsatisfyingly tried the following:




  • CMU Sphinx

  • CVoiceControl

  • Ears

  • Julius


  • Kaldi (e.g., Kaldi GStreamer server)


  • IBM ViaVoice (used to run on Linux but was discontinued years ago)

  • NICO ANN Toolkit

  • OpenMindSpeech

  • RWTH ASR

  • shout


  • silvius (built on the Kaldi speech recognition toolkit)

  • Simon Listens

  • ViaVoice / Xvoice


  • Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselfly


All the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands). By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately.



On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.



Baidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. Benchmarks from Gigaom are encouraging as shown in the screenshot below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set):



enter image description here



There exist some very alpha open-source projects:





  • https://github.com/mozilla/DeepSpeech (part of Mozilla's Vaani project: http://vaani.io (mirror))

  • https://github.com/pannous/tensorflow-speech-recognition

  • Vox, a system to control a Linux system using Dragon NaturallySpeaking: https://github.com/Franck-Dernoncourt/vox_linux + https://github.com/Franck-Dernoncourt/vox_windows

  • https://github.com/facebookresearch/wav2letter

  • https://github.com/espnet/espnet


  • http://github.com/tensorflow/lingvo (to be released by Google, mentioned at Interspeech 2018)


I am also aware of this attempt at tracking states of the arts and recent results (bibliography) on speech recognition. as well as this benchmark of existing speech recognition APIs.





I am aware of Aenea, which allows speech recognition via Dragonfly on one computer to send events to another, but it has some latency cost:



enter image description here



I am also aware of these two talks exploring Linux option for speech recognition:





  • 2016 - The Eleventh HOPE: Coding by Voice with Open Source Speech Recognition (David Williams-King)


  • 2014 - Pycon: Using Python to Code by Voice (Tavis Rudd)










share|improve this question
















The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate text.





More details:



I have unsatisfyingly tried the following:




  • CMU Sphinx

  • CVoiceControl

  • Ears

  • Julius


  • Kaldi (e.g., Kaldi GStreamer server)


  • IBM ViaVoice (used to run on Linux but was discontinued years ago)

  • NICO ANN Toolkit

  • OpenMindSpeech

  • RWTH ASR

  • shout


  • silvius (built on the Kaldi speech recognition toolkit)

  • Simon Listens

  • ViaVoice / Xvoice


  • Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselfly


All the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands). By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately.



On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.



Baidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. Benchmarks from Gigaom are encouraging as shown in the screenshot below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set):



enter image description here



There exist some very alpha open-source projects:





  • https://github.com/mozilla/DeepSpeech (part of Mozilla's Vaani project: http://vaani.io (mirror))

  • https://github.com/pannous/tensorflow-speech-recognition

  • Vox, a system to control a Linux system using Dragon NaturallySpeaking: https://github.com/Franck-Dernoncourt/vox_linux + https://github.com/Franck-Dernoncourt/vox_windows

  • https://github.com/facebookresearch/wav2letter

  • https://github.com/espnet/espnet


  • http://github.com/tensorflow/lingvo (to be released by Google, mentioned at Interspeech 2018)


I am also aware of this attempt at tracking states of the arts and recent results (bibliography) on speech recognition. as well as this benchmark of existing speech recognition APIs.





I am aware of Aenea, which allows speech recognition via Dragonfly on one computer to send events to another, but it has some latency cost:



enter image description here



I am also aware of these two talks exploring Linux option for speech recognition:





  • 2016 - The Eleventh HOPE: Coding by Voice with Open Source Speech Recognition (David Williams-King)


  • 2014 - Pycon: Using Python to Code by Voice (Tavis Rudd)







software-rec speech-recognition






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Sep 10 '18 at 11:59







Franck Dernoncourt

















asked Jan 18 '16 at 18:04









Franck DernoncourtFranck Dernoncourt

1,33972552




1,33972552








  • 2





    Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)

    – Theophrastus
    Jan 18 '16 at 18:20








  • 1





    @Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)

    – Franck Dernoncourt
    Jan 18 '16 at 18:24








  • 1





    I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu

    – Hatshepsut
    Jan 6 '17 at 18:18













  • @Hatshepsut thanks for the pointers.

    – Franck Dernoncourt
    Jan 6 '17 at 18:23














  • 2





    Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)

    – Theophrastus
    Jan 18 '16 at 18:20








  • 1





    @Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)

    – Franck Dernoncourt
    Jan 18 '16 at 18:24








  • 1





    I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu

    – Hatshepsut
    Jan 6 '17 at 18:18













  • @Hatshepsut thanks for the pointers.

    – Franck Dernoncourt
    Jan 6 '17 at 18:23








2




2





Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)

– Theophrastus
Jan 18 '16 at 18:20







Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)

– Theophrastus
Jan 18 '16 at 18:20






1




1





@Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)

– Franck Dernoncourt
Jan 18 '16 at 18:24







@Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)

– Franck Dernoncourt
Jan 18 '16 at 18:24






1




1





I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu

– Hatshepsut
Jan 6 '17 at 18:18







I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu

– Hatshepsut
Jan 6 '17 at 18:18















@Hatshepsut thanks for the pointers.

– Franck Dernoncourt
Jan 6 '17 at 18:23





@Hatshepsut thanks for the pointers.

– Franck Dernoncourt
Jan 6 '17 at 18:23










6 Answers
6






active

oldest

votes


















11














Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.



KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:



sudo add-apt-repository ppa:vikoadi/ppa
sudo apt update
sudo apt install kdeconnect indicator-kdeconnect


The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.



Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.



As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.



enter image description here



enter image description here






share|improve this answer

































    4














    For now, only Voice notebook works in Linux.






    share|improve this answer





















    • 2





      Thanks, it only works in the Chrome browser though.

      – Franck Dernoncourt
      Oct 16 '16 at 17:36



















    3














    As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:




    • it recognizes my mother tongue very well

    • it works fast and very reliable


    Downsides:




    • of course it is proprietary and closed software from Google

    • a Google service will listen to, process and supposedly store every word you speak

    • audio and text will be processed and obviously stored by Google

    • speechpad.pw requires a monthly / quaterly / yearly subscription fee

    • speechpad.pw only runs as an addon to Google Chrome browser - no other browser


    So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.



    These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.






    share|improve this answer


























    • Thanks, yes significant downsides, especially that it only works in the Chrome browser.

      – Franck Dernoncourt
      Oct 28 '16 at 22:45






    • 1





      You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.

      – Alexis Wilke
      Nov 10 '17 at 20:19



















    2














    The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.






    share|improve this answer
























    • Thanks, works only in Google Chrome though

      – Franck Dernoncourt
      Aug 8 '17 at 14:37





















    0














    I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.



    I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
    There may also be remote-keyboard apps for mobile devices that may support dictation input.



    I shall experiment and try to get back to you with something more definitive.






    share|improve this answer































      -3














      You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu






      share|improve this answer



















      • 1





        OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.

        – Cerin
        Aug 8 '16 at 17:37











      Your Answer








      StackExchange.ready(function() {
      var channelOptions = {
      tags: "".split(" "),
      id: "106"
      };
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function() {
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled) {
      StackExchange.using("snippets", function() {
      createEditor();
      });
      }
      else {
      createEditor();
      }
      });

      function createEditor() {
      StackExchange.prepareEditor({
      heartbeatType: 'answer',
      autoActivateHeartbeat: false,
      convertImagesToLinks: false,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: null,
      bindNavPrevention: true,
      postfix: "",
      imageUploader: {
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      },
      onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      });


      }
      });














      draft saved

      draft discarded


















      StackExchange.ready(
      function () {
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f256138%2fis-there-any-decent-speech-recognition-software-for-linux%23new-answer', 'question_page');
      }
      );

      Post as a guest















      Required, but never shown

























      6 Answers
      6






      active

      oldest

      votes








      6 Answers
      6






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes









      11














      Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.



      KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:



      sudo add-apt-repository ppa:vikoadi/ppa
      sudo apt update
      sudo apt install kdeconnect indicator-kdeconnect


      The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.



      Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.



      As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.



      enter image description here



      enter image description here






      share|improve this answer






























        11














        Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.



        KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:



        sudo add-apt-repository ppa:vikoadi/ppa
        sudo apt update
        sudo apt install kdeconnect indicator-kdeconnect


        The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.



        Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.



        As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.



        enter image description here



        enter image description here






        share|improve this answer




























          11












          11








          11







          Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.



          KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:



          sudo add-apt-repository ppa:vikoadi/ppa
          sudo apt update
          sudo apt install kdeconnect indicator-kdeconnect


          The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.



          Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.



          As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.



          enter image description here



          enter image description here






          share|improve this answer















          Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.



          KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:



          sudo add-apt-repository ppa:vikoadi/ppa
          sudo apt update
          sudo apt install kdeconnect indicator-kdeconnect


          The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.



          Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.



          As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.



          enter image description here



          enter image description here







          share|improve this answer














          share|improve this answer



          share|improve this answer








          edited Oct 6 '16 at 22:01









          Franck Dernoncourt

          1,33972552




          1,33972552










          answered Oct 6 '16 at 20:28









          shockburnershockburner

          21126




          21126

























              4














              For now, only Voice notebook works in Linux.






              share|improve this answer





















              • 2





                Thanks, it only works in the Chrome browser though.

                – Franck Dernoncourt
                Oct 16 '16 at 17:36
















              4














              For now, only Voice notebook works in Linux.






              share|improve this answer





















              • 2





                Thanks, it only works in the Chrome browser though.

                – Franck Dernoncourt
                Oct 16 '16 at 17:36














              4












              4








              4







              For now, only Voice notebook works in Linux.






              share|improve this answer















              For now, only Voice notebook works in Linux.







              share|improve this answer














              share|improve this answer



              share|improve this answer








              edited May 26 '17 at 10:07









              Jeff Schaller

              39.4k1054125




              39.4k1054125










              answered Oct 16 '16 at 17:30









              AlexeiAlexei

              411




              411








              • 2





                Thanks, it only works in the Chrome browser though.

                – Franck Dernoncourt
                Oct 16 '16 at 17:36














              • 2





                Thanks, it only works in the Chrome browser though.

                – Franck Dernoncourt
                Oct 16 '16 at 17:36








              2




              2





              Thanks, it only works in the Chrome browser though.

              – Franck Dernoncourt
              Oct 16 '16 at 17:36





              Thanks, it only works in the Chrome browser though.

              – Franck Dernoncourt
              Oct 16 '16 at 17:36











              3














              As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:




              • it recognizes my mother tongue very well

              • it works fast and very reliable


              Downsides:




              • of course it is proprietary and closed software from Google

              • a Google service will listen to, process and supposedly store every word you speak

              • audio and text will be processed and obviously stored by Google

              • speechpad.pw requires a monthly / quaterly / yearly subscription fee

              • speechpad.pw only runs as an addon to Google Chrome browser - no other browser


              So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.



              These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.






              share|improve this answer


























              • Thanks, yes significant downsides, especially that it only works in the Chrome browser.

                – Franck Dernoncourt
                Oct 28 '16 at 22:45






              • 1





                You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.

                – Alexis Wilke
                Nov 10 '17 at 20:19
















              3














              As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:




              • it recognizes my mother tongue very well

              • it works fast and very reliable


              Downsides:




              • of course it is proprietary and closed software from Google

              • a Google service will listen to, process and supposedly store every word you speak

              • audio and text will be processed and obviously stored by Google

              • speechpad.pw requires a monthly / quaterly / yearly subscription fee

              • speechpad.pw only runs as an addon to Google Chrome browser - no other browser


              So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.



              These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.






              share|improve this answer


























              • Thanks, yes significant downsides, especially that it only works in the Chrome browser.

                – Franck Dernoncourt
                Oct 28 '16 at 22:45






              • 1





                You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.

                – Alexis Wilke
                Nov 10 '17 at 20:19














              3












              3








              3







              As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:




              • it recognizes my mother tongue very well

              • it works fast and very reliable


              Downsides:




              • of course it is proprietary and closed software from Google

              • a Google service will listen to, process and supposedly store every word you speak

              • audio and text will be processed and obviously stored by Google

              • speechpad.pw requires a monthly / quaterly / yearly subscription fee

              • speechpad.pw only runs as an addon to Google Chrome browser - no other browser


              So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.



              These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.






              share|improve this answer















              As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:




              • it recognizes my mother tongue very well

              • it works fast and very reliable


              Downsides:




              • of course it is proprietary and closed software from Google

              • a Google service will listen to, process and supposedly store every word you speak

              • audio and text will be processed and obviously stored by Google

              • speechpad.pw requires a monthly / quaterly / yearly subscription fee

              • speechpad.pw only runs as an addon to Google Chrome browser - no other browser


              So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.



              These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.







              share|improve this answer














              share|improve this answer



              share|improve this answer








              edited Oct 29 '16 at 17:32

























              answered Oct 28 '16 at 22:29









              tootoo

              413




              413













              • Thanks, yes significant downsides, especially that it only works in the Chrome browser.

                – Franck Dernoncourt
                Oct 28 '16 at 22:45






              • 1





                You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.

                – Alexis Wilke
                Nov 10 '17 at 20:19



















              • Thanks, yes significant downsides, especially that it only works in the Chrome browser.

                – Franck Dernoncourt
                Oct 28 '16 at 22:45






              • 1





                You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.

                – Alexis Wilke
                Nov 10 '17 at 20:19

















              Thanks, yes significant downsides, especially that it only works in the Chrome browser.

              – Franck Dernoncourt
              Oct 28 '16 at 22:45





              Thanks, yes significant downsides, especially that it only works in the Chrome browser.

              – Franck Dernoncourt
              Oct 28 '16 at 22:45




              1




              1





              You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.

              – Alexis Wilke
              Nov 10 '17 at 20:19





              You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.

              – Alexis Wilke
              Nov 10 '17 at 20:19











              2














              The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.






              share|improve this answer
























              • Thanks, works only in Google Chrome though

                – Franck Dernoncourt
                Aug 8 '17 at 14:37


















              2














              The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.






              share|improve this answer
























              • Thanks, works only in Google Chrome though

                – Franck Dernoncourt
                Aug 8 '17 at 14:37
















              2












              2








              2







              The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.






              share|improve this answer













              The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.







              share|improve this answer












              share|improve this answer



              share|improve this answer










              answered Aug 8 '17 at 14:36









              Indy Tech FixIndy Tech Fix

              211




              211













              • Thanks, works only in Google Chrome though

                – Franck Dernoncourt
                Aug 8 '17 at 14:37





















              • Thanks, works only in Google Chrome though

                – Franck Dernoncourt
                Aug 8 '17 at 14:37



















              Thanks, works only in Google Chrome though

              – Franck Dernoncourt
              Aug 8 '17 at 14:37







              Thanks, works only in Google Chrome though

              – Franck Dernoncourt
              Aug 8 '17 at 14:37













              0














              I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.



              I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
              There may also be remote-keyboard apps for mobile devices that may support dictation input.



              I shall experiment and try to get back to you with something more definitive.






              share|improve this answer




























                0














                I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.



                I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
                There may also be remote-keyboard apps for mobile devices that may support dictation input.



                I shall experiment and try to get back to you with something more definitive.






                share|improve this answer


























                  0












                  0








                  0







                  I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.



                  I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
                  There may also be remote-keyboard apps for mobile devices that may support dictation input.



                  I shall experiment and try to get back to you with something more definitive.






                  share|improve this answer













                  I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.



                  I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
                  There may also be remote-keyboard apps for mobile devices that may support dictation input.



                  I shall experiment and try to get back to you with something more definitive.







                  share|improve this answer












                  share|improve this answer



                  share|improve this answer










                  answered Jan 31 '18 at 6:51









                  user273470user273470

                  1




                  1























                      -3














                      You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu






                      share|improve this answer



















                      • 1





                        OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.

                        – Cerin
                        Aug 8 '16 at 17:37
















                      -3














                      You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu






                      share|improve this answer



















                      • 1





                        OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.

                        – Cerin
                        Aug 8 '16 at 17:37














                      -3












                      -3








                      -3







                      You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu






                      share|improve this answer













                      You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu







                      share|improve this answer












                      share|improve this answer



                      share|improve this answer










                      answered Jul 6 '16 at 15:28









                      Pavel PopovPavel Popov

                      1




                      1








                      • 1





                        OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.

                        – Cerin
                        Aug 8 '16 at 17:37














                      • 1





                        OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.

                        – Cerin
                        Aug 8 '16 at 17:37








                      1




                      1





                      OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.

                      – Cerin
                      Aug 8 '16 at 17:37





                      OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.

                      – Cerin
                      Aug 8 '16 at 17:37


















                      draft saved

                      draft discarded




















































                      Thanks for contributing an answer to Unix & Linux Stack Exchange!


                      • Please be sure to answer the question. Provide details and share your research!

                      But avoid



                      • Asking for help, clarification, or responding to other answers.

                      • Making statements based on opinion; back them up with references or personal experience.


                      To learn more, see our tips on writing great answers.




                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function () {
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f256138%2fis-there-any-decent-speech-recognition-software-for-linux%23new-answer', 'question_page');
                      }
                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      Morgemoulin

                      Scott Moir

                      Souastre