Is there any decent speech recognition software for Linux?
The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate text.
More details:
I have unsatisfyingly tried the following:
- CMU Sphinx
- CVoiceControl
- Ears
- Julius
Kaldi (e.g., Kaldi GStreamer server)
IBM ViaVoice (used to run on Linux but was discontinued years ago)- NICO ANN Toolkit
- OpenMindSpeech
- RWTH ASR
- shout
silvius (built on the Kaldi speech recognition toolkit)- Simon Listens
- ViaVoice / Xvoice
Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselfly
All the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands). By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately.
On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.
Baidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. Benchmarks from Gigaom are encouraging as shown in the screenshot below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set):
There exist some very alpha open-source projects:
https://github.com/mozilla/DeepSpeech (part of Mozilla's Vaani project: http://vaani.io (mirror))- https://github.com/pannous/tensorflow-speech-recognition
- Vox, a system to control a Linux system using Dragon NaturallySpeaking: https://github.com/Franck-Dernoncourt/vox_linux + https://github.com/Franck-Dernoncourt/vox_windows
- https://github.com/facebookresearch/wav2letter
- https://github.com/espnet/espnet
http://github.com/tensorflow/lingvo (to be released by Google, mentioned at Interspeech 2018)
I am also aware of this attempt at tracking states of the arts and recent results (bibliography) on speech recognition. as well as this benchmark of existing speech recognition APIs.
I am aware of Aenea, which allows speech recognition via Dragonfly on one computer to send events to another, but it has some latency cost:
I am also aware of these two talks exploring Linux option for speech recognition:
2016 - The Eleventh HOPE: Coding by Voice with Open Source Speech Recognition (David Williams-King)
2014 - Pycon: Using Python to Code by Voice (Tavis Rudd)
software-rec speech-recognition
add a comment |
The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate text.
More details:
I have unsatisfyingly tried the following:
- CMU Sphinx
- CVoiceControl
- Ears
- Julius
Kaldi (e.g., Kaldi GStreamer server)
IBM ViaVoice (used to run on Linux but was discontinued years ago)- NICO ANN Toolkit
- OpenMindSpeech
- RWTH ASR
- shout
silvius (built on the Kaldi speech recognition toolkit)- Simon Listens
- ViaVoice / Xvoice
Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselfly
All the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands). By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately.
On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.
Baidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. Benchmarks from Gigaom are encouraging as shown in the screenshot below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set):
There exist some very alpha open-source projects:
https://github.com/mozilla/DeepSpeech (part of Mozilla's Vaani project: http://vaani.io (mirror))- https://github.com/pannous/tensorflow-speech-recognition
- Vox, a system to control a Linux system using Dragon NaturallySpeaking: https://github.com/Franck-Dernoncourt/vox_linux + https://github.com/Franck-Dernoncourt/vox_windows
- https://github.com/facebookresearch/wav2letter
- https://github.com/espnet/espnet
http://github.com/tensorflow/lingvo (to be released by Google, mentioned at Interspeech 2018)
I am also aware of this attempt at tracking states of the arts and recent results (bibliography) on speech recognition. as well as this benchmark of existing speech recognition APIs.
I am aware of Aenea, which allows speech recognition via Dragonfly on one computer to send events to another, but it has some latency cost:
I am also aware of these two talks exploring Linux option for speech recognition:
2016 - The Eleventh HOPE: Coding by Voice with Open Source Speech Recognition (David Williams-King)
2014 - Pycon: Using Python to Code by Voice (Tavis Rudd)
software-rec speech-recognition
2
Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)
– Theophrastus
Jan 18 '16 at 18:20
1
@Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)
– Franck Dernoncourt
Jan 18 '16 at 18:24
1
I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu
– Hatshepsut
Jan 6 '17 at 18:18
@Hatshepsut thanks for the pointers.
– Franck Dernoncourt
Jan 6 '17 at 18:23
add a comment |
The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate text.
More details:
I have unsatisfyingly tried the following:
- CMU Sphinx
- CVoiceControl
- Ears
- Julius
Kaldi (e.g., Kaldi GStreamer server)
IBM ViaVoice (used to run on Linux but was discontinued years ago)- NICO ANN Toolkit
- OpenMindSpeech
- RWTH ASR
- shout
silvius (built on the Kaldi speech recognition toolkit)- Simon Listens
- ViaVoice / Xvoice
Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselfly
All the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands). By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately.
On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.
Baidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. Benchmarks from Gigaom are encouraging as shown in the screenshot below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set):
There exist some very alpha open-source projects:
https://github.com/mozilla/DeepSpeech (part of Mozilla's Vaani project: http://vaani.io (mirror))- https://github.com/pannous/tensorflow-speech-recognition
- Vox, a system to control a Linux system using Dragon NaturallySpeaking: https://github.com/Franck-Dernoncourt/vox_linux + https://github.com/Franck-Dernoncourt/vox_windows
- https://github.com/facebookresearch/wav2letter
- https://github.com/espnet/espnet
http://github.com/tensorflow/lingvo (to be released by Google, mentioned at Interspeech 2018)
I am also aware of this attempt at tracking states of the arts and recent results (bibliography) on speech recognition. as well as this benchmark of existing speech recognition APIs.
I am aware of Aenea, which allows speech recognition via Dragonfly on one computer to send events to another, but it has some latency cost:
I am also aware of these two talks exploring Linux option for speech recognition:
2016 - The Eleventh HOPE: Coding by Voice with Open Source Speech Recognition (David Williams-King)
2014 - Pycon: Using Python to Code by Voice (Tavis Rudd)
software-rec speech-recognition
The short version of the question: I am looking for a speech recognition software that runs on Linux and has decent accuracy and usability. Any license and price is fine. It should not be restricted to voice commands, as I want to be able to dictate text.
More details:
I have unsatisfyingly tried the following:
- CMU Sphinx
- CVoiceControl
- Ears
- Julius
Kaldi (e.g., Kaldi GStreamer server)
IBM ViaVoice (used to run on Linux but was discontinued years ago)- NICO ANN Toolkit
- OpenMindSpeech
- RWTH ASR
- shout
silvius (built on the Kaldi speech recognition toolkit)- Simon Listens
- ViaVoice / Xvoice
Wine + Dragon NaturallySpeaking + NatLink + dragonfly + damselfly
All the above-mentioned native Linux solutions have both poor accuracy and usability (or some don't allow free-text dictation but only voice commands). By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned below for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately.
On Microsoft Windows I use Dragon NaturallySpeaking, on Apple Mac OS X I use Apple Dictation and DragonDictate, on Android I use Google speech recognition, and on iOS I use the built-in Apple speech recognition.
Baidu Research released yesterday the code for its speech recognition library using Connectionist Temporal Classification implemented with Torch. Benchmarks from Gigaom are encouraging as shown in the screenshot below, but I am not aware of any good wrapper around to make it usable without quite some coding (and a large training data set):
There exist some very alpha open-source projects:
https://github.com/mozilla/DeepSpeech (part of Mozilla's Vaani project: http://vaani.io (mirror))- https://github.com/pannous/tensorflow-speech-recognition
- Vox, a system to control a Linux system using Dragon NaturallySpeaking: https://github.com/Franck-Dernoncourt/vox_linux + https://github.com/Franck-Dernoncourt/vox_windows
- https://github.com/facebookresearch/wav2letter
- https://github.com/espnet/espnet
http://github.com/tensorflow/lingvo (to be released by Google, mentioned at Interspeech 2018)
I am also aware of this attempt at tracking states of the arts and recent results (bibliography) on speech recognition. as well as this benchmark of existing speech recognition APIs.
I am aware of Aenea, which allows speech recognition via Dragonfly on one computer to send events to another, but it has some latency cost:
I am also aware of these two talks exploring Linux option for speech recognition:
2016 - The Eleventh HOPE: Coding by Voice with Open Source Speech Recognition (David Williams-King)
2014 - Pycon: Using Python to Code by Voice (Tavis Rudd)
software-rec speech-recognition
software-rec speech-recognition
edited Sep 10 '18 at 11:59
Franck Dernoncourt
asked Jan 18 '16 at 18:04
Franck DernoncourtFranck Dernoncourt
1,33972552
1,33972552
2
Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)
– Theophrastus
Jan 18 '16 at 18:20
1
@Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)
– Franck Dernoncourt
Jan 18 '16 at 18:24
1
I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu
– Hatshepsut
Jan 6 '17 at 18:18
@Hatshepsut thanks for the pointers.
– Franck Dernoncourt
Jan 6 '17 at 18:23
add a comment |
2
Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)
– Theophrastus
Jan 18 '16 at 18:20
1
@Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)
– Franck Dernoncourt
Jan 18 '16 at 18:24
1
I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu
– Hatshepsut
Jan 6 '17 at 18:18
@Hatshepsut thanks for the pointers.
– Franck Dernoncourt
Jan 6 '17 at 18:23
2
2
Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)
– Theophrastus
Jan 18 '16 at 18:20
Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)
– Theophrastus
Jan 18 '16 at 18:20
1
1
@Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)
– Franck Dernoncourt
Jan 18 '16 at 18:24
@Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)
– Franck Dernoncourt
Jan 18 '16 at 18:24
1
1
I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu
– Hatshepsut
Jan 6 '17 at 18:18
I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu
– Hatshepsut
Jan 6 '17 at 18:18
@Hatshepsut thanks for the pointers.
– Franck Dernoncourt
Jan 6 '17 at 18:23
@Hatshepsut thanks for the pointers.
– Franck Dernoncourt
Jan 6 '17 at 18:23
add a comment |
6 Answers
6
active
oldest
votes
Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.
KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:
sudo add-apt-repository ppa:vikoadi/ppa
sudo apt update
sudo apt install kdeconnect indicator-kdeconnect
The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.
Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.
As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.
add a comment |
For now, only Voice notebook works in Linux.
2
Thanks, it only works in the Chrome browser though.
– Franck Dernoncourt
Oct 16 '16 at 17:36
add a comment |
As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:
- it recognizes my mother tongue very well
- it works fast and very reliable
Downsides:
- of course it is proprietary and closed software from Google
- a Google service will listen to, process and supposedly store every word you speak
- audio and text will be processed and obviously stored by Google
- speechpad.pw requires a monthly / quaterly / yearly subscription fee
- speechpad.pw only runs as an addon to Google Chrome browser - no other browser
So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.
These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.
Thanks, yes significant downsides, especially that it only works in the Chrome browser.
– Franck Dernoncourt
Oct 28 '16 at 22:45
1
You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.
– Alexis Wilke
Nov 10 '17 at 20:19
add a comment |
The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.
Thanks, works only in Google Chrome though
– Franck Dernoncourt
Aug 8 '17 at 14:37
add a comment |
I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.
I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
There may also be remote-keyboard apps for mobile devices that may support dictation input.
I shall experiment and try to get back to you with something more definitive.
add a comment |
You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu
1
OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.
– Cerin
Aug 8 '16 at 17:37
add a comment |
Your Answer
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "106"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f256138%2fis-there-any-decent-speech-recognition-software-for-linux%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
6 Answers
6
active
oldest
votes
6 Answers
6
active
oldest
votes
active
oldest
votes
active
oldest
votes
Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.
KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:
sudo add-apt-repository ppa:vikoadi/ppa
sudo apt update
sudo apt install kdeconnect indicator-kdeconnect
The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.
Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.
As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.
add a comment |
Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.
KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:
sudo add-apt-repository ppa:vikoadi/ppa
sudo apt update
sudo apt install kdeconnect indicator-kdeconnect
The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.
Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.
As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.
add a comment |
Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.
KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:
sudo add-apt-repository ppa:vikoadi/ppa
sudo apt update
sudo apt install kdeconnect indicator-kdeconnect
The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.
Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.
As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.
Right now I'm experimenting with using KDE connect in combination with Google speech recognition on my android smartphone.
KDE connect allows you to use your android device as an input device for your Linux computer (there are also some other features). You need to install the KDE connect app from the Google play store on your smartphone/tablet and install both kdeconnect and indicator-kdeconnect on your Linux computer. For Ubuntu systems the install goes as follows:
sudo add-apt-repository ppa:vikoadi/ppa
sudo apt update
sudo apt install kdeconnect indicator-kdeconnect
The downside of this installation is that it installs a bunch of KDE packages that you don't need if you don't use the KDE desktop environment.
Once you pair your android device with your computer (they have to be on the same network) you can use the android keyboard and then click/press on the mic to use Google speech recognition. As you talk, text will start to appear where ever your cursor is active on your Linux computer.
As for the results, they are a bit mixed for me as I'm currently writing some technical astrophysics document and Google speech recognition is struggling with the jargon that you don't typically read. Also forget about it figuring out punctuation or proper capitalization.
edited Oct 6 '16 at 22:01
Franck Dernoncourt
1,33972552
1,33972552
answered Oct 6 '16 at 20:28
shockburnershockburner
21126
21126
add a comment |
add a comment |
For now, only Voice notebook works in Linux.
2
Thanks, it only works in the Chrome browser though.
– Franck Dernoncourt
Oct 16 '16 at 17:36
add a comment |
For now, only Voice notebook works in Linux.
2
Thanks, it only works in the Chrome browser though.
– Franck Dernoncourt
Oct 16 '16 at 17:36
add a comment |
For now, only Voice notebook works in Linux.
For now, only Voice notebook works in Linux.
edited May 26 '17 at 10:07
Jeff Schaller
39.4k1054125
39.4k1054125
answered Oct 16 '16 at 17:30
AlexeiAlexei
411
411
2
Thanks, it only works in the Chrome browser though.
– Franck Dernoncourt
Oct 16 '16 at 17:36
add a comment |
2
Thanks, it only works in the Chrome browser though.
– Franck Dernoncourt
Oct 16 '16 at 17:36
2
2
Thanks, it only works in the Chrome browser though.
– Franck Dernoncourt
Oct 16 '16 at 17:36
Thanks, it only works in the Chrome browser though.
– Franck Dernoncourt
Oct 16 '16 at 17:36
add a comment |
As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:
- it recognizes my mother tongue very well
- it works fast and very reliable
Downsides:
- of course it is proprietary and closed software from Google
- a Google service will listen to, process and supposedly store every word you speak
- audio and text will be processed and obviously stored by Google
- speechpad.pw requires a monthly / quaterly / yearly subscription fee
- speechpad.pw only runs as an addon to Google Chrome browser - no other browser
So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.
These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.
Thanks, yes significant downsides, especially that it only works in the Chrome browser.
– Franck Dernoncourt
Oct 28 '16 at 22:45
1
You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.
– Alexis Wilke
Nov 10 '17 at 20:19
add a comment |
As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:
- it recognizes my mother tongue very well
- it works fast and very reliable
Downsides:
- of course it is proprietary and closed software from Google
- a Google service will listen to, process and supposedly store every word you speak
- audio and text will be processed and obviously stored by Google
- speechpad.pw requires a monthly / quaterly / yearly subscription fee
- speechpad.pw only runs as an addon to Google Chrome browser - no other browser
So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.
These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.
Thanks, yes significant downsides, especially that it only works in the Chrome browser.
– Franck Dernoncourt
Oct 28 '16 at 22:45
1
You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.
– Alexis Wilke
Nov 10 '17 at 20:19
add a comment |
As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:
- it recognizes my mother tongue very well
- it works fast and very reliable
Downsides:
- of course it is proprietary and closed software from Google
- a Google service will listen to, process and supposedly store every word you speak
- audio and text will be processed and obviously stored by Google
- speechpad.pw requires a monthly / quaterly / yearly subscription fee
- speechpad.pw only runs as an addon to Google Chrome browser - no other browser
So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.
These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.
As one more Linuxer searching for a useful speech-to-text (dictation) program, I took a look into speechpad.pw:
- it recognizes my mother tongue very well
- it works fast and very reliable
Downsides:
- of course it is proprietary and closed software from Google
- a Google service will listen to, process and supposedly store every word you speak
- audio and text will be processed and obviously stored by Google
- speechpad.pw requires a monthly / quaterly / yearly subscription fee
- speechpad.pw only runs as an addon to Google Chrome browser - no other browser
So, speechpad.pw is very proprietary and also closed source and also bound to Google which we all know as a sleepless meta data, personal information and personal contents collector.
These downsides make it a no-go application for me though the speech recognition itself works very well - much better than anything else I have seen so far.
edited Oct 29 '16 at 17:32
answered Oct 28 '16 at 22:29
tootoo
413
413
Thanks, yes significant downsides, especially that it only works in the Chrome browser.
– Franck Dernoncourt
Oct 28 '16 at 22:45
1
You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.
– Alexis Wilke
Nov 10 '17 at 20:19
add a comment |
Thanks, yes significant downsides, especially that it only works in the Chrome browser.
– Franck Dernoncourt
Oct 28 '16 at 22:45
1
You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.
– Alexis Wilke
Nov 10 '17 at 20:19
Thanks, yes significant downsides, especially that it only works in the Chrome browser.
– Franck Dernoncourt
Oct 28 '16 at 22:45
Thanks, yes significant downsides, especially that it only works in the Chrome browser.
– Franck Dernoncourt
Oct 28 '16 at 22:45
1
1
You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.
– Alexis Wilke
Nov 10 '17 at 20:19
You could use Google Docs on Chrome and use their "Tools" » "Voices Typing ..." option. Probably exact same speech recognition software, but it's free. Then copy paste the results from your doc to wherever you need the text.
– Alexis Wilke
Nov 10 '17 at 20:19
add a comment |
The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.
Thanks, works only in Google Chrome though
– Franck Dernoncourt
Aug 8 '17 at 14:37
add a comment |
The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.
Thanks, works only in Google Chrome though
– Franck Dernoncourt
Aug 8 '17 at 14:37
add a comment |
The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.
The Chrome App "VoiceNote II" (http://voicenote.in/) is working great on my Xubuntu 16.04 machine. No voice-training required, and set-up was simple. One search to find it, one click to install, one click to create a shortcut and to the Desktop bind it.
answered Aug 8 '17 at 14:36
Indy Tech FixIndy Tech Fix
211
211
Thanks, works only in Google Chrome though
– Franck Dernoncourt
Aug 8 '17 at 14:37
add a comment |
Thanks, works only in Google Chrome though
– Franck Dernoncourt
Aug 8 '17 at 14:37
Thanks, works only in Google Chrome though
– Franck Dernoncourt
Aug 8 '17 at 14:37
Thanks, works only in Google Chrome though
– Franck Dernoncourt
Aug 8 '17 at 14:37
add a comment |
I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.
I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
There may also be remote-keyboard apps for mobile devices that may support dictation input.
I shall experiment and try to get back to you with something more definitive.
add a comment |
I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.
I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
There may also be remote-keyboard apps for mobile devices that may support dictation input.
I shall experiment and try to get back to you with something more definitive.
add a comment |
I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.
I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
There may also be remote-keyboard apps for mobile devices that may support dictation input.
I shall experiment and try to get back to you with something more definitive.
I would suggest using dragon on your phone or tablet, then emailing the text to yourself. Its a drag but it works and is very accurate. If you insist on using Linux for this, getting a second display will make life much easier to copy and past.
I haven't tried this but you might be able to use or adapt the Python Bluetooth Chat program with dragon on your tablet/phone.
There may also be remote-keyboard apps for mobile devices that may support dictation input.
I shall experiment and try to get back to you with something more definitive.
answered Jan 31 '18 at 6:51
user273470user273470
1
1
add a comment |
add a comment |
You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu
1
OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.
– Cerin
Aug 8 '16 at 17:37
add a comment |
You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu
1
OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.
– Cerin
Aug 8 '16 at 17:37
add a comment |
You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu
You can use speech to text in Linux application This application use Google Speech Api and binary integration module for 32 or 64 bit Linux. You can see a short presentation of using speechpad.pw tools in Ubuntu
answered Jul 6 '16 at 15:28
Pavel PopovPavel Popov
1
1
1
OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.
– Cerin
Aug 8 '16 at 17:37
add a comment |
1
OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.
– Cerin
Aug 8 '16 at 17:37
1
1
OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.
– Cerin
Aug 8 '16 at 17:37
OP is looking for a speech-to-text engine. That's just a web-UI wrapper (and a bad one at that) around a STT engine.
– Cerin
Aug 8 '16 at 17:37
add a comment |
Thanks for contributing an answer to Unix & Linux Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f256138%2fis-there-any-decent-speech-recognition-software-for-linux%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
2
Some detail about what you found "unsatisfying" might advance your otherwise interesting but rather general posting topic. For example: what specifically did you find unsatisfying about the "Wine + Dragon NaturallySpeaking" combination? (how did it fail to replicate your Windows experience?)
– Theophrastus
Jan 18 '16 at 18:20
1
@Theophrastus Basically all native Linux solutions have both poor accuracy and usability. By poor accuracy, I mean an accuracy significantly below the one the speech recognition software I mentioned for other platforms have. As for Wine + Dragon NaturallySpeaking, in my experience it keeps crashing, and I don't seem to be the only one to have such issues unfortunately (appdb.winehq.org/…)
– Franck Dernoncourt
Jan 18 '16 at 18:24
1
I haven't tried these, but in case someone finds it useful: github.com/Uberi/speech_recognition and jasperproject.github.io and github.com/benoitfragit/google2ubuntu
– Hatshepsut
Jan 6 '17 at 18:18
@Hatshepsut thanks for the pointers.
– Franck Dernoncourt
Jan 6 '17 at 18:23