Questions tagged [voice-recognition]

Voice Recognition means identification of the person talking and is frequently misapplied to mean "Speech Recognition" - identification of what is being said.

1
vote
1answer
61 views

Why doesn't Unity voice recognition work for single letters?

I am trying to create a voice recognition game in unity. What i don't understand is when i use word such as "left" or "forward", it easily detects it. But when it comes to just using a letter, it ...
0
votes
0answers
32 views

Arduino serial pin is not receiving data

I am trying to run a Geeetech Voice Recognition Module to recognize my commands and change the color of Neopixels with them. The problem is, that my Arduino never receives the output from the voice ...
1
vote
0answers
60 views

Why isn't responsivevoice.org working with my Javascript?

I'm trying to add a Hindi Male accent to my website which currently has a US Male accent. I used a website called responsivevoice.org which gave me a unique link that I needed to put before the ...
1
vote
1answer
22 views

How to change default US Male voice to UK female or something else

const btn = document.querySelector('.talk'); const content = document.querySelector('.content'); const greetings = [ 'If you are good im good to 😉', 'Im doin alright', 'Im tired 😴'...
-1
votes
0answers
37 views

Is it possible to listen to voice in application like OK Google does? [closed]

I need to create application with voice control. Is it possible to add some service that's going to listen for speech and looking for keywords all the time without any other actions of user? like you ...
0
votes
2answers
23 views

Android how to open a menu / voice recognition

Android Studio here. I am trying to open the menu (the three dot one in the right upper corner) but without clicking it. I am using a voice recognition commands. I already tried to call it in many ...
0
votes
0answers
11 views

How do you use Voice Control in a Tizen wearable web app?

I'm attempting to follow this official guide; https://developer.tizen.org/ko/development/guides/web-application/text-input-and-voice/voice-control but I'm finding that a lot of the information is ...
0
votes
0answers
22 views

How to recognize specific word offline in Android?

I need to recognize one word similar to "hello" from file (as offline) in android. I used Google Speech Recognition, but it requires internet connection and it doesn't support read from audio file. (...
1
vote
1answer
11 views

Object has no attribute error using library pyttsx3

I have written a demo project for voice recognition bot. But I am facing some error which shows that the object has no attribute. I have attached the code below def speak(audio): print('Computer: ...
-1
votes
0answers
6 views

I need an interface that use VoiceIt API in Node.js

VoicIt is an API freely available . I want to integrate the said API with Node.Js for this I need an interface that interact and use the services of the API. can anyone pls help me.
1
vote
0answers
30 views

How to make C# print what you have said after a question

I want to make a program that when I say "what is my name" it will respond with "I don't know, what is it?" then it listens to what you are saying and example if I say "Jhon Doe" then it will say "...
0
votes
0answers
14 views

Google Glass Enterprise Edition 2.0 “Ok Glass” voice Command is not working

I am trying to use the Voice command with my application for Google Glass Enterprise Edition 2.0, but the Glass is even not detecting the Ok Glass Default Voice Trigger Command. I am wondering if I ...
1
vote
1answer
39 views

Google Speech To Text API: Enable Word Confidence Not Found

I'm not able to add the Word level confidence to my alternative results, can someone please assist? I tried reading through the following page: https://cloud.google.com/speech-to-text/docs/word-...
1
vote
0answers
31 views

How to transform an audioblob in wav file using Reactjs or Javascript?

I am working on VUI interface with Reactjs frontend. I got a BLOB file that I can play but I want to convert it to .WAV file using REACT or Javascript to send it to my server. I tried lot of things, ...
0
votes
1answer
28 views

Is it possible to implement “OK Google” like functionality using Ionic

I am trying to build an app like Alex or Google Home, suppose a user says "Hey MyApp", mic should be opened or a function associated with the button should be invoked automatically I have tried API....
1
vote
2answers
33 views

Determine fundamental frequency of voice recordings

I am using the command line tool aubiopitch to analyze voice recordings. My goal is to determine the fundamental frequency of the voice recorded. I know, of course, that the frequency varies – that's ...
0
votes
1answer
70 views

Solutions for voice recognition in AngularJS [duplicate]

I am attempting to find a solution for voice recognition for an AngularJS app I am building for Android and Electron. I have found a solution for Android in ng-speech-recognition but haven't yet ...
1
vote
0answers
25 views

SFSpeechRecognizer multiple languages

I am building a search that supports voice recognition and transforms speech to text so I am using SFSpeechRecognizer. But the problem is that I need to support multiple languages at the same time ...
0
votes
0answers
8 views

Framework voice recognition

Can anyone give me examples of voice recognition framework? Background voice authoring. I want to use voice with app in background to execute some functions
0
votes
0answers
33 views

How to fix 'No recognizer is installed' in Visual Studio 2019

I created console app (.Net Framework) to translate audio into text and display the text on the console but when I try to run the program throws an error: System.PlatformNotSupportedException: 'No ...
1
vote
1answer
44 views

noise reduction using python regarding other people's sound as noise

I want to use python to dispose of an Audio file which can recognize only my voice. For example, I speak to a raspberry pi car about "forward". It will go straight but other people who speak "forward" ...
0
votes
0answers
24 views

process.nextTick is not a function at new MicrophoneStream (microphone-stream.js:114) at recognizeMicrophone (recognize-microphone.js:107)

I am getting token in frontend from the backend .I am getting the error at var stream = recognisationMicrophone() .From there it is going in catch section. i have tried https://github.com/watson-...
0
votes
0answers
19 views

How to search something in google using voice in c#

I have downloaded some most used keywords and phrases in google and put them in a text file separated from the recognition text file and response text file. I separated the file so that I can use it ...
0
votes
0answers
26 views

how to execute voice google search

I can't search in google using my voice. I disabled search by default and can only be enabled by saying the word "search". I separated the words and phrases to be searched for to another file so that ...
1
vote
1answer
32 views

How to open a door using a phrase with speech recognition and python

i'm having some trouble with a code using the google api speech recognition. That's what i need: The "door" must open when i said the right phrase,but i'm a beginner python coder, so, i don't have ...
1
vote
0answers
27 views

Restrict Speech Recognition to one language only

I have one query About Android Speech Recognition. I want to restrict a speech recognizer to recognize only given/selected language. For example, if I select "en" as preferred language, speech ...
0
votes
1answer
40 views

webrtc vad for finding start of (possibly short) utterance

We'd like to know when in an audio file an utterance starts. The utterance can be a whole sentence or quite short, e.g. a single word. There may be some background noise (breathing, creaking, fans etc....
1
vote
1answer
26 views

Got an error during UBM speaker-adaptation with sidekit

I've already trained a UBM model and now I'm trying to implement the speaker-adaptation when I got following error. Exception: show enroll/something.wav is not in the HDF5 file I got two files "...
1
vote
0answers
36 views

Got an error during speaker-adaptation with sidekit

I'm trying to implement the speaker-adaptation of UBM using sidekit when I got following error. Exception: show enroll/something.wav is not in the HDF5 file I got two files "enroll" and "test" ...
0
votes
0answers
35 views

Using microphone, listen for sound effect

I am working on a test system for a device. I want to make sure that a specific sound effect is played from the device under test. To verify this I plan to use a small computer running Linux. The ...
0
votes
1answer
60 views

Google Assistant Custom Commands | Hey Google, Open My App's page

I want to open my android app's specific page with voice control in google assistant. I read some article "app actions" can help when it released. But Vodafone has already doing this feature. In ...
0
votes
0answers
12 views

How to handle libjingle_peerconnection with SFSpeechRecognizer for Subtitle

I am using libjingle_peerconnection for Voice Call and Video Call. Now I want to display subtitle for voice call and Video call using SFSpeechRecognizer but both libjingle_peerconnection and ...
0
votes
0answers
80 views

How to activate a button's function or start a new activity through voice interaction api using specific words?

i am willing to start a new activity when i say "ok Google" in my app accompanied with a specific word which will open an activity in the application.
0
votes
0answers
17 views

More labels for one activity in android manifest for voice Interactions

I'm able to open my activity from "OK Google" using: <activity android:name=".activities.MainActivity" android:label="Pizza"> ... But how can I start the same activity performing different ...
-2
votes
1answer
71 views

Can I classify ivectors with neural networks for language recognition?

I'm doing a language recognizer, I had planned to classify my i-vectors with neural networks, but I've read a lot of papers and they always use other methods like SVM or PLDA, can someone explain to ...
2
votes
0answers
70 views

Android - voice recognition and saving audio file - not working on some devices

I have working solution for voice recognition and saving the audio file, but the code works only on some devices. I have tried running this code on few devices. It seems that this solution works on ...
1
vote
0answers
39 views

KewordRecognizer in Unity: MissingMethodException

I am trying to experiment with a voice recognition program in unity. I am using UnityEngine.Windows.Speech.KeywordRecognizer. However, whenever I run the program, I receive the following error: ...
0
votes
0answers
27 views

How to obtain Universal Background model in sidekit for language recognition

I want to obtain ivectors in sidekit, so I have like 1000 audios , firstly I obtained my mfccs with: frontend.features.mfcc(input_sig, lowfreq=100, maxfreq=8000, nlinfilt=0, nlogfilt=24, nwin=0.025, ...
0
votes
0answers
20 views

Offline voice recognition for an unpopular language [duplicate]

I'm interested in getting voice recognition to work for a foreign language that Dragon Naturally Speaking (DNS) doesn't support (specifically, Khmer). Actually if anyone has any thoughts on how this ...
0
votes
1answer
29 views

How to import code from private GitHub repo into snips?

I understand that the normal way to use complex action code in Snips is to place the code in GitHub and pull it from there via Action Type "GitHub". Is there any way to access a private GitHub repo ...
0
votes
0answers
19 views

Is there any way to select 1 in n svm models which is the best to predict a feature?

I'm trying to solve a problem of keyword detection. In that, i have a library that contain 50 words to predict. I'm using MFCC to extract the feature of word and got 13*13 dimension vector feature. ...
9
votes
1answer
332 views

Voice recognition for alQuran Arabic

How can we compare two audio files, or voice recorder files, according to Al-Quran. Al-Quran has special pronunciation compared to Arabic pronunciation. Is it possible to do the comparison between ...
0
votes
0answers
12 views

Web Speech API formatting numbers

I'm trying to use the Web Speech API (https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API) to listen numbers. But when i'm saying something like a document, the API try to format like a ...
0
votes
0answers
72 views

The “network not conected” when using Voice Recognition on both emulator and real device, despite manifest

I am trying to get Voice Recognition work with Android Studio. I am using the tutorial from this website. I am aware of some typical problems with Voice Recognition, so I double checked that: My ...
1
vote
0answers
196 views

How to trigger streaming voice recognition with Google Cloud API and Python

First of all I am quite new to both python and this website, so please bear with me. I have successfully performed voice recognition from a live microphone input using the example code available at ...
0
votes
0answers
16 views

PocketSphinx how to create android app to make calls with voice recognition offline

I need help to create an android application with voice recognition that will allow me to make offline calls. For that, I want to use pocketsphinx.
0
votes
0answers
76 views

Speech-to-text voice differentiation with Microsoft Speech API?

I would like to know if Microsoft Speech API on Python supports multiple voices differentiation. I saw the beta of SDK Speaker Recognition, but I don't think it works just for differentiation(more ...
1
vote
0answers
34 views

Google Voice Typing Offline Functionality

While using the google voice-to-text function in my Android-Device offline, the commands New Line and New Paragraph do not work. It just writes "new line". If I have an internet connection, it works ...
0
votes
0answers
18 views

Java record audio only when someone is speaking

I'm building a voice assistant. I have a working audio recorder that I can stop and start easily. I just want to be able to detect when the user is actually speaking (it isn't silent) so that I only ...
0
votes
0answers
32 views

ChromeVox, input range gets wrong step value. It jumps to max value

I am trying to make an input type="range" to work correctly with ChromeVox. Here the issue: if I click the right arrow key to move forward, it jumps directly to the max value that in my example is ...