A generic interface for speech transcription. More...
#include <yarp/dev/ISpeechTranscription.h>
Public Member Functions | |
virtual | ~ISpeechTranscription () |
virtual yarp::dev::ReturnValue | setLanguage (const std::string &language="auto")=0 |
Sets the language for speech transcription. | |
virtual yarp::dev::ReturnValue | getLanguage (std::string &language)=0 |
Gets the current language set for speech transcription. | |
virtual yarp::dev::ReturnValue | transcribe (const yarp::sig::Sound &sound, std::string &transcription, double &score)=0 |
Performs the speech transcription. | |
A generic interface for speech transcription.
Definition at line 20 of file ISpeechTranscription.h.
|
virtualdefault |
|
pure virtual |
Gets the current language set for speech transcription.
language | the returned string (code) representing the speech language (e.g. ita, eng...). Default value is "auto". |
Implemented in WhisperSpeechTranscription, GoogleSpeechTranscription, FakePythonSpeechTranscription, FakeSpeechTranscription, and SpeechTranscription_nwc_yarp.
|
pure virtual |
Sets the language for speech transcription.
language | a string (code) representing the speech language (e.g. ita, eng...). Default value is "auto". |
Implemented in WhisperSpeechTranscription, FakePythonSpeechTranscription, FakeSpeechTranscription, GoogleSpeechTranscription, and SpeechTranscription_nwc_yarp.
|
pure virtual |
Performs the speech transcription.
sound | the audio data to transcribe |
transcription | the returned transcription (it may be empty) |
score | the returned score/confidence value in the range (0-1.0). It may be not implemented. |
Implemented in WhisperSpeechTranscription, GoogleSpeechTranscription, FakePythonSpeechTranscription, FakeSpeechTranscription, and SpeechTranscription_nwc_yarp.