This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
What activity happens during the pre-processing stage of speech recognition?
The audio is converted to .wmv format.
Background noise is added to the audio signal.
Feature vectors are extracted from the audio waveform for modeling.
What are phonemes?
Artifacts that are removed from the signal as part of the clean-up process.
The smallest unit of sound in speech.
AI models that generate audio.
Why is it important to generate prosody in speech synthesis?
Prosody maximizes the volume of the audio output.
Prosody translates the speech to the language of the listener.
Prosody ensures natural pronunciation and speech cadence.
You must answer all questions before checking your work.
Was this page helpful?
Need help with this topic?
Want to try using Ask Learn to clarify or guide you through this topic?