Module assessment

1.

What activity happens during the pre-processing stage of speech recognition?

The audio is converted to .wmv format.

Background noise is added to the audio signal.

Feature vectors are extracted from the audio waveform for modeling.

2.

What are phonemes?

Artifacts that are removed from the signal as part of the clean-up process.

The smallest unit of sound in speech.

AI models that generate audio.

3.

Why is it important to generate prosody in speech synthesis?

Prosody maximizes the volume of the audio output.

Prosody translates the speech to the language of the listener.

Prosody ensures natural pronunciation and speech cadence.

Check your knowledge