Share via


com.microsoft.cognitiveservices.speech

Classes

AudioDataStream

Represents audio data stream used for operating audio data as a stream.

AutoDetectSourceLanguageConfig

Represents auto detect source language configuration used for specifying the possible source language candidates Note: close() must be called in order to release underlying resources held by the object.

AutoDetectSourceLanguageResult

Represents the result of auto detecting source languages Added in version 1.8.0

CancellationDetails

Contains detailed information about why a result was canceled.

ClassLanguageModel

Represents a ClassLanguageModel.

Connection

Connection is a proxy class for managing connection to the speech service of the specified Recognizer.

ConnectionEventArgs

Defines payload for connection events like Connected/Disconnected.

ConnectionMessage

ConnectionMessage represents implementation specific messages sent to and received from the speech service.

ConnectionMessageEventArgs

Defines payload for Connection's MessageReceived events.

Diagnostics

Native logging and other diagnostics

EmbeddedSpeechConfig

Class that defines embedded (offline) speech configuration.

Grammar

Represents a generic grammar used to assist in improving speech recogniton accuracy.

GrammarList

Allows adding multiple grammars to a SpeechRecognizer to improve the accuracy of speech recognition.

HybridSpeechConfig

Class that defines hybrid (cloud and embedded) configurations for speech recognition and speech synthesis.

KeywordRecognitionEventArgs

Defines content of an keyword recognizing/recognized events.

KeywordRecognitionModel

Represents a keyword recognition model for recognizing when the user says a keyword to initiate further speech recognition.

KeywordRecognitionResult

Defines result of keyword recognition.

KeywordRecognizer

Performs keyword recognition on the speech input.

NoMatchDetails

Contains detailed information for NoMatch recognition results.

PhraseListGrammar

Allows additions of new phrases to improve speech recognition.

PronunciationAssessmentConfig

Represents pronunciation assessment configuration.

PronunciationAssessmentResult

Represents the result of pronunciation assessment.

PropertyCollection

Represents collection of properties and their values.

RecognitionEventArgs

Defines payload for recognition events like Speech Start/End Detected

RecognitionResult

Contains detailed information about result of a recognition operation.

Recognizer

Defines the base class Recognizer which mainly contains common event handlers.

SessionEventArgs

Defines payload for SessionStarted/Stopped events.

SourceLanguageConfig

Represents source language configuration used for specifying recognition source language.

SpeechConfig

Speech configuration.

SpeechRecognitionCanceledEventArgs

Defines payload of speech recognition canceled events.

SpeechRecognitionEventArgs

Defines contents of speech recognizing/recognized event.

SpeechRecognitionModel

Contains detailed speech recognition model information.

SpeechRecognitionResult

Defines result of speech recognition.

SpeechRecognizer

Performs speech recognition from microphone, file, or other audio input streams, and gets transcribed text as result.

SpeechSynthesisBookmarkEventArgs

Defines contents of speech synthesis bookmark event.

SpeechSynthesisCancellationDetails

Contains detailed information about why a speech synthesis was canceled.

SpeechSynthesisEventArgs

Defines contents of speech synthesis related event.

SpeechSynthesisResult

Contains detailed information about result of a speech synthesis operation.

SpeechSynthesisVisemeEventArgs

Defines contents of speech synthesis viseme event.

SpeechSynthesisWordBoundaryEventArgs

Defines contents of speech synthesis word boundary event.

SpeechSynthesizer

Performs speech synthesis to speaker, file, or other audio output streams, and gets synthesized audio as result.

SpeechTranslationModel

Contains detailed speech translation model information.

SynthesisVoicesResult

Contains detailed information about the retrieved synthesis voices list.

VoiceInfo

Contains detailed information about the synthesis voice information.

Enums

CancellationErrorCode

Defines error code in case that CancellationReason is Error.

CancellationReason

Defines the possible reasons a recognition result might be canceled.

NoMatchReason

Defines the possible reasons a recognition result might not be recognized.

OutputFormat

Define Speech Recognizer output formats.

ProfanityOption

Define profanity option for response result.

PronunciationAssessmentGradingSystem

Defines the point system for pronunciation score calibration; default value is FivePoint.

PronunciationAssessmentGranularity

Defines the pronunciation evaluation granularity; default value is Phoneme.

PropertyId

Defines property ids.

ResultReason

Defines the possible reasons a recognition result might be generated.

ServicePropertyChannel

Defines channels used to send service properties.

SpeechSynthesisBoundaryType

Defines the boundary type of speech synthesis boundary event.

SpeechSynthesisOutputFormat

Defines the possible speech synthesis output audio format.

StreamStatus

Defines the possible status of audio data stream.

SynthesisVoiceGender

Define synthesis voice gender.

SynthesisVoiceStatus

Defines the status of synthesis voice.

SynthesisVoiceType

Define synthesis voice type.