OpenAI
OpenAITTSVoice
module-attribute
OpenAITTSVoice = Literal[
"alloy",
"ash",
"ballad",
"echo",
"coral",
"fable",
"onyx",
"nova",
"sage",
"shimmer",
"verse",
]
OpenAI's text-to-speech available voices.
OpenAISpeechSynthesizer
Bases: BaseModel
Speech synthesizer using OpenAI's API.
model
class-attribute
instance-attribute
model: Literal["gpt-4o-mini-tts", "tts-1", "tts-1-hd"] = (
"gpt-4o-mini-tts"
)
Model to use for speech synthesis.
voice
class-attribute
instance-attribute
voice: OpenAITTSVoice = 'alloy'
Voice to use for speech synthesis.
speed
class-attribute
instance-attribute
Speed of speech synthesis.
timeout
class-attribute
instance-attribute
timeout: PositiveInt = 120
Timeout for speech synthesis in seconds.
instructions
class-attribute
instance-attribute
instructions: str | None = None
Instructions passed to the model. Valid only when the model is from the GPT-4o family or higher.
silence_threshold
class-attribute
instance-attribute
silence_threshold: float | None = None
Silence threshold for the audio asset.
silence_duration
class-attribute
instance-attribute
silence_duration: float | None = None
Silence duration for the audio asset.
synthesize
synthesize(
texts: Sequence[str],
*,
audio_params: AudioAssetParams | None = None,
**kwargs: Any
) -> list[AudioAsset]
Synthesize speech from texts using OpenAI's API.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
|
Sequence[str]
|
Texts to synthesize. |
required |
|
AudioAssetParams | None
|
Parameters for the audio asset. |
None
|
|
Any
|
Additional parameters for the OpenAI API. |
{}
|
Returns:
Type | Description |
---|---|
list[AudioAsset]
|
List of audio assets. |