I am trying to reduce the gap between two sentence groups in Text to Speech in Azure console. The gap between sentences can be reduced by specifying sentenceboundarysilence
i.e. sentenceboundarysilence="5ms"
, but I don’t know how to customize silence between sentence groups(as shown in below image). Basically, I want to specify the duration between the first sentence group (ending with "best game ever") and the second sentence group (starting with "Reason number one")
2
Answers
Digging through SSML docs the main ways to customize pauses is with
SentenaceBoundary
, break, andsilence
. Abreak
can be added anywhere in the text. Silence can only be used at the end of text. If you want to increase the pause between two groups of text, add abreak
at the end of the last sentence in a group of sentences. For Azure speech studio you can add a break by putting a time within square brackets like this:Here are some docs on adding breaks/silence to SSML: https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-synthesis-markup-structure#add-silence
I don’t believe there is an option to reduce the pause between two sentence groups without creating some really messy SSML (you could set the time for sentence boundary to your minimum pause time, then add breaks at the start of each sentence to increase the time where needed.
This is a SSML Text for the Azure api that works perfectly.
The break tag must be used inside a voice tag.
The time is defined in Milliseconds 1000 = 1 seconds