Can Character encoding for ph value be specified? I wonder if the value has to be HTML Hex entity encoding?
<speak version="1.0" xmlns="https://www.w3.org/2001/10/synthesis" xml:lang="en-US">
<voice name="en-US-Jessa24kRUS">
<phoneme alphabet="ipa" ph="təmei̥ɾou̥"> tomato </phoneme>
</voice>
</speak>
Thank you.
⚠ Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.
@Jun711
Thanks for the feedback! We are currently investigating and will update you shortly.
@Jun711 Hi Jun, in this situation the example is showing as the entity escape versions of the IPA characters. Because many platform/browser/text editor combinations do not correctly cut and paste Unicode text. Normally, one would directly use the UTF-8 representation of these symbols: "təmei̥ɾou̥"
Please refer to the doc for entire SSML doc https://www.w3.org/TR/speech-synthesis/#S3.1.10
@YutongTie-MSFT Thank you for the update. I am going to test with different phonetic characters using UTF-8 representation of phonetic characters and will report back.
@Jun711 We will now proceed to close this thread. If there are further questions regarding this matter, please respond here and @YutongTie-MSFT and we will gladly continue the discussion.
@YutongTie-MSFT
Thanks. It is working well.
I just noticed that it doesn't work when I use : (u003A) instead of ː (u02D0).
Is : (u003A) not supported?
For example,
a <phoneme alphabet='ipa' ph='ʃi:p'>sheep</phoneme>