Azure-docs: Phoneme ph attribute value character encoding

Created on 11 Sep 2019  ·  5Comments  ·  Source: MicrosoftDocs/azure-docs

Can Character encoding for ph value be specified? I wonder if the value has to be HTML Hex entity encoding?

<speak version="1.0" xmlns="https://www.w3.org/2001/10/synthesis" xml:lang="en-US">
    <voice  name="en-US-Jessa24kRUS">
        <phoneme alphabet="ipa" ph="təmei̥ɾou̥"> tomato </phoneme>
    </voice>
</speak>

Thank you.


Document Details

Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.

Pri2 cognitive-servicesvc cxp product-question speech-servicsubsvc triaged

All 5 comments

@Jun711
Thanks for the feedback! We are currently investigating and will update you shortly.

@Jun711 Hi Jun, in this situation the example is showing as the entity escape versions of the IPA characters. Because many platform/browser/text editor combinations do not correctly cut and paste Unicode text. Normally, one would directly use the UTF-8 representation of these symbols: "təmei̥ɾou̥"

Please refer to the doc for entire SSML doc https://www.w3.org/TR/speech-synthesis/#S3.1.10

@YutongTie-MSFT Thank you for the update. I am going to test with different phonetic characters using UTF-8 representation of phonetic characters and will report back.

@Jun711 We will now proceed to close this thread. If there are further questions regarding this matter, please respond here and @YutongTie-MSFT and we will gladly continue the discussion.

@YutongTie-MSFT
Thanks. It is working well.

I just noticed that it doesn't work when I use : (u003A) instead of ː (u02D0).
Is : (u003A) not supported?

For example,

a <phoneme alphabet='ipa' ph='ʃi:p'>sheep</phoneme>
Was this page helpful?
0 / 5 - 0 ratings