MandarinIPAOperator — IPA for Cantonese¶
cjklib.reading.operator.MandarinIPAOperator
is an implementation
of a transcription of Standard Mandarin into the
International Phonetic Alphabet (IPA).
Specifics¶
Features:
- Tones can be marked either with tone numbers (1-4), tone contour numbers (e.g. 214), IPA tone bar characters or IPA diacritics,
- support for low third tone (1/2 third tone) with tone contour 21,
- four levels of the neutral tone for varying stress depending on the preceding syllable and
- splitting of syllables into onset and rhyme using method
getOnsetRhyme()
.
Tones¶
Tones in IPA can be expressed using different schemes. The following schemes are implemented here:
- Numbers, regular tone numbers from 1 to 5 for first tone to fifth (qingsheng),
- ChaoDigits, numbers displaying the levels of tone contours, e.g. 214 for the regular third tone,
- IPAToneBar, IPA modifying tone bar characters, e.g. ɕi˨˩˦,
- Diacritics, diacritical marks and finally
- None, no support for tone marks
Unlike other operators for Mandarin, distinction is made for six different
tonal occurrences. The third tone is affected by tone sandhi and basically
two different tone contours exist. Therefore
getTonalEntity()
and
splitEntityTone()
work with string representations as tones defined in
TONES
.
Same behaviour as found in other operators for Mandarin can be
achieved by simply using the first character of the given string:
>>> from cjklib.reading import operator
>>> ipaOp = operator.MandarinIPAOperator(toneMarkType='ipaToneBar')
>>> syllable, toneName = ipaOp.splitEntityTone(u'mən˧˥')
>>> tone = int(toneName[0])
The implemented schemes render tone information differently. Mapping might lose information so a full back-transformation can not be guaranteed.
Source¶
- Yuen Ren Chao: A Grammar of Spoken Chinese. University of California Press, Berkeley, 1968, ISBN 0-520-00219-9.