1. Introduction

Acoustic Analysis of Monophthongs in Tibetan of Yushu Dialect

Lingzhen Li

Yonghong Li

0 0 Northwest Minzu University, China National Information Technology Research Institute , Lanzhou , China

107 113

Based on experimental phonetics, this paper further reveals the acoustic characteristics of monophthongs of Tibetan in Yushu dialect with the help of Adobe Audition 3.0, Praat and other speech analysis software. Firstly, the spectrogram is analyzed and the vowel acoustic parameters are extracted; Secondly, the formant pattern diagram and acoustic vowel diagram of Yushu dialect are drawn by using the values of F1, F2 and F3, which clearly reflect the acoustic characteristics and spatial distribution position of Yushu dialect monophthongs and the relationship between F1, F2, F3 and vowel acoustic characteristics. It is concluded that the lower the tongue position is, the larger value of F1 will be, and vice versa, the smaller value of F1 will be; The more anterior the tongue is, the larger value of F2 is; on the contrary, the smaller value of F2 is; The round lip effect can reduce the F2 value.

1 Yushu dialect monophthong experiment phonetics acoustic analysis

1. Introduction

Located in the southwest of Qinghai province, Yushu region has jurisdiction over six counties: Yushu, Chengduo, Baoqian, Zaduo, Zhiduo and Qumalai. With Tibetan as the main language, it is located at the junction of Wei Zang, Kang and Anduo dialect areas. The overall phonetic appearance presents a transitional feature. Yushu dialect is traditionally classified as Kang dialect [ 1 ]. Huang Bufan [ 2 ] believes that Yushu dialect has the nature of intermediary dialect or dialect chain due to the influence of the three dialects. Yushu dialect can be regarded as a dialect juxtaposed with the three dialects. Its phonetic features are: the initials system is greatly simplified, and the plosives, affricates and fricative initials have the opposition of unvoiced (aspirated / non aspirated) and voiced [ 3 ]; Tones were initially produced to make up for the confusion caused by the disappearance of many phonemes [ 3 ]; Rich vowels, with 2 to 5 compound vowels [ 4 ].

A monophthong is a vowel with the same tongue position, lip shape and opening degree. It can exist alone in a syllable without other vowels. At present, the research on the vowels of Yushu dialect mainly includes: Huang Bufan's The phonetic characteristics and historical evolution law of Yushu Tibetan language [ 2 ] thinks that the diversity of the vowel evolution of Yushu dialect is more prominent; Dengzhen Wengmu's study on the phonology of Tibetan Yushu dialect[ 4 ] mentioned that Yushu dialect has more simple vowels than other Tibetan Languages, and has 2 to 5 compound vowels; Sangta's phonological study of Tibetan Yushu dialect [ 5 ]shows that most simple vowels in Yushu dialect are the result of the loss and weakening of ancient Tibetan finals.

In this experiment, Yushu is taken as the investigation point. According to the listening and discrimination results, eight monophthongs of Yushu dialect are determined, which are: a、i、e、o、 u 、 ʊ 、 y 、 ə. In this paper, the eight monophthongs are described from the three-dimensional spectrogram; Secondly, extracted the acoustic parameters and drawn formant patterns and acoustic vowel diagrams. By exploring the linguistic value of Yushu Tibetan dialect, we hope to provide some reference for the study of more single dialect and lay a certain foundation for phonological description. Phonological description is an important work of language formal description, so this work is also the most basic work of speech synthesis and recognition.

2. Experimental method 2.1. Experimental materials

The pronunciation vocabulary used in this experiment was selected from the Tibetan dialect questionnaire [ 6 ], and the pronunciation partners selected 557 commonly used words in the oral language from monosyllabic words. See Table 1 below for examples of pronunciation materials.

2.2. Pronunciation partner

The pronunciation partner is a female college student (22 years old) from Northwest University for Nationalities who has clear enunciation, and can speak authentic Yushu dialect without being affected by other dialects. In order to ensure the accuracy of the signal, the partner is required to be familiar with the materials and read each word twice while signal acquisition.

2.3. Voice signal acquisition

The recording was conducted in the professional recording room of Northwest University for Nationalities, with good sealing and sound insulation. Recording equipment includes notebook computer, microphone ecm-44b Lavalier microphone, eurorack ub1204fx-pro mixer, blaster X-Fi surround5.1pro external sound card, etc.; The recording software is Adobe Audition3.0, which adopts single channel recording, with sampling accuracy of 16 bits and sampling frequency of 22050hz. It can complete the recording work with high efficiency and quality, control the recording process, monitor the changes of technical indicators such as speech speed, energy and signal-to-noise ratio, and observe the voice state of the speaker. The recording samples are stored in (*.wav) format.

2.4. Experimental data processing and analysis

After the original speech is preprocessed with Adobe Audition3.0, Matlab is used to cut it into speech files corresponding to a single speech and a name, and Praat speech analysis software is used to mark the voice. When marking, syllables are marked on the first layer, and initials and finals are marked on the second layer, as shown in Fig.1, Praat speech analysis software was used to extract and analyze all acoustic parameters in this study.

3. Analysis of experimental results 3.1. Spectrogram analysis

Vowel is the most important component of voice, which is mainly reflected as formant in acoustics. The formant is the resonant frequency of the sound cavity, which is generally expressed in F, and the corresponding number is used to represent the number of formants. For vowels, F1 and F2 are closely related to the height of the vowel tongue position, the front and back of the tongue position, and the round spread of the lip shape. Therefore, the values of F1 and F2 will be taken as an important basis for describing the acoustic characteristics of vowels in phonetics. Next, select the representative sounds of eight vowels, draw a three-dimensional spectrogram, and show the acoustic characteristics of each category of vowels by analyzing the spectrogram. process, while F3 shows an upward trend, and then reaches a stable trend. Comparing the two diagrams, the high frequency energy of /a/ is still very strong, and F1, F2 and F3 are relatively higher. Comparing the two spectrograms, F1 and F2 of the former are higher. Combined with the difference of the size opening of mouth, it is verified that F1 is related to the size opening of mouth (tongue position). The larger the opening, the larger F1.

From the spectrogram in Fig.6, F1 of the vowel /e/ is about 400hz, F2 is far away from F1, about 2200hz, F2 is close to F3, the frequency energy is strong, and the distribution is relatively uniform. Compared with /i/, F2 and F3 are lower. Influenced by the front-end initials, F2 and F3 initially point to the low frequency, then rise rapidly and transition to the stable stage. The lowest end in the figure is the energy of fundamental frequency. Fig.7 is " knife " language spectrogram, F1 of vowel /ə/ is relatively high. Influenced by the previous initials, the initial value of F2 is large, and then it drops rapidly, which is very close to F3, about 1500hz. F4 and F5 have high values and relatively small energy.

3.2. Vowel formant pattern of Yushu dialect

Drawing different vowel formants into a formant pattern diagram is conducive to observing the formant corresponding pattern between vowels, and can more vividly see the location and relationship of each vowel formant. After extracting the acoustic parameters of vowels in voice samples and averaging them, the frequencies of the first three formants F1, F2 and F3 of the eight monophthongs are obtained respectively, with vowel as the abscissa and the frequencies as the ordinate, and draw the formant pattern spectrogram of Yushu dialect, as shown in Fig.10:

From the formant pattern of Yushu dialect, we can clearly see that each monophthong has its own formant distribution characteristics. It mainly shows that F1 and F2 are different in value and relative distance. According to the above Fig.10, F1 and F2 of /i/ are the largest, followed by /y/, the distance between /a/ is the smallest, followed by /ə/.

F1 values from small to large are: /i/</y/</ ʊ/</ u/</e/</o/</ ə/</ a/； F2 values in descending order are: /i/</e/</y/</a/</ ʊ/</ə/</ o/</u/.

It can be found that F1 and F2 values roughly form an inverse relationship. However, there are exceptions. For example, for the two monophthongs /e/ and /y/, the F1 value of /e/ is greater than /y/, but the F2 value is also greater than /y/, and they do not form a strict inverse proportional relationship. Considering that F2 is also related to the round spread of lip shape, that is, the round lip effect can reduce the F2 value, because the round lip effect and the back position of tongue can make the front resonant cavity larger when pronouncing.

3.3. Acoustic vowel diagram of Yushu dialect

The acoustic vowel diagram is different from the traditional vowel tongue bitmap. It is obtained according to the objective values of F1 and F2. At the same time, F1 is the vertical coordinate and F2 is the horizontal coordinate. The coordinate origin is set in the upper right corner, making its relative position roughly the same as that of the traditional vowel tongue bitmap. Jos (1948) [ 8 ] believes that although the formant frequencies of the same vowel uttered by different people are different, the relative positions of each vowel on the acoustic vowel map are stable. The position of each vowel in the Fig.11 is obtained by averaging the formant frequencies of all samples of each vowel.

4. Summary

The eight monophthongs of Yushu dialect are: a、i、e、o、u、ʊ、y、ə. /a/ is the central back low unrounded vowel, /i/ is the front high unrounded vowel, /u/ is the rear high rounded vowel, /y/ is the front high rounded vowel, /o/ is the rear medium high rounded vowel, /e/ is the rear medium high unrounded vowel, /ʊ/ is the middle high rounded lip vowel behind the center. /ə/ is the second half of the high unrounded lip vowel. The distribution of formants was consistent: the higher the tongue position was, the smaller the F1 value was; the lower the tongue position was, the larger the F1 value was; The more anterior the tongue is, the greater the F2 value is. The more posterior the tongue is, the smaller the F2 value is; In the same case, the round lip effect can reduce the value of F2.

The tone of Yushu dialect is very special. Through the analysis and research of its pronunciation, it can supplement the blank of the other three major Tibetan dialects and the world tone language. At the same time, the acoustic analysis of Yushu dialect using experimental phonetics can promote the development of phonetic information and visualization. It is hoped that with the development of science and technology, computer technology and digital signal analysis technology can be more and more applied in phonetics, and promote the further development of phonetics to fill the shortcomings of traditional phonetics.

5. Acknowledgements

This work was financially supported by NSFC grant fund (No.11964034) and Research and innovation Projects (No.2021CXZX-674). 6. References

[1]

Peng

Jin , Tibetan Jianzhi, 2nd. ed., Ethnic Publishing House , 1983 .

[2]

Bufan

Huang , Suonan jiangcai, Minghui Zhang, Phonetic characteristics and historical evolution of Yushu Tibetan language , Chinese Tibetology, ( 1994 ) (2) 24 .

[3] Anseraga , A survey of Tibetan Yushu dialect (Labu) phonology, Tibet studies , ( 2018 ) (1) 7 .

[4]

Dengzhen

Wengmu , Phonological study of Yushu dialect in Tibetan, Henan science and technology, ( 2015 ) (22) 1 .

[5] Sangta , Phonological study of Yushu dialect in Tibetan, Master's thesis , Northwest University for Nationalities, 2012 .

[6]

Jiangping

Kong , Tibetan dialect questionnaire, 2nd. ed., Commercial Press, 2011 .

[7]

Yasheng

Jin , Ruishan Zhang, A study on the unit sound acoustics of Dongxiang language , Northwest ethnic studies , ( 2010 ) (4)10 .

[8] Joos , M. Acoustic Phonetics ,Language, 2nd. ed., No.24 , ( suppl .2).

[9]

Gesang

Jumian , Gesang Yangjing, Introduction to Tibetan dialect, 2nd. ed., Ethnic Publishing House , 2002 .

[10] Jiangping

Kong

, Basic course of experimental phonetics , 2nd. ed., Peking University Press, 2015 .