<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Plugin for automatisation of phonetic-phonological analysis and obtaining analytical feedback for Spanish learners Plugin para la automatización del análisis fonético-fonológico y la obtención de retroalimentación analítica para estudiantes de español</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Tamara</forename><surname>Couto-Fernández</surname></persName>
							<email>tamara.cfernandez@udc.es</email>
						</author>
						<author>
							<persName><forename type="first">Albina</forename><surname>Sarymsakova</surname></persName>
							<email>albina.sarymsakova@udc.es</email>
							<affiliation key="aff1">
								<orgName type="department">Faculty of Philology</orgName>
								<orgName type="institution">University of A Coruña</orgName>
								<address>
									<addrLine>Campus da Zapateira, A Coruña</addrLine>
									<postCode>15008</postCode>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Nelly</forename><surname>Condori-Fernández</surname></persName>
							<email>n.condori.fernandez@udc.es</email>
						</author>
						<author>
							<persName><forename type="first">Patricia</forename><surname>Martín-Rodilla</surname></persName>
							<email>patricia.martin.rodilla@udc.es</email>
							<affiliation key="aff0">
								<orgName type="department">Faculty of Computer Science</orgName>
								<orgName type="institution" key="instit1">University of A Coruña</orgName>
								<orgName type="institution" key="instit2">Camiño do Lagar de Castro</orgName>
								<address>
									<addrLine>6, A Coruña</addrLine>
									<postCode>15008</postCode>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Plugin for automatisation of phonetic-phonological analysis and obtaining analytical feedback for Spanish learners Plugin para la automatización del análisis fonético-fonológico y la obtención de retroalimentación analítica para estudiantes de español</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">8FD57ECB38903954FF7F9D5DDBEF03C3</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T08:51+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Praat, intonation analysis, ICT, Python (A. Sarymsakova)</term>
					<term>0000-0002-1044-3871 (N. Condori-Fernández)</term>
					<term>0000-0002-1540-883X (P. Martín-Rodilla)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>We present in this article the Plugin for phonetic-phonological analysis in Spanish (PAFe), which consists of a series of scripts (a code written with a programming language (Python) that, implement three different intonation comparison algorithms of an ELE (Spanish as a foreign language) student and a native speaker of Spanish), allowing, in turn, three different types of analysis: global, tonal tendency and intersyllabic. In addition, PAFe has a database to keep a history of different types of data (user profile, pronunciation exercises and audios) and a graphical interface to include reports on pronunciation evolution in Praat, a tool for acoustic analysis. PAFe is a software solution that offers new functionalities of Praat and allows the following: (i) to perform a comparative analysis between the intonational patterns of an ELE student and a native speaker; (ii) to report the evolution of the acquisition of such patterns in Spanish thanks to the history of the stored data. In this way, automated feedback is provided to both students and teachers.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The present work is framed in the area of natural language processing, specifically, in the comparative-contrastive analysis of intonation for the didactic purposes provided by our original tool PAFe. Despite the existence of some tools, such as the Oplustil and Toledo <ref type="bibr" target="#b10">[11]</ref> proposal, or the study by Strik, Truong, Wet and Cucchiarini <ref type="bibr" target="#b7">[8]</ref>, which offer results of phonetic-phonological similarity or detect errors made in pronunciation.</p><p>Nonetheless, no tool provides both facilities at the same time, nor offers to monitor the evolution of the students.</p><p>For this reason, we have decided to develop a system that complements language teaching, in particular, one that can be used remotely or in hybrid modalities.</p><p>Our tool offers the functionality to perform an instant comparative analysis of a student's pronunciation, taking as a reference the speech of a native speaker, and observing the evolution of this through data stored in history.</p><p>For the development of our plugin, several technologies have been used to support the work done, such as Praat, Python and PostgreSQL.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Methodology</head><p>We start designing our work based on the following essential principles of intonation analysis:</p><p>1. We annotate the syllables of each speech act in a Praat textgrid (Boersma and Weenink <ref type="bibr" target="#b0">[1]</ref>; we identify pitch values of all vowels in the syllables (voiced or voiced consonants are measured as well), using the Praat Script developed by Mateo Ruiz <ref type="bibr" target="#b8">[9,</ref><ref type="bibr" target="#b9">10]</ref>, which extracts the absolute values in Hz, relativises them and draws the standardised melody graph; 2. we discriminate relevant frequency values between tonal segments from irrelevant values; according to Cantero Serena [2, 3], Font-Rotchés and Cantero Serena <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref>, less than 10% difference between segments is considered imperceptible. Once we have obtained the relevant data from the intonation analysis, we move on to the PAFe architecture.</p><p>Our project develops an extension to an existing desktop application for acoustic speech analysis: Praat. Therefore, we start from a developed architecture to which a new module (PAFe) is coupled (Figure <ref type="figure" target="#fig_0">1</ref>) consisting of Praat scripts, Python code and a PostgreSQL database. Praat, through its scripting, allows command line calls to other systems, as described by Dragos-PaulPop <ref type="bibr" target="#b4">[5]</ref>, thus making it possible to extend the application through the use of other languages and technologies, external to Praat. This new module (PAFe) communicates with the original system by employing new Praat scripts that are associated with the application's menu items (see Figure <ref type="figure" target="#fig_1">2</ref>), from which these files are executed. Sometimes, the new module dispenses with calls to Praat and generates information windows directly from Python code files. The intermediary between Praat and the data managed in the database is Python.</p><p>We employ natural language processing and audio processing techniques in our tool, taking as our main source the human voice recordings of native speakers and students. Praat allows us to extract quantitative information at the prosodic level from the audios.</p><p>Subsequently, the native/student comparative algorithms in terms of prosodic aspects that are presented and implemented by the tool can offer comparative information between two native/student audios to provide feedback in Spanish language learning. These algorithms are an original contribution implemented in the tool since there was no algorithmic proposal of this type for Spanish until now.</p><p>We have developed the PAFe Plugin following an iterative and incremental methodology based on agile technologies and scrum development methodology, based on the work of Schwaber and Sutherland <ref type="bibr" target="#b11">[12]</ref>.  In the following, we describe the development of our tool.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Solution: PAFe Plugin</head><p>Our PAFe tool, in its final version, allows comparative analysis by providing similarity results and intonation graphs based on pitch values 2 and tonal tendency in each defined segment and, finally, visualisation of a student's progress over time. We highlight the following operations made possible by our plugin:</p><p>1. The application allows the creation of different profiles to facilitate the process of managing the data uploaded by users. a) First of all, the teacher is registered. b) A pupil is then assigned to the teacher previously registered. This step avoids confusion if there is more than one user of the same computer or laptop. c) Finally, the profile of a native Spanish speaker is recorded to upload the data that will serve as a reference for the programme; 2. PAFe enables the management of WAV and TextGrid files<ref type="foot" target="#foot_0">3</ref> : our programme includes both storage and deletion of audio files and annotations; 3. It also allows for different types of acoustic analysis (global analysis, tonal tendency analysis and intersyllabic analysis): the algorithm that performs the global analysis consists of dividing the previously saved audios of learners and native speakers of Spanish into about 1000 intervals (discarding silences) to obtain very precise comparative values. However, this type of analysis does not provide feedback about possible deviations in tone but provides generic data on the percentage similarity of the native speaker's and learner's audio. As far as tonal tendency analysis is concerned, the programme works with .TextGrid annotations and the previously saved .WAV audio files. In this case, the utterances are divided by words and, to obtain the similarity locally, it is indicated whether the pitch of each word has been reproduced correctly or not and, in case it has not been reproduced correctly, the percentage of deviation is indicated; the percentage of pitch similarity and the average difference between two audios are also obtained. Finally, the intersyllabic acoustic analysis is a comparative analysis, syllable by syllable, of the similarity between the tone realisation of a learner and that of a native speaker; in this case, for each 2 Tone frequency in Hz syllable, the difference in pronunciation concerning the reference audio is indicated, as well as the percentage of similarity of tone and the average difference between two audios is obtained. According to the results obtained through this last type of analysis, both the similarity and the difference between the reference audio and the learners' audio are shown more accurately. Finally, we can see the evolution of our students' results through the option to view the history. Finally, we show a flowchart (Figure <ref type="figure" target="#fig_3">3</ref>) that provides information about the behaviour of our plugin, exposing the functionalities and their interrelation, as well as presenting the operators that interact with the application and their restrictions. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Illustrative example of intersyllabic analysis</head><p>In this section, we show how one type of comparative analysis is carried out. To perform the intersyllabic analysis, it is necessary to fill in a form (Figure <ref type="figure">4</ref>) with the data that characterise the audio of the learner we want to compare.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figures 4: Form for conducting an intersyllabic analysis</head><p>The audios of that student that meet these properties are then filtered out and display a window with a drop-down menu for the selection of the audio to be analysed. Once the audio is selected, the corresponding. TextGrid file is selected in the same way.</p><p>Each type of analysis returns different results. For the intersyllabic analysis, we show a similarity result per syllable and the average percentage difference (Figure <ref type="figure" target="#fig_4">5</ref>). Finally, we obtain a graph with the tonal differentiation curves in each syllable for each audio (Figure <ref type="figure" target="#fig_5">6</ref>).  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>In conclusion, we highlight the following key issues that we have addressed in this paper:</p><p>1. The PAFe tool allows different types of comparative-contrastive analysis of the intonation (global, tonal tendencies and intersyllabic) of EFL learners and native speakers of Spanish; Among them, we consider the intersyllabic as the most accurate since the results of tonal difference appear syllable by syllable and show the tonal deviations of the students, and the global as the most efficient in terms of response time since it does not require the uploading of TextGrids, and the segmentation is done in an automated way, as shown by the empirical data of the Couto Fernández <ref type="bibr" target="#b3">[4]</ref> work.</p><p>2. This application has several functions; apart from performing the intonational analysis, it allows to store the audios, the . TextGrid files and the results of the analysis (the history) of each utterance according to the profile of the speaker (student or native speaker of Spanish).</p><p>3. PAFe has been developed to achieve the following didactic objectives: to facilitate the work of teachers with regard to the identification and correction of intonation deviations (we have carried out an empirical analysis with teachers of Spanish as a foreign language, where we measured the degree of satisfaction with PAFe, with positive results, as indicated in the work Couto Fernández <ref type="bibr" target="#b3">[4]</ref>; to store the results of the analyses carried out for future improvement; to serve as a selfevaluation and self-correction tool for ELE students, given that the tool itself allows them to upload .WAV and . TextGrid files, run the analyses and obtain the results without constant help from teachers. As a future line of research, we highlight the need to measure this degree of feedback to students empirically.</p><p>As far as we know, it is the only existing solution both under Praat and outside Praat that allows this type of analysis and offers feedback to the student in the Spanish language. We highlight that as feedback and self-evaluation, our tool offers the percentage of similarity and difference of pitch values so that the student can correct his pronunciation. Also, as future lines of work, we plan to improve the graphical environment of the plugin and open to the student, as an end user, the possibility of its use via the web.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Overall architecture of PAFe</figDesc><graphic coords="2,72.00,496.56,218.50,108.50" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Example of User Interface visualising the new functionalities added in PraatIn the following, we describe the development of our tool.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Example of User Interface visualising the new functionalities added in PraatIn the following, we describe the development of our tool.</figDesc><graphic coords="2,304.80,426.17,212.50,290.20" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Use case diagram (PAFe functionalities and main actors)</figDesc><graphic coords="3,304.70,287.03,230.20,233.97" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figures 5 :</head><label>5</label><figDesc>Figures 5: Intersyllabicanalysis information</figDesc><graphic coords="4,72.00,379.14,218.00,62.50" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 6 :</head><label>6</label><figDesc>Figure 6: Graph showing the tonal curves of each audio for each syllable (the X-axis represents the syllable division of an utterance and the Y-axis the pitch values).</figDesc><graphic coords="4,72.00,469.00,220.50,202.00" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_0">File with tags segmenting associated audio</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Praat: doing phonetics by computer</title>
		<author>
			<persName><forename type="first">P</forename><surname>Boersma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Weenink</surname></persName>
		</author>
		<ptr target="http://www.praat.org/" />
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">J</forename><surname>Cantero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Serena</forename></persName>
		</author>
		<title level="m">Teoría y análisis de la entonación</title>
				<imprint>
			<date type="published" when="2002">2002</date>
			<biblScope unit="volume">54</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Análisis prosódico del habla: más allá de la melodía</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">J</forename><surname>Cantero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Serena</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Comunicación Social: Lingüística, Medios Masivos, Arte, Etnología, Folclor y otras ciencias afines</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="page" from="485" to="498" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Una herramienta de análisis del habla de audio para proporcionar retroalimentación automática a los estudiantes en la pronunciación en español</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">Couto</forename><surname>Fernández</surname></persName>
		</author>
		<imprint>
			<publisher>UDC. A Coruña</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Designing an MVC Model for Rapid Web Application Development</title>
		<author>
			<persName><forename type="first">Dragos-Paul</forename><surname>Pop</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Adam</forename><surname>Altar</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.proeng.2014.03.106</idno>
	</analytic>
	<monogr>
		<title level="j">Procedia Engineering</title>
		<imprint>
			<biblScope unit="volume">69</biblScope>
			<biblScope unit="page" from="1172" to="1179" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">Font</forename><surname>Rotchés</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">J</forename><surname>Cantero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Serena</forename></persName>
		</author>
		<title level="m">La melodía del habla: acento, ritmo y entonación, Eufonía: didáctica de la música</title>
				<imprint>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="19" to="39" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Melodic Analysis of Speech Method applied to Spanish and Catalan</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">Font</forename><surname>Rotchés</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">J</forename><surname>Cantero</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Serena</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Phonica</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="33" to="47" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Comparing different approaches for automatic pronunciation error detection</title>
		<author>
			<persName><forename type="first">H</forename><surname>Strik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Truong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Wet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Cucchiarini</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.specom.2009.05.007</idno>
	</analytic>
	<monogr>
		<title level="j">Speech Communication</title>
		<imprint>
			<biblScope unit="volume">51</biblScope>
			<biblScope unit="page" from="845" to="852" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Protocolo para la extracción de los datos tonales y curva estándar en análisis melódico del habla</title>
		<author>
			<persName><forename type="first">M</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Mateo</forename><surname>Ruiz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Phonica</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="49" to="90" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Scripts en Praat para la extracción de datos tonales y curva estándar</title>
		<author>
			<persName><forename type="first">M</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Mateo</forename><surname>Ruiz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Phonica</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="91" to="111" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Uso de una herramienta didáctica para la práctica de la entonación en hablantes no nativos de español</title>
		<author>
			<persName><forename type="first">P</forename><surname>Oplustil</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Toledo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Revista de lingüística</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="page" from="37" to="50" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
	<note>Sintagma</note>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<author>
			<persName><forename type="first">K</forename><surname>Schwaber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sutherland</surname></persName>
		</author>
		<title level="m">La guía definitiva de scrum: Las reglas del juego</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
