<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">YorCALL: Improving and sustaining Yoruba Language through a practical Iterative learning Approach</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Zainab</forename><forename type="middle">O</forename><surname>Abdulkareem</surname></persName>
							<email>abdulkareemzainab01@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="institution">Alhikmah University</orgName>
								<address>
									<settlement>Ilorin</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Efiong</forename><forename type="middle">E</forename><surname>Edet</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">University of Ibadan</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">YorCALL: Improving and sustaining Yoruba Language through a practical Iterative learning Approach</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">6A197C40D1DDC05FEE39FF49CD656E33</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T00:54+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Tonal</term>
					<term>Speech</term>
					<term>Dataset</term>
					<term>Iterative</term>
					<term>Translate</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The number of people who speak Yoruba language fluently and write it correctly with appropriate tonal signs is declining. This could be attributed to the adoption of English Language (a colonial language) by parent as their children first language and the neglect of Yoruba from most Nigeria Educational system curricula. To this end, the aim of this work was to improve user knowledge of tonal sign assigning, correct word pronunciation and thus Yoruba language literacy by implementing a Computer Assisted Language Learning System that translates Yoruba text to speech, and allows users to check-up the meaning of words and take test on Yoruba language literacy. We propose a practical iterative learning approach that factors in the basic features and requirements that will ensure the optimal realization of benefits of the system to the user.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>CCS Concepts</head><p>• Computing methodologies ➝Artificial intelligence ➝Natural language processing ➝Machine translation Keywords</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">INTRODUCTION</head><p>Yoruba tribe makes up about 35 million people in total with majority constituent from Nigeria (about 21% of Nigeria's population), 1.2 million in Benin, 0.4 million in Ghana, 0.1 million in Togo )0.1 million in Ivory Coast, 0.2 million in Europe and 0.2 million in North America <ref type="bibr">[15]</ref>. Yoruba language is a tonal language consisting of seven vowel sounds exclusive of nasal vowels and eighteen consonant sounds, making up 25 alphabets and 3 tonal signs (three level tones: high, low and mid (the default tone) to distinguish between words with the same spelling but different pronunciation and meaning. Every Yoruba syllable must have at least one tone. There is a continued decline in the number of "Yorubas" that can speak Yoruba language fluently and write it correctly with appropriate tonal signs. This is as a result of adoption of English Language (a colonial language) by parent as their children first language and the neglect of Yoruba from most Nigeria Educational system curricula. To prevent extinction of Nigeria's indigenous languages there have being persistent calls by various esteemed scholars and organisation to revive Nigeria indigenous languages, by adopting Indigenous languages as Children's first language and making Indigenous languages a compulsory course to be taken by students in all level of education. However, there is need for more efforts that will adopt practical approach to ensure that the language remains intact. To this end, Computer Assisted Language Learning (CALL) for Yoruba is developed in this project to help speakers of Yoruba language to speak correctly and fluently by applying the appropriate tone on words they pronounce. CALL is perceived as an approach to language teaching and learning in which the computer is used as an aid to the presentation, reinforcement and assessment of material to be learned, usually including a substantial interactive element <ref type="bibr">[4]</ref>.</p><p>The computer is used to ameliorate users' knowledge of particular Language/languages. CALL has being continually adopted as medium of learning new languages or improving on old ones. Over the years CALL is used to incite language learners, provide comfortable access to learning material, testing of users acquisition level and examining the relevance of call to Learning and Disseminating System (Checking if it has accomplish its functions and purpose).</p><p>The CALL system provided information which is expected to be responded to by system users. The system in turn process input and gives appropriate justification of input structure and meaning.</p><p>To improve Flexibility of Computer Assisted Language Learning Interaction between system and users have being adopted, text, static and moving images and audio are used for interactive Computer Assisted Learning Language System Environment.</p><p>By adopting a practical iterative learning approach, this work focuses on improving user knowledge of Yoruba tonal sign assigning, correct word pronunciation. It also improves Yoruba language literacy by implementing a Computer Assisted Language Learning System that translates Yoruba text to speech, and allows users to check-up the meaning of words and undertake test on Yoruba language literacy.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">BACKGROUND</head><p>Previous studies have focused on CALL by identifying three historical phases of CALL and classified them according to underlying pedagogical and methodological approaches <ref type="bibr">[10]</ref>. The participation of the potential users of the system in CALL development cannot be sidelined, which is "the active involvement of end-users, as non professional developers, in a software development life cycle <ref type="bibr">[5]</ref>. The contribution of language learners tutoring system improves the effectiveness of the system design. The CALL system developers should be focused on getting right requirements and intelligibly transforming it to a standardised design.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">RELATED WORKS</head><p>An Intelligent CALL System for Arabic Learners was designed for Primary Schools and Arabic language learners. The system employed Natural Language Processing for learning the language and provides learning materials for the users, on which they are expected to take test on. The instructor is able to stipulate conditions to determine the test questions specialisation. The response given by users is analysed using morphological analyser, syntax analyser and semantic analyser. The morphological analyser breaks down response of the users to smallest component of the language. The syntax analyser checks for the structural correctness of user's response and form syntactic categories. The Semantic analyser checks for rule based approach to generate appropriate response to result given by the system users to promote effective learning. [8] RU_CALL, a standalone system is aimed to provide electronic language acquisition domain for learning and improving knowledge of users with at least the basic/passive noesis of Runyakitara language. The system centred on nominal morphology, morphological analyser was used to develop exercises for learning. The system focused on nouns and made use of natural language processing to create extensive lesson materials for prospective users of the system. Runyakitara Computer Assisted Language was created as a screening tool for users, to test the users syntactic and semantic knowledge of the language, serve as access tool for learners and give relevant activity for learners. The system accomplished its intended goal with large percentage of users showing continual interest in</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>RU_CALL. [14]</head><p>There is Computer Assisted Language Learning system which consists of comprehensive wordbook consisting of Root language (English) to Object language (Yoruba) <ref type="bibr">[1]</ref>. The Language Assistance System identifies words separated by space which is converted to source text by lexical transfer and attaching appropriate part of speech. The acceptable structure was achieved by matching twenty-eight corresponding English Noun Phrase rules with corresponding Yoruba arrangement of the rule. Data fed into the system (in English) is pre-processed and translated to corresponding data in Yoruba language. The input is checked for its correctness using existing information in the database. A system with about 90 percent exactitude was produced and regarded as auspicious and satisfactory.</p><p>The principles of Natural Language Processing and Digital Signal Processing was adopted to develop a CALL system <ref type="bibr">[13]</ref>. Natural Language Processing does the breakdown of sentences into syllables, which is the smallest unit of the sentence (vowel, consonant: nasal and non-nasal) by emphasizing on tones of the syllables. Digital Signal Processing consists of speech processing and sound processing. Speech processing checks for syllables corresponding to input block of text and combine together to form strings and then optimizing them. The sound processing (Speech signal) process the sound and make the pronunciation sound available. A web based Computer Assistance Language Learning on two languages was built, to translate from Yoruba phrases to English Language and contra wise <ref type="bibr">[14]</ref>. Poly-layer framework and Hidden Markov Model were employed. Using mathematical and word principle, Yoruba and English words were grouped into two exclusive sets. The set theory was used to determine acceptable structure, axiom of extension, for any two subsets from source and target languages to be equal there must be like components. The law of probability, Bayes' theorem, and Statistical design theory were used to increase the CALL system accuracy and produce result.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Yoruba Phonemics</head><p>Yoruba language is a tonal language consisting of seven oral vowel sounds, five nasal vowels and eighteen consonant sounds and 3 tonal signs (three level tones: high, low and mid (the default tone) to distinguish between words with the same spelling but different pronunciation and meaning. Every Yoruba syllable must have at least one tone. <ref type="bibr">b, d, f, g, gb, j, k, l, m, n, p, r, s, ṣ, t, u, w, y</ref> Vowel Sounds Oral vowel: a e ẹ i o ọ u Nasal vowel: an ẹn in ọn un</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Consonant Sounds</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">SYSTEM MODEL</head><p>The framework of Yoruba Computer Assisted Language Learning (YorCALL), as shown in figure <ref type="figure" target="#fig_0">1</ref>, presents the different components and their interactions. Sectionalized into three, the framework can be viewed as: The YorCALL interactive interface provides the user with an interface to enter any Yoruba text with the use of on-Screen keyboard or the provided buttons with Yoruba alphabets and sign. The YorCALL Database mainly consist all the Yoruba syllable sounds as recorded by a Yoruba Language expert from Oyo town. The Learn &amp; Dictionary modules provide the users with basic knowledge on the meaning of most common Yoruba words such as cardinal and ordinal numbers from one to ten and days of the week and months of the year as well as English words listed in alphabetical order from A-Z with information about them. It consists of 1000 English words and their equivalent in Yoruba language. Users are able to hear the corresponding pronunciation of the words. The Palindrome module responds to indicate whether any Yoruba word as entered by the user is/is not a palindrome.</p><formula xml:id="formula_0"></formula></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Class Module</head><p>The CLASS module provides principles of pronouncing words in Yoruba language and also provides guide on the principle of tonal signs and applications. It allows the user to write down words, check for validity and pronounce the word for the user. As depicted in figure <ref type="figure" target="#fig_1">2</ref>, YorCALL system's processes include: a) breaking words into syllable b) fetching the audio equivalent of syllable c) stacking and concatenating sounds to produce a word sound Users are can type in words for pronunciation by using the onscreen keyboard provided or use any keyboard that allows user to type Yoruba fonts. The words are broken down to syllable(s) and equivalent audio sound of the syllable is fetched and concatenated. After which a sound player box pops up and users are able to play the sound.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.1">Tokenizer</head><p>A syllable in Yoruba can either be a vowel combination of vowel and consonant, or nasal sound. During the process of breaking words down to syllables, the tonal sounds assigned by the user is taken into consideration. Each word input by the user is scanned through, vowel letters are the main unit is used for tokenisation. The tokenization process is broadly divided in three phases, namely: i. accept word ii. scan from the left, check for vowel sounds ii. anywhere there is a vowel sound break, and then add to syllable set until end of the word.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Figure 3: Flow Diagram of the tokenization process</head><p>For Illustration, if b is to be combined with the vowel sounds; it will produce the following syllables as in table 1:  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.2">Token/Syllable Fetcher</head><p>Each of the vowel sounds was combined with each consonant and nasal sound to generate all possible syllable. These sounds to each syllable were recorded using audio recorder by a Yoruba language speaking expert. The recorded sounds were subsequently trimmed to eliminate noise and saved in the YorCALL DB with the file name they represent. The Token/syllable fetchers searches through for the coreesponding audio sound for each token for stacking and concatenation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.3">Syllable to Speech Processor</head><p>After successful process of tokenisation and equivalent audio sound of the syllables are fetched from the audio dataset, they are stacked in the order of their entry (from left to right) and concatenated, ready to be played by the sound player box. Users are able to view the breakdown of words pronunced to syllable and play the sound equivalent as often time as possible as shown in Figure <ref type="figure">6</ref>.</p><p>Where SS represents syllable sound</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">System Class Diagram</head><p>YorCALL has five classes (Class, Palindrome, Dictionary, Learn, Games) which are subclasses of the Home. The Sound Player is a sub class of the Class with a modal dependency. As shown in Figure <ref type="figure" target="#fig_3">4</ref>, each class has at least member methods with data member ranging from zero to six. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">PERFORMANCE MEASURE OF THE SYSTEM</head><p>Three category of users (amateur speakers, average Yoruba speakers, above average speakers) yielded the following summarized responses as observed in Table <ref type="table" target="#tab_1">2</ref>:   <ref type="table">and 9</ref>, reveal, at both extremes, about 75% acceptability and usability of the system but only 20% voted for it to be used by the visually challenged persons. Based on the feedback in X7,X8, X10, about 80% agreed that the naturalness of the system produced sound was like that of Yoruba natural speaker. 70% of the respondents agreed that the system can be used to not only improve the language tonal sign assigning but also the language literacy.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">CONCLUSION</head><p>The study has presented a CALL system for Yoruba language with the main objective of providing a digital learning environment that introduce Yoruba language to new learner or improve amateur's knowledge, especially in the area of correct word pronunciation and right sign assigning. Using personal computer, the system enables the user to learn at his/her own time and place. Future work is ongoing to expand the number of words in the dictionary and improve on present functionalities of the system such as development of game whereby the system pronounce Yoruba words randomly and the user provides the textual equivalence, considering the tone, wherby the user input is validated. In conclusion we believe the system can be further scaled to include other threatened languages and taught in various levels of Institutions of learning.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: YorCALL Architectural Framework</figDesc><graphic coords="2,307.80,306.53,242.41,113.25" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Model of the Class Module</figDesc><graphic coords="3,63.60,54.00,223.11,118.70" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head></head><label></label><figDesc>ọn un n ẹ n n ọ n n àn ẹ`n ìn ọ n ùn ba be bẹ bi bo bọ bu b b bẹ b b bọ b bà bè bẹ`bì bọ bò bù ban bẹn bin bọn bun b n bẹ n b n bọ n b n bàn bẹ`n bìn bọ n bùn ⁞ ⁞ ⁞</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: YorCALL Class Diagram</figDesc><graphic coords="3,322.65,357.57,244.77,211.65" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 9 :</head><label>9</label><figDesc>Figure 7: YorCALL Performance Measure by Amateur Yoruba Speakers</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 : Some Yoruba Syllables</head><label>1</label><figDesc></figDesc><table><row><cell>Consonant</cell><cell>Vowel Sounds</cell><cell>Syllables Produced</cell></row><row><cell>Alphabet</cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 : Summary of Responses to Administered Questionnaire (Source: [13])</head><label>2</label><figDesc></figDesc><table><row><cell></cell><cell>Weight</cell><cell></cell><cell></cell><cell></cell><cell>5</cell><cell>4</cell><cell>3</cell><cell>2</cell><cell>1</cell></row><row><cell></cell><cell>Questions</cell><cell></cell><cell></cell><cell></cell><cell>VH</cell><cell>H</cell><cell>F</cell><cell>L</cell><cell>VL</cell></row><row><cell>X4</cell><cell cols="3">How would you rate the performance of the</cell><cell></cell><cell></cell><cell>19 30</cell></row><row><cell></cell><cell cols="3">developed YorCALL</cell><cell></cell><cell></cell></row><row><cell></cell><cell>system?</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>X5</cell><cell cols="4">How would you rate the use of this application?</cell><cell>6</cell><cell>26 16</cell><cell>1</cell></row><row><cell>X6</cell><cell cols="4">How would you rate the voice quality produced by</cell><cell>6</cell><cell>26 13</cell><cell>6</cell></row><row><cell></cell><cell cols="2">the developed</cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>system?</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>X7</cell><cell cols="4">How would you rate the sign assignment of the Yoruba</cell><cell>27</cell><cell>13 7</cell><cell>12</cell></row><row><cell></cell><cell cols="4">alphabets display on the</cell><cell></cell></row><row><cell></cell><cell>screen?</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>X8</cell><cell cols="4">What is your rating of the naturalness of the output</cell><cell>9</cell><cell>19 14</cell><cell>7</cell></row><row><cell></cell><cell>from</cell><cell>the</cell><cell cols="2">developed</cell><cell></cell></row><row><cell></cell><cell cols="2">YorCALL?</cell><cell></cell><cell></cell><cell></cell></row><row><cell>X9</cell><cell cols="4">What is your overall performance assessment of</cell><cell>6</cell><cell>27 13</cell><cell>3</cell></row><row><cell></cell><cell cols="3">the developed YorCALL?</cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell>SA</cell><cell>A</cell><cell>N</cell><cell>D</cell><cell>SD</cell></row><row><cell>X10</cell><cell cols="4">Would you agree that the</cell><cell>10</cell><cell>24</cell><cell>12</cell></row><row><cell></cell><cell cols="4">sound produced by the</cell><cell></cell></row><row><cell></cell><cell cols="4">developed YORCALL was</cell><cell></cell></row><row><cell></cell><cell cols="4">close to Natural human</cell><cell></cell></row><row><cell></cell><cell>voice?</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>X11</cell><cell cols="4">Would you agree that the</cell><cell>17</cell><cell>23</cell><cell>9</cell></row><row><cell></cell><cell cols="4">response time (i.e. time</cell><cell></cell></row><row><cell></cell><cell>duration</cell><cell>for</cell><cell cols="2">producing</cell><cell></cell></row><row><cell></cell><cell>sound)</cell><cell>was</cell><cell>of</cell><cell>no</cell><cell></cell></row><row><cell></cell><cell cols="2">significance?</cell><cell></cell><cell></cell><cell></cell></row><row><cell>X12</cell><cell cols="3">Would you agree that the</cell><cell></cell><cell>16</cell><cell>24</cell><cell>9</cell></row><row><cell></cell><cell cols="4">developed YorCALL system</cell><cell></cell></row><row><cell></cell><cell cols="4">is reliable to convert Yoruba</cell><cell></cell></row><row><cell></cell><cell cols="2">text to spoken</cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell cols="2">expression?</cell><cell></cell><cell></cell><cell></cell></row><row><cell>X13</cell><cell cols="3">Would you agree that the</cell><cell></cell><cell>15</cell><cell>14</cell><cell>7</cell></row><row><cell></cell><cell cols="4">developed YorCALL can</cell><cell></cell></row><row><cell></cell><cell cols="4">serve as a teaching aid for</cell><cell></cell></row><row><cell></cell><cell>Yoruba</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>language?</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>X14</cell><cell cols="3">Would you agree that the</cell><cell></cell><cell></cell></row><row><cell></cell><cell cols="4">developed system will be of</cell><cell></cell></row><row><cell></cell><cell cols="4">help to a visually challenged</cell><cell></cell></row><row><cell></cell><cell>person?</cell><cell></cell><cell></cell><cell></cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0">CoRI'16, Sept 7-9, 2016, Ibadan, Nigeria.   </note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">ACKNOWLEDGEMENT</head><p>We appreciate all members of Intelligent System group (ISG) for their insightful contributions towards this work.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.">REFERENCES</head></div>			</div>
			<div type="references">

				<listBibl/>
			</div>
		</back>
	</text>
</TEI>
