=Paper=
{{Paper
|id=Vol-1347/paper11
|storemode=property
|title=Modeling lexical effects in language production: where have we gone wrong?
|pdfUrl=https://ceur-ws.org/Vol-1347/paper11.pdf
|volume=Vol-1347
|dblpUrl=https://dblp.org/rec/conf/networds/ZhaoM15
}}
==Modeling lexical effects in language production: where have we gone wrong?==
Modeling Lexical Effects in Language Production: Where Have We Gone Wrong? Ting Zhao Victoria A. Murphy Department of Education Department of Education University of Oxford University of Oxford ting.zhao@education.ox.ac.uk victoria.murphy@education.ox.ac.uk 1 Introduction 2 Lexical characteristics that contribute to the speed of spoken production Words have their own conceptual representations, semantic properties, and physical forms. These This study considers three lexical layers (i.e. lexical characteristics not only set words apart as Meaning, Form, and Usage), each of which is a distinct item in the lexical repertoire but also underpinned by its own manifest indicators. The provide valuable insight into the processes and lexical variables under examination all have been mechanisms of language production. found to significantly influence the speed of lex- Over the past decades there has been a large ical processing, as will be briefly reviewed below. body of research examining how word meaning, Meaning. (1) Word concreteness (WC): A form, and usage directly affect the speed of main difference between concrete and abstract monolingual speakers’ production (e.g. Alario et words lies in the existence of sensorimotor at- al., 2004; Barry, Morrison, & Ellis, 1997; Bates tributes of the former. A number of studies have et al., 2003; Bonin, Chalard, Méot, & Fayol, revealed that concrete words exhibit preferential 2002). Of note, almost all these studies have processing relative to abstract words (e.g. De failed to accommodate the fact that word usage, Groot, 1992; Jin, 1990; Schwanenflugel & Akin, given it is a behavioral outcome (Zevin & 1994). (2) Word typicality (WT): The degree of a Seidenberg, 2002, 2004), likely mediates the re- lexical item’s typicality depends upon how many lationship between meaning/form and spoken attributes that it shares with other members of the production. Moreover, lexical characteristics same category. Typical items are usually pro- have been predominantly examined as discrete cessed more accurately and faster relative to variables in the literature, but in fact, some of atypical items in a range of tasks (e.g. Bjorklund them may correspond to the same layer of lan- & Thompson, 1983; Jerger & Damian, 2005; guage production or the same aspect of lexical Southgate & Meints, 2000). (3) Semantic neigh- knowledge. Additionally, little work has been borhood density (SND): Words with high SND done on children’s emerging bilingual lexical are characterized by having a great deal of se- representations, especially those learning an L2 mantic neighbors and low semantic distance, within input-limited contexts, possibly due to the whereas low-SND words typically have few se- fact that this population has only recently begun mantic neighbors and high semantic distance. to receive focused attention in the research field. The superiority of high SND over low SND In order to delineate the exact manner in words for processing has been observed in lexi- which lexical effects come into play, the present cal decision, word naming, and semantic catego- study used structural equation modeling to per- rization (e.g. Buchanan, Westbury, & Burgess, form a simultaneous test of the complex relation- 2001; Siakaluk, Buchanan, & Westbury, 2003; ships among a variety of lexical variables and to Yates, Locker, & Simpson, 2003). (4) Number of assess their direct, indirect and total effects on related senses (NoRS): Many words are polyse- L2 lexical processing efficiency. Furthermore, mous in terms of having several different but attempts were also made to estimate and then to related senses. Compared to monosemous words, compare three types of hypothesized models, in polysemous words exhibit preferential pro- which the lexical relationships were specified cessing in a variety of tasks (e.g. Beretta, differently with respect to spoken production in Fiorentino, & Poeppel, 2005; Klepousniotou & L2 learners. Baum, 2007; Lichacz, Herdman, Lefevre, & Baird, 1999). Copyright © by the paper’s authors. Copying permitted for private and academic purposes. In Vito Pirrelli, Claudia Marzi, Marcello Ferro (eds.): Word Structure and Word Usage. Proceedings of the NetWordS Final Conference, Pisa, March 30-April 1, 2015, published at http://ceur-ws.org 51 Form. Word length can be measured ortho- orded using the Audacity software, and then graphically (i.e. NoL: number of letters) or pho- manually calculated for analysis. nologically (i.e. NoP: number of phonemes and Norms of lexical variables. The values of WC, NoS: number of syllables). The presence of WT, and SWF were rated by the participants on length effects has been reported in several previ- Likert scales. The values of other lexical varia- ous studies (e.g. Alario et al., 2004; Cuetos, Ellis, bles were obtained from psycholinguistics data- & Alvarez, 1999; De Groot, Borgwaldt, Bos, & bases such as the Irvine Phonotactic Online Dic- Van den Eijnden, 2002) although the predictive tionary (Vaden, Halpin, & Hickok, 2009) and the power of each specific measure varies across Wordmine2 (Durda & Buchanan, 2006). research contexts possibly due to their examina- tion of different languages (Bates et al., 2003). 3.2 Analytical strategies Usage. Usage is represented by subjective Structural equation modeling (SEM), which word frequency (SWF) and /or age of acquisition combines path analysis, confirmatory factor (AoA), both of which have been observed to sig- analysis, and analysis of structural models, was nificantly affect the speed of spoken production used to estimate the goodness-of-fit of three in such a way that individuals take less effort to types of hypothesized models. This analytical access high-frequency and early-acquired words strategy, as an extension of multiple regression, relative to low-frequency and late-acquired ones enables researchers to estimate not only the di- (e.g. Balota, Cortese, Sergent-Marshall, Spieler, rect effects but also indirect effects that one vari- & Yap, 2004; Barry et al., 1997; Morrison, Ellis, able has upon another. Moreover, SEM can be & Quinlan, 1992). AoA effects interact with fre- used to measure the proportion of variance ex- quency effects in such a way that the former is plained by the models proposed in the present partly dependent on the latter (Brysbaert & study so as to hold general implications for the Ghyselinck, 2006). lexical processing system as a whole, although it should be acknowledged that this type of analy- 3 Methodology and analytical strategies sis might lack a specific focus on certain varia- 3.1 Methodology bles through purposeful manipulation of experi- mental materials. Additionally, latent variables Participants. Thirty-nine 5th grade children (aged are formed to manifest different dimensions that 10-11 years) and 94 undergraduates (aged 17-20 are underpinned by their own indicators. In so years) were recruited. All had Chinese as their doing, the present study moves away from the native language and English as their second. The examination of each lexical variable to that of child sample had been learning English as a for- specified constructs and structural relations be- eign language for about 2.5 years, and the adult tween constructs, thus a better understanding of sample for approximately 10 years. the nature of lexical characteristics can be gained Stimuli. The experiment consisted of two at a more macro level. blocks of stimulus words and one block of filler Conducting SEM typically involves six steps words. Each block had 35 (in the child group) / (Kline, 2011): model specification, model identi- 66 (in the adult group) valid trials. The stimuli fication, select good measures, model estimation, were selected from ten semantic categories in model evaluation and modification, and inter- almost equal numbers. They were all presented preting and reporting results. Moreover, as rec- in the same format over the course of the exper- ommended by Kline (2011), SEM was conducted iments. in two steps in the present study, that is, the Procedures. The participants were tested indi- measurement models were validated in terms of vidually in a quiet room. They performed picture convergent validity, discriminant validity, and naming in L2 (English) and then L1 (Chinese)- reliability before the structural models proceeded to-L2 (English) translation. As a stimulus ap- to be estimated. One last thing to note is that the peared on the screen, the participants were asked data entered for analysis were lexical items. The to produce the L2 word as rapidly and accurately stimulus size in the adult group was considered as possible. The SuperLab software (Cedrus sufficiently large for performing SEM analysis. Corporation, 2007) generated stimulus presenta- In order to reduce the complexity of the hypothe- tions. Response latencies (RLs), defined as the sized model specifying children’s L2 lexical pro- duration between the presentation of a stimulus cessing, composite variables rather than latent and the initiation of a vocal response, were rec- variables were constructed to decrease the num- 52 ber of stimulus words required for this type of cies. Similar results held for adults’ L1-to-L2 analysis. translation (see Appendix C for details). Three competing models were hypothesized and estimated to determine which one best fitted the data. The first model concerns only the direct relationship between the lexical variables, and picture naming and translation latencies. The second model identifies word usage as a media- tor and examines the indirect effects of meaning and form variables on the recorded RLs. The third model considers both direct and indirect effects of word meaning and form on the out- come variable. To illustrate, an example of these three types of hypothesized models that specify Model 1 the possible relationships between lexical varia- bles and the speed of adults’ picture naming is presented in Appendix A. The goodness of model fit was estimated ac- cording to six types of indices, including model 𝝌𝟐 , CFI, RMSEA, AGFI, GFI, and NFI. A rule of thumb is that an RMSEA below .08 indicates reasonable fit, and values greater than .90 for the CFI, AGFI, GFI, and NFI suggest close approx- Model 2 imate fit. SEM was run using IBM SPSS AMOS v.20. It should be noted that, before performing SEM analysis, the whole RL data set was screened for incorrect and omitted responses, outliers (low cut-off: below 350ms, high cut-off: 3 SDs), and those participants and stimulus items with an exceptionally high error rate. As conven- tionally done, RLs were then averaged to gener- ate a summary score for each lexical item, and Model 3 these values were entered into final SEM analy- sis. Figure 1: SEM results: Picture naming in adults 4 Results As regards children’s picture naming, the re- sults presented in Appendix B shows that Model The model-fit indices of the three models under 3 reached a better model fit than Models 1 and 2. examination across two types of productive tasks Moreover, Figure 2 indicates that Model 3 (38%) in both populations are presented in Appendix B. explained more variance in naming speed than Comparatively, the child and adult data could Model 1 (36%) and Model 2 (24%). In addition, best be modeled by the third model where word word usage, as represented by age of acquisition, meaning and form not only make direct but also together with word typicality were found to sig- indirect contribution to the RLs. nificantly and directly predict the naming speed Take picture naming in adults as an example in Model 3. Similar results were observed with (see SEM results in Appendix B and Figure 1), it children’s L1-to-L2 translation (see Appendix C is clear that Model 3 achieved a much better for details). model fit than Model 1, and Model 3 explained more variance in naming latencies (59%) than Model 1 (45%) and Model 2 (51%). Additionally, among all the lexical variables included in Model 3, only word usage was found to make a signifi- cant and direct contribution to the naming laten- Model 1 53 whether or not the same results still hold for oth- er L2 learner types, particularly those whose L1s are not Sino-Tibetan languages, as well as for monolingual speakers needs to be further inves- tigated. Importantly, examining these issues would allow us to gain a better understanding of Model 2 the nature of lexical characteristics by addressing the issue of whether lexical effects are language- dependent or universal across languages. Second, not all the variance can be explained the included lexical variables, partly due to the fact that it seems implausible to cover every possible fea- ture of a lexical item because of theoretical and Model 3 practical considerations. Third, given the use of a non-experimental design, it would be difficult to Figure 2: Path analysis results: Picture naming in make unequivocal explanations of causality children among the variables of interest. To conclude, the model that considers both di- Taken together, these results indicate that rect and indirect effects of meaning and form on word usage does not exist independently of other L2 lexical processing efficiency may be superior lexical variables but rather mediates the impact to those that do not. As also observed in our of meaning and form on L2 children’s and adults’ study, word usage does play a mediating role in productive performance. In comparison, the indi- lexical processing, in part reflecting that ‘only in rect effects of meaning and form on L2 lexical the stream of thought and life do words have processing efficiency were found to be more no- meanings’ (Wittgenstein, 1967, p.31). ticeable with adults relative to with children. References 5 Discussion and conclusions Alario, F-Xavier, Ferrand, L., Laganaro, M., The present study uses SEM as a methodological New, B., Frauenfelder, U.H., & Segui, J. improvement to investigate the relationships be- (2004). Predictors of picture naming speed. tween a range of lexical variables and L2 lexical Behavior Research Methods, 36(1): 140-155. processing efficiency in both children and adults. Balota, D.A., Cortese, Michael J, Sergent- A comparison of the three different types of Marshall, Susan D, Spieler, Daniel H, & Yap, models indicates that word meaning and form MelvinJ. (2004). Visual word recognition of makes not only direct but also indirect contribu- single-syllable words. Journal of tion to the speed of L2 lexical processing, and Experimental Psychology: General, 133: 283- word usage likely mediates the extent to which 316. meaning and form influence the processing out- Barry, C., Morrison, C.M., & Ellis, A.W. (1997). come. Furthermore, a comparison between chil- Naming the Snodgrass and Vanderwart dren and adults suggests that the importance of pictures: Effects of age of acquisition, word usage tends to increase with age. frequency, and name agreement. The A note of caution thus should be raised when Quarterly Journal of Experimental interpreting the results of previous studies where Psychology, 50(3): 560-585. the mediating effects of word usage have not Bates, Elizabeth, D’Amico, Simona, Jacobsen, been adequately addressed. Accordingly, future Thomas, Székely, Anna, Andonova, Elena, research modeling lexical effects would be well Devescovi, Antonella, . . . Pléh, Csaba. (2003). advised to consider the indirect effect that word Timed picture naming in seven languages. meaning and form have on L2 learners’ produc- Psychonomic Bulletin & Review, 10(2): 344- tive performance via usage. 380. Although this study provides new insights into Beretta, A., Fiorentino, R., & Poeppel, D. (2005). how lexical variables are related to each other, The effects of homonymy and polysemy on there are several limitations that should be lexical access: an MEG study. Cognitive acknowledged. First, since this research focuses Brain Research, 24(1): 57-65. only on L2 learners within input-limited contexts, 54 Bjorklund, David F, & Thompson, B.E. (1983). Kline, Rex B. (2011). Principles and Practice of Category typicality effects in children’s Structural Equation Modeling (3rd ed.). New memory performance: Qualitative and York: Guilford Press. quantitative differences in the processing of Lichacz, F.M., Herdman, C.M., Lefevre, J.A., & category information. Journal of Baird, B. (1999). Polysemy effects in word Experimental Child Psychology, 35(2): 329- naming. Canadian Journal of Experimental 344. Psychology, 53(2): 189-193. Bonin, P., Chalard, M., Méot, A., & Fayol, M. Morrison, C.M., Ellis, A.W., & Quinlan, P.T. (2002). The determinants of spoken and (1992). Age of acquisition, not word written picture naming latencies. British frequency, affects object naming, not object Journal of Psychology, 93(1): 89-114. recognition. Memory & Cognition, 20(6): Brysbaert, Marc, & Ghyselinck, Mandy. (2006). 705-714. The effect of age of acquisition: Partly Schwanenflugel, P.J., & Akin, Carolyn E. (1994). frequency related, partly frequency Developmental trends in lexical decisions for independent. Visual Cognition, 13(7-8): 992- abstract and concrete words. Reading 1011. Research Quarterly, 29(3): 251-264. Buchanan, L., Westbury, C., & Burgess, C. Siakaluk, P.D., Buchanan, L., & Westbury, C. (2001). Characterizing semantic space: (2003). The effect of semantic distance in neighborhood effects in word recognition. yes/no and go/no-go semantic categorization Psychonomic Bulletin & Review, 8 (3): 531- tasks. Memory & Cognition, 31(1): 100-113. 544. Southgate, V., & Meints, K. (2000). Typicality, Cedrus Corporation. (2007). SuperLab 4.5. San naming, and category membership in young Pedro, CA. children. Cognitive Linguistics, 11(1/2): 5-16. Cuetos, Fernando, Ellis, A.W., & Alvarez, B. Vaden, KI, Halpin, HR, & Hickok, GS. (2009). (1999). Naming times for the Snodgrass and Irvine Phonotactic Online Dictionary, Version Vanderwart pictures in Spanish. Behavior 2.0. Retrieved January 30, 2013 from Research Methods, 31(4): 650-658. http://www.iphod.com/search/V2ListWords.html. De Groot, A.M.B. (1992). Determinants of word Yates, Mark, Locker, Lawrence, & Simpson, translation. Journal of Experimental Greg B. (2003). Semantic and phonological Psychology: Learning, Memory, and influences on the processing of words and Cognition, 18(5): 1001-1018. pseudohomophones. Memory & Cognition, De Groot, A.M.B., Borgwaldt, Susanne, Bos, 31(6): 856-866. Mieke, & Van den Eijnden, Ellen. (2002). Zevin, J.D., & Seidenberg, M.S. (2002). Age of Lexical decision and word naming in acquisition effects in word reading and other bilinguals: Language effects and task effects. tasks. Journal of Memory and Language, Journal of Memory and Language, 47(1): 91- 47(1): 1-29. 124. Zevin, J.D., & Seidenberg, M.S. (2004). Age-of- Durda, K., & Buchanan, L. . (2006). acquisition effects in reading aloud: Tests of WordMine2. Retrieved December 1, 2012. cumulative frequency and frequency from http://www.wordmine2.org trajectory. Memory & Cognition, 32(1): 31-38. Jerger, S., & Damian, M.F. (2005). What’s in a name? Typicality and relatedness effects in children. Journal of Experimental Child Psychology, 92(1): 46-75. Jin, Y.S. (1990). Effects of concreteness on cross-language priming in lexical decisions. Perceptual and Motor Skills, 70(3): 1139- 1154. Klepousniotou, E., & Baum, S.R. (2007). Disambiguating the ambiguity advantage effect in word recognition: An advantage for polysemous but not homonymous words. Journal of Neurolinguistics, 20(1): 1-24. 55 Appendix A. An example of the hypothesized models Adult picture naming Model 1 Model 2 Model 3 Appendix B. Fit indices for the hypothesized models 𝜒 ! (𝑝) df CFI RMSEA AGFI GFI NFI Picture Model 1 173.83 (.00) 27 .81 .17 .74 .87 .79 naming Model 2 52.16 (.00) 27 .97 .07 .90 .95 .94 Adults Model 3 45.96 (.00) 22 .97 .08 .90 .96 .94 Chinese- Model 1 169.46 (.00) 27 .81 .16 .75 .88 .79 English Model 2 45.69 (.01) 27 .98 .06 .91 .96 .94 translation Model 3 41.87 (.01) 22 .97 .07 .90 .96 .95 Picture Model 1 28.17 (.00) 11 .85 .27 .82 .93 .79 naming Model 2 28.52 (.00) 11 .88 .12 .83 .94 .79 Chil- Model 3 5.67 (.46) 6 1.00 .00 .93 .99 .96 dren Chinese- Model 1 28.17 (.00) 11 .81 .12 .82 .93 .74 English Model 2 23.09 (.02) 11 .86 .10 .87 .95 .79 translation Model 3 5.67 (.46) 6 1.00 .00 .93 .99 .95 Appendix C. SEM results of the hypothesized models Adults: Picture naming Model 1 Model 2 Model 3 56 L1-to-L2 translation Model 1 Model 2 Model 3 Children: Picture naming Model 1 Model 2 Model 3 L1-to-L2 translation Model 1 Model 2 Model 3 57