<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Adjective-Noun Placement Variation in French Twitter Corpora</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Joan</forename><surname>Palmiter Bajorek</surname></persName>
							<email>jpbajorek@ucdavis.edu</email>
							<affiliation key="aff0">
								<orgName type="institution">University of California</orgName>
								<address>
									<addrLine>Davis One Shields Ave</addrLine>
									<postCode>95616</postCode>
									<settlement>Davis</settlement>
									<region>CA</region>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Adjective-Noun Placement Variation in French Twitter Corpora</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">97801DB41D70DAC40E295865E8898D6A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T12:39+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>French written and spoken corpora reveal that variation of adjective-noun placement occurs more often within written corpora than in spoken ones <ref type="bibr" target="#b12">(Lightfoot, 1979;</ref><ref type="bibr" target="#b16">Prévost, 2009;</ref><ref type="bibr" target="#b23">Thuilier, 2013)</ref>. I examine adjective lemma variation of placement, adjective categories, frequency, and syllable length as a means of assessing current Twitter trends of adjective placement. In this preliminary study, a cross-section of 30 adjective lemmas within a corpus of approximately 6,000 French tweets demonstrates that parallels exist between the usage of adjectives on Twitter and norms of spoken French.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>This article examines French adjective-noun placement in the determiner phrase within written, spoken, and Twitter corpora. Placement of French adjectives in the determiner phrase has long been controversial and misconstrued within the literature <ref type="bibr" target="#b1">(Benzitoun, 2014;</ref><ref type="bibr" target="#b8">Grevisse &amp; Goosse, 2011)</ref>. It is generally accepted that " <ref type="bibr">[i]</ref>n Romance languages such as French and Spanish, postnominal adjectives are the rule rather than the exception" <ref type="bibr" target="#b0">(Alexiadou, Haegeman, &amp; Stavrou, 2007)</ref>.</p><p>However, contemporary written and spoken corpora indicate that adjective placement has greater flexibility and ambiguity than has previously been understood. Statistical analysis of corpora indicate that adjective placement is more restrictive in spoken usage than written usage by a large margin <ref type="bibr" target="#b23">(Thuilier, 2013)</ref>. This research investigates adjective placement in French tweets through the examination of two French Twitter timelines of users Le Monde and Cyprien. It is hypothesized that French tweets will display higher levels of adjective placement restriction than Thuilier's written and spoken corpora findings. This may be attributed to Twitter's character limit and the curt, spoken-like treatment of language on the social network and overall online usage of language.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">French Adjectives</head><p>In French, attributive adjectives "adjectifs épithètes" can be placed before or after the noun as seen in ( <ref type="formula">1</ref>) <ref type="bibr" target="#b8">(Grevisse &amp; Goosse, 2011;</ref><ref type="bibr" target="#b10">Laenzlinger, 2005)</ref>, an example modified from <ref type="bibr" target="#b23">(Thuilier, 2013)</ref>.</p><p>(1) a. un beau requin b. un requin sympa a beautiful shark a shark nice</p><p>In example (1a), the adjective "beau," found in the prenominal position and modifies the noun "requin." Conversely, in (1b), the adjective "sympa" is postnominal <ref type="bibr" target="#b0">(Alexiadou et al., 2007;</ref><ref type="bibr" target="#b7">Delbecque, 1990;</ref><ref type="bibr" target="#b23">Thuilier, 2013)</ref>. French adjectives are widely believed to exist in one of three categories: fixed prenominal, fixed postnominal, or accepted alternator, with recognized semantic shift attributed to either position.</p><p>Although these categories are frequently used, "[i]t has proven notoriously difficult to define the functions of the two positions" <ref type="bibr" target="#b19">(Sleeman, Van de Velde, &amp; Perridon, 2014)</ref> and variation is oft attributed to semantic shifts <ref type="bibr" target="#b0">(Alexiadou et al., 2007;</ref><ref type="bibr" target="#b10">Laenzlinger, 2005)</ref>. For example, the adjective "brave" means good or decent in the preposition, but courageous or brave in the postposition. In this way, positions are correlated with meaning. Yet, Thuilier, a native speaker of French, provides several examples of exceptions of words that are considered as regular alternators with specified meanings correlated to positions (2013). Furthermore, many adjectives retain the same meaning regardless of placement, such as the adjective "charmant," meaning charming <ref type="bibr" target="#b23">(Thuilier, 2013)</ref>. In many ways, this variation in placement thus is a choice of the user and varies widely on the usage context. It should be noted that the primary concern of this research is variation in placement and patterns of adjective corpora rather than semantic interpretations, which are outside the scope of this study.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Syntactic Theory</head><p>In considering the syntactic basis of adjective placement, modern frameworks explain the variation through Branching Direction Theory <ref type="bibr">(Song, 2012)</ref>. It is posited that nouns move leftward and up the syntactic tree with "piedpiping" and "snowballing" movement and interact with linear and mirror image matching among languages <ref type="bibr" target="#b5">(Cinque, 1994</ref><ref type="bibr" target="#b6">(Cinque, , 2010;;</ref><ref type="bibr" target="#b10">Laenzlinger, 2005;</ref><ref type="bibr" target="#b22">Stavrou, 2012)</ref>. However, much is left to be desired in these theoretical understandings of syntactic underpinnings. If adjectives that alternate in position and have semantic shifts relegated to those positions, why does the noun moving up and down the tree account for the adjective's semantic change <ref type="bibr" target="#b0">(Alexiadou et al., 2007)</ref>? How does this theory account for languages that display varied adjective-noun ordering systems? In the words of <ref type="bibr">Alexiadou et al., " [a]</ref> question that remains open is why various languages have to resort to different means (order, excessive articles, morphology) in order to encode different interpretations of adjective-noun combinations" (parentheses original ( <ref type="formula">2007</ref>)). Do to the recent growth in Optimality Theory <ref type="bibr">(Song, 2012)</ref>, there is hope that elegant, more fully explanatory theories can be developed.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Historical Background</head><p>French evolved from Latin, a language with no fixed adjective placement <ref type="bibr" target="#b19">(Sleeman et al., 2014)</ref>. Between the 13th and 19th Century, adjective-noun placement shifted from prenominal preference in Old French to postnominal dominance in Modern French <ref type="bibr" target="#b2">(Boucher, 2006;</ref><ref type="bibr" target="#b19">Sleeman et al., 2014)</ref>.</p><p>Several theories postulate reasons for this phenomenon.</p><p>According to Glatigny, "preposed adjectives belong in very great majority to the ancient foundations of the language" (translated by the author <ref type="bibr" target="#b23">(Thuilier, 2013)</ref>). The shift may resulted from the influence and exposure of French and Romance languages to Germanic languages <ref type="bibr" target="#b8">(Grevisse &amp; Goosse, 2011</ref>). Yet some reject these "influenced by another language" interpretations due to the lack of documented evidence <ref type="bibr" target="#b12">(Lightfoot, 1979)</ref>. Another theory indicates that prenominal adjectives have marked positions that licenses specialized reading and greater frequency <ref type="bibr" target="#b11">(Ledgeway, 2012)</ref>. Evidence for this theory comes from the stronger syntactic bond of prenominal adjectives to nouns as compared to the relationship of their postnominal counterparts and nouns. The phonological connections between prenominal adjectives and nouns are demonstrated through the use of liaisons and irregular adjective agreements <ref type="bibr" target="#b11">(Ledgeway, 2012)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Usage Norms in Corpora</head><p>For decades, newspapers, literary texts, university essays, and spoken corpora have been analyzed for their adjective placement norms. While patterns arise, gaps in information and uniformity of analysis abound in the research. Most studies cite raw counts of adjectives found to be prenominal or postnominal. They do not take into account adjective lemma repetition, aspects of frequency, and the possibility of alternation of position. Recognizing the importance of the source of corpora, Delbecque writes that prenominal and postnominal proportions of placement are "mainly to be attributed to the text genre <ref type="bibr">" (1990)</ref>. Echoing these sentiments, Thuilier writes that intense difference exist according to the type of production (2013).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Literary Texts</head><p>In Wydler's study of the famous text "La Chanson de Roland," 70% of overall adjectives were prenominal (Gerard J. <ref type="bibr" target="#b3">Brault, 1978;</ref><ref type="bibr" target="#b12">Lightfoot, 1979)</ref>. "The Song of Roland" is the written version of the epic tale that was once performed orally, though its precise origins are clouded in "considerable speculation" (G.J. <ref type="bibr" target="#b4">Brault, 2010)</ref>. Delbecque notes that modern French novels demonstrate the same proportional split in placement, but provides no data to support the claim <ref type="bibr">(1990)</ref>.</p><p>In another analysis of French adjectives, Wagner separated adjectives into three basic categories. "Cardinals" are basic, essential, and high frequency adjectives throughout the French language.</p><p>"Populaires" are mainstream, colloquial, and wide spoken adjectives. Finally, "savants" are "learned" adjectives of scholarly registers (translations by the author <ref type="bibr" target="#b12">(Lightfoot, 1979)</ref>). For clarity in the following section, these three categories will be referred to as Frequent, Colloquial, and Scholarly. In an examination of 13 th and 14 th century prose, Wagner found that Frequent adjectives strongly favored prenominal positions: 2,393 prenominal and 11 postnominal occurrences, a 99.5% to 0.5% split <ref type="bibr" target="#b12">(Lightfoot, 1979)</ref>. 75% of Colloquial adjectives were prenominal whereas roughly 70% of Scholarly adjectives were postnominal.</p><p>When the Frequent and Colloquial categories were combined, the average prenominal to postnominal division was 95% to 5%. This combined category was of extremely high frequency, almost 16 times as common as the Scholarly category <ref type="bibr" target="#b12">(Lightfoot, 1979)</ref>. It is important to note that all of the data literature cited in this section explores the raw counts of the data and categorization of lemmas, but does not specify the frequency or other attributes of the adjectives. Wagner's study underlines the importance of adjective categorization when considering adjective placement. In addition, Lightfoot notes that the difference between source adjectives and those that are "fairly recent" borrowings reinforcing Glatigny's theories of ancient French versus modern linguistic acquisitions (1979).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">University Essays</head><p>In 1980, Wilmet investigated 90 philology essays of university students dating from 1977 to 1978 <ref type="bibr" target="#b7">(Delbecque, 1990)</ref>. Of the 3,835 adjectives found, Wilmet reported that 2.3% were exclusively prenominal, 16.9% alternated in position, and 80.8% were exclusively postnominal (1980). Wilmet did not account for categories of adjectives, but what can be interpreted from this study is the propensity for the work of university students, who focus their studies on literary criticism, history, and linguistics, to use literary, worked, polished language that displays postnominal preferences, an idea supported by <ref type="bibr">Thuilier's work (2013)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3">Newspapers</head><p>Postnominal preference is also found in most of the studies in newspaper corpora. Five studies of French adjective placement in newspapers published between 1911 and 1978 by Damourette and Pichon, Hug, and Forsgren returned homogenous results of 65% postnominal placement. With an average of 3,000 adjectives per study, all of this research compared raw counts of adjective-noun placement <ref type="bibr" target="#b7">(Delbecque, 1990)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4">Spoken and Modern Corpora</head><p>Contemporary analyses provide more thorough and nuanced approaches to corpora studies as seen through the work of <ref type="bibr" target="#b23">Thuilier (2013)</ref>. For reduced redundancies, section 2.8, refers to Thuilier's 2013 work. In 2013, Thuilier compiled a corpus of over 1.3 million French words from written and spoken French. The written corpus was composed of Le Monde newspaper articles dating from 1989-1993 sourced from the French Treebank (FTB). For the spoken corpus, Thuilier used the 2005 edition of the CORAL Romance corpus of transcribed speech. Thuilier's data reveals placement norms between the French written and spoken usages.</p><p>Spoken data displayed placement that was more rigid and more likely to follow prescriptivist norms than written language. In explanation, Thuilier hypothesizes that written language is "travaillée," worked and polished, and therefore may have greater opportunity for flexibility, nuance, and variations of style (translation by the author). Conversely, it was hypothesized that spoken language is spontaneous and instinctive tendency follows mainstream norms.</p><p>Thuilier's corpus comprised of 1,750 adjective lemmas of which 170 were found to alternate in position. Oral corpora lemmas demonstrated a prenominal preference of 74% while their written corpora counterparts demonstrated a prenominal preference of 67%.</p><p>Additionally, Thuilier coined a term about the syllable length patterning demonstrated by the data, "court avant long," short before long (translated by the author). Prenominal adjectives were frequently monosyllabic and "adjectives of one syllable are more than 80% in anteposition" (translated by the author).</p><p>From these studies and especially that of Thuilier's work, further experiments can be designed to investigate several factors in adjectivenoun placement in French corpora. To summarize the above sections succinctly the percentages quoted in this section are summarized in Table <ref type="table" target="#tab_0">1</ref>. The following factors have been previously demonstrated to be significant in classifying French adjective placement: lemmas as a better gauge of corpora rather than raw counts of adjectives, variation of placement, categorization of adjectives, frequency, origin of corpora, and syllable length.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.5">Twitter</head><p>Twitter is a real-time social networking and microblogging service <ref type="bibr" target="#b13">(Lomicka &amp; Lord, 2012)</ref>. Users can read, post, and repost messages called "tweets." Tweets are limited to 140 characters that are retained on a user's profile page, called a "timeline" <ref type="bibr" target="#b13">(Lomicka &amp; Lord, 2012)</ref>.</p><p>A global phenomenon, Twitter has 288 million users as of 2015 <ref type="bibr" target="#b15">(Popper, 2014;</ref><ref type="bibr" target="#b21">Statistica, 2015)</ref>. While the company is American and based in San Francisco, 77% of Twitter accounts are outside of the United States and over 35 Twitter company offices are outside of the United States, including one in Paris <ref type="bibr">(Twitter, 2015a)</ref>. Despite its American origins, which might imply high volumes of English language tweets, only 34% of tweets are in English <ref type="bibr" target="#b20">(Smith, 2015)</ref>. Twitter users from France tripled from 2009 to 2013 <ref type="bibr" target="#b21">(Statistica, 2013)</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Defining the Corpus</head><p>Approximately 6,000 tweets were analyzed from the Twitter feeds of the French newspaper Le Monde (@lemondefr) and the French comedian Cyprien (@MonsieurDream). These users were chosen because of 1) their comparable four million Twitter followers ("Twitter statistics for France," 2015), 2) Le Monde's connection to Thuilier's (2013) and <ref type="bibr" target="#b25">Wilmet's (1980)</ref> corpora studies, and 3) the users have the 2nd and 3rd largest audiences of Twitter in France. The top 24 French Twitter accounts are attributed to men or groups (Socialbakers, 2015) thus the Twitter timelines analyzed in this study are a sample of the most popular, mainstream content on French Twitter.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Le Monde</head><p>Launched in March 2009, Le Monde's Twitter timeline feed of tweets reports events related to France, the world, and politics. Written by journalists Pauline Croquet and Clément Martel, the timeline has over 132,000 tweets and over 6,000 photos and videos <ref type="bibr">(Twitter, 2015c)</ref>. As the online presence of one of the largest French newspapers, the tweets from this source are objective, political, newsworthy, and diplomatic in nature.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Cyprien</head><p>While Cyprien has significantly fewer tweets than the Le Monde timeline, Cyprien's popularity was determined to be a more important factor than number of tweets. Launched in June 2007, MonsieurDream has over 10,000 tweets and is written by Cyprien Iov who goes by Cyprien <ref type="bibr" target="#b24">(Twitter, 2015d)</ref>. Cyprien is a prolific 24-year-old French blogger, illustrator, comedian, and poster of videos on YouTube.com <ref type="bibr" target="#b9">(Iov, 2015a</ref><ref type="bibr" target="#b10">(Iov, , 2015b))</ref>. His work ranges from the banal to the political, blogging about the experience of moving apartments, gay marriage, and general selfpromotion. Cyprien has more Twitter followers than the Twitter accounts of movie celebrities, for example Gad Elmaleh with 300,000 fewer followers <ref type="bibr">(Socialbakers, 2015)</ref>. As a young comedian, Cyprien's humor is colloquial, sensationalized, dramatic, and irreverent in nature. His most popular YouTube video is a rap song where Cyprien responds a man who criticized his jokes <ref type="bibr" target="#b7">(Cyprien, 2011)</ref>. As of May 2016, the video has almost 37 million views and includes jokes about sex, online reputation, clothing style, and poor education, concluding with a joke about killing handicapped people <ref type="bibr" target="#b7">(Cyprien, 2011)</ref>.</p><p>By juxtaposing the linguistic trends from these two users, adjective placement norms on Twitter can be examined. It was hypothesized that French tweets would display higher levels of adjective placement restriction than Thuilier's corpora findings <ref type="bibr">(2013)</ref>. It was expected that Cyprien's usage of language would demonstrate even greater restriction in adjective placement than Le Monde's tweets do to the higher register and scholastic nature of the latter's content.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Methodology</head><p>The research looked into the placement norms of Le Monde and Cyprien on Twitter to determine whether corpora more closely resembled written or spoken corpora norms. In addition, questions of distribution, alternation, syllable length and frequency were considered in the data collection and analysis.</p><p>Tweets were downloaded from twitter using Python scripts and the Python module "Tweepy" <ref type="bibr" target="#b17">(Roesslein, 2009)</ref>. To access the tweets, a Twitter account was created with a corresponding Twitter website giving the user access to personalized Application Program Interface (API) credentials. Modified from scripts posted on GitHub <ref type="bibr" target="#b26">(Yanofsky, 2013)</ref>, roughly 6,000 tweets were downloaded from Twitter users Le Monde and MonsieurDream. Due to restrictions on tweet access to the general public, roughly 3,200 tweets were downloadable per Twitter timeline. Each user has a homepage, a unique page of their tweets and retweets called a "timeline," synonymous with the term "Twitter feed" <ref type="bibr">(Twitter, 2015b)</ref>. The scripts created csv files of the tweet's date published and text content. Due to the prolific tweeting of Le Monde, all tweets analyzed were published between April 1 st , 2015 and May 4 th , 2015. For the Cyprien corpus, the tweets from his timeline were published between December 9 th , 2009 and May 2 nd , 2015. Hereafter for reasons of clarity, the corpora will be termed "Le Monde" and "Cyprien" and will no longer be italicized.</p><p>For the scope of this study, in the Twitter corpus 30 adjective lemmas were investigated, see Appendix. These lemmas were chosen based on their prevalence in the linguistic literature and were placed into categories: High Frequency, Colors, Contemporary/Neologisms, Famous Alternators, and Others, see Appendix A.</p><p>Tweets containing the specific adjective lemmas were appended to separate csv files to isolate all instances of the adjective. Due to the added complexity of multiple adjectives, see mirror imaging <ref type="bibr" target="#b5">(Cinque, 1994</ref><ref type="bibr" target="#b6">(Cinque, , 2010))</ref>, only tweets with a single adjective were analyzed. Tallies were created of how many instances were found for each adjective lemma within the two Twitter feeds.</p><p>These tallies were then sorted to categorize adjectives found in a determiner phrase. From those that fit the correct syntactic environment, determiner phrase, they were annotated as prenominal and postnominal. Tallies were noted for all the above values. In Python, a Chi square test was conducted on the raw counts of the 30 adjectives found in the determiner phrase of either timeline were statistically significant.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Findings</head><p>6,406 tweets were data-mined from the Cyprien and Le Monde Twitter feeds. After isolating the 30 adjective lemmas, 1,181 adjective occurrences were analyzed. Roughly 82% of these single adjectives were found in a determiner phrase or noun phrase, a total of 949 adjective occurrences. Some adjective lemmas were not attested within the corpora. In the analysis of the data, variation in placement, categorization of adjectives, adjective frequency, syllable length and origin of the corpora were analyzed. The following analysis considers the Le Monde and Cyprien datasets separately as well as combined, to consider the implications of Twitter as a usage context.</p><p>It is crucial to note that despite the 6,406 tweets data-mined, the resulting adjectives found in determiner phrases was lower than initially anticipated. It is therefore recognized the findings from this corpus must be considered a preliminary study that has recommendations for further investigations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.1">Placement Variation</head><p>Of adjectives found within the corpora, 8% of adjective lemmas in Cyprien tweets and 30% adjective lemmas in Le Monde's tweets varied in placement, Figure <ref type="figure" target="#fig_0">1</ref>.</p><p>Of the lemmas which alternated in this corpus, they demonstrated prenominal preference at an average of 80%/20% prenominal/postnominal split. The distribution of this variation ranged from 60% to 99.67% prenominal placement. These finding are in line with Thuilier's finding that alternators are more inclined to be prenominally placed (2013).</p><p>Trends of the placement of alternators differed between the Cyprien and Le Monde corpora. Le Monde's adjectives displayed more variation in placement whereas very few of Cyprien's adjectives varied in placement. Supporting the initial hypothesis, adjective placement is more rigid in Cyprien's tweets than in Le Monde's tweets. The margin of difference in usage norms is a significant finding of this study and would be a strong starting point, to analyze Twitter feeds of news sources as compared to the tweets of individuals, for future investigations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.2">Adjective Categories</head><p>In consideration of the initial categories of the 30 adjective lemmas, adjectives were analyzed within their categorizations as a comparison point to Wagner's categories of Frequency, Colloquial, and Scholarly, see Table 2 <ref type="bibr" target="#b12">(Lightfoot, 1979)</ref>. Of the Famous Alternators, most of these were prenominally placed, though many of this category were adjective lemmas that were unattested in the corpus. Contemporary/Neologisms and Others displayed the greatest amount of variation in placement within the corpus. Mirroring Wagner's results, High Frequency adjectives were placed prenominally for the most part, on average 92% of the adjective lemmas. Colors were exclusive found in postnominal positions. This complements Wilmet's findings which demonstrated that "blanc, bleu, noir et rouge" were found postnominally 97.4% of the time <ref type="bibr" target="#b25">(1980;</ref><ref type="bibr" target="#b23">Thuilier, 2013)</ref>. Thus, category of adjective was helpful for grouping some adjectives, but not for all the categories of the study.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.3">Frequency</head><p>Adjectives used with highest frequency are correlated with those that are prenominally placed. Of the top 10 adjective lemmas in the Cyprien tweets, all were prenominal dominant at an average of 97.79% overall. Of the top 10 adjective lemmas in the Le Monde tweets, 70% were prenominal dominant, see Table <ref type="table" target="#tab_2">3</ref>. These high frequency adjectives in the Le Monde tweets were exclusively postnominal, "politique," "blanc," and "rouge." As seen also with the above section,  adjectives describing colors are famously postnominally placed <ref type="bibr" target="#b8">(Grevisse &amp; Goosse, 2011)</ref>.</p><p>Of the highest frequency adjectives from both timelines, 6 out of 10 overlap between the Cyprien and Le Monde tweets.</p><p>In the Cyprien corpus, there was a significant divide between the high frequency adjectives prenominal dominance to low frequency postnominal dominance. The Le Monde corpus is more nuanced in this separation of variables. In the Le Monde corpus, 7 of the 10 high frequency adjectives displayed prenominal dominance. For the low frequency adjectives, 10 of 13 also displayed prenominal preference, with only 3 that were exclusively postnominal in placement.Overall, the Le Monde corpus has a larger number of adjectives that alternate in placement than those that are exclusively postnominally placed, which leads to lower levels of stratification in frequency and placement. When the corpora were combined, the mixed nature of the Le Monde corpus was demonstrated. Frequency was strongly correlated with prenominal adjective placement in the Cyprien corpus where it was highly stratified, but this trend was less significant in the Le Monde corpus.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.4">Syllable Length</head><p>The results of this study suggest that syllable count was not a significant factor in adjective placement. Previous literature indicated that "short before long" patterns were observed in larger data sets <ref type="bibr" target="#b23">(Thuilier, 2013)</ref>, however these trends were not observed in these data, see Appendix B. The average percentage for the average prenominal placement for lemmas with one syllable was 60%, two syllables at 82%, and three syllables at 61%. While two syllable adjectives display prenominal preference, this is not a major finding of this study.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.5">Twitter Corpora Overall</head><p>As seen in the variation of placement, Le Monde's adjective lemmas demonstrated greater flexibility of placement than Cyprien's, which supports the hypothesis. Le Monde's flexibility was more in line with written corpora norms as seen in Thuilier, while Cyprien's follows more spoken norms of restriction ( <ref type="formula">2013</ref>). As compared to the corpora in Table <ref type="table" target="#tab_0">1</ref>, section 2.8, this Twitter corpora demonstrated significant stratification at a 76%/31% prenominal split. From a raw count of the 30 adjectives found in either position within the corpora, Cyprien's adjective placement was more stratified than Le Monde's placement, Table <ref type="table" target="#tab_3">4</ref>. When combining the counts in both Twitter feeds, there was a strong tendency toward prenominal placement: 504 prenominal to 100 postnominal, 5:1 a ratio.</p><p>It should be noted that the lemma "nouveau" was removed from the raw count comparisons due to its nature as an outlier. While the lemma posed no problem as an example of categories, high frequency, and syllable length, its high raw count skews the data set. Within the Cyprien data set, "nouveau" was found 301 instances in the determiner phrase compared to all the other adjectives with values ranging from 0 to 59 occurrences. Due to the limited scope of this article "nouveau" was not analyzed independently, a case for future study.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7">Conclusions</head><p>Twitter corpora displayed adjective placement norms that were more closely aligned with spoken corpora norms than written corpora norms when compared to the patterns explored in <ref type="bibr" target="#b23">Thuilier (2013)</ref>. As hypothesized, the Cyprien corpus was more restricted in adjective placement alternation than the Le Monde corpus. In the Le Monde tweets, 30% of the adjectives were found in both prenominal and postnominal positions, much higher than the 8% alternating in the Cyprien corpus. This suggests the greater flexibility of adjective placement in the Le Monde corpus. Syllable count was not a significant factor in adjective placement, though there was a tendency for adjectives of two syllables to be prenominally place. Frequency was a significant factor in adjective placement and was highly correlated with prenominal placement in the Cyprien corpus.</p><p>Limitations of this study include small final numbers of adjectives attested per lemma. Despite the approximately 6,000-tweet sample size, some adjectives occurred less than 5 times. In the future, larger data sets would be preferred. In addition, the 30 adjectives chosen may have influenced the data collected and in the future, different sampling methods could remedy this issue. Overall, these findings provide preliminary insight into the French adjective placement norms on Twitter.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.1">Further Research</head><p>Sampling: Using a parts of speech tagger, a larger analysis of all the adjectives in the corpora could be conducted. Tweets could be data mined over a specific time interval rather than from user timelines. In addition, this study showed the longitudinal trends of specific users. Further research could also examine cross-sectional data of streams of tweets and isolate specific collocations or adjective usage. Collocations of adjective-noun pairings and a study of ngrams of the corpus could illuminate stylistic preferences of Twitter users. Gender and other demographics data could be compiled to generate studies that compare different types of Twitter users. Nouveau: Further study might investigate specific adjective lemmas, such as nouveau, for a microanalysis interpretation of the data. Dialects and Languages: While it is clear that French is spoken around the world, varieties of the language based on location were not chosen for the this study <ref type="bibr" target="#b23">(Thuilier, 2013)</ref>, but could be analyzed to compare language norms of this corpus. Semantic shifts in placement were not analyzed within this study, but could be an aspect of future work. French is not the only language that displays adjective-noun prenominal and postnominal alternation; Spanish does as well <ref type="bibr" target="#b0">(Alexiadou et al., 2007;</ref><ref type="bibr" target="#b7">Delbecque, 1990</ref>). An analysis of the adjectives in Spanish tweets, or other languages with comparable alternation trends could be compared to this study. This study provides a jumping-board for further adjective-noun Twitter corpora studies.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Appendix B</head><p>Full Combined Data Sets of Syllable Counts</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Raw Counts and Percentages of TweetsTable 2: Combined Cyprien &amp; Le Monde Data Sets</figDesc><graphic coords="6,313.43,389.61,220.65,133.89" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="8,313.43,238.02,222.35,468.96" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="9,47.33,118.63,256.95,303.30" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 :</head><label>1</label><figDesc>Summary of Literature Review Percentages</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 :</head><label>2</label><figDesc>Combined Cyprien &amp; Le Monde Data Sets</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 :</head><label>3</label><figDesc>Combined Cyprien &amp; Le Monde Data Sets</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4 :</head><label>4</label><figDesc>Raw Counts and Percentages of Tweets</figDesc><table /></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>Thank you to the reviewers for their insightful comments and to Dr. Raúl Aranovich, Dr. Susan Palmiter, and Alan Wong for their remarks on earlier versions of this work.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0" />			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Noun Phrase in the Generative Perspective</title>
		<author>
			<persName><forename type="first">A</forename><surname>Alexiadou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Haegeman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Stavrou</surname></persName>
		</author>
		<idno type="DOI">10.1515/9783110207491</idno>
	</analytic>
	<monogr>
		<title level="j">Noun Phrase in the Generative Perspective</title>
		<imprint>
			<biblScope unit="volume">71</biblScope>
			<biblScope unit="page" from="1" to="664" />
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The place of the attributive adjective in French: what we learn speech corpora</title>
		<author>
			<persName><forename type="first">C</forename><surname>Benzitoun</surname></persName>
		</author>
		<idno type="DOI">10.1051/shsconf/20140801066</idno>
	</analytic>
	<monogr>
		<title level="j">Congres Mondial De Linguistique Francaise</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page">16</biblScope>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
	<note>4e</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Mapping Function to Form: Adjective Positions in French</title>
		<author>
			<persName><forename type="first">P</forename><surname>Boucher</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Lingvisticae Investigationes</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="43" to="60" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">The Song of Roland</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">J</forename><surname>Brault</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1978">1978</date>
			<publisher>Pennsylvania State University Press</publisher>
		</imprint>
		<respStmt>
			<orgName>University Park</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Song of Roland: An Analytical Edition: Introduction and Commentary</title>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">J</forename><surname>Brault</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Knowledge of Language: Its Nature, Origin, and Use</title>
				<editor>
			<persName><surname>Chomsky</surname></persName>
		</editor>
		<meeting><address><addrLine>N.</addrLine></address></meeting>
		<imprint>
			<publisher>Praeger</publisher>
			<date type="published" when="1986">2010. 1986</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">On the evidence for partial Nmovement in the Romance DP. Paths towards universal grammar</title>
		<author>
			<persName><forename type="first">G</forename><surname>Cinque</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1994">1994</date>
			<biblScope unit="page" from="85" to="110" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<title level="m" type="main">The Syntax of Adjectives: A Comparative Study</title>
		<author>
			<persName><forename type="first">G</forename><surname>Cinque</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2010">2010</date>
			<publisher>MIT Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Word Order as a Reflection of Alternate Conceptual Construals in French and Spanish. Similarities and Divergences in Adjective Position</title>
		<author>
			<persName><forename type="first">Cyprien</forename><surname>Delbecque</surname></persName>
		</author>
		<ptr target="https://www.youtube.com/watch?v=dKwzZZKIbUs" />
	</analytic>
	<monogr>
		<title level="j">Cognitive Linguistics</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="349" to="416" />
			<date type="published" when="1990">2011. May 21, 2016. 1990</date>
		</imprint>
	</monogr>
	<note>Cyprien répond à Cortex</note>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Grevisse</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Goosse</surname></persName>
		</author>
		<title level="m">Le bon usage : grammaire française : 75 ans</title>
				<meeting><address><addrLine>Bruxelles; Paris</addrLine></address></meeting>
		<imprint>
			<publisher>De Boeck-Duculot</publisher>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title/>
		<author>
			<persName><forename type="first">C</forename><surname>Iov</surname></persName>
		</author>
		<ptr target="http://www.cyprien.fr/index.php/presse/" />
	</analytic>
	<monogr>
		<title level="j">Presse</title>
		<imprint>
			<date type="published" when="2015-05-30">2015a. 30 May 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">French adjective ordering: perspectives on DP-internal movement types</title>
		<author>
			<persName><forename type="first">C</forename><surname>Iov</surname></persName>
		</author>
		<author>
			<persName><surname>Laenzlinger</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.lingua.2003.11.003</idno>
		<ptr target="https://www.youtube.com/user/MonsieurDream" />
	</analytic>
	<monogr>
		<title level="j">Lingua</title>
		<imprint>
			<biblScope unit="volume">115</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="645" to="689" />
			<date type="published" when="2005">2015b. 30 May 2015. 2005</date>
		</imprint>
	</monogr>
	<note>YouTube Channel: Cyprien</note>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">From Latin to romance : morphosyntactic typology and change</title>
		<author>
			<persName><forename type="first">A</forename><surname>Ledgeway</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2012">2012</date>
			<publisher>Oxford University Press</publisher>
			<pubPlace>Oxford; New York, NY</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Principles of Diachronic Syntax</title>
		<author>
			<persName><forename type="first">D</forename><surname>Lightfoot</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1979">1979</date>
			<publisher>Cambridge University Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">A tale of tweets: Analyzing microblogging among language learners</title>
		<author>
			<persName><forename type="first">L</forename><surname>Lomicka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Lord</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">System</title>
		<imprint>
			<biblScope unit="volume">40</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="48" to="63" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<title level="m" type="main">Syntactic relations: A critical survey</title>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">H</forename><surname>Matthews</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2007">2007</date>
			<publisher>Cambridge University Press</publisher>
			<biblScope unit="volume">114</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<title level="m" type="main">Twitter Now Has 255 Million Users, but Activity Has Dropped Year over Year</title>
		<author>
			<persName><forename type="first">B</forename><surname>Popper</surname></persName>
		</author>
		<ptr target="http://www.theverge.com/2014/4/29/5665752/twitter-q1-2014-earnings" />
		<imprint>
			<date type="published" when="2014-03-20">2014. 20 Mar. 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">The Acquisition of French: The Development of Inflectional Morphology and Syntax in L1 Acquisition</title>
		<author>
			<persName><forename type="first">P</forename><surname>Prévost</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Bilingualism, and L2 Acquisition</title>
				<imprint>
			<publisher>John Benjamins Publishing Company</publisher>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<title level="m" type="main">Tweepy Documentation</title>
		<author>
			<persName><forename type="first">J</forename><surname>Roesslein</surname></persName>
		</author>
		<ptr target="http://docs.tweepy.org/en/v3.2.0/" />
		<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<title level="m" type="main">Trust the text : language, corpus and discourse</title>
		<author>
			<persName><forename type="first">J</forename><surname>Sinclair</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Carter</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2004">2004</date>
			<publisher>Routledge</publisher>
			<pubPlace>London; New York, N.Y.</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title level="m" type="main">Adjectives in Germanic and Romance</title>
		<author>
			<persName><forename type="first">P</forename><surname>Sleeman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Van De Velde</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Perridon</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2014">2014</date>
			<publisher>John Benjamins Publishing Company</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<author>
			<persName><forename type="first">C</forename><surname>Smith</surname></persName>
		</author>
		<ptr target="http://www.socialbakers.com/statistics/twitter/profiles/france/Song" />
		<title level="m">By the Numbers: 150 Amazing Twitter Statistics</title>
				<editor>
			<persName><forename type="first">J</forename><forename type="middle">J</forename></persName>
		</editor>
		<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="2012">2015. 20 May. 2015. May 2015. 30 May 2015. 2015. 30 May 2015. 2012</date>
		</imprint>
	</monogr>
	<note>Most Popular Twitter Accounts in France. Twitter Statistics for France. Word order</note>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Facebook, Twitter and Google Penetration 2009-2013</title>
		<author>
			<persName><surname>Statistica</surname></persName>
		</author>
		<ptr target="http://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users/" />
	</analytic>
	<monogr>
		<title level="m">Social Networks: Global Sites Ranked by Users 2015. Leading Social Networks Worldwide as of March 2015, Ranked by Number of Active Users (in Millions</title>
				<meeting><address><addrLine>France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2013-05-30">2013. 30 May 2015. 2015. 30 May 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">The syntax of adjectives: A comparative study</title>
		<author>
			<persName><forename type="first">M</forename><surname>Stavrou</surname></persName>
		</author>
		<idno type="DOI">10.1353/lan.2012.0024</idno>
	</analytic>
	<monogr>
		<title level="j">Language</title>
		<imprint>
			<biblScope unit="volume">88</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="419" to="423" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<title level="m" type="main">Syntaxe du français parlé vs. écrit : le cas de la position de l&apos;adjectif épithète par rapport au nom</title>
		<author>
			<persName><forename type="first">J</forename><surname>Thuilier</surname></persName>
		</author>
		<ptr target="https://twitter.com/lemondefr" />
		<imprint>
			<date type="published" when="2013">2013. 2015a. 30 May 2015. 04 Jun 2015. 30 May 2015</date>
			<publisher>About Twitter, Inc</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">Monsieur Dream Twitter</title>
		<author>
			<persName><surname>Twitter</surname></persName>
		</author>
		<ptr target="https://twitter.com/MonsieurDreamTwitterstatisticsforFrance" />
		<imprint>
			<date type="published" when="2015-05-30">2015d. 30 May 2015. 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Anteposition and Postposition of Qualificative Epithets in Contemporary French</title>
		<author>
			<persName><forename type="first">M</forename><surname>Wilmet</surname></persName>
		</author>
		<ptr target="http://www.socialbakers.com/statistics/twitter/profiles/france/" />
	</analytic>
	<monogr>
		<title level="j">Travaux de Linguistique</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page" from="179" to="201" />
			<date type="published" when="1980">March.2015. 1980</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<title level="m" type="main">A Script to Download All of a User&apos;s Tweets into a Csv</title>
		<author>
			<persName><surname>Yanofsky</surname></persName>
		</author>
		<ptr target="https://gist.github.com/yanofsky/5436496" />
		<imprint>
			<date type="published" when="2013-05-30">2013. 1 Nov. 2013. 30 May 2015</date>
		</imprint>
	</monogr>
	<note>Yanofsky/tweet_dumper</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
