<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Gender Stereotypes in Film Language: A Corpus-Assisted Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lucia Busso §</string-name>
          <email>l.busso0@fileli.unipi.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gianmarco Vignozzi §</string-name>
          <email>gianmarco.vignozzi@fileli.unipi.it</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CoLingLab-Università di Pisa*</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Università di Pisa**</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>English: The present study concentrates on the representation and the reception of gender stereotypes. The analysis was first carried out on an ad hoc corpus of cult romantic comedies and dramedies of AngloAmerican pop contemporary culture and secondly with a perception test. Both the corpus-driven analysis and the test results provide useful insights into the representation, recognition and entrenchment of gender stereotypes in language and in western culture. The preliminary findings generally confirm and validate the scientific literature, although showing some notable new elements.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Italiano: Il lavoro si incentra sulla
rappresentazione e la percezione degli
stereotipi di genere. La ricerca è stata prima
condotta su un corpus costruito ad hoc di
film cult della cultura pop contemporanea
anglo-americana appartenenti ai generi
romantic comedy e dramedy, ed in seguito
con un test di percezione. Il duplice
approccio utilizzato fa luce sulla
rappresentazione, il riconoscimento e il
radicamento degli stereotipi di genere nella
lingua e nella cultura occidentale. I risultati
si trovano in linea con la letteratura,
sebbene mostrino alcuni nuovi elementi.
In the era of digital revolution and screen
proliferation, movies have undoubtedly acquired,
thanks to their significance, a pivotal role in
shaping our worldviews. In fact, popular films have the
power to sway our collective imagination and
influence our attitudes on crucial issues related to
race, class, gender, etc. Characters in films reflect
and perpetuate the status and options of them in
today’s society and culture, and thus play an
active part in creating symbolic role models
        <xref ref-type="bibr" rid="ref19 ref5">(Kord
2005, Bednarek 2015)</xref>
        . Accordingly, it is
interesting to examine the ways in which both females
and males are represented on celluloid to better
understand the ideologies they bear, and how
gender identities are idealized. There seems to be
wide agreement on the fact that characterization
in filmic discourse heavily relies on archetypes
and simplification
        <xref ref-type="bibr" rid="ref10 ref4">(Culpeper 2001; Bednarek
2010)</xref>
        . This is especially true in gender
representation, as stereotypical roles simplify
characterization in a way that it is easier to be received by
the viewing audience. This, however, often results
in an extreme polarization of gender roles. Film
dialogues are therefore an ideal ground on which
to study gender stereotypes and their linguistic
representation and reception. Hence, this paper
aims to fathom the discursive representation and
the perception of well-established gender
stereotypes in the dialogues of a sample of cult British
and American romantic comedies, by integrating
the tools of discourse analysis, corpus linguistics
and perception analysis.
2 Films, language and gender
The nature of film language is still an object of
debate. Movie scripts can be classified as texts
that are “written‐to‐be‐spoken‐as‐if‐not‐written”
        <xref ref-type="bibr" rid="ref14">(Gregory &amp; Carroll 1978: 42)</xref>
        . Dialogues, in fact,
portray a sort of “prefabricated orality” in that
they are carefully written to be performed and
sound natural to the audience, who longs for
authenticity
        <xref ref-type="bibr" rid="ref8">(Chaume 2012: 81)</xref>
        . Corpus-based
studies have proved that spontaneous conversation
and scripted dialogues are very similar in nature,
sharing almost the same array of
lexico-grammatical features
        <xref ref-type="bibr" rid="ref13 ref20 ref26 ref3 ref4">(Quaglio 2009, Bednarek 2010,
Forchini 2012, Baker 2014, amongst others)</xref>
        , but
due to the evident need for clarity and speed in
audio-visual texts, there may be changes in terms
of their frequency. In fact, film scripts, sometimes
tend to over-use features of spontaneous
conversation (e.g.: greetings and leave-takings, Bruti &amp;
Vignozzi (2016)) both for dramatic reasons and to
render the speech of characters as
natural-sounding as possible.
      </p>
      <p>
        Starting from the premises that gender is socially
constructed
        <xref ref-type="bibr" rid="ref7">(Cameron 2010)</xref>
        and that a large part
of its perception relies on the observation of
preestablished models, television and films provide
the perfect field for examining generalized
western social representation of accepted human
behaviour
        <xref ref-type="bibr" rid="ref30">(Shrum 2008)</xref>
        . In this vein, verbal
language becomes one of the pivotal means to create,
reinforce and most importantly perpetuate
stereotypical representations. Canonical research on
language and gender has shown that traits such as
hedges, empty adjectives, excessively polite
forms, intensifiers, troubles talk etc. are more
typical of women
        <xref ref-type="bibr" rid="ref21 ref33 ref9">(Lakoff, 1975; Tannen 1994;
Coates 1993)</xref>
        , whereas males are associated with
substandard and diatopically marked registers
        <xref ref-type="bibr" rid="ref32 ref34">(Trudgill 1972; Tannen 1991)</xref>
        and a use of
language that is aimed at retaining status and
attention. However, nowadays many of these ideas
have been partially rejected and framed as
stereotypical norms around feminity and masculinity,
which do not leave space for diversity
        <xref ref-type="bibr" rid="ref25 ref5 ref7">(Cameron
2010, Mullany 2007; Bednarek, 2015)</xref>
        . In recent
times, corpus linguistics and computational
linguistics have shown interest in analysing
differences in language between genders
        <xref ref-type="bibr" rid="ref15 ref2 ref23 ref24">(Argamom et
al, 2003, Baker 2006, Herring &amp; Paolillo 2006,
McEnery 2006, Monroe et al. 2008, amongst
others)</xref>
        . This body of literature represents the
backbone structure of our work, which aims to put
together “corpus linguistics and gender analysis:
two strands of linguistic research that do not go
together frequently”
        <xref ref-type="bibr" rid="ref20">(Kreyer 2014: 570)</xref>
        .
      </p>
    </sec>
    <sec id="sec-2">
      <title>3 Data and corpus driven analysis</title>
      <p>
        The corpus. We compiled a corpus out of the
orthographic transcriptions of eight English and
American romantic comedies, using the web
software SketchEngine
        <xref ref-type="bibr" rid="ref17 ref18">(Kilgarriff et al. 2004, 2014)</xref>
        .
The films were chosen not only for their themes,
but also for chronological coherence, as they
cover approximately the first decade of the 21st
century (table 1).
      </p>
      <p>Title
Sliding Doors
Billy Elliot
Bridget Jones’ Diary
Year
1998
2000
2001</p>
      <p>Nation
UK
UK
UK/USA</p>
      <p>Bend It Like Beckham
The Devil Wears Prada
Juno
Eat, Pray, Love
Letters to Juliet
2006
2007
2010
2010
USA
USA
USA
USA
The resulting corpus is therefore a synchronic ad
hoc corpus of 95,036 tokens. We further
subdivided it into two subcorpora consisting of the
turns of female and male characters – respectively
55,766 (58.7%) and 39,270 (41.3%) tokens
(henceforth: M and F). We chose to gather a new
corpus – instead of relying on existing ones – to
obtain a higher control on the data. Moreover,
popular romantic comedies are the perfect humus
for a polarized representation of gender roles,
because of their content and intrinsic structure. As
will be seen, however, our results are comparable
with the ones extracted from much the larger film
corpus Cornell Movie-Dialogs Corpus.1</p>
      <sec id="sec-2-1">
        <title>Keywords and semantic domains clouds analy</title>
        <p>
          sis. We used the online text analysis software
WMatrix
          <xref ref-type="bibr" rid="ref28">(Rayson 2003, 2004)</xref>
          to compare M and
F both against each other and a reference corpus
– the BNC-spoken. WMatrix performs automatic
semantic analysis (of English) texts. This
semantic analysis is carried out by a first POS tagging
phase; the output is then semantically tagged from
a set of 21 predefined semantic fields, further
subdivided into 232 category labels for more
finegrained classification. Thus, from the comparative
analyses starting from males and females’
subcorpora, keywords and semantic domains clouds
(calculated with log-likelihood statistic).
Statistically significant items are the ones with LL values
near or over 7, since 6.63 is the cut-off for 99%
confidence of significance. The automatically
obtained clouds were manually analysed to filter
possible errors and select the more significant
semantic domains associated with our sub-corpora.
From the comparisons of the two sub-corpora
against each other and against the BNC Spoken,
we selected the most relevant semantic domains
and keywords (i.e. with the higher LL values) for
more qualitative-like evaluation. Tables 2 and 3
report the domains and the keywords that we
selected.
1The fact that F is bigger than M should not come as a
surprise. The film genre of romantic comedy is generally
addressed to women and has therefore more female leading
characters.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Sem. domains F</title>
        <p>Business: Selling
As it can be seen, in our corpus women tend to
speak about shopping, cleaning, personal care,
and family, whereas men appear to discuss
money, sports, work and male friendship. In table
2 are also present semantic domains which were
relevant for both M and F speech, i.e. “Anatomy
and Physiology” and “Intimacy and Sex” (in
bold). These last two domains may emerge as
strongly relevant due to corpus-specific reasons.
Romantic comedies, in fact, are most often
centred around romantic and quite physical
relationships. However, what we think is of interest when
analysing the overlapping between semantic
domains between females and males is the different
wording. Women and men refer to their bodies
and their relationships in different ways, which
are consistent with a polarization of gender roles
(E.g.: breasts vs. boobs). Keywords are also worth
mentioning. Their evaluation showed that women
make larger use of politeness forms, while men
resort to more swearwords and interjections, such
as “right, all right”.
2 The stimuli-sentences were chosen to be as representative
as possible of the entire corpus: they are evenly distributed
among all the films of the corpus, with two or three instances
from each film for each subcorpus.</p>
        <p>
          Interestingly, the tendencies that emerged from
our small corpus are in line with Schofield and
Mehr (2016)’s analysis of the Cornell
Movie-Dialogs Corpus
          <xref ref-type="bibr" rid="ref11">(Danescu-Niculescu-Mizil et al.
2012a)</xref>
          , a vast corpus of more than 600 films of
different genres. The similarity of the results gave
us confidence in using the stereotypical
representations of genders’ speech to investigate its
reception by means of a test.
        </p>
        <p>The test. With the aim of testing the reception
and entrenchment of gender stereotypes in
speakers, we developed a perception test based on the
results of our corpus-driven analysis. We
manually extracted 18 lines per subcorpus2, each
containing one or more of the stereotypical semantic
domains and keywords that emerged from the
previous WMatrix analysis. The resulting 36
extracted lines were used as stimuli in the perception
test3. The choice of such limited number of
sentences was determined by two reasons. The first,
theoretically motivated, was not to repeat the
same keywords and stereotypes too many times.
Such repetition, in our opinion, could have
influenced or biased the participants. The second
reason, of a more practical nature, was to construct a
reasonably-sized test to maintain participants’
attention and avoid fatigue, which could have
influenced the responses. We extracted film lines
containing a variable concentration of stereotypes,
ranging from sentences referring to only one to
several stereotypical domains. The selection was
done manually, based on the rather obvious
hypothesis that sentences more “stereotypically
dense” would be recognised more easily. The
stimuli-sentences were also chosen as deprived of
context as possible, in order not to give any clue
about the film of origin. Proper names were
omitted, and when this was not possible, substituted
with the string [XXX]. For example, in (1) the
name of the male romantic partner was obscured
so that the only clue to the gender of the speaker
would be the linguistic stereotypes (shopping,
mitigated swearwords, weaving).</p>
        <p>1)</p>
        <p>When [XXX] and I broke up for two
weeks, I bought a loom, a frigging loom
The test was presented to 22 native, bilingual or
highly proficient speakers of English, 15 women
and 7 men (mean age: 39.5). The task was to
decide whether a given sentence had been uttered by
3 For reasons of space we do not include the complete list of
the sentences extracted and used for the test. Several
examples are reported in the text and in following footnotes.
a man or a woman. In order not to force
participants to a necessarily binary choice, the option “I
don’t know” was also included. We additionally
asked speakers to specify words, expressions or
general concepts that influenced their answers.
This provided us with interesting insights into
participants’ process of thinking and categorizing.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>4 Results</title>
      <p>Several interesting considerations arise from the
analysis of the data. Firstly, it appears that overall
the stereotypes were correctly spotted and
categorized.</p>
      <p>Chart. 1: Percentage of recognised stereotypes (in red)
However, it also emerges that female stereotypes
were more unambiguously recognisable, with
fewer answers assigned to the other gender or to
the “I don’t know” category (chart.1).</p>
      <p>By examining more closely the results, a
subdivision of the data can be made to account for the
differences in it: recognised (more than 50%
correct), ambiguous (between 25-50% correct) and
completely misunderstood (less than 25% correct)
stereotypes. Table 4 illustrates the distribution of
answers in the three frequency slots.</p>
      <p>&gt; 50% 25-50% &lt; 25%
F LINES
As was firstly hypothesized, sentences with a
higher “density” of stereotypical keywords or
semantic domains were usually the ones that
speakers better recognised. Stimuli in the first group,
therefore, consist of clear-cut and well
4 E.g.: “Give me the bag! I've got to get some proper shoes
for the wedding now” (71%) (f); “What are you doing, eh?
You're me best mate!” (82%) (m).
5 E.g.: “God! My mum had a fit when she saw the boots!”
(47%) (f); “He's a kid. He's just a fucking little kid.” (47%)
(m).
6 The reverse stereotypes utterances are the following.
recognisable clusters of linguistic and conceptual
stereotypes4. The second group is instead formed
by stereotypes that were recognised by a
substantial part of the informants, but not by the majority.
This, in our opinion, may be due to several
factors: some concepts, for example, could be
perceived as less prototypical than others. In
addition, some linguistic features (e.g. discourse
markers) were not fully recognised as
stereotypical due to our limitation to the written dimension.
Prosody, contextual information and
multimodality are in fact fundamental aspects of language
that were inevitably excluded from our
experimental design5. Finally, the last group consists of
stereotypes that were not perceived as such by
speakers (e.g.: family as a typical argument of
women’s speech), and of what we called reverse
stereotypes. That is, utterances that conceptually
represented ambiguous events or anti-prototypical
situations: a woman swearing, a man talking about
his feelings.6 As predicted, these stereotypes were
not recognised at all by participants, who tended
to assign them to the opposite gender. It is
interesting to note that also some male-produced
sentences were not recognized by our informants,
perhaps due to the composition of our corpus.
Several predominant keywords and domains in M,
in fact, may be strictly related to the chosen film
genre. For example, the massive presence of the
WMatrix domain Evaluation_inaccurate -- i.e.
apologies --reflects the archetypical situation in
romantic comedies of men apologizing for their
mistakes to women. Being so context-related,
however, speakers were not able to correctly
locate sentences containing expressions from this
domain.7
Another aspect that was taken into consideration
in our analysis was the gender of the informants,
to see if a relation with the data could be
recognised. There was a statistically significant
difference between the gender of the participant and the
answer to the test (H (2) = 9.2388, p-value =
0.0024, Kruskal-Wallis test with Wilcoxon
posthoc, Bonferroni p-value correction).</p>
      <p>A chi-square test of independence was performed
as well to examine the relation between gender of
the speaker and responses given.</p>
      <p>I. Oh, shit! I stubbed my foot on the side of the shagging
bath! (f)
II. This is the first time in 18 years I'm going to be able
to call the shots in my own life! (m)
7- I made a mistake, such a big, BIG mistake and I'm sorry.
I'm truly, truly sorry.
- We accept that we fight a lot, and we hardly have sex
anymore, but we don't wanna live without each other.
The relation between these variables was
significant. (χ2 = 10.298, p-value= 0.0058).</p>
      <p>Chart. 2: mosaic plot of the results divided by gender.
Chart 2 shows the difference in male and female
informants’ answers. The numbers of the variable
“responses” indicate the three possible answers of
the test: “male” (1), “female” (2), “I don’t know”
(3). As it can be seen, men assigned overall more
utterances to the “I don’t know” option rather than
to one of the two genders. Women, instead, show
a fairly equal distribution of responses among the
three conditions. Furthermore, both men and
women assigned more utterances to female
characters than to male ones (see table 5). This result
is in line with the fact that women stereotypes
were better recognised overall, in the sense that
fewer answers were assigned to the other gender.</p>
      <sec id="sec-3-1">
        <title>MEN WOMEN</title>
        <p>m 23% 30%
f 29% 34%
idk 48% 36%
Table 5: distribution of informants’
answers divided by gender of the speaker
Other useful insights into the data came from the
words our informants identified as relevant to
their decision. In fact, two tendencies emerged:
speakers either indicated specific words,
collocations or phrases, or answered with abstract
concepts and pragmatic inferences based on the
utterances. Interestingly, words and expressions
exactly replicated keywords, while general and
abstract concepts reflected the semantic domains
that emerged in the corpus analysis. In addition,
several speakers performed actual pragmatic
inferences based on the stereotypical concepts
contained in the sentences. For example, to (2)
subjects reacted either with a specific word like in a)
or with a more general consideration as in b).
2) Ooh, you must feel like you're about to find
your long-lost soul mate!
a) "soul mate"
b) talking about feelings in general</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>5 Conclusions</title>
      <p>
        The present paper proposes an original take on
investigating gender stereotypes in language. The
novelty in our approach lies in the hybrid
methodology that falls neither in the tradition of the
literature on “gendered discourse” nor in the more
recent field of corpus linguistics, but combines the
two and adds insights from psycholinguistics as
well. This kind of integrated analysis provided us
with preliminary results that help identify gender
archetypical roles, behaviours and linguistic
representations in modern western culture. What is
interesting to note is that the gender
representations coming to light from our corpus of
pop-culture films are based on features that are now
dismissed as clichéd and stereotypical by the
literature
        <xref ref-type="bibr" rid="ref7">(see Cameron 2005, 2010; Bexter 2006)</xref>
        , but
which seem to be nonetheless entrenched in our
interpretation of reality.
      </p>
      <p>The archetypical depiction of characters is
particularly evident in popular comedies, which do not
examine characters’ psychology in depth. The test
validated our assumption that film language
stereotypically portrays the way in which men and
women talk drawing on recognisable traits
attached to femininity and masculinity in our
culture. In fact, speakers were mostly able to
correctly assign the utterances to the right gender.
In addition, all our informants showed
metalinguistic –or second-level –awareness about
stereotypical concepts and linguistic clues, and several
of them also provided us with insightful and
creative inferences based on the event described in the
utterance. We interpret this as a sign of
stereotypes being conceptual in nature, deeply
entrenched in our representation of the world and
accessed via linguistic clues. The “reverse
stereotypes” also reinforce this idea.</p>
      <p>Acknowledgements: We would like to thank
professors Silvia Bruti, Alessandro Lenci, Belinda Crawford
and Monika Bednarek for useful comments. We also
gratefully acknowledge the reviewers for their
valuable comments.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Shlomo</given-names>
            <surname>Argamon</surname>
          </string-name>
          , Moshe Koppel, Jonathan Fine, and Anat Rachel Shimoni.
          <year>2003</year>
          .
          <article-title>Gender, genre, and writing style in formal written texts</article-title>
          .
          <source>Text &amp; Talk</source>
          ,
          <volume>23</volume>
          (
          <issue>3</issue>
          ),
          <fpage>321</fpage>
          -
          <lpage>346</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Paul</given-names>
            <surname>Baker</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Using Corpora in Discourse Analysis</article-title>
          . London: Continuum.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Paul</given-names>
            <surname>Baker</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Using Corpora to Analyze Gender</article-title>
          . London; New York: Bloomsbury Academic.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Monika</given-names>
            <surname>Bednarek</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>The Language of Fictional Television: Drama and Identity</article-title>
          . London: Continuum.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Monika</given-names>
            <surname>Bednarek</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Corpus-Assisted Multimodal Discourse Analysis of Television and Film Narratives</article-title>
          . In P. Baker,
          <string-name>
            <surname>T.</surname>
          </string-name>
          McEnery (Eds.),
          <source>Corpora and Discourse Studies: Integrating Discourse and Corpora</source>
          ,
          <volume>63</volume>
          -
          <fpage>87</fpage>
          . Basingstoke, UK: Palgrave Macmillan.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Silvia</given-names>
            <surname>Bruti</surname>
          </string-name>
          and
          <string-name>
            <given-names>Gianmarco</given-names>
            <surname>Vignozzi</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Routines as social pleasantries in period dramas: a corpus linguistic analysis</article-title>
          . in R. Ferrari,S. Bruti (eds),
          <source>A Language of One's Own: Idiolectal English</source>
          , pp.
          <fpage>207</fpage>
          -
          <lpage>239</lpage>
          , Bologna: I libri di Emil.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Deborah</given-names>
            <surname>Cameron</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Gender, Language and the New Biologism</article-title>
          . Constellations,
          <volume>17</volume>
          (
          <issue>4</issue>
          ),
          <fpage>526</fpage>
          -
          <lpage>39</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Frederic</given-names>
            <surname>Chaume</surname>
          </string-name>
          .
          <year>2012</year>
          . Audiovisual Translation: Dubbing. Manchester, St Jerome.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Jennifer</given-names>
            <surname>Coates</surname>
          </string-name>
          .
          <year>1993</year>
          .
          <article-title>Women, Men and Language</article-title>
          . London: Longman.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Jonathan</given-names>
            <surname>Culpeper</surname>
          </string-name>
          .
          <year>2001</year>
          .
          <article-title>Language and characterisation: people in plays and other texts</article-title>
          . Harlow: Longman.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Cristian</given-names>
            <surname>Danescu-Niculescu-Mizil</surname>
          </string-name>
          , Justin Cheng, Jon Kleinberg, and
          <string-name>
            <given-names>Lillian</given-names>
            <surname>Lee</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>You had me at hello: How phrasing affects memorability</article-title>
          .
          <source>In Proceedings of ACL</source>
          ,
          <fpage>892</fpage>
          -
          <lpage>901</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Penelope</given-names>
            <surname>Eckert and Sally McConnell-Ginet</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>Language and Gender</article-title>
          . Cambridge: Cambridge University Press.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Pierfranca</given-names>
            <surname>Forchini</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Movie Language Revisited. Evidence from Multi-Dimensional Analysis and Corpora</article-title>
          . Bern: Peter Lang.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Michael</given-names>
            <surname>Gregory</surname>
          </string-name>
          and
          <string-name>
            <given-names>Susanne</given-names>
            <surname>Carroll</surname>
          </string-name>
          .
          <year>1978</year>
          .
          <article-title>Language and Situation: TV Heroines: Contemporary Screen Images of Women</article-title>
          . Lanham: Rowman &amp; Littlefield.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Susan C Herring and John C Paolillo</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Gender and genre variation in weblogs</article-title>
          .
          <source>Journal of Sociolinguistics</source>
          ,
          <volume>10</volume>
          (
          <issue>4</issue>
          ),
          <fpage>439</fpage>
          -
          <lpage>459</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Janet</given-names>
            <surname>Holmes</surname>
          </string-name>
          .
          <year>2006</year>
          . Gendered Talk at Work:
          <article-title>Constructing Gender Identity through Workplace Discourse</article-title>
          . Oxford: Blackwell.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Adam</given-names>
            <surname>Kilgarriff</surname>
          </string-name>
          , Vít Baisa, Jan Bušta, Miloš Jakubíček, Vojtěch Kovvář, Jan Michelfeit, Pavel Rychlý,
          <string-name>
            <given-names>Vít</given-names>
            <surname>Suchomel</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>The Sketch Engine: ten years on</article-title>
          .
          <source>Lexicography</source>
          ,
          <volume>1</volume>
          ,
          <fpage>7</fpage>
          -
          <lpage>36</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Adam</given-names>
            <surname>Kilgarriff</surname>
          </string-name>
          , Pavel Rychlý, Pavel Smrž, David Tugwell.
          <source>The Sketch Engine</source>
          .
          <year>2004</year>
          . Information Technology. Available online at: www.sketchengine.co.uk
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Susanne</given-names>
            <surname>Kord</surname>
          </string-name>
          and
          <string-name>
            <given-names>Elisabeth</given-names>
            <surname>Krimmer</surname>
          </string-name>
          .
          <year>2005</year>
          .
          <article-title>Hollywood Divas, Indie Queens, and Language Varieties and their Social Contexts</article-title>
          . London/New York: Routledge.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Rolf</given-names>
            <surname>Kreyer</surname>
          </string-name>
          .
          <year>2014</year>
          . Review:
          <string-name>
            <given-names>P.</given-names>
            <surname>Baker</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Using Corpora to Analyze Gender</article-title>
          . London/New York: Bloomsbury.
          <source>International Journal of Corpus Linguistics</source>
          <volume>19</volume>
          ,
          <fpage>570</fpage>
          -
          <lpage>575</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <given-names>Robin</given-names>
            <surname>Lakoff</surname>
          </string-name>
          .
          <year>1975</year>
          .
          <article-title>Language and woman's place</article-title>
          . New York, NY: Harper &amp; Row.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Tom</given-names>
            <surname>McArthur</surname>
          </string-name>
          .
          <year>1981</year>
          .
          <article-title>Lexicon of Contemporary English</article-title>
          . London: Longman.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Anthony M. McEnery</surname>
            ,
            <given-names>Richard Z. Xiao &amp; Yukio</given-names>
          </string-name>
          <string-name>
            <surname>Tono</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Corpus-based Language Studies: An Advanced Resource Book</article-title>
          . London/New York: Routledge.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Burt L Monroe</surname>
          </string-name>
          ,
          <string-name>
            <surname>Michael P Colaresi</surname>
          </string-name>
          , and
          <string-name>
            <surname>Kevin M Quinn</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Fightin'words: Lexical feature selection and evaluation for identifying the content of political conflict</article-title>
          .
          <source>Political Analysis</source>
          ,
          <volume>16</volume>
          (
          <issue>4</issue>
          ),
          <fpage>372</fpage>
          -
          <lpage>403</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Louise</given-names>
            <surname>Mullany</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Gendered Discourse in the Professional Workplace</article-title>
          . Basingstoke, NY: Palgrave Macmillan.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <given-names>Paulo</given-names>
            <surname>Quaglio</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Television Dialogue: The Sitcom Friends vs</article-title>
          .
          <source>Natural Conversation</source>
          . Philadelphia: John Benjamins.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <given-names>Paul</given-names>
            <surname>Rayson</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Wmatrix: A Web-based Corpus Processing Environment</article-title>
          . Computing Department, Lancaster University. Available online at: http://ucrel.lancs.ac.uk/wmatrix/
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <given-names>Paul</given-names>
            <surname>Rayson</surname>
          </string-name>
          , Dawn Archer, Scott Piao, &amp;
          <string-name>
            <surname>Anthony M. McEnery</surname>
          </string-name>
          .
          <year>2004</year>
          .
          <article-title>The UCREL semantic analysis system. Proceedings of the beyond named entity recognition semantic labelling for NLP tasks workshop</article-title>
          , Lisbon, Portugal,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <given-names>Alexandra</given-names>
            <surname>Schofield</surname>
          </string-name>
          and
          <string-name>
            <given-names>Leo</given-names>
            <surname>Mehr</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Gender distinguishing features in film dialogue</article-title>
          .
          <source>NAACL CLfL.</source>
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <given-names>L.j.</given-names>
            <surname>Shrum</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Media consumption and perceptions of social reality</article-title>
          . In J. Bryant &amp;
          <string-name>
            <surname>M.B. Oliver</surname>
          </string-name>
          (eds.),
          <source>Media Effects: Advances in Theory and Research, 3rd Edition</source>
          . New York, NY: Routledge.
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Mary M Talbot</surname>
          </string-name>
          .
          <year>2003</year>
          . “
          <article-title>Gender Stereotypes: Reproduction and Challenge”</article-title>
          . In Holmes, J. &amp;
          <string-name>
            <surname>Meyerhoff</surname>
          </string-name>
          , M. (eds.),
          <source>The Handbook of Language and Gender</source>
          . Oxford: Blackwell,
          <fpage>468</fpage>
          -
          <lpage>86</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <given-names>Deborah</given-names>
            <surname>Tannen</surname>
          </string-name>
          .
          <year>1991</year>
          .
          <article-title>You just don't understand: Women and men in conversation</article-title>
          . Virago London.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <string-name>
            <given-names>Deborah</given-names>
            <surname>Tannen</surname>
          </string-name>
          .
          <year>1994</year>
          .
          <article-title>Gender and Discourse</article-title>
          . New York: Oxford University Press.
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <string-name>
            <given-names>Peter</given-names>
            <surname>Trudgill</surname>
          </string-name>
          .
          <year>1972</year>
          .
          <article-title>Sex, covert prestige and linguistici change in the urban British English of Norwich</article-title>
          .
          <source>Language in Society 1</source>
          ,
          <fpage>179</fpage>
          -
          <lpage>195</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>