<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Natural Language Processing for Literary Text Analysis: Word-Embeddings-Based Analysis of Zofka Kveder's Work</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Senja Pollak</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Matej Martinc</string-name>
          <email>matej.martincg@ijs.si</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Katja Mihurko Poniz</string-name>
          <email>katja.mihurko.poniz@ung.si</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Jozef Stefan Institute</institution>
          ,
          <addr-line>Ljubljana</addr-line>
          ,
          <country country="SI">Slovenia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Jozef Stefan International Postgraduate School</institution>
          ,
          <addr-line>Ljubljana</addr-line>
          ,
          <country country="SI">Slovenia</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Nova Gorica</institution>
          ,
          <addr-line>Nova Gorica</addr-line>
          <country country="SI">Slovenia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we use word embeddings to analyse the corpus of work of a Slovenian modernist writer Zofka Kveder. For a selection of central concepts of her work, we compute the closest FastText embeddings. The interpretation shows that many of the word relations can be interpreted in line with the ndings in literary studies (e.g., woman is discussed in relation to men and Catholicism), or open novel paths for interpretation (e.g., woman in relation to European). On the other hand we also point at some problems resulting from the lemmatization and from using FastText embeddings.</p>
      </abstract>
      <kwd-group>
        <kwd>Digital literary studies</kwd>
        <kwd>Word embeddings</kwd>
        <kwd>Distant reading</kwd>
        <kwd>Gender</kwd>
        <kwd>Zofka Kveder</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The computational methods have importantly enriched the studies of literary
history in the last two decades. However, it can also be stated that the digital
humanities depend in large part on literary studies [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. Since 1980s, the
feminist theory and gender studies have played an important role in the analysis of
literary texts. Moreover, the feminist literary history has made visible a large
number of forgotten women authors through digitalization projects, such as The
Orlando Project (https://www.artsrn.ualberta.ca/orlando/), The Women
Writers Project (https://www.wwp.northeastern.edu/) and the Virtual
Research Enviroment NEWW Women Writers in History (http://resources.
huygens.knaw.nl/womenwriters/), which is focusing not only on collection of
women's writings but especially on the collection of reception data. This
illustrates the fruitful connection between feminist literary history and digital
humanities. Projects that connect computational methods and feminist approaches
with literary texts, enable better understanding of women's roles, especially in
the literary life of earlier periods. Through the use of approaches such as distant
reading, developed by F. Moretti [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] in the Stanford Literary Lab, and by novel
data visualization and data analysis methods applied to large textual corpora,
new light can be shed on the work of forgotten or neglected women writers.
      </p>
      <p>
        The writings of women have always been an integral part of the literary eld
although they encountered many obstacles caused by the male dominance [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
Feminist digital literary studies have impacted the eld of digital humanities, but
regrettably also here, their contributions have not always been recognised and
acknowledged [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. Recent studies [
        <xref ref-type="bibr" rid="ref17 ref4">17,4</xref>
        ] prove how feminist approaches within
digital humanities enable new ndings and more complex understanding of the
literary history. Also in our work, we investigate a female author, and position
our work in the eld of digital humanities in combination with with the
feminist literary history and gender studies. We focus on the work of Zofka Kveder
who was one of the most important female writers in the multicultural space
of Habsburg Empire. Zofka Kveder's work has been previously transformed to
digitalized form and identi ed as a good source for digital humanities
investigations [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], but the presented work is the rst actual study analysing the work
with digital humanities and natural language processing methods.
      </p>
      <p>
        In this paper, we test how natural language processing methods can facilitate
the investigation of a literary text, in particular how using word embeddings can
reveal interesting relations between concepts in the work of Zofka Kveder. Word
embeddings are vector representations of words, where each word is assigned a
real-valued vector in a vector space. Due to the distributional hypothesis (see
Zellig Harris [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]), which states that words used in similar context will have similar
meanings, embeddings can capture a certain degree of semantics by translating
semantic relations in text into vector space relations. Word embeddings have
been previously applied to literary text corpora. For example, Grayson et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]
use them to compare the characters in 19th-century ction by the authors Jane
Austen, Charles Dickens, and Arthur Conan Doyle, while Wohlgenannt et al. [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]
test several word embeddings methods on the task of extracting a social network
for literary texts, speci cally on the series of fantasy novels. In our experiments,
we train FastText embeddings [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] on the corpus of works of Zofka Kveder and for
selected concepts identify twenty nearest semantic neighbours according to the
cosine similarity between the embedding vectors. These are then interpreted from
a literary perspective, showing the potential of simple computational approaches
to literary investigation, as well as some de ciencies of automated methods.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Corpus of Zofka Kveder</title>
      <p>
        Zofka Kveder (1878-1926) was a multilingual and multicultural author who lived
in three Central European capitals: Ljubljana, Prague and Zagreb and wrote in
three languages: Slovenian, Croatian and German. Many of her works were
published in newspapers and literary magazines in the Central and South-Eastern
Europe in Czech, Slovak, Bulgarian, Serbian, Polish and Serbian language. She
was also a cultural mediator and an ardent feminist. In most of her stories,
a female character in various roles is in the foreground. She also touched on
the concept of free love, which was an important issue at the time, and
acknowledged the problems of forced marriages, women's urges, illegitimate motherhood,
abortion, suicide, prostitution, early death at childbirth and many other themes
from the lives of women. As many authors from the late 19th and early 20th
century, Zofka Kveder depicted the incompatibility of women's emancipation
with marriage and motherhood, and criticized the double moral of the
middleclass society, which could not accept an unmarried woman to enjoy sex without
feelings of guilt [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>She also looked for concrete possibilities that would allow women to overcome
their position as the Other, to change their relationship with their own bodies
and to overcome feelings of guilt and uselessness, which, as she demonstrated,
could lead to the disintegration of identity or even death. Having opted for
stylistic pluralism, Kveder shaped her narrative using naturalist-realistic
stylistic devices, while also feeling an a nity for stylistic procedures typical of New
Romanticism. Her works have been translated to many European languages,
including Bulgarian, English, German, Polish, Czech.</p>
      <p>
        The corpus in our study contains all Kveder's Slovenian writings: two novels,
two plays, two one-act plays, some dramatic scenes, a large number of short
stories and tales and articles (literary reviews, feminist writings, etc.). The corpus
was published in ve volumes of the Collected works of Zofka Kveder - as printed
and electronic books edited by K. Mihurko Poniz [
        <xref ref-type="bibr" rid="ref10 ref9">10,9</xref>
        ]. In total, the corpus
contains 1,217,517 tokens.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Selected concepts</title>
      <p>For our analysis, we have selected 12 concepts (words): zenska [woman], zena
[wife/woman], moski [man], moz [husband/man], moderen [modern], dusa [soul],
pijanec [drunkard], ljubezen [love], otrok [child], emancipiran [emancipated],
potovati [to travel], mati [mother].</p>
      <p>
        The concepts have been selected according to the prevailing topics in Kveder's
works. The selection criterion was also Kveder's relationship to the concepts of
the so-called Wiener Moderne (Viennese modern age). In the last decade of the
19th century and in the rst decade of the 20th century in the majority of the
European literatures di erent stylistic formations had been developed that are
associated under the umbrella-term of the literary modernity (cf. Le Rider [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]).
We wanted to nd out how Kveder positioned herself and her literary gures
towards the concept of \modern". The authors of the Viennese modern age
revisioned the concepts of gender roles, therefore the concepts of woman, man and
love were chosen. The femaleness was in the majority of writings of the
modern age still connected with the motherhood, therefore we were interested how
Kveder connected di erent topics around this thematic eld and consequently
if we can conclude that she represented the conservative or progressive views on
this problem (concepts of mother and child). The writers who belonged to the
literary modernism (especially to the literary currents of symbolism and new
romanticism) developed new views on the relationship between body and internal
world - in Slovenian language the word soul was used at that time for the
individual's psyche. In the feminist but also in the literary writings Kveder re ected
upon women's emancipation (therefore we selected the concept of
emancipation), which was also one of the central topics of the modern age. Since in the
late 19th and early 20th century women traveled and explored new worlds more
often than ever before, we were also interested, how the concepts of travel and
migrations were coded in Kveder's texts (concept of travel). The concept of the
drunkard was chosen because of Kveder's strong (autobiographical) interest in
the topic of alcoholism.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Method</title>
      <p>
        Even if usually word embeddings are trained on very large corpora, there are
several studies that show that embeddings can also lead to useful results on smaller
text collections. For example, a recent study by Diaz et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] suggests that
leveraging embeddings trained on a large general corpus for modelling semantic
relations on a specialized corpus is problematic due to strong language use
variation. The study shows that embeddings trained only on a small topic speci c
corpus outperform non-topic speci c general embeddings trained on very large
general corpora for a somewhat speci c task of query expansion in specialized
text. Similar, FastText embeddings trained on small domain corpora were used
in Pollak et al. [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Since we are also dealing with a specialized historical text
written by a single author known for its distinct writing style, we follow this line
of research and train embeddings only on the author's text.
      </p>
      <p>
        To counteract the negative a ects of a small training corpus, we use
FastText embeddings [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] as they are capable of capturing subword information and
modeling of a xes and su xes by representing a word as an average of its
character n-grams of speci c length. This allows the model to compensate for the
lack of available semantic information due to small corpus size by leveraging also
morphological similarity, which in many cases translates to semantic relatedness.
Also, since Slovenian is a morphologically rich language, FastText embeddings
are trained on a lemmatized corpus of Zofka Kveder's texts. Lemmatization was
employed in order to prevent the scenario in which the majority of nearest
semantic neighbours we want to obtain for each seed concept would be di erent forms
of the input concept. Text was lemmatized using Lemmagen lemmatizer (Jursic
et al., 2010). After that, for each of the chosen concepts we obtain 20 nearest
semantic neighbours according to the cosine similarity between the embedding
vectors. The selected sets are then used for literary investigation, presented in
next section.
      </p>
      <p>Seed word 20 closest semantic neighbours (Slovenian)</p>
      <p>zeneva, evropejec, moski, mosnja, mera,
zenska konkurent, zkelnerai,kamleocs,tvkole,rmikoasli*z,efmar,ikzeojn,kruarzemnecraa,,modistinja,
konkurencen, ovsenjak, bosnjak, lolog, mozitev</p>
      <p>moz, zenin, zenitev, mozen, sina,
zena neomzmozoezne,nm,uozbeovs,tvzoe,nemvoaz,imteovs,trvood, boimnao,z*ze[nosmkao,zen],
druzina, milodar, evropejec, nakljucje, vdova
joski, refoski, zenski, loloski, kmetiski,</p>
      <p>zenska, loski, spol, miski, bloski,
moski psiholoski, kakrsen, fanuski*, zenstvo, emancipiran,
siski [Siska], evropejec, nezenski, privilegiran, mostvo</p>
      <p>defravdant, vdova, zena, zenin, vnuk,
moz sinaz,betersai,nv*n,onvaidc,lezgean,itvenvu,koinceja,,sivnd,ov,
sorodnistvo, sluzincad, guvernant, rojak, stars
monumentalen, operen, uporen, efekten, tipicen,
moderen uovroigdinena,leonr,iebnutbaelenn,,oosbpnicoesne,np,avtzehtoicdeenn,, omkuosdeyn*,,
koncerten, teren, impozanten, asketicen, frapanten</p>
      <p>hrepenenje, dusec, srce, hrepenec, verujoc,
dusa ctuotpelcin,av,essoulsjast,vtoe,sngorebnak,ombraz,loattao,mn,aspleeptoostta,,
hrepeneti, omamljenost, gorkota, pesimizem, socutje
jeruza, zganec* [zganje], dedec, indijanec, galicijanec,
pijanec krezggaarn,jdare,bseovdceovv,kag,rupnivte,cp,oudmgraazdaanre,ch,rkiobsoivr,ka,
ostir, mesickovka, pokora, matijec*, kristjan
zvestoba, vkljub, neljub, vdan, samoljuben,
ljubezen ljuta,zsvaemsto,lvjuerbujjeo,cp, rseoljcuubtjeez,ntievs,nporserlcjeunb,, dzoabljruobtlaje,nost,
lazniv, objetje, milina, poljuben, ponizujoc</p>
      <p>sirotica, vnuk, vnukinja, angel, njivica,
otrok aondgrealas,eld, eski ble,tleecn,ombaez,icnrevci,ceskn,ahu,bsoiznicceak,,</p>
      <p>terezinka, rajnki, deca, sinka, sinek
talentiran, improviziran, absolviran, izoliran, privilegiran,
emancipiran trakdaircaiokntearlieznir,arteis,pceimktp, enramtuarna,likzoemmp, lriceizrearnv,ircainvi,likzoirnavne,nciloonloagle,n,
tiranizirati, individualiteta, individualen, afektiran, manir
pestovati, sluzbovati, izposlovati, posvetovati, odsvetovati,
potovati prnoraozkaodvoavtait,iz,bvoarsoovvaattii,, puocsinrekdoovvaatit,i,ogborestkoovvaattii,, poobiizsvkeodvoavtia,ti,
obedovati, nadzorovati, napredovati, vojskovati, tekmovati</p>
      <p>obubozati, hci, obdrzati, omehcati, poizkusati,
mati ocenati*sv[aorcileon,aosb],ljpuobkoovpaatvi,athic,ehrcine,rkdiend, opvr,iksrtaarjsaatti,i, vnuk,
objokovati, odklepati, zblizati, sinek, obupati
20 closest semantic neighbours (English translation)</p>
      <p>geneva, european, man, pouch, measure,
woman/wife, manhood/team, husband*, pharisee, situation,
competitor, cleric, clericalism, competition, milliner,
competitive, oatcake, bosniak, philologist, wedlock</p>
      <p>husband, groom, marriage, possible, son,
capable, poorness, wedding, relatives, woman,
unmarried, husband's, geneva, maleness, marry*,
family, charity, european, coincidence, widow
boobs, refosk, female, philological, peasant,</p>
      <p>woman, loski, gender, mice, bloski,
psychological, kind, fanuski, womanhood, emancipated,
siska, european, unmarried, privileged, maleness</p>
      <p>defalcator, widow, wife, groom, grandson,
son, tesin*, problem, granddaughter, widow,</p>
      <p>gather, again, marry, father, son,
kinship, domestics, governess, compatriot, parent
monumental, opera, rebellious, showy, typical,
introductory, oriental, worn-out eastern, mody*,</p>
      <p>original, bubna, naughty, pathetic, tasty,
concert, terrain, imposing, ascetic, fascinating
longing, su ocating, heart, yearning, believing,
feeling, universe, bitterness, atom, blindness,
warmth, drought, anxiety, coldness, tension,
longing, stoned, bitterness, pessimism, compassion
spirit, alcohol, buster, indian, galician,</p>
      <p>distiller, soda, drinker, dirty, kosir,
kregar, debevec's, estate, podgradar, highlander,
innkeeper, mesickovka, penance, matijec*, christian
delity, despite, undesirable, devoted, self-impetuous,
angry, self-centeredness, kindest, dearest, infatuation,
faithful, believing, compassion, coldhearted, kindness,
lying, embracement, grace, arbitrary, humiliating
orphan, grandson, granddaughter, angel, eld,</p>
      <p>adult, rod, laziness, worm, pauper,
angel, girl, little nger, daughter-in-law, little son,
terezinka, deceased, children, little son, little son
talented, improvised, absorbed, isolated, privileged,
characterized, cimperman, complicated, civilized, philologist,
traditional, respect, naturalism, reserved, conventional,
tyrannize, individuality, individual, a ected, manner
hold in arms, serve, arrange, consult, advise against,
prophesy, convene, mediate, host, inquire,</p>
      <p>regress, feast, e ect, slander, visit,
dine, control, progress, wage war, compete
loose all possessions, daughter, retain, soften, try,
warn, promise, daughter's, grandfathers, grow old,
Lord's prayer, bury, daughter's, deprive, grandson,</p>
      <p>grieve, unlock, bring together, little son, despair</p>
    </sec>
    <sec id="sec-5">
      <title>Analysis and discussion</title>
      <p>In Table 1 we present the results of experiments and in Table 2 the English
translations. In this section, we interpret selected concepts and relations from
the perspective of literary studies, and reveal some limitations of the method.</p>
      <p>The concept woman (see table rows 1 and 2) is placed in a relationship with
man (man, husband, groom) and with Catholicism: Pharisee, cleric, clericalism.
Words related to writer's family situation (family, widow, son) also appear.
In connection with the concept woman, there is also the word milliner, which
indicates a common women's profession in the modern age. The connection with
the word European is also interesting because literary studies have not discussed
it yet.</p>
      <p>
        In the concept of emancipated, no word appears in direct connection with a
woman. The concept of mother appears in connection with daughter, which is
expected, given the frequent topic of the mother-daughter relationship in Kveder's
writings. The concept mother connects with words that express emotions
associated with su ering: to bury, to grieve, to despair. The literary studies on the
topic of motherhood in Kveder's work connected Kveder's work with negative
representations of motherhood [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>
        A man is positioned in the relationship to a woman (wife, marriage) and
in his family role (son, grandson, father, parent, kinship). Interesting is the
connection with the word gender, which does not occur in resulting words for
the the seed concept woman. The concept love is associated with delity/loyalty
and emotions. Surprisingly, it does not connect with the body, although this
was discussed as a frequent topic in Kveder's oeuvre in the eld of literary and
gender studies [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>The concept of modern is associated with words such as rebellious, e
ective, original but also pathetic, imposing, ascetic and fascinating, which are the
expected connections according to previous research of Kveder's writings. The
connection with the Oriental is interesting and calls for further research. The
relation with the concept soul (longing, heart, compassion) are unsurprising since
this concept was often discussed in the modern age. However, the connection
with the word atom is interesting and unexpected.</p>
      <p>Subword information, which in uences the similarity in FastText
embeddings, generally improves the semantic modelling in morphologically rich
languages and small corpora, as it helps with nding semantically similar
concepts with similar morphological structure. For example, similar lexemes in zena
[wife], zenin [groom], zenitev [marriage] are re ecting also semantic similarity of
words. On the other hand, the results indicate that this feature can also lead to
strong correlations between semantically (mainly) unrelated words containing
many common characters (e.g., strong correlation between zenska (woman in
Slovenian) and Zeneva (Slovenian name for the city in Switzerland)). An
interesting example is also ljubezen [love], where we can nd many etymologically
related words with common lexemes, which can be easily interpreted as
semantically related, such as samoljuben [self-centred], samoljubje [self-centeredness],
preljubezniv [kindest], but on the other hand we have words with similar subword
information that are not closely semantically related (e.g. poljuben [arbitrary]).</p>
      <p>Another aspect related to string similarity is related to word types. We can
notice that the part-of-speech (POS) of seed words and returned words is to a
large extent preserved. In our case the majority of seed words are nouns, and
corresponding returned embeddings return nearly only nouns, while when
adjectives or verbs were selected as seeds, the related words also belong mainly to the
respective POS categories. This is due to stronger association between words
with the same characteristic word type a xes. For example, the seed
adjective emancipiran has returned the set of semantically closest words talentiran,
improviziran, absolviran, . . . , while the verb potovati is associated to verbs
pestovati, sluzbovati, etc. However, when a noun has a typical verb ending (e.g. mati ),
there are many verbs in the returned list. Even if in general grouping of words in
relation to their POS can be interpreted as a feature, we found that in our study
(on a small corpus) this could be a source of confusion and that nouns provided
more interesting and reliable results than verbs and adjectives. Nouns would also
be our rst choice for selection of seed words for future analysis of literary works
in Slovenian language with FastText embeddings. Another thing that made the
interpretation of results harder are mistakes in the lemmatization. For example,
rst name Josko was lemmatized into joske (Slovenian for boobs), showing that
named entities should be marked or better handled in lemmatization.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion and future work</title>
      <p>
        In this paper, we investigate how natural language processing methods can
facilitate the investigation of literary work. We have trained FastText embeddings
on the corpus of the text of Zofka Kveder who was a principal Slovenian
modernist, multilingual and multicultural author and a feminist. We have selected
twelve concepts (words), according to the prevailing topics in Kveder's works
and in relationship to the concepts of the so-called Wiener Moderne (Viennese
modern age). For these words, we have computed twenty nearest semantic
neighbours according to the cosine similarity between the embedding vectors. These
were then interpreted from a literary perspective, showing the potential of
simple computational approaches to literary investigation. From the point of view
of the research of Zofka Kveder's work, the results of our research con rm the
ndings of literary history studies, and also open new research directions (e.g.
the relation between woman and European, and modern and Oriental), which
means that computer analysis provides interesting results that can be a useful
approach in literary studies. In future work, we plan to enlarge the set of
concepts, investigate how to remove noise by using improved lemmatization, testing
the results without lemmatization and recognizing named entities. In addition to
observing similarities between concepts, word embeddings allow for investigating
analogies, which could be very interesting when applied to literary texts.
Further, we plan to use the methods on larger literary corpora, which would allow
for embeddings-based analysis of di erences between authors, authors' gender
(e.g. male vs. female authors in the same period) and diachronic based studies,
which would allow to analyse how certain concepts evolved during time. Last but
not least, we will analyse di erent word embeddings methods (w2v, GloVe,...) to
analyse strengths and weaknesses of di erent methods, and compare the results
to statistical word association measures, such as PMI [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and PPMI [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
      </p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>The work presented in this paper has been supported by European Unions
Horizon 2020 research and innovation programme under grant agreement No.
825153, project EMBEDDIA (Cross-Lingual Embeddings for Less-Represented
Languages in European News Media). The authors acknowledge also the nancial
support from the Slovenian Research Agency core research programme
Knowledge Technologies (P2-0103) and the research programme Historical
Interpretations of the 20th century (P6-0347). The authors also acknowledge the COST
Action Distant Reading (Grant No. CA 16204).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bojanowski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grave</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joulin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Enriching word vectors with subword information</article-title>
          . In:
          <article-title>Transactions of the Association for Computational Linguistics</article-title>
          . pp.
          <volume>135</volume>
          {
          <issue>146</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Bourdieu</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>La domination masculine</article-title>
          .
          <source>Actes de la recherche en sciences sociales 84(4)</source>
          ,
          <volume>2</volume>
          {
          <fpage>31</fpage>
          (
          <year>1990</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Church</surname>
            ,
            <given-names>K.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanks</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Word association norms, mutual information, and lexicography</article-title>
          .
          <source>Computational Linguistics</source>
          <volume>16</volume>
          (
          <issue>1</issue>
          ),
          <volume>22</volume>
          {
          <fpage>29</fpage>
          (
          <year>1990</year>
          ), https://www.aclweb. org/anthology/J90-1003
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Coker</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ozment</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Building the women in book history bibliography, or digital enumerative bibliography as preservation of feminist labor</article-title>
          .
          <source>Digital Humanities Quarterly</source>
          <volume>13</volume>
          (
          <issue>3</issue>
          ) (
          <year>2019</year>
          ), http://digitalhumanities.org/dhq/vol/13/3/000428/ 000428.html
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Diaz</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Craswell</surname>
          </string-name>
          , N.:
          <article-title>Query expansion with locally-trained word embeddings</article-title>
          .
          <source>In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)</source>
          . pp.
          <volume>367</volume>
          {
          <fpage>377</fpage>
          . Berlin, Germany (
          <year>August 2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Grayson</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mulvany</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wade</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meaney</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Greene</surname>
            ,
            <given-names>D.:</given-names>
          </string-name>
          <article-title>Novel2Vec: Characterising 19th century ction via word embeddings</article-title>
          .
          <source>In: Proceedings of 24th Irish Conference on Arti cial Intelligence and Cognitive Science (AICS'16)</source>
          . Dublin, Ireland (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Harris</surname>
            ,
            <given-names>Z.S.:</given-names>
          </string-name>
          <article-title>Distributional structure</article-title>
          .
          <source>Word</source>
          <volume>10</volume>
          (
          <issue>23</issue>
          ),
          <volume>146</volume>
          {
          <fpage>162</fpage>
          (
          <year>1968</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jensterle-Dolezal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Problem identitete in travma telesa v prozi Zofke Kveder (A Problem of identity and the trauma of the body in the prose of Z. Kveder</article-title>
          . In:
          <string-name>
            <surname>Jensterle-Dolezal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Honzak</surname>
            <given-names>Jahic</given-names>
          </string-name>
          ,
          <string-name>
            <surname>J</surname>
          </string-name>
          . (eds.) Zofka Kvedrova (
          <year>1878</year>
          {
          <year>1926</year>
          ):
          <article-title>Recepce jej tvorby ve 21</article-title>
          . stolet , p.
          <volume>81</volume>
          {
          <fpage>99</fpage>
          . Praha:
          <article-title>Narodn knihovna CR{ Slovanska knihovna (</article-title>
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kveder</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>The collected works of Zofka Kveder. 1</article-title>
          . In: Poniz, K.M. (ed.)
          <article-title>The collected works of Slovene poets and writers</article-title>
          . Litera,
          <string-name>
            <surname>Maribor</surname>
          </string-name>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kveder</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>The collected works of Zofka Kveder. 2-5</article-title>
          . In: Poniz, K.M. (ed.)
          <article-title>The collected works of Slovene poets and writers</article-title>
          .
          <source>Zalozba ZRC</source>
          , Ljubljana (
          <year>2014</year>
          -2018)
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Le Rider</surname>
          </string-name>
          , J.:
          <article-title>Modernity and Crises of Identity: Culture and</article-title>
          Society in Fin-de-siecle Vienna. Continuum, New York (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>Mihurko</given-names>
            <surname>Poniz</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Digitalisation of the literary creativity : a case study of Zofka Kveder</article-title>
          . In: Vranes,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (ed.)
          <article-title>Digital humanities</article-title>
          and Slavic cultural heritage : international scienti c conference, Belgrade, 6-7 May
          <year>2019</year>
          :
          <article-title>[the book of summaries]</article-title>
          . University of Belgrade, Faculty of Philology, Belgrade (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>Mihurko</given-names>
            <surname>Poniz</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>The depictions of motherhood in the writings of Zofka Kveder and German literary women in the 19th century</article-title>
          . In: Kiiskinen, H. (ed.)
          <article-title>ISCH 2010 Conference book</article-title>
          (May
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>Mihurko</given-names>
            <surname>Poniz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Parente-Capkova</surname>
          </string-name>
          ,
          <string-name>
            <surname>V.</surname>
          </string-name>
          :
          <article-title>The new women from the margins</article-title>
          .
          <source>Interlitteraria</source>
          <volume>20</volume>
          (
          <issue>2</issue>
          ),
          <volume>184</volume>
          (Dec
          <year>2015</year>
          ). https://doi.org/10.12697/il.
          <year>2015</year>
          .
          <volume>20</volume>
          .2.
          <fpage>15</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Moretti</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Distant reading</article-title>
          . Verso, London and New York (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Niwa</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nitta</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Co-occurrence vectors from corpora vs. distance vectors from dictionaries</article-title>
          .
          <source>In: COLING 1994 Volume 1: The 15th International Conference on Computational Linguistics</source>
          (
          <year>1994</year>
          ), https://www.aclweb.org/anthology/C94- 1049
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Ozment</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>From recovery to restoration: Aphra behn and feminist bibliography</article-title>
          .
          <source>Early Modern Women: An Interdisciplinary Journal</source>
          <volume>13</volume>
          (
          <issue>1</issue>
          ),
          <volume>105</volume>
          {
          <fpage>116</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Pollak</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Repar</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martinc</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Podpecan</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Karst exploration: Extracting terms and de nitions from karst domain corpus</article-title>
          .
          <source>In: Proceedings of eLex 2019</source>
          . pp.
          <volume>934</volume>
          {
          <fpage>956</fpage>
          .
          <string-name>
            <surname>Sintra</surname>
          </string-name>
          ,
          <string-name>
            <surname>Portugal</surname>
          </string-name>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Pressman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Swanstrom</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>The literary and/as the digital humanities</article-title>
          .
          <source>Digital Humanities Quarterly</source>
          <volume>7</volume>
          (
          <issue>1</issue>
          ) (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Wernimont</surname>
          </string-name>
          , J.:
          <article-title>Whence feminism? assessing feminist interventions in digital literary archives</article-title>
          .
          <source>Digital Humanities Quarterly</source>
          <volume>7</volume>
          (
          <issue>1</issue>
          ) (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Wohlgenannt</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chernyak</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ilvovsky</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Extracting social networks from literary text with word embedding tools</article-title>
          .
          <source>In: Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)</source>
          . pp.
          <volume>18</volume>
          {
          <fpage>25</fpage>
          .
          <string-name>
            <surname>Osaka</surname>
          </string-name>
          ,
          <string-name>
            <surname>Japan</surname>
          </string-name>
          (
          <year>December 2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>