<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Natural Language Processing for Literary Text Analysis: Word-Embeddings-Based Analysis of Zofka Kveder&apos;s Work</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Senja</forename><surname>Pollak</surname></persName>
							<email>senja.pollak@ijs.si</email>
							<affiliation key="aff0">
								<orgName type="institution">Jožef Stefan Institute</orgName>
								<address>
									<settlement>Ljubljana</settlement>
									<country key="SI">Slovenia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Matej</forename><surname>Martinc</surname></persName>
							<email>matej.martinc@ijs.si</email>
							<affiliation key="aff0">
								<orgName type="institution">Jožef Stefan Institute</orgName>
								<address>
									<settlement>Ljubljana</settlement>
									<country key="SI">Slovenia</country>
								</address>
							</affiliation>
							<affiliation key="aff2">
								<orgName type="department">Jožef Stefan International Postgraduate School</orgName>
								<address>
									<settlement>Ljubljana</settlement>
									<country key="SI">Slovenia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Katja</forename><forename type="middle">Mihurko</forename><surname>Poniž</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">University of Nova Gorica</orgName>
								<address>
									<settlement>Nova Gorica</settlement>
									<country key="SI">Slovenia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Natural Language Processing for Literary Text Analysis: Word-Embeddings-Based Analysis of Zofka Kveder&apos;s Work</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">6FE197FDB529A4595FDEE4289CABCB2E</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T05:36+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Digital literary studies</term>
					<term>Word embeddings</term>
					<term>Distant reading</term>
					<term>Gender</term>
					<term>Zofka Kveder geneva, european, man, pouch, measure, woman/wife, manhood/team, husband*, pharisee, situation, competitor, cleric, clericalism, competition, milliner, competitive, oatcake, bosniak, philologist, wedlock wife/woman husband, groom, marriage, possible, son, capable, poorness, wedding, relatives, woman, unmarried, husband&apos;s, geneva, maleness, marry*, family, charity, european, coincidence, widow man boobs, refošk, female, philological, peasant, woman, loški, gender, mice, bloški, psychological, kind, fanuški, womanhood, emancipated, šiška, european, unmarried, privileged, maleness husband/man defalcator, widow, wife, groom, grandson, son, tesin*, problem, granddaughter, widow, gather, again, marry, father, son, kinship, domestics, governess, compatriot, parent modern monumental, opera, rebellious, showy, typical, introductory, oriental, worn-out eastern, mody*, original, bubna, naughty, pathetic, tasty, concert, terrain, imposing, ascetic, fascinating soul longing, suffocating, heart, yearning, believing, feeling, universe, bitterness, atom, blindness, warmth, drought, anxiety, coldness, tension, longing, stoned, bitterness, pessimism, compassion drunkard spirit, alcohol, buster, indian, galician, distiller, soda, drinker, dirty, košir, kregar, debevec&apos;s, estate, podgradar, highlander, innkeeper, mešičkovka, penance, matijec*, christian love fidelity, despite, undesirable, devoted, self-impetuous, angry, self-centeredness, kindest, dearest, infatuation, faithful, believing, compassion, coldhearted, kindness, lying, embracement, grace, arbitrary, humiliating child orphan, grandson, granddaughter, angel, field, adult, rod, laziness, worm, pauper, angel, girl, little finger, daughter-in-law, little son, terezinka, deceased, children, little son, little son emancipated talented, improvised, absorbed, isolated, privileged, characterized, cimperman, complicated, civilized, philologist, traditional, respect, naturalism, reserved, conventional, tyrannize, individuality, individual, affected, manner to travel hold in arms, serve, arrange, consult, advise against, prophesy, convene, mediate, host, inquire, regress, feast, effect, slander, visit, dine, control, progress, wage war, compete mother loose all possessions, daughter, retain, soften, try, warn, promise, daughter&apos;s, grandfathers, grow old, Lord&apos;s prayer, bury, daughter&apos;s, deprive, grandson, grieve, unlock, bring together, little son, despair</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper we use word embeddings to analyse the corpus of work of a Slovenian modernist writer Zofka Kveder. For a selection of central concepts of her work, we compute the closest FastText embeddings. The interpretation shows that many of the word relations can be interpreted in line with the findings in literary studies (e.g., woman is discussed in relation to men and Catholicism), or open novel paths for interpretation (e.g., woman in relation to European). On the other hand we also point at some problems resulting from the lemmatization and from using FastText embeddings.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The computational methods have importantly enriched the studies of literary history in the last two decades. However, it can also be stated that the digital humanities depend in large part on literary studies <ref type="bibr" target="#b17">[19]</ref>. Since 1980s, the feminist theory and gender studies have played an important role in the analysis of literary texts. Moreover, the feminist literary history has made visible a large number of forgotten women authors through digitalization projects, such as The Orlando Project (https://www.artsrn.ualberta.ca/orlando/), The Women Writers Project (https://www.wwp.northeastern.edu/) and the Virtual Research Enviroment NEWW Women Writers in History (http://resources. huygens.knaw.nl/womenwriters/), which is focusing not only on collection of women's writings but especially on the collection of reception data. This illustrates the fruitful connection between feminist literary history and digital humanities. Projects that connect computational methods and feminist approaches with literary texts, enable better understanding of women's roles, especially in the literary life of earlier periods. Through the use of approaches such as distant reading, developed by F. <ref type="bibr">Moretti [15]</ref> in the Stanford Literary Lab, and by novel data visualization and data analysis methods applied to large textual corpora, new light can be shed on the work of forgotten or neglected women writers.</p><p>The writings of women have always been an integral part of the literary field although they encountered many obstacles caused by the male dominance <ref type="bibr" target="#b1">[2]</ref>. Feminist digital literary studies have impacted the field of digital humanities, but regrettably also here, their contributions have not always been recognised and acknowledged <ref type="bibr" target="#b18">[20]</ref>. Recent studies <ref type="bibr" target="#b15">[17,</ref><ref type="bibr" target="#b3">4]</ref> prove how feminist approaches within digital humanities enable new findings and more complex understanding of the literary history. Also in our work, we investigate a female author, and position our work in the field of digital humanities in combination with with the feminist literary history and gender studies. We focus on the work of Zofka Kveder who was one of the most important female writers in the multicultural space of Habsburg Empire. Zofka Kveder's work has been previously transformed to digitalized form and identified as a good source for digital humanities investigations <ref type="bibr" target="#b11">[12]</ref>, but the presented work is the first actual study analysing the work with digital humanities and natural language processing methods.</p><p>In this paper, we test how natural language processing methods can facilitate the investigation of a literary text, in particular how using word embeddings can reveal interesting relations between concepts in the work of Zofka Kveder. Word embeddings are vector representations of words, where each word is assigned a real-valued vector in a vector space. Due to the distributional hypothesis (see Zellig Harris <ref type="bibr" target="#b6">[7]</ref>), which states that words used in similar context will have similar meanings, embeddings can capture a certain degree of semantics by translating semantic relations in text into vector space relations. Word embeddings have been previously applied to literary text corpora. For example, Grayson et al. <ref type="bibr" target="#b5">[6]</ref> use them to compare the characters in 19th-century fiction by the authors Jane Austen, Charles Dickens, and Arthur Conan Doyle, while Wohlgenannt et al. <ref type="bibr" target="#b19">[21]</ref> test several word embeddings methods on the task of extracting a social network for literary texts, specifically on the series of fantasy novels. In our experiments, we train FastText embeddings <ref type="bibr" target="#b0">[1]</ref> on the corpus of works of Zofka Kveder and for selected concepts identify twenty nearest semantic neighbours according to the cosine similarity between the embedding vectors. These are then interpreted from a literary perspective, showing the potential of simple computational approaches to literary investigation, as well as some deficiencies of automated methods.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Corpus of Zofka Kveder</head><p>Zofka <ref type="bibr">Kveder (1878</ref><ref type="bibr">Kveder ( -1926) )</ref> was a multilingual and multicultural author who lived in three Central European capitals: Ljubljana, Prague and Zagreb and wrote in three languages: Slovenian, Croatian and German. Many of her works were published in newspapers and literary magazines in the Central and South-Eastern Europe in Czech, Slovak, Bulgarian, Serbian, Polish and Serbian language. She was also a cultural mediator and an ardent feminist. In most of her stories, a female character in various roles is in the foreground. She also touched on the concept of free love, which was an important issue at the time, and acknowledged the problems of forced marriages, women's urges, illegitimate motherhood, abortion, suicide, prostitution, early death at childbirth and many other themes from the lives of women. As many authors from the late 19th and early 20th century, Zofka Kveder depicted the incompatibility of women's emancipation with marriage and motherhood, and criticized the double moral of the middleclass society, which could not accept an unmarried woman to enjoy sex without feelings of guilt <ref type="bibr" target="#b13">[14]</ref>.</p><p>She also looked for concrete possibilities that would allow women to overcome their position as the Other, to change their relationship with their own bodies and to overcome feelings of guilt and uselessness, which, as she demonstrated, could lead to the disintegration of identity or even death. Having opted for stylistic pluralism, Kveder shaped her narrative using naturalist-realistic stylistic devices, while also feeling an affinity for stylistic procedures typical of New Romanticism. Her works have been translated to many European languages, including Bulgarian, English, German, Polish, Czech.</p><p>The corpus in our study contains all Kveder's Slovenian writings: two novels, two plays, two one-act plays, some dramatic scenes, a large number of short stories and tales and articles (literary reviews, feminist writings, etc.). The corpus was published in five volumes of the Collected works of Zofka Kveder -as printed and electronic books edited by K. Mihurko Poniž <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b8">9]</ref>. In total, the corpus contains 1,217,517 tokens.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Selected concepts</head><p>For our analysis, we have selected 12 concepts (words):</p><formula xml:id="formula_0">ženska [woman], žena [wife/woman], moški [man], mož [husband/man], moderen [modern], duša [soul], pijanec [drunkard], ljubezen [love], otrok [child], emancipiran [emancipated], po- tovati [to travel], mati [mother].</formula><p>The concepts have been selected according to the prevailing topics in Kveder's works. The selection criterion was also Kveder's relationship to the concepts of the so-called Wiener Moderne (Viennese modern age). In the last decade of the 19th century and in the first decade of the 20th century in the majority of the European literatures different stylistic formations had been developed that are associated under the umbrella-term of the literary modernity (cf. Le Rider <ref type="bibr" target="#b10">[11]</ref>). We wanted to find out how Kveder positioned herself and her literary figures towards the concept of "modern". The authors of the Viennese modern age revisioned the concepts of gender roles, therefore the concepts of woman, man and love were chosen. The femaleness was in the majority of writings of the modern age still connected with the motherhood, therefore we were interested how Kveder connected different topics around this thematic field and consequently if we can conclude that she represented the conservative or progressive views on this problem (concepts of mother and child). The writers who belonged to the literary modernism (especially to the literary currents of symbolism and new romanticism) developed new views on the relationship between body and internal world -in Slovenian language the word soul was used at that time for the individual's psyche. In the feminist but also in the literary writings Kveder reflected upon women's emancipation (therefore we selected the concept of emancipation), which was also one of the central topics of the modern age. Since in the late 19th and early 20th century women traveled and explored new worlds more often than ever before, we were also interested, how the concepts of travel and migrations were coded in Kveder's texts (concept of travel). The concept of the drunkard was chosen because of Kveder's strong (autobiographical) interest in the topic of alcoholism.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Method</head><p>Even if usually word embeddings are trained on very large corpora, there are several studies that show that embeddings can also lead to useful results on smaller text collections. For example, a recent study by Diaz et al. <ref type="bibr" target="#b4">[5]</ref> suggests that leveraging embeddings trained on a large general corpus for modelling semantic relations on a specialized corpus is problematic due to strong language use variation. The study shows that embeddings trained only on a small topic specific corpus outperform non-topic specific general embeddings trained on very large general corpora for a somewhat specific task of query expansion in specialized text. Similar, FastText embeddings trained on small domain corpora were used in Pollak et al. <ref type="bibr" target="#b16">[18]</ref>. Since we are also dealing with a specialized historical text written by a single author known for its distinct writing style, we follow this line of research and train embeddings only on the author's text.</p><p>To counteract the negative affects of a small training corpus, we use Fast-Text embeddings <ref type="bibr" target="#b0">[1]</ref> as they are capable of capturing subword information and modeling of affixes and suffixes by representing a word as an average of its character n-grams of specific length. This allows the model to compensate for the lack of available semantic information due to small corpus size by leveraging also morphological similarity, which in many cases translates to semantic relatedness. Also, since Slovenian is a morphologically rich language, FastText embeddings are trained on a lemmatized corpus of Zofka Kveder's texts. Lemmatization was employed in order to prevent the scenario in which the majority of nearest semantic neighbours we want to obtain for each seed concept would be different forms of the input concept. Text was lemmatized using Lemmagen lemmatizer (Juršič et al., 2010). After that, for each of the chosen concepts we obtain 20 nearest semantic neighbours according to the cosine similarity between the embedding vectors. The selected sets are then used for literary investigation, presented in next section. In Table <ref type="table" target="#tab_0">1</ref> we present the results of experiments and in Table <ref type="table">2</ref> the English translations. In this section, we interpret selected concepts and relations from the perspective of literary studies, and reveal some limitations of the method. The concept woman (see table <ref type="table" target="#tab_0">rows 1 and 2</ref>) is placed in a relationship with man (man, husband, groom) and with Catholicism: Pharisee, cleric, clericalism. Words related to writer's family situation (family, widow, son) also appear. In connection with the concept woman, there is also the word milliner, which indicates a common women's profession in the modern age. The connection with the word European is also interesting because literary studies have not discussed it yet.</p><p>In the concept of emancipated, no word appears in direct connection with a woman. The concept of mother appears in connection with daughter, which is expected, given the frequent topic of the mother-daughter relationship in Kveder's writings. The concept mother connects with words that express emotions associated with suffering: to bury, to grieve, to despair. The literary studies on the topic of motherhood in Kveder's work connected Kveder's work with negative representations of motherhood <ref type="bibr" target="#b12">[13]</ref>.</p><p>A man is positioned in the relationship to a woman (wife, marriage) and in his family role (son, grandson, father, parent, kinship). Interesting is the connection with the word gender, which does not occur in resulting words for the the seed concept woman. The concept love is associated with fidelity/loyalty and emotions. Surprisingly, it does not connect with the body, although this was discussed as a frequent topic in Kveder's oeuvre in the field of literary and gender studies <ref type="bibr" target="#b7">[8]</ref>.</p><p>The concept of modern is associated with words such as rebellious, effective, original but also pathetic, imposing, ascetic and fascinating, which are the expected connections according to previous research of Kveder's writings. The connection with the Oriental is interesting and calls for further research. The relation with the concept soul (longing, heart, compassion) are unsurprising since this concept was often discussed in the modern age. However, the connection with the word atom is interesting and unexpected.</p><p>Subword information, which influences the similarity in FastText embeddings, generally improves the semantic modelling in morphologically rich languages and small corpora, as it helps with finding semantically similar concepts with similar morphological structure. For example, similar lexemes in žena [wife], ženin [groom], ženitev [marriage] are reflecting also semantic similarity of words. On the other hand, the results indicate that this feature can also lead to strong correlations between semantically (mainly) unrelated words containing many common characters (e.g., strong correlation between ženska (woman in Slovenian) and Ženeva (Slovenian name for the city in Switzerland)). An interesting example is also ljubezen [love], where we can find many etymologically related words with common lexemes, which can be easily interpreted as semantically related, such as samoljuben [self-centred], samoljubje [self-centeredness], preljubezniv [kindest], but on the other hand we have words with similar subword information that are not closely semantically related (e.g. poljuben [arbitrary]).</p><p>Another aspect related to string similarity is related to word types. We can notice that the part-of-speech (POS) of seed words and returned words is to a large extent preserved. In our case the majority of seed words are nouns, and corresponding returned embeddings return nearly only nouns, while when adjectives or verbs were selected as seeds, the related words also belong mainly to the respective POS categories. This is due to stronger association between words with the same characteristic word type affixes. For example, the seed adjective emancipiran has returned the set of semantically closest words talentiran, improviziran, absolviran, . . . , while the verb potovati is associated to verbs pestovati, službovati, etc. However, when a noun has a typical verb ending (e.g. mati ), there are many verbs in the returned list. Even if in general grouping of words in relation to their POS can be interpreted as a feature, we found that in our study (on a small corpus) this could be a source of confusion and that nouns provided more interesting and reliable results than verbs and adjectives. Nouns would also be our first choice for selection of seed words for future analysis of literary works in Slovenian language with FastText embeddings. Another thing that made the interpretation of results harder are mistakes in the lemmatization. For example, first name Joško was lemmatized into joške (Slovenian for boobs), showing that named entities should be marked or better handled in lemmatization.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusion and future work</head><p>In this paper, we investigate how natural language processing methods can facilitate the investigation of literary work. We have trained FastText embeddings on the corpus of the text of Zofka Kveder who was a principal Slovenian modernist, multilingual and multicultural author and a feminist. We have selected twelve concepts (words), according to the prevailing topics in Kveder's works and in relationship to the concepts of the so-called Wiener Moderne (Viennese modern age). For these words, we have computed twenty nearest semantic neighbours according to the cosine similarity between the embedding vectors. These were then interpreted from a literary perspective, showing the potential of simple computational approaches to literary investigation. From the point of view of the research of Zofka Kveder's work, the results of our research confirm the findings of literary history studies, and also open new research directions (e.g. the relation between woman and European, and modern and Oriental), which means that computer analysis provides interesting results that can be a useful approach in literary studies. In future work, we plan to enlarge the set of concepts, investigate how to remove noise by using improved lemmatization, testing the results without lemmatization and recognizing named entities. In addition to observing similarities between concepts, word embeddings allow for investigating analogies, which could be very interesting when applied to literary texts. Further, we plan to use the methods on larger literary corpora, which would allow for embeddings-based analysis of differences between authors, authors' gender (e.g. male vs. female authors in the same period) and diachronic based studies, which would allow to analyse how certain concepts evolved during time. Last but not least, we will analyse different word embeddings methods (w2v, GloVe,...) to analyse strengths and weaknesses of different methods, and compare the results to statistical word association measures, such as PMI <ref type="bibr" target="#b2">[3]</ref> and PPMI <ref type="bibr" target="#b14">[16]</ref>.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>20 closest word embeddings for 12 seed concepts in Slovenian (asterisk * denotes words which were incorrectly lemmatized; in some cases the correct lemma is given in square brackets).</figDesc><table><row><cell>Seed word</cell><cell>20 closest semantic neighbours (Slovenian)</cell></row><row><cell></cell><cell>ženeva, evropejec, moški, mošnja, mera,</cell></row><row><cell>ženska</cell><cell>žena, moštvo, moš*, farizej, razmera, konkurent, klerikalec, klerikalizem, konkurenca, modistinja,</cell></row><row><cell></cell><cell>konkurenčen, ovsenjak, bošnjak, filolog, možitev</cell></row><row><cell></cell><cell>mož, ženin, ženitev, možen, sina,</cell></row><row><cell>žena</cell><cell>zmožen, uboštvo, možitev, rodbina, ženska, neomožen, možev, ženeva, moštvo, omož* [omožen],</cell></row><row><cell></cell><cell>družina, milodar, evropejec, naključje, vdova</cell></row><row><cell></cell><cell>joški, refoški, ženski, filološki, kmetiški,</cell></row><row><cell>moški</cell><cell>ženska, loški, spol, miški, bloški, psihološki, kakršen, fanuški*, ženstvo, emancipiran,</cell></row><row><cell></cell><cell>šiški [ Šiška], evropejec, neženski, privilegiran, moštvo</cell></row><row><cell></cell><cell>defravdant, vdova, žena, ženin, vnuk,</cell></row><row><cell>mož</cell><cell>sina, tesin*, nadlega, vnukinja, vdov, zbera, vnovič, ženitev, oče, sin,</cell></row><row><cell></cell><cell>sorodništvo, služinčad, guvernant, rojak, starš</cell></row><row><cell></cell><cell>monumentalen, operen, uporen, efekten, tipičen,</cell></row><row><cell>moderen</cell><cell>uvoden, orientalen, obnošen, vzhoden, mody*, originalen, buben, ošpičen, patetičen, okusen,</cell></row><row><cell></cell><cell>koncerten, teren, impozanten, asketičen, frapanten</cell></row><row><cell></cell><cell>hrepenenje, dušeč, srce, hrepeneč, verujoč,</cell></row><row><cell>duša</cell><cell>čuteč, vesoljstvo, grenkoba, atom, slepota, toplina, suša, tesnoba, mrzlota, napetost,</cell></row><row><cell></cell><cell>hrepeneti, omamljenost, gorkota, pesimizem, sočutje</cell></row><row><cell></cell><cell>jeruža, žganec* [žganje], dedec, indijanec, galicijanec,</cell></row><row><cell>pijanec</cell><cell>žganjar, sodovka, pivec, umazanec, košir, kregar, debevčev, grunt, podgradar, hribovka,</cell></row><row><cell></cell><cell>oštir, mešičkovka, pokora, matijec*, kristjan</cell></row><row><cell></cell><cell>zvestoba, vkljub, neljub, vdan, samoljuben,</cell></row><row><cell>ljubezen</cell><cell>ljuta, samoljubje, preljubezniv, preljub, zaljubljenost, zvest, verujoč, sočutje, tesnosrčen, dobrota,</cell></row><row><cell></cell><cell>lažniv, objetje, milina, poljuben, ponižujoč</cell></row><row><cell></cell><cell>sirotica, vnuk, vnukinja, angel, njivica,</cell></row><row><cell>otrok</cell><cell>odrasel, šib, lenoba, črviček, ubožica, angela, dekletec, mezinec, snah, sinček,</cell></row><row><cell></cell><cell>terezinka, rajnki, deca, sinka, sinek</cell></row><row><cell></cell><cell>talentiran, improviziran, absolviran, izoliran, privilegiran,</cell></row><row><cell>emancipiran</cell><cell>karakterizirati, cimperman, kompliciran, civiliziran, filolog, tradicionalen, respekt, naturalizem, rezerviran, konvencionalen,</cell></row><row><cell></cell><cell>tiranizirati, individualiteta, individualen, afektiran, manir</cell></row><row><cell></cell><cell>pestovati, službovati, izposlovati, posvetovati, odsvetovati,</cell></row><row><cell>potovati</cell><cell>prorokovati, zborovati, posredovati, gostovati, poizvedovati, nazadovati, vasovati, učinkovati, obrekovati, obiskovati,</cell></row><row><cell></cell><cell>obedovati, nadzorovati, napredovati, vojskovati, tekmovati</cell></row><row><cell></cell><cell>obubožati, hči, obdržati, omehčati, poizkušati,</cell></row><row><cell>mati</cell><cell>svarilo, obljubovati, hčerin, dedov, starati, očenati* [očenaš], pokopavati, hčerkin, prikrajšati, vnuk,</cell></row><row><cell></cell><cell>objokovati, odklepati, zbližati, sinek, obupati</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>The work presented in this paper has been supported by European Unions Horizon 2020 research and innovation programme under grant agreement No. 825153, project EMBEDDIA (Cross-Lingual Embeddings for Less-Represented Languages in European News Media). The authors acknowledge also the financial support from the Slovenian Research Agency core research programme Knowledge Technologies (P2-0103) and the research programme Historical Interpretations of the 20th century (P6-0347). The authors also acknowledge the COST Action Distant Reading (Grant No. CA 16204).</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Enriching word vectors with subword information</title>
		<author>
			<persName><forename type="first">P</forename><surname>Bojanowski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Grave</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Joulin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Mikolov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Transactions of the Association for Computational Linguistics</title>
				<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="135" to="146" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">La domination masculine</title>
		<author>
			<persName><forename type="first">P</forename><surname>Bourdieu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Actes de la recherche en sciences sociales</title>
		<imprint>
			<biblScope unit="volume">84</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="2" to="31" />
			<date type="published" when="1990">1990</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Word association norms, mutual information, and lexicography</title>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">W</forename><surname>Church</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Hanks</surname></persName>
		</author>
		<ptr target="https://www.aclweb.org/anthology/J90-1003" />
	</analytic>
	<monogr>
		<title level="j">Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">16</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="22" to="29" />
			<date type="published" when="1990">1990</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Building the women in book history bibliography, or digital enumerative bibliography as preservation of feminist labor</title>
		<author>
			<persName><forename type="first">C</forename><surname>Coker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Ozment</surname></persName>
		</author>
		<ptr target="http://digitalhumanities.org/dhq/vol/13/3/000428/000428.html" />
	</analytic>
	<monogr>
		<title level="j">Digital Humanities Quarterly</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">3</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Query expansion with locally-trained word embeddings</title>
		<author>
			<persName><forename type="first">F</forename><surname>Diaz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Mitra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Craswell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics</title>
		<title level="s">Long Papers</title>
		<meeting>the 54th Annual Meeting of the Association for Computational Linguistics<address><addrLine>Berlin, Germany</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016-08">August 2016</date>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="367" to="377" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Novel2Vec: Characterising 19th century fiction via word embeddings</title>
		<author>
			<persName><forename type="first">S</forename><surname>Grayson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Mulvany</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Wade</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Meaney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Greene</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of 24th Irish Conference on Artificial Intelligence and Cognitive Science (AICS&apos;16)</title>
				<meeting>24th Irish Conference on Artificial Intelligence and Cognitive Science (AICS&apos;16)<address><addrLine>Dublin, Ireland</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Distributional structure</title>
		<author>
			<persName><forename type="first">Z</forename><forename type="middle">S</forename><surname>Harris</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Word</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="issue">23</biblScope>
			<biblScope unit="page" from="146" to="162" />
			<date type="published" when="1968">1968</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Problem identitete in travma telesa v prozi Zofke Kveder (A Problem of identity and the trauma of the body in the prose of Z. Kveder</title>
		<author>
			<persName><forename type="first">A</forename><surname>Jensterle-Doležal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Recepce její tvorby ve 21. století</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Jensterle-Doležal</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Honzak Jahić</surname></persName>
		</editor>
		<meeting><address><addrLine>Praha</addrLine></address></meeting>
		<imprint>
			<publisher>Národní knihovna ČR-Slovanská knihovna</publisher>
			<date type="published" when="1878">1878-1926. 2008</date>
			<biblScope unit="page" from="81" to="99" />
		</imprint>
	</monogr>
	<note>Zofka Kvedrová</note>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">The collected works of Zofka Kveder. 1</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Kveder</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The collected works of Slovene poets and writers</title>
				<editor>
			<persName><forename type="first">K</forename><forename type="middle">M</forename><surname>Poniž</surname></persName>
		</editor>
		<meeting><address><addrLine>Maribor</addrLine></address></meeting>
		<imprint>
			<publisher>Litera</publisher>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">The collected works of Zofka Kveder. 2-5</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Kveder</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The collected works of Slovene poets and writers</title>
				<editor>
			<persName><forename type="first">K</forename><forename type="middle">M</forename><surname>Poniž</surname></persName>
		</editor>
		<meeting><address><addrLine>Ljubljana</addrLine></address></meeting>
		<imprint>
			<publisher>Založba ZRC</publisher>
			<date type="published" when="2014">2014-2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">Le</forename><surname>Rider</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename></persName>
		</author>
		<title level="m">Modernity and Crises of Identity: Culture and Society in Fin-de-siècle Vienna</title>
				<meeting><address><addrLine>New York</addrLine></address></meeting>
		<imprint>
			<publisher>Continuum</publisher>
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Digitalisation of the literary creativity : a case study of Zofka Kveder</title>
		<author>
			<persName><forename type="first">K</forename><surname>Mihurko Poniž</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Digital humanities and Slavic cultural heritage : international scientific conference</title>
				<editor>
			<persName><forename type="first">A</forename><surname>Vraneš</surname></persName>
		</editor>
		<meeting><address><addrLine>Belgrade; Belgrade</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019-05-07">6-7 May 2019. 2019</date>
		</imprint>
		<respStmt>
			<orgName>University of Belgrade, Faculty of Philology</orgName>
		</respStmt>
	</monogr>
	<note>the book of summaries</note>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">The depictions of motherhood in the writings of Zofka Kveder and German literary women in the 19th century</title>
		<author>
			<persName><forename type="first">K</forename><surname>Mihurko Poniž</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ISCH 2010 Conference book</title>
				<editor>
			<persName><forename type="first">H</forename><surname>Kiiskinen</surname></persName>
		</editor>
		<imprint>
			<date type="published" when="2010-05">May 2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">The new women from the margins</title>
		<author>
			<persName><forename type="first">K</forename><surname>Mihurko Poniž</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Parente-Čapková</surname></persName>
		</author>
		<author>
			<persName><surname>Moretti</surname></persName>
		</author>
		<idno type="DOI">10.12697/il.2015.20.2.15</idno>
		<ptr target="https://doi.org/10.12697/il.2015.20.2.1515" />
	</analytic>
	<monogr>
		<title level="m">Distant reading</title>
				<meeting><address><addrLine>London and New York</addrLine></address></meeting>
		<imprint>
			<publisher>Verso</publisher>
			<date type="published" when="2013">Dec 2015. 2013</date>
			<biblScope unit="volume">20</biblScope>
			<biblScope unit="page">184</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Co-occurrence vectors from corpora vs. distance vectors from dictionaries</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Niwa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Nitta</surname></persName>
		</author>
		<ptr target="https://www.aclweb.org/anthology/C94-1049" />
	</analytic>
	<monogr>
		<title level="m">The 15th International Conference on Computational Linguistics</title>
				<imprint>
			<date type="published" when="1994">1994. 1994</date>
			<biblScope unit="volume">1</biblScope>
		</imprint>
	</monogr>
	<note>COLING</note>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">From recovery to restoration: Aphra behn and feminist bibliography</title>
		<author>
			<persName><forename type="first">K</forename><surname>Ozment</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Early Modern Women: An Interdisciplinary Journal</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="105" to="116" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Karst exploration: Extracting terms and definitions from karst domain corpus</title>
		<author>
			<persName><forename type="first">S</forename><surname>Pollak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Repar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Martinc</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Podpečan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of eLex 2019</title>
				<meeting>eLex 2019<address><addrLine>Sintra, Portugal</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="934" to="956" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">The literary and/as the digital humanities</title>
		<author>
			<persName><forename type="first">J</forename><surname>Pressman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Swanstrom</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Digital Humanities Quarterly</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Whence feminism? assessing feminist interventions in digital literary archives</title>
		<author>
			<persName><forename type="first">J</forename><surname>Wernimont</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Digital Humanities Quarterly</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Extracting social networks from literary text with word embedding tools</title>
		<author>
			<persName><forename type="first">G</forename><surname>Wohlgenannt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Chernyak</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Ilvovsky</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)</title>
				<meeting>the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)<address><addrLine>, Japan</addrLine></address></meeting>
		<imprint>
			<publisher>Osaka</publisher>
			<date type="published" when="2016-12">December 2016</date>
			<biblScope unit="page" from="18" to="25" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
