<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Tagging Narrative with Propp&apos;s Character Functions Using Large Language Models</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Pablo</forename><surname>Gervás</surname></persName>
							<email>pgervas@ucm.es</email>
						</author>
						<author>
							<persName><forename type="first">Gonzalo</forename><surname>Méndez</surname></persName>
							<email>gmendez@ucm.es</email>
						</author>
						<author>
							<affiliation key="aff0">
								<orgName type="department">Facultad de Informática</orgName>
								<orgName type="institution">Universidad Complutense de Madrid</orgName>
								<address>
									<postCode>28040</postCode>
									<settlement>Madrid</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="department">Instituto de Tecnología del Conocimiento</orgName>
								<orgName type="institution">Universidad Complutense de Madrid</orgName>
								<address>
									<postCode>28223</postCode>
									<settlement>Madrid</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Tagging Narrative with Propp&apos;s Character Functions Using Large Language Models</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">AF56D5C46CD3D5D37EB62BE18F44BCF3</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:31+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>automated tagging</term>
					<term>Propp&apos;s Morphology of the Folktale</term>
					<term>character functions</term>
					<term>large language models</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The character functions proposed by Vladimir Propp as abstraction of plot structure to understand Russian folk tales have been popular as means of analysing narrative. Several e orts have been carried out to annotate narratives in this way manually or to apply machine learning techniques over texts previously annotated with syntactic and semantic information. The present paper explores the feasibility of annotating directly from the bare text of synopses of the stories by relying on large language models.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>When trying to understand narrative, a very powerful tool used in the past has been the identi cation of abstractions of the meaning of the story that describe its plot in a way that is more generic than a summary of the story. A major tool to help in that aim was the Morphology of the Folktale proposed by Russian formalist Vladimir Propp <ref type="bibr" target="#b0">[1]</ref>. The set of character functions proposed by Propp to describe the structure of Russian folktales has been subsequently applied to analyse French fairy tales <ref type="bibr" target="#b1">[2]</ref>, fantasy plays <ref type="bibr" target="#b2">[3]</ref> and television science ction <ref type="bibr" target="#b3">[4]</ref>. Research e orts have focused on developing detailed annotation schemes <ref type="bibr" target="#b4">[5,</ref><ref type="bibr" target="#b5">6]</ref> or ontologies <ref type="bibr" target="#b6">[7,</ref><ref type="bibr" target="#b7">8,</ref><ref type="bibr" target="#b8">9]</ref> that can be used to annotate stories manually. More recent e orts consider the possibility of developing automatic processes of annotation at the level of Proppian functions that operate over a prior layer of syntactic and semantic annotation of the stories <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b12">13,</ref><ref type="bibr" target="#b13">14]</ref>.</p><p>The present paper explores the possibility of relying on the few-shot learning capabilities of large language models to annotate stories with Proppian functions by operating directly over the text of synopses of the stories. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Receipt of agent</head><p>The hero acquires a magical agent</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Punishment</head><p>The villain receives punishment or consequences.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Transfer</head><p>The hero is hero directed or taken to new location</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Wedding</head><p>The hero marries or is rewarded Struggle</p><p>The hero and villain engage in direct conflict.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Previous Work</head><p>The work relevant to this paper that needs reviewing is the representation of narrative using Propp's character functions, existing e orts to annotate narrative with them, and solutions for annotating text with large language models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Narrative Structure and Propp's Character Functions</head><p>Based on a detailed study of 100 Russian folk tales from the anthology by Afanasiev, the Russian formalist Vladimir Propp proposed a set of abstractions of plot-relevant functions played by the characters of a tale, which he called character functions <ref type="bibr" target="#b0">[1]</ref>. These character functions represent basic contributions to the plot such as going on a journey, having a con ict with another character, being sent on a mission, or being rewarded. The character functions proposed by Propp are shown in Table <ref type="table" target="#tab_0">1</ref>.</p><p>Certain character functions are associated with particular types of character, such as the hero or the villain. Propp also postulated seven such types of character: the villain, the donor (provider), the helper, the princess (or sought-for person), the dispatcher, the hero or victim and the false hero.</p><p>Early uptake of Propp's character functions as means of annotating a corpus of stories relied on XML <ref type="bibr" target="#b4">[5]</ref>. Propp's character functions have been re-represented as ontologies de ned in Description Logics <ref type="bibr" target="#b6">[7]</ref>. The more relevant e ort at annotating with character function information the very stories for which Propp designed his character functions was carried out by Finlayson as part of his PhD thesis <ref type="bibr" target="#b11">[12]</ref>, and later extended by Yarlott and Finlayson <ref type="bibr" target="#b5">[6]</ref>. They propose ProppML, "an annotation scheme designed to capture all the components of a Proppian-style morphological analysis of narratives". Using this scheme, they annotated fteen Russian folk tales from the original corpus, which amounts to 18,862 words. The corpus was annotated separately by two highly trained annotators, and the results then merged with supervision by a third expert annotator.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Automated Annotation of Propp Character Functions</head><p>Finlayson's e orts at deeply annotating a corpus of Russian folk tales at various levels-syntax, semantics, and narrative structure-made it possible to develop a machine-learning algorithm capable of extracting culturally-relevant plot patterns from sets of folktales. Such an algorithm takes as input a story annotated as an event timeline and it can predict for it abstract narrative structures such as Villainy, Struggle-Victory, and Reward <ref type="bibr" target="#b13">[14]</ref>. This an instance of a computational system learning a theory of narrative structure directly from a corpus of data.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Text Annotation with Large Language Models</head><p>The recent advances demonstrated by applying Large Language Models (LLMs) to various Natural Language Processing (NLP) tasks <ref type="bibr" target="#b14">[15]</ref> has opened for exploration a number of avenues that appeared to be impossible just a few years ago. The main idea is that a model pre-trained on very large amounts of textual data can be queried in very simple ways to carry out speci c NLP tasks and perform surprisingly well either with no additional information (zero-shot learning) or with very simple queues that exemply how the task should be solved inserted in the prompt that poses the query (few-shot learning).</p><p>In particular, the task of annotating texts shows that tools such as ChatGPT show very high performance <ref type="bibr" target="#b15">[16]</ref>. ChatGPT is an AI language model developed as part of the GPT (Generative Pre-trained Transformer) series of models and speci cally designed for generating human-like text responses in conversational contexts. This type of approach has been applied to diverse tasks such as annotating legal text <ref type="bibr" target="#b16">[17]</ref> or propagandistic text <ref type="bibr" target="#b17">[18]</ref>.</p><p>Gemini <ref type="bibr" target="#b18">[19]</ref> (previously Bard <ref type="bibr" target="#b19">[20]</ref>) is a family of generative AI models that lets developers generate content handling both text and images as input. These models can generate text, handle multi-turn conversations and also zero, one, and few-shot tasks. These models have been trained with data existing up to the beginning of 2023. The Gemini Pro 1.0 version is available for use through an open web interface or through a commercial web API. It allows for similar queries as those presented to ChatGPT <ref type="bibr" target="#b20">[21]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Automated Tagging of Narrative Synopses with Proppian Character Functions</head><p>We want to explore the relative performance of large language models when used to annotate stories with Propp's character functions via zero-shot learning and few-shot learning approaches.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Zero-Shot Tagging Using ChatGPT</head><p>We are going to try to get ChatGPT to annotate the synopses of the tales that Propp includes as examples of annotations in his book. This should allow us to compare the annotation proposed by ChatGPT with the one indicated by Propp himself.</p><p>As a rst step, we ask ChatGPT if it knows about Propp's functions. This is the reply:</p><p>Vladimir Propp, a Russian folklorist and scholar, identi ed and analyzed common narrative elements in Russian folktales. One of his most notable contributions is his identi cation of "morphology of the folktale, " which includes what he termed "character functions". Propp identi ed 31 character functions that frequently appear in folktales. Here's a brief overview of Propp's character functions:</p><p>It then provides a list with 31 items, containing 7 character types and 24 character functions, some of which are Propp's and others are not; there are also some missing Propp's character functions:</p><p>• Character types: hero, villain, donor, helper, princess/prize, false hero, dispatcher It is interesting to note that ChatGPT, although knowledgeable in general in a surprising way, mixes up the types of character identi ed by Propp with the character functions, gets the number right, but misses out a number of functions because it includes the types of characters instead of those functions.</p><p>To test the ability of the system to annotate a story with the characters functions, the system is given the following prompt:</p><p>Can you annotate this tale using Propp's character functions? &lt;story&gt;</p><p>In response, the system outputs a text that describes its version of the annotation. For the following example (tale 104 as described in Propp's book):</p><p>A girl called Vasilisa has a miraculous doll. The tsar does not have a wife. Vasilisa goes to town. Vasilisa settles in an old woman's house. The old woman buys Vasilisa some ax. Vasilisa spins unusually ne yarn from the ax. The doll makes a loom for Vasilisa in one night . Vasilisa weaves unusually ne linen on the loom. The old woman takes the linen to the tsar. The tsar orders that shirts be sewn by the one who has spun and woven the linen. The girl sews the shirts from the linen. The tsar sends for Vasilisa. Vasilisa marries the tsar.</p><p>The assignment of character functions that Propp suggests is:</p><p>A girl called Vasilisa has a miraculous doll (Acquisition of Magical Agent). The tsar does not have a wife (Lack). Vasilisa goes to town. Vasilisa settles in an old woman's house (Unrecognised Arrival). The old woman buys Vasilisa some ax. Vasilisa spins unusually ne yarn from the ax. The doll makes a loom for Vasilisa in one night (Acquisition of Magical Agent). Vasilisa weaves unusually ne linen on the loom. The old woman takes the linen to the tsar. The tsar orders that shirts be sewn by the one who has spun and woven the linen (Di cult Task). The girl sews the shirts from the linen (Task Resolved). The tsar sends for Vasilisa. Vasilisa marries the tsar (Wedding).</p><p>In contrast, the annotation that ChatGPT suggests as a zero-shot solution is:</p><p>Lack: The tsar lacks a wife. Dispatcher: Vasilisa goes to town. Helper: Vasilisa settles in the old woman's house and receives aid from the miraculous doll. Task: The old woman buys flax for Vasilisa, who spins unusually fine yarn and weaves linen. Delivery: The old woman takes the linen to the tsar. Task: The tsar orders shirts to be sewn from the linen spun and woven by the same person. Performance of task: Vasilisa sews the shirts. Arrival: The tsar sends for Vasilisa. Final union: Vasilisa marries the tsar.</p><p>Zero-shot learning presents the di culty of having the system provide the result in a format of its own devising, which may di er signi cantly from the one being used as reference.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Zero-Shot Tagging Using Gemini</head><p>Similarly to what we did using ChatGPT, we prompted Gemini for what it knew about Propp's character types and character functions. It provided a list of the seven character types and then another with 31 character functions:<ref type="foot" target="#foot_0">1</ref> initial situation, absentation, interdiction, violation, reconnaissance, delivery of information, trickery, complicity, mediation, beginning of the counteraction, hero's departure, rst function of the donor, tests of the hero, acquisition of magic agent, hero's reaction, spatial translocation, struggle, branding, victory, liquidation of the lack, return, pursuit, rescue, unrecognized arrival, false hero's claims. recognition, reward, di cult task, exposure of the false hero, trans guration, wedding. punishment, and resolution. This response does in essence match Propp's set of character functions, but it presents an issue that adds complexity to the task. The system is sometimes not using the exact names that Propp assigned to his character functions, but rather paraphrasing some of them. This happens for instance for the ones shown in Table <ref type="table" target="#tab_2">2</ref>.</p><p>Zero-shot learning approaches to this task will likely su er from having to identify an appropriate mapping between the set of tags used by the system and the set of tags being used as reference. This presents a particular problem because it makes it very di cult to carry out automated processing of the tagging.</p><p>To test the ability of the system to annotate a story with the characters functions, the system was given the same prompt as in the previous case. For the same example considered above (tale 104 as described in Propp's book) the annotation that Gemini suggests as a zero-shot solution is:</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Characters:</head><p>Hero: Vasilisa Dispatcher: None explicitly identified (opportunity arises naturally) Donor: Doll (magical helper) Prize/Princess: Tsar (implicitly)</p><p>It is interesting to note that, in the zero-shot learning, Gemini decides to provide information about how the content of the story relates to the Spheres of Action that it considers Propp to have de ned.</p><p>We tested this with ChatGPT and for the previous example (tale 104 as described in Propp's An important point to note here is that the system response does not actually annotate the text in the sense of respecting the text and assigning labels to speci c segments of it. Instead it is declaring what labels might be applicable to the text in some sense, and for each one it provides a description of the events in the text that justify the assignment. For instance, the 1st donor function label is justi ed with the sentence "The miraculous doll aids Vasilisa in spinning and weaving", which can be understood as a condensed description of the various tasks that Vasilisa carries out in the story, taken in the context that we have been told that the magical doll is helping her. The summary is actually incorrect, or maybe just not altogether precise, as the doll actually only provides the magical loom with which Vasilisa weaves, and it does not actually help with the spinning. This presents a problem in how this type of response should be annotated. First, because it is di cult to establish whether the system assignment of the label matches accurately Propp's assignment-the boundaries of the segment of text considered to justify the label are not de ned. Second, because the assignment appears to be based on an incorrect interpretation of the facts in the story.</p><p>In this particular case, the 1st donor function label is incorrectly assigned, because Propp de nes the rst action of the donor to be testing of the hero by the donor to decide whether he is worthy of gift or assistance. The system has also made an incorrect inference here. A similar case can be observed for the inference supporting the Delivery tag: the system justi es it by saying Vasilisa delivers the linen to the tsar, when the story says the old woman does.</p><p>A more re ned option might be to include in the prompt an example of how the annotations should be presented in combination with the input.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.4.">Few-Shot Tagging Using Gemini</head><p>We tested the same prompt-enriched with the explicit list of character functions to use-with GeminiPro and for the previous example (tale 104 as described in Propp's book), the annotation produced is: This tale is a simplified version and omits some of the more complex elements of Propp's morphology. The villain and some other functions (e.g., struggle, victory) are absent. The focus is on Vasilisa's exceptional skills, aided by the magical doll, which ultimately lead to her marriage with the Tsar.</p><p>It is interesting to see that Gemini has some di culty in identifying the di erence between character types and narrative functions as described in the prompt. This results in an annotation that mixes both kinds of labels indistincly.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Discussion</head><p>Regardless of the impressive nature at rst glance of the replies provided by these large language models, it is very easy for them to be incorrect to a surprising degree. This is especially true for queries that require responses involving detailed recall of de nitions. The examples reported above of how the di erent LLMs recall the actual list of Propp's character functions shows both the strengths and the weaknesses of these models.</p><p>The very nature of these systems implies that the text of any response is dynamically constructed on demand from a neural representation in an underlying latent space. For this reason, responses are very likely to be paraphrases rather than verbatim transcriptions. This has the advantage of allowing for a certain leeway in the interpretation of de nitions. These systems are more likely to identify instances of a particular character function even when they are phrased in a signi cantly di erent way from the original de nition. However, it also has the disadvantage of relying on potentially divergent understandings of speci c function. The set of de nitions provided for the character functions when prompted suggests that this is the case at least for ChatGPT. This suggests that the set of tags being used to annotate by ChatGPT in a zero-shot approach to the task is simply incorrect.</p><p>To provide some quantitative measure of the degree of divergence, the set of tags assigned to the sentences in the tale is compared with the assignment that Propp provides for it in his book. In appendices II and III, Propp provides examples of how the plots of a number of tales can be described in terms of his set of character functions. The tales so described are referred to only by number, and the assignment of character functions is given over a synopses of the Table <ref type="table">3</ref> Metrics for performance of di erent models and prompting strategies on assignment of character functions to tale synopses with respect to Propp's assignment. Metrics for di erent model + prompting strategy pairs is shown in a di erent column. For each model + prompting strategy pair, the first pair of rows in the table show: precision and recall against Propp's assignment, the second pair of rows shows percentage of labels that show some linguistically plausible relation to corresponding story content, and percentage of positions identified by Propp as involving a character function that the system o ers a label for, and the third pair of rows shows percentage of incorrect assignments of Propp's categories, and percentage of labels assigned not equivalent to any Proppian character function. The assignments provided of character functions to segments of the synopses required a detailed analysis. The responses by the various models include some kind of label together with a justi cation of which part of the story the label is assigned to, both labels and descriptions tend to be paraphrases rather than references to speci c character function names or speci c segments of the text. This makes simple measures like precision and recall against Propp's original assignment di cult to compute and even more di cult to interpret. To circumvent this problem, in all cases where the model suggested a label that could reasonably be considered to refer to the one assigned by Propp it has been considered valid. In the hope of making the analysis more informative, four additional metrics have been computed. First, the number of times that identi able references to Propp's character functions have been assigned to segments of the story that they do not apply to, which measures correct match between label and text. Second, the number of times that labels have been used that cannot in be related in some way to the character functions de ned by Propp, which measures familiarity with the set of character functions. Third, the number of times that labels assigned (whether identi able as Propp's character functions or not) can be seen to relate to the corresponding story segment by some valid inference, which measures the language understanding ability of the system. Finally, we report a metric to measure the number of times that the system has attempted to assign a label to segments of the text to which Propp had assigned one, which measure the ability of the system to identify plot-relevant events, regardless of whether it can tag them correctly. The results for these set of metrics are shown in Table <ref type="table">3</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>ChatGPT</head><p>The rst observation to make is that the rst four metrics show relatively low values. This indicates poor performance overall.</p><p>It is interesting to note that for several of the metrics the zero-shot approach performs better than the few-shot approach. This happens to ChatGPT for all metrics save the identi cation of plot-relevant segments, and to Gemini at least for the number of incorrect assignments (44 from 16). This is consistent with observed behaviour of LLMs, with quality of results dropping as the size of prompts increases.</p><p>The fact that ChatGPT zero-shot had a very low recall is not surprising given that it was not really using the full set of character functions. The zero-shot approach using Gemini had both higher precision and higher recall than the other options, which is consistent with the fact that it is not confused about the character functions. Nevertheless, the scores are surprisingly low.</p><p>The additional metrics reported provide some insight into the reasons for these low values. We notice very poor performance (33%) for ChatGPt at the identi cation of plot-relevant segments, and signi cant percentages of incorrect .assignments of existing labels (26% for ChatGPT and 16 for Gemini%). The number of labels assigned that do not correspond to Propp's set is surprisingly high (10%) for Gemini zero shot given that it supposedly knows the correct set, but it drops signi cantly (to 4%) when prompted with the set of names. In contrast, ChatGPT reacts badly to the enriched prompt, with percentages for both incorrect assignments and invented labels.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusions</head><p>The experiments reported suggest that the use of large language models for automated tagging of stories with Propp character function may yield very di erent results depending on the level of familiarity of the particular model employed with the set of character functions in question. They also suggest that, if the model is not su ciently familiar with the de nitions of the character functions, a few-shot approach in which brief versions of the de nitions are provided may not be enough to compensate for this.</p><p>Further experiments are needed both in terms of exploration of a broader range of variations in the prompting and in the consideration of di erent models or versions of the models. Important aspects to consider in this sense are the di erences between public web interfaces and publicly available sources for the models, and di ences in model size.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>Annotating the Tale of Vasilisa using Propp's Framework Characters:Hero: Vasilisa Donor: The Doll (magical helper) Dispatcher (indirect): The Tsar's lack of wife creates an opportunity for Vasilisa. Narrative Functions: Absentation: The Tsar lacks a wife (lack). Hero's possession of a magical agent: Vasilisa has the miraculous doll. Departure: Vasilisa goes to town (possibly seeking a new life). Helper: The old woman provides Vasilisa with flax. Difficult Task (indirect): Vasilisa spins unusually fine yarn, implying exceptional skill. Receipt of Magical Agent (indirect): The doll creates a loom for Vasilisa, demonstrating its magical abilities. Difficult Task: Vasilisa weaves unusually fine linen, again showcasing her skill. Mediation: The old woman takes the linen to the Tsar, revealing Vasilisa's talent. Transfer: The Tsar sends for Vasilisa, bringing her closer to her potential reward. Marriage: Vasilisa marries the Tsar (fulfills the lack and resolves the story). Notes:</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Character functions proposed byPropp.    </figDesc><table><row><cell>Absentation</cell><cell>The villain or a member of the</cell><cell>Branding</cell><cell>The hero is marked or identi-</cell></row><row><cell></cell><cell>family leaves the home</cell><cell></cell><cell>fied</cell></row><row><cell>Interdiction</cell><cell>A prohibition is imposed on</cell><cell>Victory</cell><cell>The hero defeats the villain.</cell></row><row><cell></cell><cell>the hero</cell><cell></cell><cell></cell></row><row><cell cols="2">Interdiction violation The interdiction is violated</cell><cell>Liquidation of lack</cell><cell>Villainy or lack resolved</cell></row><row><cell>Reconnaissance</cell><cell>The villain attempts to obtain</cell><cell>Return</cell><cell>The hero returns</cell></row><row><cell></cell><cell>information</cell><cell></cell><cell></cell></row><row><cell>Delivery</cell><cell>The villain gains information</cell><cell>Pursuit</cell><cell>The villain chases the hero</cell></row><row><cell></cell><cell>or an object</cell><cell></cell><cell></cell></row><row><cell>Trickery</cell><cell>The villain deceives the hero</cell><cell>Rescue from pursuit</cell><cell>The hero is saved from pur-</cell></row><row><cell></cell><cell>to gain an advantage</cell><cell></cell><cell>suit or danger</cell></row><row><cell>Complicity</cell><cell>The victim submits to decep-</cell><cell>Unrecognized arrival</cell><cell>The hero arrives unrecog-</cell></row><row><cell></cell><cell>tion.</cell><cell></cell><cell>nized</cell></row><row><cell>Villainy or lack</cell><cell>The villain causes harm or in-</cell><cell>Unfounded claims</cell><cell>A false hero claims hero sta-</cell></row><row><cell></cell><cell>jury or someone lacks some-</cell><cell></cell><cell>tus</cell></row><row><cell></cell><cell>thing they want</cell><cell></cell><cell></cell></row><row><cell>Mediation</cell><cell>Misfortune is made known</cell><cell>Di icult task</cell><cell>The hero is given a di icult</cell></row><row><cell></cell><cell></cell><cell></cell><cell>task</cell></row><row><cell>Counteraction</cell><cell>The hero reacts to the vil-</cell><cell>Solution</cell><cell>The task is accomplished or</cell></row><row><cell></cell><cell>lain's actions</cell><cell></cell><cell>resolved</cell></row><row><cell>Departure</cell><cell>The hero leaves home</cell><cell>Recognition</cell><cell>The hero is recognized or ac-</cell></row><row><cell></cell><cell></cell><cell></cell><cell>knowledged</cell></row><row><cell>1st donor function</cell><cell>The hero is tested by a poten-</cell><cell>Exposure</cell><cell>The villain's identity or de-</cell></row><row><cell></cell><cell>tial donor</cell><cell></cell><cell>ception is exposed</cell></row><row><cell>Hero's reaction</cell><cell>The hero reacts to the test</cell><cell>Transfiguration</cell><cell>The hero undergoes a trans-</cell></row><row><cell></cell><cell></cell><cell></cell><cell>formation.</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 2</head><label>2</label><figDesc>Mapping between Propp's character function names and Bard's paraphrases.</figDesc><table><row><cell cols="2">Propp's name Bard's paraphrase</cell></row><row><cell>Delivery</cell><cell>Delivery of information</cell></row><row><cell>Transfer</cell><cell>Spatial translocation</cell></row><row><cell>Exposure</cell><cell>Exposure of false hero</cell></row><row><cell>Solution</cell><cell>Resolution</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">Gemini listed the character functions grouped by sphere of action, following Propp's description of them.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This paper has been partially supported by the CANTOR project (PID2019-108927RB-I00) funded by the Spanish Ministry of Science and Innovation.</p></div>
			</div>


			<div type="funding">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The miraculous doll Helper: The old woman Princess: Vasilisa herself, who becomes the tsar's wife Dispatcher: The tsar False Hero: None explicitly mentioned Narrative Functions: Dispatcher: The tsar sends for Vasilisa. 1st donor function: The miraculous doll aids Vasilisa in spinning and weaving. Receipt of agent: The doll makes a loom for Vasilisa. Delivery: Vasilisa presents the fine linen to the tsar. Difficult task: The tsar orders shirts to be sewn from the fine linen. Solution: Vasilisa sews the shirts. Wedding: Vasilisa marries the tsar.</p></div>
			</div>


			<div type="funding">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>False Hero: None explicitly identified Narrative Functions: Sphere of Action 1: Preparation Initial Situation: Vasilisa has a miraculous doll and lives without a father figure. Lack: Tsar lacks a wife. (Note: not directly affecting Vasilisa, but creates initial context) Sphere of Action 2: Departure Beginning of the Counteraction: Vasilisa goes to town (implicitly seeking opportunity). Sphere of Action 3: The Hero's Actions Hero's Departure: Vasilisa leaves her home environment. First Function of the Donor: Old woman provides flax (indirect aid). Tests of the Hero: Task of spinning fine yarn (challenge). Acquisition of Magic Agent: Doll magically creates a loom for Vasilisa. Hero's Reaction: Uses the loom to weave fine linen. Spatial Translocation: Linen reaches the Tsar. Sphere of Action 4: Return Return: Tsar acknowledges the linen and initiates contact with Vasilisa. Return: Vasilisa arrives at the Tsar's court. Sphere of Action 5: Recognition and Reward Unrecognized Arrival: Vasilisa implicitly demonstrates her skills. Difficult Task: Sewing the shirts (implicit test of skill). Exposure of the False Hero: N/A Transfiguration: Vasilisa gains new status as Tsar's wife. Wedding: Marriage to the Tsar (reward). Sphere of Action 6: Resolution Resolution: Lack of the Tsar is addressed (gains a wife), and Vasilisa finds a new life.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>It is also identifying a number of inferences of implicit content that is not mentioned in the story but is indeed inferable from the text. Examples of this are the fact that Vasilisa has left home (to go to town) or that Vasilisa gains new status as a result of having married the Tsar.</p><p>Another relevant detail is that Gemini assigns 11 character functions to this story, whereas Propp only assigned 7. Propp himself had assigned to this tale only 7 character functions, though not the same as ChatGPT.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Few-Shot Tagging Using ChatGPT</head><p>Finally, we wanted to test the performance of the system at a few-shot learning task. So we replicated the experiment above with an enriched prompt that explicitly included the list of character functions and their de nitions.</p><p>I need to annotate some tales using Propp's character types and narrative functions. These are the tags and definitions I need to use:</p><p>For the character types: + Hero: The role model of a story. + Villain: The negative character who creates struggles for the hero. + Donor: The character who provides some magical object to the hero. + Helper: The character who helps the hero. + Princess: The character who becomes a companion of the hero. + Dispatcher: The character who illustrates the need for the hero's quest and sends the hero off. + False Hero: The character who takes credit for the hero's actions For the narrative functions: + Absentation: The villain or a member of the family leaves the home + Branding: The hero is marked or identified + Interdiction: A prohibition is imposed on the hero + Victory: The hero defeats the villain + Interdiction violation: The interdiction is violated + Resolution of trigger: Villainy or lack resolved + Reconnaissance: The villain attempts to obtain information + Return: The hero returns + Delivery: The villain gains information or an object + Pursuit: The villain chases the hero + Trickery: The villain deceives the hero to gain an advantage + Rescue from pursuit: The hero is saved from pursuit or danger + Complicity: The victim submits to deception + Unrecognized arrival: The hero arrives unrecognized + Villainy or lack: The villain causes harm or injury or someone lacks something they want + Unfounded claims: A false hero claims hero status + Mediation: Misfortune is made known + Difficult task: The hero is given a difficult task + Counteraction: The hero reacts to the villain's actions + Solution: The task is accomplished or resolved + Departure: The hero leaves home + Recognition: The hero is recognized or acknowledged + 1st donor function: The hero is tested by a potential donor + Exposure: The villain's identity or deception is exposed + Hero's reaction: The hero reacts to the test + Transfiguration: The hero undergoes a transformation + Receipt of agent: The hero acquires a magical agent + Punishment: The villain receives punishment or consequences + Transfer: The hero is hero directed or taken to new location + Wedding: The hero marries or is rewarded + Struggle: The hero and villain engage in direct conflict Annotate this tale: &lt;story&gt;</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title level="m" type="main">Morphology of the Folktale</title>
		<author>
			<persName><forename type="first">V</forename><surname>Propp</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1968">1968</date>
			<publisher>University of texas Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">D</forename><surname>Williams</surname></persName>
		</author>
		<title level="m">The Fairy Tales by Madamme d&apos;Aulnoy</title>
				<meeting><address><addrLine>Houston, Texas</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1982">1982</date>
		</imprint>
		<respStmt>
			<orgName>Rice University</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Mcgowan</surname></persName>
		</author>
		<title level="m">An analysis of the fantasy plays of James M. Barrie utilizing Vladimir Propp&apos;s structural model of the fairy tale</title>
				<imprint>
			<date type="published" when="1984">1984</date>
		</imprint>
		<respStmt>
			<orgName>New York University</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">of myth/folktale and the television science ction genre</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">J</forename><surname>Latourette</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Doctor Who meets Vladimir Propp: A comparative narrative analysis</title>
				<imprint>
			<date type="published" when="1990">1990</date>
		</imprint>
		<respStmt>
			<orgName>Northwestern University</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Proppian structural analysis and xml modeling</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Malec</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of Computers, Literature and Philology</title>
				<meeting>of Computers, Literature and Philology<address><addrLine>CLiP</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2001">2001. 2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Proppml: A complete annotation scheme for proppian morphologies</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">V H</forename><surname>Yarlott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Finlayson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">7th Workshop on Computational Models of Narrative (CMN 2016</title>
		<title level="s">Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik</title>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">A description logic ontology for fairy tale generation</title>
		<author>
			<persName><forename type="first">F</forename><surname>Peinado</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Gervás</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Díaz-Agudo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Procs. of the Workshop on Language Resources for Linguistic Creativity, LREC</title>
				<meeting>s. of the Workshop on Language Resources for Linguistic Creativity, LREC</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="page" from="56" to="61" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Multilingual ontologies for the representation and processing of folktales</title>
		<author>
			<persName><forename type="first">T</forename><surname>Declerck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Aman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Banzer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Machá Ek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Schäfer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Skachkova</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the LT4DHCSEE in conjunction with RANLP</title>
				<meeting>the LT4DHCSEE in conjunction with RANLP</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="20" to="23" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">An ontology-driven information system based on Vladimir Propp&apos;s morphology of the folktale for southern African folktales</title>
		<author>
			<persName><forename type="first">F</forename><surname>Pannach</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2019">2019</date>
			<pubPlace>Germany</pubPlace>
		</imprint>
		<respStmt>
			<orgName>University of Göttingen</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Master&apos;s thesis</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Deriving narrative morphologies via analogical story merging</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Finlayson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">New Frontiers in Analogy Research</title>
		<imprint>
			<biblScope unit="page" from="127" to="136" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Natural language processing for storytelling and role playing: a training system based on the propp model</title>
		<author>
			<persName><forename type="first">M</forename><surname>Despontin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Sbattella</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Tedesco</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">ICERI2010 Proceedings, IATED</title>
				<imprint>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="5036" to="5045" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Learning narrative structure from annotated folktales</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">M A</forename><surname>Finlayson</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2012">2012</date>
		</imprint>
		<respStmt>
			<orgName>Massachusetts Institute of Technology</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Ph.D. thesis</note>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Inferring propp&apos;s functions from semantically annotated text</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Finlayson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of American Folklore</title>
		<imprint>
			<biblScope unit="volume">129</biblScope>
			<biblScope unit="page" from="55" to="77" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Propplearner: Deeply annotating a corpus of russian folktales to enable the machine learning of a russian formalist theory</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Finlayson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Digital Scholarship in the Humanities</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="page" from="284" to="300" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Recent advances in natural language processing via large pre-trained language models: A survey</title>
		<author>
			<persName><forename type="first">B</forename><surname>Min</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Ross</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Sulem</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">P B</forename><surname>Veyseh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">H</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Sainz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Agirre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Heintz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Roth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Computing Surveys</title>
		<imprint>
			<biblScope unit="volume">56</biblScope>
			<biblScope unit="page" from="1" to="40" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Alizadeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kubli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Samei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dehghani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Bermeo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Korobeynikova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Gilardi</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2307.02179</idno>
		<title level="m">Open-source large language models outperform crowd workers and approach chatgpt in text-annotation tasks</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">The unreasonable e ectiveness of large language models in zero-shot semantic annotation of legal texts</title>
		<author>
			<persName><forename type="first">J</forename><surname>Savelka</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">D</forename><surname>Ashley</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Frontiers in Arti cial Intelligence</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Hasanain</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Ahmed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Alam</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2311.09812</idno>
		<title level="m">Large language models for propaganda span annotation</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<author>
			<persName><forename type="first">G</forename><surname>Team</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2312.11805</idno>
		<title level="m">Gemini: A family of highly capable multimodal models</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">An overview of bard: an early experiment with generative ai</title>
		<author>
			<persName><forename type="first">J</forename><surname>Manyika</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Hsiao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">AI. Google Static Documents</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Factors to consider when selecting a large language model: A comparative analysis</title>
		<author>
			<persName><forename type="first">S</forename><surname>Mandvikar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Intelligent Automation and Computing</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="37" to="40" />
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
