Benchmarking the Semantics of Taste: Towards the Automatic Extraction of Gustatory Language

Benchmarking the Semantics of Taste: Towards the Automatic Extraction of Gustatory Language TeresaPaccosi tpaccosi@fbk.eu Fondazione Bruno Kessler

Via Sommarive, 18 Trento

Università degli studi di Trento

Via Calepina 14 Rovereto

DHLab / KNAW Humanities Cluster

Oudezijds Achterburgwal 185, 1012 DK Amsterdam The Netherlands

SaraTonelli satonelli@fbk.eu Fondazione Bruno Kessler

Via Sommarive, 18 Trento

Dec 04 -06 2024 Pisa Italy

Benchmarking the Semantics of Taste: Towards the Automatic Extraction of Gustatory Language 1613-0073 BED1AE1D38FE01D4EE107BAAD2111230 GROBID - A machine learning software for extracting information from scholarly documents Sensory semantics gustatory language information extraction digital humanities

In this paper, we present a benchmark containing texts manually annotated with gustatory semantic information. We employ a FrameNet-like approach previously tested to address olfactory language, which we adapt to capture gustatory events. We then propose an exploration of the data in the benchmark to show the possible insights brought by this type of approach, addressing the investigation of emotional valence in text genres. Eventually, we present a supervised system trained with the taste benchmark for the extraction of gustatory information from historical and contemporary texts.

Introduction

Despite the central role of nutrition in our lives, taste has been often classified as an inferior sense in the Western philosophical tradition. This downplayed role is reflected in the vocabulary used to describe the gustatory experience, which, together with smell, is characterized by a scarcity of domain-specific terms [1]. The difficulty in capturing the semantics of taste could help explain why there are few works in the fields of Natural Language Processing (NLP) and Digital Humanities (DH) that deal with this sense and, in particular, the language used to describe its experience. While there has been renewed interest in the automatic extraction of nutrients and ingredients from texts for health and medicinal purpose [2], less attention has been devoted to the development of tools and models focused on capturing the semantics of sensory experiences, especially in a diachronic fashion.

In this paper, we present an English benchmark for the study of gustatory language and a supervised system for the automatic extraction of taste-related events in English, which we trained using this benchmark. The benchmark was built to be a counterpart to the olfactory one presented in [3], with the idea of making the study of the language of these two senses comparable. The system is designed as a means to study the language used to describe the experience of tasting from both synchronic and diachronic perspectives. The selected formal representation for the semantics of taste is based on Frame Semantics [4], and the system is trained to identify the lexical units and the possible semantic roles contributing to the construction of a gustatory event. We present the results of the experiments and an exploration of the benchmark data, aiming to demonstrate the potential of frame-based analysis for sensory studies.

Related Work

In recent years, there has been a growing interest within the NLP community in developing resources designed to capture the sensory content of language [5]. In particular, in the framework of the three-year European Project "Odeuropa" 1 aimed at preserving intangible cultural heritage, several works have focused on analyzing smell descriptions [6] and extracting olfactory information from texts. For instance, [3] created a manually annotated benchmark with smell events, which has been subsequently used to train a system for olfactory information extraction [7,8]. The benchmark focuses on the language used to describe olfactory experiences and covers a period of four centuries (1600-1900), making it useful for historical research. An extension in this direction is SENSE-LM, a system for extracting sensory information from texts, which shows that combining language models with lexical resource-based approaches yields better results in extracting sensory references from texts compared to systems that do not integrate these two components [9]. The authors were the first to combine sensorimotor representations with the textual features of language models for the task of sensory information extraction in text documents. Even if they propose the system for all the 5 senses, they only tested it on olfactory

Frame Element

Definition

Taste_Source

The food items that are ingested Quality Any property used to describe the taste (usually adjectives) Taste_Carrier

Anything that can contain the taste source Taster

The person/animal who ingests the food Evoked_Taste

The taste that is evoked but it is not present (e.g., it tastes like onions) Location

The place in which the food is tasted Taste_Modifier An ingredient that can modify the perception of the taste of a taste source Circumstances

The condition or circumstance in which the taste event occurs Effect Any effect provoked by the tasting experience

Table 1

List of Gustatory Frame Elements and auditory language, using respectively the benchmark of [3] and an artificial dataset they generated with GPT-4 [10]. Most existing work on food representation in the field of NLP focuses on health-related applications. A notable work with a linguistic focus is [2], where the authors concentrate on identifying noun-compound headnouns for developing conversational agents in the e-commerce domain. They propose a supervised approach based on a neural sequence-to-sequence model to identify the most informative token in Italian food compound-nouns, obtaining promising results despite the complexity of the task. Taste has been also addressed from a diachronic point of view in [11], in which the author reconstructs the evolution of food language focusing on the history of some dishes and ingredients across continents using computational linguistic tools. Several studies have developed named-entity recognition (NER) models to automatically extract food entities for medicinal purposes and food science applications [12,13], creating domainspecific corpora by sourcing data from culinary websites and online recipe books [14,15].

Benchmark for Taste

The training data we use for the models in this paper is a benchmark created according to the annotation guidelines presented in [16]. The formalization adopted to annotate the benchmark is inspired by Frame Semantics [4] and their implementation through the FrameNet annotation project [17]. In FrameNet, events and situations are constructed as frames, structures that represent the knowledge necessary to understand the meaning of words. Frames include two main components, namely lexical units, domain-specific words or expression that trigger the frame, and frame elements, domain-specific semantic roles usually attached as dependents to the lexical unit. In our case, taste events are captured through a so-called Gustatory frame, which is triggered in a document by Taste_Words (i.e., domain-specific lexical units). Each lexical unit is annotated in the bench-mark together with the frame elements associated with it, which the taste extraction system should then identify automatically. For instance, in the sentence "[Slimy milk]𝑇 𝑎𝑠𝑡𝑒_𝑆𝑜𝑢𝑟𝑐𝑒 has an [unpleasant] 𝑄𝑢𝑎𝑙𝑖𝑡𝑦 taste", the system has to identify the Taste_Word ('taste'), and then the possible frame elements (in this case, Taste_Source and Quality). A list of the possible frame elements and their definition is provided in In Table 2 we report the statistics of the annotated benchmark (note that in [16] we presented only a preliminary version of the benchmark containing around 1,400 Taste_Words). The most frequent frame element is the Taste_Source, followed by Quality and Taste_Modifier, which represent the core frame elements, while the rest of the frame elements are much sparser. Even if the distribution of the frame elements is not balanced, the system is trained to extract the taste words and all the 9 frame elements. Two expert linguists, trained on [16]'s guidelines, annotated three documents from 1670, 1720, and 1920 to assess Inter Annotator Agreement (IAA). The Krippendorff's alpha score [18] at span level was 0.70, indicating a moderate agreement.

Exploration of olfactory and gustatory benchmarks

It has been observed that words used to describe olfactory and gustatory experiences tend to appear more frequently in emotionally charged contexts and carry a stronger evaluative content compared to words related to other senses [19]. By 'evaluative content', we refer in this paper to the concept of 'emotional valence', which is defined as "the pleasantness of a word in terms of positive and negative meaning" ([1], p. 201). We therefore conducted an exploration of the gustatory benchmark to investigate the positive and negative connotations of gustatory events across different text genres. We perform the same analysis for olfactory events, using the olfactory benchmark of [3] in order to compare the outcome for the two senses. To perform this analysis, we first divide Taste_Words and Smell_Words into positive and negative.

To this purpose, we use the categories proposed in the Historical Thesaurus of English of Savouriness and Unsavouriness for Taste and Fragrant/Fragrance and Stench for Smell 10 . This thesaurus contains almost every recorded word in English from medieval times to the present day, ordered into detailed hierarchies of meaning. In the Thesaurus, every category of the hierarchy is divided per part of speech (PoS). For our analysis, we manually selected all the nouns, adjectives and adverbs used in the period we cover with our documents, namely from 16 th century to 20 th century. We then assigned the words labeled as Taste_Words and Smell_Words in the documents to one of the two categories (positive or negative) and calculated the normalized frequency of each category across different text genres. As reported in Section 3, the genres represented in the gustatory bench- We display the output of this analyses in Fig. 1 (for taste words) and Fig. 2 (for smell words), aimed at showing which emotional valence prevails in each genre for the two senses. We observe that two genres exhibit opposite tendencies: medicine/botany shows a more negative orientation in the smell benchmark and a more positive one in the taste benchmark, whereas travel/ethnography is more positive concerning smell and more negative for taste (see Fig. 1 and Fig. 2, where the light blue refers to negative valencies and the dark blue to positive ones). We then analyzed the most frequent smell / taste sources in the two selected genres to motivate why they exhibit such difference in emotional valence. We notice that smell sources in medicine/botany tend to be common to hospital and disease-related domains having words such as 'urine' and 'fetid bronchitis', while taste sources more easily belong to the realm of common food, with words such as 'almonds' and 'apples'. For what concerns travel/ethnography instead, among the most frequently described taste sources there are exotic and rare foods such as 'coconut' and 'plantain', likely resulting unpleasant to the palates of foreign travelers. Smell sources tend to refer instead to plants, like 'flowers' or 'roots', hence usually pleasant or neutral to the noses of the writers. This analysis of categories and sources' distribution in the genres underlines the importance of a frame-base analysis for understanding and comparing sensory descriptions, in particular their emotional valence.

System for Gustatory Information Extraction

The benchmark introduced in the previous sections is used to train a classifier whose goal is to detect gustatory information in English texts. The system is based on multi-task learning (Section 5.1), and is then compared with a "single task" classifier, which we consider our baseline (Section 5.2).

Multitask configuration

To build our system for gustatory information extraction, we adopted a multitask learning approach [20,21], a configuration successfully tested for olfactory information extraction in [7,8]. This approach treats the classification of lexical units and each frame element as different tasks. Additionally, we explored a "single task" classification approach, where both lexical units and frame elements are classified within a multiclass token classification task. The results of these experiments served as a baseline for evaluating the effectiveness of the multitask approach. In both configurations, we employed a transformer-based model fine-tuned for a token classification task [22]. This methodology has proved effective across various NLP tasks, including olfactory information extraction [8] and the extraction of food-related ingredients [13]. We experiment the two configurations with monolingual (English) and multilingual versions of BERT and RoBERTa and with an English historical model, MacBERTh. The models we use are listed below:

-English BERT: bert-base-cased11 [23] -Multilingual BERT (mBERT): bert-base-multilingualcased 12 [23] -English historical model: MacBERTh13 [24] -English RoBERTa: roberta-base14 [25] -Multilingual RoBERTa (RoBERTa xlm): xlmroberta-large15 [26] We fine-tuned each model using the same data, maintaining identical training, validation, and test splits, and evaluated them using 5-fold cross-validation. Each fold contained 80% of the lexical units and their related frame elements for training, 10% for validation (dev), and 10% for testing. These splits were consistent across all configurations and not entirely random. This configuration ensured a balanced distribution of frame elements and comparability in every run. For labeling the data, we adopted the IOB (Inside-Outside-Beginning) labeling format, as used in [7,8]. This method facilitates a comprehensive analysis of sentences and lexical expressions by

Table 3

Results (F1) of the classifiers on the lexical unit (T_Word) and 9 frame elements with single (italics) and multitask configurations.

The results are the average of the f1 results of each label across the 5 folds.

labeling each token with either Inside, Outside, or Beginning labels as appropriate. To fine-tune the models, we used MaChAmp [27], a specialized toolkit designed for multi-task fine-tuning scenarios. In this approach, each label classification is treated as a distinct task. This setup ensures that simpler tasks, such as recognizing lexical units, contribute as auxiliary tasks to more complex label classifications like "Circumstances" or "Effect" which include entire sentences rather than individual words. MaChAmp enables the choice of different parameters, such as loss weight, epochs and batch size, and we tested different configurations 16 . The results in Table 3 for the multitask approach share the configuration which yielded the best results. The configuration is the same for all the models and it is reported in Appendix A.

"Single Task" configuration as Baseline

Similar to the system for smell information extraction presented in [8], we designed our baseline approach as a single-task multiclass classification, where the model assigns one of 21 possible labels to each token. These labels include 20 representing either "begin" or "inside" of each lexical unit and frame element, and 1 label representing "outside". As we did for the multitask approach, each model is fine-tuned with a token classification head on top 17 . During the training of each model, a hyperparameter search was conducted on the first fold of our data. The search space included learning rates 16,32], and training epochs up to 20, with warmup applied for 10% of the training steps. After determining the optimal hyperparameters for each model, it is fine-tuned 16 Loss weight with different combinations over the labels [1, 0.75], epochs [10,20,30], and batch size [16,32] 17 https://huggingface.co/docs/transformers/tasks/token_ classification five times, each time with a different data fold, and the average scores were computed. We present the results of for the single task approach of each model in italics in Table 3. We observe high performance variations across different frame elements, with the best results obtained for "Quality" and "Taste_Modifier". This is probably due to the fact that their syntactic realization tends to be consistent in the different documents, with "Quality" mainly expressed by adjectives and "Taste_Modifier" by prepositional phrases introduced by with. On the contrary, classification results for "Taste_Source" are quite low despite it being the most frequent FE in the training set, probably because they can be expressed by many different role fillers and syntactic constructions. Upon reviewing the test and prediction results, we find that most mistakes concerning Taste_Source are due to a wrong span extent, for instance the system predicts "the taste of [lollilop]" while the gold standard is "the taste [of lollipop]". This issue is also likely reflected in the inter-annotator agreement (IAA) of the benchmark. In the future, we will consider alternative ways to evaluate text spans beside exact match, for instance by computing the cosine similarity between gold instances and system predictions. Overall, MacBERTh is the best model for Taste_Word detection, but the different FEs are mostly detected with higher accuracy using RoBERTa xlm. For this reason, we plan to adopt this model for our future research on gustatory language.

[1𝑒 − 5, 2𝑒 − 5, 3𝑒 − 5, 4𝑒 − 5, 5𝑒 − 5], batch sizes [8,

Conclusions and Future Direction

In this paper, we presented a benchmark for gustatory events containing manually annotated taste-related information, built as a counterpart to the one proposed in [3].

The benchmark is constructed with the same approach adopting a frame-based methodological framework to analyze sensory language. We emphasized the importance of frame-based analysis to capture sensory events by exploring the characterization of positive and negative valence in the benchmarks through the analysis of taste and smell words and sources. The analysis based on frames seems to bring relevant insights into capturing sensory valence from different perspectives, likely supporting the suitability of this approach to deal with humanistic inquiries. We then presented a supervised system to automatically extract taste-related frames, trained on this benchmark. This preliminary exploration and the results obtained with our experiments seem promising for future exploration with automatically extracted data. Indeed, the limited data of the benchmark are not enough to draw relevant conclusions, and for this reason we plan to use our system to extract more data and conduct largescale analyses of the evolution of sensory information over time.

Appendices

A. Lexical Units and Frame Elements

In Table 4, we display the list of lexical units or taste words presented in [16].

B. Hyperparameter Values

The hyperparameter setting for all our models is presented in Table 5. The setting is the default MaChAmp's hyperparameter values, with the addition of loss weights at 1, and 20 epochs of training.

mark are: Literature, Science & Philosophy, Household & Recipes, Travel & Ethnography, Medicine & Botany. In the olfactory benchmark presented in [3], there are instead 10 different genres: Household & Recipes, Law & Regulations, Literature, Medicine & Botany, Perfumes & Fashion, Public health, Religion, Science & Philosophy, Theatre, Travel & Ethnography.

Figure 1 :1Figure 1: Savoury (dark blue) and Unsavoury (light blue) frequencies of taste words in genres

Figure 2 :2Figure 2: Fragrant/Fragrance (dark blue) and Stench (light blue) frequencies of smell words in genres

Table 11. The documents annotated in the benchmark cover 5 different domains or genres, almost evenly distributed with 3/4 documents for century in every domain for a total of 72 documents. The genres are: Literature, Science & Philosophy, Household & Recipes, Travel & Ethnography, and Medicine & Botany.To select the documents we automatically search for texts presenting a greater density of lexical units (taste words) 2 spanning through several English corpora and tasterelated websites. The corpora form which we extract the documents we annotated are: (1) Early English Books Online (EEBO)3 , a collection of documents published between 1475 and 1700 covering different domains such as literature, philosophy, politics, religion, geography, history, politics, and mathematics; (2) Project Gutenberg4 , a digitized archive of cultural works, containing different repositories, mainly in the literary domain; (3) medievalcookery.com 5 a list of texts freely available online relating to medieval food and ancient cooking recipes; (4) foodsofengland.co.uk6 an online library which holds the complete texts of several cook books from 1390 to 1974;(5) Wikisource 7 , an online digital library of free-content textual sources managed by the Wikimedia Foundation; (6) British Library 8 , a collection of 65,227 digitised volumes from the 16th to the 19th Century; (7) London Pulse Frame Elements (FEs)

1500 1600 1700 1800 1900 Overall

Taste_Words440241750014988035,648Taste_Source372162737510815994,393Quality19714952558814891,732Taste_Modifier13514266154781,357Taster6517385185100638Evoked_Taste20127315316247Location1144122416116Taste_Carrier9389261298Circumstances192063822882656Effect2456323431174

Table 22Statistics of the Taste BenchmarkMedical Reports 9 , a collection of 5800 Medical Officer ofHealth reports from the Greater London area from 1848to 1972.

Table 44The limited number of documents is likely a contributing factor to the significant discrepancies in accuracy among the different frame elements, necessitating more instances to enable a good generalization. Future steps should involve increasing the number of documents and providing less sparse annotations, aiming for better temporal balance. The focus should be on annotating frame elements with lower scores and fewer instances in the benchmark, such as Taste_Carrier and Location. Additionally, alternative metrics and techniques should be employed to capture and explain performance variations across different models. As a further comparison, we plan also to assess the performance of general-purpose frame semantic parsers like LOME[28] on our benchmark. Nouns Acidity, aftertaste, aroma, bitterness, dainty, delicacy, disgust, distaste, flavor, flavour, flavorful, flavourful, flavoring, flavouring, flavorsome, flavoursome, flavorous, flavourous, gustation, insipidity, mistaste, over-eating, palatableness, piquancy, pungency, rancidity, relish, rellish (obsolete), saltness, sapidity, sapor, savor, savoriness, savour, sharpness, smack, smatch, sourness, sowreness (archaic form of sourness), sweetness, tang, tarage, tartness, tast (obsolete), taste, tastelessness, tasting, unsavoriness, unsavouriness Adjectives Acid, acidic, appetizing, appetizing, bitter, bitter-sweet, bland, dainty, delectable, delicious, delightsom(e), disgusting, flavorless, flavorful, flavourful, flavourless, flavoursome, gamy, indigestible, insipid, juicy, mellow, palatable, piquant, pungent, racy, rancid, rank, salt/salty, sapid, savory, savoury, savourly, seasoned, sharp, sour, soured, sower (archaic form of sour), spicy, stale, sweet, tangy, tart, tasteless, tasty, toothsome, unpalatable, unsavor, unsavour, unsavoury, unsavory, unseasoned, unsweet, unsweetened, wearish, wersh, yummy Verbs Drink (up), drinking (up), drank (up), drunk (up), eat (up), ate (up), eateth (archaic), eaten (up), eating (up), distaste, distasting, distasted, mistaste, mistasted, mistasting, partake, partaking, partook, partaken, relish, relisheth (archaic), relishing, relished, season, seasoning, seasoned, smack, smacking, smacked, smatch (obsolete), sweeten, sweetening, sweetened, taste, tasting, tasted Adverbs Sweetly, sourly, tastefully, bitterly, tastingly, unsavourily, unsavourly, insipidly, savourously, savourily, flavourfully Lexical units for TastePart of Speech Lexical Units

Table 55Hyperparameter value used for the experiments which yield the best resultsThe list of lexical units is provided in Appendix Ahttps://textcreationpartnership.org/tcp-texts/ eebo-tcp-early-english-books-online/https://www.gutenberg.org/https://www.medievalcookery.com/etexts.html?Englandhttp://www.foodsofengland.co.uk/references.htmhttps://en.wikisource.org/wiki/Main_Pagehttps://data.bl.uk/digbks/https://wellcomelibrary.org/moh/about-the-reports/ about-the-medical-officer-of-health-reports/In the categories at https://ht.ac.uk/category/: The world>physical sensation>Taste/Flavour>Savouriness&Unsavouriness; The world>physical sensation>Smell/Odour>Fagrant/Fragrance&Stenchhttps://huggingface.co/google-bert/bert-base-casedhttps://huggingface.co/google-bert/bert-base-multilingual-casedhttps://huggingface.co/emanjavacas/MacBERThhttps://huggingface.co/FacebookAI/roberta-basehttps://huggingface.co/FacebookAI/xlm-roberta-base

Aknowledgments

Funded by the European Union under grant agreement 101088548 -TRIFECTA. Views and opinions expressed are however those of the author only and do not necessarily reflect those of the European Union or the European Research Council. Neither the European Union nor the granting authority can be held responsible for them. The authors would also like to thank Marieke Van Erp, the head of the project, for her support.

Sensory linguistics: Language, perception and metaphor BWinter 2019 John Benjamins Publishing Company 20 What's in a food name: Knowledge induction from gazetteers of food main ingredient BMagnini VBalaraman SMagnolini MGuerini FBKessler TPovo Proceedings of CLiC-it 2018 CLiC-it 2018 2018 241 A multilingual benchmark to capture olfactory situations over time SMenini TPaccosi STonelli MVan Erp ILeemans PLisena RTroncy WTullett AHürriyetoğlu GDijkstra Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change the 3rd Workshop on Computational Approaches to Historical Language Change 2022 Frame semantics and the nature of language CJFillmore Annals of the New York Academy of Sciences 280 1976 A computational approach to generate a sensorial lexicon SSTekiroğlu GÖzbal CStrapparava 10.3115/v1/W14-4716 Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon (CogALex), Association for Computational Linguistics and the 4th Workshop on Cognitive Aspects of the Lexicon (CogALex), Association for Computational Linguistics and

Dublin City University; Dublin, Ireland

2014 Towards olfactory information extraction from text: A case study on detecting smell experiences in novels RBrate PGroth MVan Erp Proceedings of the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, International Committee on Computational Linguistics the The 4th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, International Committee on Computational Linguistics 2020 Scent mining: Extracting olfactory events, smell sources and qualities SMenini TPaccosi SSTekiroğlu STonelli Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2023 Semantic frame extraction in multilingual olfactory events SMenini Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024. 2024 Sense-lm: A synergy between a language model and sensorimotor representations for auditory and olfactory information extraction CBoscher CLargeron VEglin EEgyed-Zsigmond Findings of the Association for Computational Linguistics: EACL 2024 2024 OAi arXiv:2303.08774 Gpt-4 technical report 2023 arXiv preprint The language of food : a linguist reads the menu / Dan Jurafsky DJurafsky W.W. Norton Company New York first edition. ed Butter: Bidirectional lstm for food named-entity recognition GCenikj GPopovski RStojanov BKSeljak TEftimov 2020 A fine-tuned bidirectional encoder representations from transformers model for food named-entity recognition: Algorithm development and validation RStojanov GPopovski GCenikj BKoroušić TSeljak Eftimov Journal of Medical Internet Research 23 e28229 2021 Foodbase corpus: a new resource of annotated food entities GPopovski BKSeljak TEftimov Database 121 2019. 2019 AWróblewska AKaliska MPawłowski DWiśniewski WSosnowski AŁawrynowicz arXiv:2204.07775 Tasteset-recipe dataset and food entities recognition benchmark 2022 arXiv preprint A new annotation scheme for the semantics of taste TPaccosi STonelli Proceedings of the 20th Joint ACL-ISO Workshop on Interoperable Semantic Annotation@ LREC-COLING 2024 the 20th Joint ACL-ISO Workshop on Interoperable Semantic Annotation@ LREC-COLING 2024 2024 JRuppenhofer MEllsworth MSchwarzer-Petruck CRJohnson JScheffczyk FrameNet II: Extended theory and practice 2016 International Computer Science Institute Technical Report KKrippendorff Computing krippendorff's alphareliability 2011 Taste and smell words form an affectively loaded and emotionally flexible part of the english lexicon, Language BWinter Cognition and Neuroscience 31 2016 Multitask learning: A knowledge-based source of inductive bias1 RCaruana Proceedings of the Tenth International Conference on Machine Learning the Tenth International Conference on Machine Learning Citeseer 1993 Multitask learning RCaruana Machine learning 28 1997 Attention is all you need AVaswani NShazeer NParmar JUszkoreit LJones ANGomez ŁKaiser IPolosukhin Advances in neural information processing systems 30 2017 Bert: Pre-training of deep bidirectional transformers for language understanding JDevlin M.-WChang KLee KToutanova Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Long and Short Papers the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2019 1 MacBERTh: Development and evaluation of a historically pretrained language model for English (1450-1950 EManjavacas Arévalo LFonteyn Proceedings of the Workshop on Natural Language Processing for Digital Humanities (NLP4DH), Association for Computational Linguistics the Workshop on Natural Language Processing for Digital Humanities (NLP4DH), Association for Computational Linguistics 2021 Roberta: A robustly optimized BERT pretraining approach YLiu MOtt NGoyal JDu MJoshi DChen OLevy MLewis LZettlemoyer VStoyanov CoRR abs/1907.11692 2019 Unsupervised crosslingual representation learning at scale AConneau KKhandelwal NGoyal VChaudhary GWenzek FGuzmán EGrave MOtt LZettlemoyer VStoyanov CoRR abs/1911.02116 2019 RVan Der Goot AÜstün ARamponi ISharaf BPlank arXiv:2005.14672 Massive choice, ample tasks (machamp): A toolkit for multi-task learning in nlp 2020 arXiv preprint LOME: Large ontology multilingual extraction PXia GQin SVashishtha YChen TChen CMay CHarman KRawlins ASWhite BVan Durme 10.18653/v1/2021.eacl-demos.19 Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, Association for Computational Linguistics DGkatzia DSeddah the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, Association for Computational Linguistics 2021