<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Contextual Modelling of &quot;Propaganda&quot;, &quot;Information&quot; and &quot;Upplysning&quot; in Swedish Parliamentary Speeches, 1920-2019</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Johan</forename><surname>Jarlbrink</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Umeå University</orgName>
								<address>
									<addrLine>Universitetstorget 4</addrLine>
									<postCode>90187</postCode>
									<settlement>Umeå</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Fredrik</forename><surname>Norén</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Umeå University</orgName>
								<address>
									<addrLine>Universitetstorget 4</addrLine>
									<postCode>90187</postCode>
									<settlement>Umeå</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Robin</forename><surname>Saberi</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">Uppsala University</orgName>
								<address>
									<postBox>Box 256</postBox>
									<postCode>751 05</postCode>
									<settlement>Uppsala</settlement>
									<country key="SE">Sweden</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Contextual Modelling of &quot;Propaganda&quot;, &quot;Information&quot; and &quot;Upplysning&quot; in Swedish Parliamentary Speeches, 1920-2019</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">F2C12B7E3A494CF16199876A3E7D6239</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T13:19+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>propaganda</term>
					<term>information</term>
					<term>conceptual history</term>
					<term>parliamentary debates</term>
					<term>Sweden</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>This paper explores the contexts of the keywords "propaganda", "information" and "upplysning" in the Swedish parliamentary debate protocols, from 1920 to 2019. The digitized protocols have recently been annotated with metadata for speakers' gender and party affiliation. Based on perspectives developed within conceptual history, we have traced the concepts in the parliamentary debates and used computational methods to cluster the contexts in which they occur. Word windows around the three keywords were compiled into a subcorpus, and topic modelling was used to cluster the contexts. The findings show that the distribution of the topics gets more even over time, partly explained by the spread of the term "information" in various political areas in the mid-20th century. Furthermore, the only distinct topic shared between the three keywords relates to campaigns to limit and prevent the consumption of alcohol, narcotics and tobacco. While a conceptual shift takes place within this topic, from "upplysning" to "information", it is also shown that it was possible to discuss and frame these issues in terms of "propaganda" post-WWII -it even became more common to do so in the 1950s and 1960s.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Today, "information" and "propaganda" are perceived as almost contradictory terms. Information is supposed to bring facts and rational order. If not, it needs to be corrected or be dismissed as disinformation. Propaganda, in turn, only carries lies and confusion. Scholars who have studied the history of propaganda often argue that this conceptual distinction started to emerge during WWI. According to this narrative, the notion became cemented after WWII, partly as a consequence of Goebbels' authoritarian propaganda apparatus <ref type="bibr" target="#b0">[1]</ref> <ref type="bibr" target="#b1">[2]</ref>. However, this seemingly obvious and clear distinction between "propaganda" and "information" should not be taken for granted. In fact, the narrative of the propaganda concept going bad after 1945 seems empirically incorrect, at least if we look at the Swedish parliamentary debates. How could it otherwise be that a parliamentarian, as late as in the 1980s, was able to say this during a debate: "Within the Centre party, there is a strong belief that young people's attitudes towards alcohol can be influenced through information and propaganda and that a very important channel for this influence is the school". 2   Digital Parliamentary Data in Action (DiPaDA 2022) workshop, Uppsala University, Sweden, <ref type="bibr">March 15, 2022</ref>. EMAIL: johan.jarlbrink@umu.se (Johan Jarlbrink); fredrik.noren@umu.se (Fredrik Norén); robin.saberinasseri@statistik.uu.se (Robin Saberi) ORCID: 0000-0002-1167-046X (Johan Jarlbrink); 0000-0001-8820-1082 (Fredrik Norén); 0000-0002-4026-9122 (Robin Saberi)</p><p>The purpose of this paper is to explore the contexts in which "propaganda", "information" and "upplysning" (a word more common before the emergence of "information", and with a similar meaning: it translates as both "information" and "enlightenment" in English) were used in the Swedish parliamentary debate protocols over a hundred-year period, from 1920 to 2019. Two research questions are being examined: Which political areas were "propaganda", "information" and "upplysning" associated with? Were there common topics in which all three terms were used? And how did the distribution of topics and associated concepts change over time?</p><p>The analysis is based on all chamber protocols of the Swedish Riksdag between 1920 and 2019 -401 million tokens divided over some 11,000 protocols. Theoretically, the paper builds on ideas developed within the field of conceptual history, perspectives that have been operationalized using computational methods in recent years. Methodologically, the paper builds on models developed for word sense induction. More specifically, we have constructed a sub-corpus of extracted word windows around the three keyword lemmas, which we have processed using LDA topic modelling. This methodological approach provides a way to model the contexts around the terms "propaganda", "information" and "upplysning".</p><p>The results presented here are based on a corpus still being curated. During 2021, individual speeches were identified and metadata for individual speakers, party affiliation and gender were added. Yet, there are still unidentified speakers and speeches in the dataset and some bugs remain to be fixed. What this paper presents is ongoing work and preliminary results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Previous results</head><p>This paper builds on our previous work in which we explored how "propaganda" and "information" were used in Swedish parliamentary speeches, from the establishment of the bicameral Riksdag in 1867 to 2019 <ref type="bibr" target="#b2">[3]</ref>. Through basic statistical measurements -word window co-occurrence, bigram extraction and keyword extraction -we examined what these concepts referred to in the parliamentary debates. Our results revealed that the negative notion of "propaganda" was dominant already in the 19th century. This is a finding in line with what conceptual historians have shown in relation to Germany and the UK <ref type="bibr" target="#b3">[4]</ref> <ref type="bibr" target="#b4">[5]</ref>. Still, our results pointed to the fact that it was also possible to use the concept of propaganda in a positive and neutral sense between the 1910s and 1980s. Interestingly, scholars of propaganda history have often described this particular period as a time when "propaganda" became more or less useless as a neutral concept in western democracies <ref type="bibr" target="#b5">[6]</ref>[7] <ref type="bibr" target="#b0">[1]</ref>.</p><p>In our previous study we also saw that when the frequencies of "propaganda" decreased in the 1960s and 1970s, the neutral sense of the concept was partly replaced by "information". The spread of "information" in its singular form turned it into a flexible concept that could be used in a variety of political contexts. This result echoes what other scholars studying the history of "information" have shown, that the term first spread to various academic disciplines <ref type="bibr" target="#b7">[8]</ref> and then into the political sphere, where it was used to make sense of and handle various problems <ref type="bibr" target="#b8">[9]</ref>. Providing "adequate information" was presented as a solution to all kinds of problems in the "information age", but information would also generate problems of its own: an "information divide", "information overload", "information pollution" etc. <ref type="bibr" target="#b9">[10]</ref>.</p><p>In the Swedish parliamentary debates, "information" usually carried positive connotations: nobody was against "information" per se. Hence, a redescription of "propaganda" as "information" was close at hand. Following Skinner <ref type="bibr" target="#b10">[11]</ref>, we argued that "information" seems to have provided a space for political action that "propaganda" was not able to, partly because of the political radicalization during the 1960s and the shift towards an ideal of two-way communication during the 1970s. However, the methods we used in our previous work were not able to capture the political contexts or themes of "propaganda" and "information" in a systematic way. Hence, in this paper, we turn to another method to expand the exploration of our conceptual investigation -also including the information-related term "upplysning".</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Theoretical perspectives and method</head><p>"Propaganda" could be seen as a key (or basic) concept. "Information" may not qualify as such, since the concept (re)entered politics from communication theory, where it was defined primarily in technical terms <ref type="bibr" target="#b9">[10]</ref>. In the tradition of conceptual history, key concepts are those that are both indispensable and contested in political discourse. Defining them, restricting, expanding or challenging their meaning and use, is part of a political struggle. The redefinition of concepts is a way to redefine social reality, interpret it and frame it, and perhaps legitimize goals and point out the necessity of certain actions <ref type="bibr" target="#b11">[12]</ref> <ref type="bibr" target="#b12">[13]</ref>. By focusing on parliamentary speeches and examining key concepts of communication -how and when they are defined and used, and by whom, in which context -we get a better understanding of the struggles and strategies that have defined communication between the state and the citizens.</p><p>While "propaganda" has been a concept to disagree about, used both as a negative label and a communication strategy to embrace, the use of "information" has to a large degree been characterized by consensus: few actors were ever against information per se. Geschichtliche Grundbegriffe, the dictionary of political concepts edited by Reinhart Koselleck and a team of researchers between 1972 and 1997, included an extensive chapter on "propaganda" <ref type="bibr" target="#b3">[4]</ref>, but "information" was not covered at all. Still, "information" has been used both as a synonym and as a counter-concept to propaganda" <ref type="bibr" target="#b8">[9]</ref>, and this is why "information" is included in our paper, along with its older synonym "upplysning".</p><p>In the last decade, several efforts have been made to utilize digital text databases and computational methods to continue -and perhaps challenge -what Koselleck started. These efforts also include research based on parliamentary records <ref type="bibr" target="#b13">[14]</ref>[15] <ref type="bibr" target="#b15">[16]</ref>. The computational approaches are usually based on the analysis of different word co-occurrence measures, and the idea that you "shall know a word by the company it keeps" <ref type="bibr" target="#b16">[17]</ref>. Peter de Bolla and his team at the Cambridge Concept Lab have examined different word co-occurrence metrics in order to capture the way "concepts are constructed through 'constellations' or 'bundles' of individual words" <ref type="bibr" target="#b17">[18]</ref>. Michael Gavin et al. have proposed an alternative method based on vector semantics, but it is founded on similar principles: "concepts are clusters of terms that co-occur throughout a corpus" <ref type="bibr" target="#b18">[19]</ref>. The model makes it possible to distinguish between different clusters, indicating different contexts of use.</p><p>Our own method is based on similar principles, but we have used topic modelling to cluster the words co-occurring with our target words. This approach is borrowed from research on word sense induction, where words co-occurring within a window around a target word are clustered in order to identify different senses of ambiguous words <ref type="bibr" target="#b19">[20]</ref> <ref type="bibr" target="#b20">[21]</ref>. However, reviewing the results of this research, "sense" does not seem to be the most appropriate word to characterize what is captured. Often, it seems that many of the clusters identified present the same word sense. What the models do capture, however, is the different contexts -that is themes, or topics -where the target word is used, and that is precisely what we have aimed for in this paper using the LDA model, implemented in Mallet <ref type="bibr">[22][23]</ref>. For this paper, we have experimented with context windows of 5, 10 and 20 words, and produced models with 5, 10, 20 and 50 topics. After a manual assessment, a context window of 20 words was considered to capture a meaningful semantic structure around the three keywords. A context window of 50 words was considered too wide and thus risked losing more specific themes of our three keywords (more attention towards exploring optimal context window sizes for sub-corpus topic modelling would, however, be desirable). Moreover, based on a manual interpretation of distinct top words for each topic, as well as an examination of the documents with high scores for individual topics, a model of 20 topics was chosen. For the purpose of this paper and our research questions, we found that this model captured a satisfying number of distinct themes and provided an overview of the political areas associated with "information", "propaganda" and "upplysning" (the 50-topic model generated too many overlapping topics that were too alike). Still, as Appendix 1 shows, some of the 20 topics were also of a general kind and thus more difficult to give a distinct label.</p><p>In previous research within computational conceptual history, context is usually defined as word context: "The context may be a short window of words around the target word or defined by a grammatical relation discovered by automatic syntactic parsing of the text" <ref type="bibr" target="#b17">[18]</ref>. This word context is important, but what is missing is often the social and political context, the different genres and authors, etc. Our own research is based on a single source, the proceedings of the Swedish parliament, but we have used metadata that has recently been made available to examine and contextualize how topics are distributed based on gender and political parties.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Sources and corpus data</head><p>Before we move on to the analysis, some details are needed about the chamber protocols as a source and as a corpus. The chamber protocol series (Kammarens protokoll) consists of the minutes from the parliament and contains both speeches and various notes on activities in the chambers, such as votes and reports of attendance. The speeches have an inherent polemical character, and speakers' expressions are sometimes provocative and rhetorical, which in turn affects the nature of the semantic content, for example in terms of the adjectives used and the phrasing of questions. Moreover, the political language also reflects the parliamentary procedures. These procedures and the speech-genre itself generate high frequencies of certain phrases and words: "Mr speaker", "question", "committee" etc. The yearly cycle of the parliament also produces repetitive activities and elements -hearings of ministers, budget debates, the closing of the parliament during summer -all following their own particular rules and formalities. The recorded speeches, furthermore, follow certain conventions, for example how speakers are introduced, and how a speech starts (by addressing the speaker of the parliament). One needs to be aware of these aspects since they have an influence on the computational results derived from the corpus as a whole.</p><p>The Swedish parliamentary records have been digitized by the Swedish Riksdag Library, and represent a vast and rich source to study parliamentary culture, dating back to the diets of the 16th century (available for free at riksdagstryck.kb.se and data.riksdagen.se). For this paper, we have used all the protocols from 1920 to 2019, 11,195 protocols of varying length, and 401 million tokens in total. During the digitization process, the quality of the optical character recognition was controlled through random samples of the material. No additional cleaning has been conducted during the work on this paper. However, within the research project Welfare State Analytics (westac.se), in which all three authors are involved, we have been working together in a team of humanities scholars and statisticians to identify speeches and speakers, and annotate metadata for name, party affiliation, gender and geographical representation. A first version of this annotated corpus, stretching from 1920 to 2019, was recently presented at the 2021 conference on Digital History in Sweden <ref type="bibr" target="#b23">[24]</ref>. Obstacles with the metadata still exist and will be dealt with continuously within the project. For example, while writing this paper, it was discovered that the metadata for party affiliation was missing for speakers representing the Left Party pre-1966.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">The topics of "information", "propaganda" and "upplysning"</head><p>In the 20-topic model, we find a handful of generic topics somewhat difficult to label, but also several topics related to specific issues and contexts (see Appendix 1). Among the generic ones are topics referring to names and titles (topic 3, 12), to the rhetoric and formalities of the proceedings (6, 10), and to various committees and different branches of government <ref type="bibr" target="#b0">(1,</ref><ref type="bibr" target="#b4">5</ref>). Yet, the most prominent topic is related to consumers <ref type="bibr" target="#b15">(16)</ref>. Other dominant topics capture party-political propaganda and election campaigns <ref type="bibr" target="#b7">(8)</ref>, issues concerning media and freedom of speech <ref type="bibr" target="#b10">(11)</ref>, and foreign relations <ref type="bibr" target="#b16">(17)</ref>.</p><p>Examining the distribution of topics over time, we see that the 1920s, 1930s and 1940s were dominated by few and mostly generic topics (Figure <ref type="figure" target="#fig_0">1</ref>). Topic 1 is the most significant, primarily referring to the inner workings of the parliament and government. The most distinct words within the topic are "majt", "majts", and "kungl", which are words partly used within the context of His Majesty the King and the function of royal decision making in the Council of State pre-1974. The second and third largest topics are topic 3 (names and titles) and topic 10 (debate rhetoric). The general character of these topics, and the fact that none of them refer explicitly to actions and activities outside of the parliamentary sphere, is consistent with findings in our previous research. In the 1920s, 1930s and 1940s, for example, "information" was primarily used in reference to information that members of parliament (MP) needed in order to make decisions -not external information to educate citizens <ref type="bibr" target="#b2">[3]</ref>. The fourth topic identified here is more specific, however: media and freedom of speech ( <ref type="formula">11</ref>) -an important issue during WWII. Beginning in the mid-20th century, the key concepts are used in relation to a wider field of topics (Figure <ref type="figure" target="#fig_1">2</ref>). In the 1950s and 1960s, they were frequently used within the topic of election campaigns and party-political propaganda (8), foreign policy <ref type="bibr" target="#b16">(17)</ref>, and the labor market <ref type="bibr" target="#b6">(7)</ref>. Consumer issues ( <ref type="formula">16</ref>) became important in the 1970s, along with alcohol, narcotics and tobacco <ref type="bibr" target="#b12">(13)</ref>. Economy ( <ref type="formula">19</ref>) became a significant topic in the 1980s. The last few decades have seen a further increase in the consumer topic ( <ref type="formula">16</ref>) -explained perhaps by privatizations within many sectors, and the need to inform consumers about available options (Figure <ref type="figure" target="#fig_2">3</ref>).  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Keyword-specific topics</head><p>Of the three keywords, "information" occurs in most topics for the whole sub-corpus of all word windows. Out of 20 topics, 14 are connected to this keyword with a value of at least 0.01. For "upplysning" the same ratio is 9/20 topics, and for "propaganda" only 4/20 topics (Tables <ref type="table">1 and 2</ref>). While "information" is present in most topics, it is often with a low and even rate compared to the other keywords. This might be an indication that "information" became a general discursive element -but not always the most prominent one -within a greater variety of political areas, whereas "propaganda" and "upplysning" were connected to fewer issues but with stronger ties. Yet, what is also shown is that the topics that became prominent in the 1950s, 1960s and 1970s -such as labour market <ref type="bibr" target="#b6">(7)</ref>, alcohol, narcotics and tobacco <ref type="bibr" target="#b12">(13)</ref>, consumers <ref type="bibr" target="#b15">(16)</ref> and foreign policy <ref type="bibr" target="#b16">(17)</ref> (see Figure <ref type="figure" target="#fig_1">2</ref>) -are those dominated by the keyword "information". This attachment of "information" to an increasing number of political issues in Sweden confirms what previous research has shown: increased use of "information" as a discursive element within a growing number of political areas <ref type="bibr" target="#b24">[25]</ref>. Furthermore, in the 20-topic model, six topics are only connected to "information" and not to "propaganda" and "upplysning". Of these, three topics connote more distinct political areas: consumers ( <ref type="formula">16</ref>), foreign relations <ref type="bibr" target="#b16">(17)</ref>, and school and pupils <ref type="bibr" target="#b17">(18)</ref>.</p><p>Fewer topics are connected to "propaganda", but those that are tend to have a higher score. Here we find, for example, one topic relating to threatening propaganda (2), and one relating to party-political propaganda and election campaigns <ref type="bibr" target="#b7">(8)</ref>. The different topic distribution of "propaganda", compared to that of "information", indicates that the contexts in which "propaganda" is found were both more limited and more specifically connected to this keyword. Moving on to "upplysning", the top three topics connected to the term connote a more generic content: various branches of government (1), names and titles (3), and descriptions and generic phrases (4). </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Alcohol, narcotics and tobacco -the shared theme of "information", "propaganda" and "upplysning"</head><p>In both the 20-topic model and the 50-topic model, very few topics are shared between the three keywords. In the 50-topic model, three topics are connected to "information", "propaganda" and "upplysning": two generic topics capturing parliamentarian names and terms related to economy, and one more distinct topic about drugs and alcohol. The latter topic also has its equivalence in the 20 model: topic 13, labelled "alcohol, narcotics, tobacco". (The 20-topic model also contains another shared topic, number 4, however it is a less meaningful one labelled "generic".) In the 20-topic model, the topic related to alcohol, narcotics and tobacco becomes gradually more significant up until the 1990s. During the last decades it is slightly less prominent (Figure <ref type="figure" target="#fig_5">4</ref>).</p><p>In our previous study on the history of "propaganda" and "information" in Swedish parliamentary debates, we found that "propaganda" was used primarily in a negative way. However, between the 1910s and the 1980s, it was also possible to use the concept in a number of other contexts -mainly related to uncontroversial issues such as traffic safety, promotion of healthy food, and campaigns to restrict or prevent the use of alcohol, drugs and alcohol <ref type="bibr" target="#b2">[3]</ref>. The latter issue resembles topic 13. The graph in Figure <ref type="figure" target="#fig_6">5</ref> shows the absolute distribution of the three keyword lemma of "information", "propaganda" and "upplysning" within topic 13. As the topic becomes more prominent (Figure <ref type="figure" target="#fig_5">4</ref>), a conceptual shift seems to occur. During the 1950s and the 1960s, "upplysning" dominated the topic. Then, from the 1970s onwards, "information" takes over as the most significant keyword (Figure <ref type="figure" target="#fig_6">5</ref>). "Propaganda" is not as present in the topic as "information" and "upplysning". Still, the graph indicates that it was indeed possible to discuss issues concerning alcohol, narcotics and tobacco in terms of "propaganda" in post-war Sweden -it even became more common to do so in the 1950s and 1960s. "Propaganda" was part of the solution, as indicated by the documents where topic 13 scores highly: "increased information and uncompromising propaganda against narcotics".   The issues where it was possible to suggest "propaganda" as a legitimate solution seem to have one thing in common: they were non-controversial and characterized by political consensus. Eat more fruit, wear a seatbelt, save fuel, learn how to swimthese are examples of such issues debated in parliament in post-war Sweden <ref type="bibr" target="#b2">[3]</ref>. Issues related to the restriction and prevention of alcohol, narcotics and tobacco were non-controversial in a similar way. Edman <ref type="bibr" target="#b25">[26]</ref> has argued that few issues were as depoliticized as those concerning alcohol and narcotics. MPs from left to right agreed on both the problems and the solutions. Suggesting "propaganda" against drug use and excessive drinking in such a context would not cause any conflictseveryone agreed that it was for the common good.</p><p>Using the metadata for gender and party, it is possible to make other observations regarding the distribution of topic 13. Figure <ref type="figure" target="#fig_8">6</ref>, for example, shows how much female and male parliamentarians talked about the issue. The bar chart for women represents the percentage of all speeches by female parliamentarians that are allocated to topic 13. Hence, during the 1970s, six percent of all female speeches are allocated to this topic. The remaining 94 percent belong to the remaining 19 topics in the model. This approach takes the unequal representation of men and women into account.</p><p>The normalized numbers used in Figure <ref type="figure" target="#fig_8">6</ref> make it possible to compare, across different time periods, how much of the female and male speeches are allocated to a certain topic. Hence, this is a way to explore gender-coded topics. If we examine the gender distribution of topic 13, the graph indicates that female parliamentarians spend slightly more time talking about alcohol, narcotics and tobacco using the selected keywords, especially from the 1950s to the 1970s. Then, after the 1970s, the distribution becomes more even. Moving on to the distribution of topic 13 for each party per decade (Figure <ref type="figure" target="#fig_9">7</ref>), differences are less distinct, although overall a higher percentage of speeches from the Social Democrats (S) and the Centre Party (C) are allocated to this topic compared to the other parties over time.   </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.">Concluding discussion</head><p>In this paper we have examined the contexts of "information", "propaganda" and "upplysning", based on all speech protocols in the Swedish parliament from 1920 to 2019. A sub-corpus was generated by extracting a context window of 20 words for all occurrences of the three keywords. Topic modelling was then used to explore the themes in the sub-corpus, and a 20-topic model was chosen for the analysis. The corpus is still being curated and some bugs remain, so the results presented here are preliminary. One finding is that the topic distribution gets more even over time. This result is related to the increased usage of "information" from the 1960s and onwards. More political issues were talked about in terms of "information" and the keyword starts to dominate a growing number of topics. In contrast, the context of "propaganda" was more limited and the use of the keyword decreased over time.</p><p>Few topics are shared between "information", "propaganda" and "upplysning". In fact, only one distinct topic is shared in the 20-topic model. This topic relates to the political area of alcohol, narcotics and tobacco. One reason why this particular issue is shared between the three keywords, and why it is possible to use the term "propaganda" in this context, might be that campaigns to limit and prevent the use of alcohol, narcotics and tobacco constituted a non-controversial political area, at least before the 1970s. This observation, in turn, is strengthened by results in previous research on the history of "propaganda" and "information" in the Swedish parliament <ref type="bibr" target="#b2">[3]</ref>, as well as research on the political consensus surrounding narcotics and alcohol <ref type="bibr" target="#b25">[26]</ref>.</p><p>Furthermore, this is the first paper that makes use of annotated metadata for gender and party affiliations in the corpus of Swedish parliamentary speeches over a longer period of time, from 1920 to 2019. While the metadata is not yet perfect, the analysis shows that these categories are potential tools for investigations of differences between female and male parliamentarians, as well as differences between parties. Hence, besides the empirical findings, this paper gives a hint about the possibilities of large-scale conceptual history using the curated parliamentary corpus, enabling researchers to contextualize the use of certain concepts through metadata categories. </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 .</head><label>1</label><figDesc>Figure 1. Topic distribution in the 1930s based on the 20-topic model.</figDesc><graphic coords="5,72.00,143.29,248.65,140.65" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 .</head><label>2</label><figDesc>Figure 2. Topic distribution in the 1960s based on the 20-topic model.</figDesc><graphic coords="5,72.00,424.21,248.58,141.45" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 .</head><label>3</label><figDesc>Figure 3. Topic distribution in the 2000s based on the 20-topic model.</figDesc><graphic coords="5,72.00,596.34,248.60,138.95" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Tables 1 - 2 .</head><label>12</label><figDesc>The relative distribution of each keyword lemma in the 20-topic model.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>3</head><label>3</label><figDesc></figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 4 .</head><label>4</label><figDesc>Figure 4. The distribution of topic 13 (labelled "alcohol, narcotics, tobacco") over time.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_6"><head>Figure 5 .</head><label>5</label><figDesc>Figure5. The absolute lemma distribution counts for "information", "propaganda" and "upplysning" within topic 13.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_7"><head>3</head><label></label><figDesc>Prot. AK 1969:9.    </figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_8"><head>Figure 6 .</head><label>6</label><figDesc>Figure 6. The distribution of topic 13 for men and for women.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_9"><head>Figure 7 .</head><label>7</label><figDesc>Figure 7. The distribution of topic 13 for Social Democratic Party (S), Moderate Party (M), Liberals (L), Centre Party (C), Green Party (MP), Christian Democrats (KD), New Democracy (NYD), and Sweden Democrats (SD). The graph excludes the Left Party due to shortcomings in the metadata annotations.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_10"><head>Topic 13 :</head><label>13</label><figDesc>alcohol, narcotics, tobacco S M L C MP KD NYD SD</figDesc></figure>
		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Appendix 1: The 20-topic model with manually interpreted labels</head></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">K</forename><surname>Moloney</surname></persName>
		</author>
		<title level="m">Rethinking Public Relations: PR Propaganda and Democracy</title>
				<meeting><address><addrLine>Abingdon</addrLine></address></meeting>
		<imprint>
			<publisher>Routledge</publisher>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">L</forename><surname>Larsson</surname></persName>
		</author>
		<title level="m">Upplysning och propaganda: Utvecklingen av svensk PR och information</title>
				<meeting><address><addrLine>Lund</addrLine></address></meeting>
		<imprint>
			<publisher>Studentlitteratur</publisher>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Forthcoming. When Propaganda Became Legitimate -and Replaced by Information: A Digital Reading of the Swedish Parliamentary Records</title>
		<author>
			<persName><forename type="first">J</forename><surname>Jarlbrink</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Norén</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of History</title>
		<imprint>
			<biblScope unit="page">1867</biblScope>
			<date type="published" when="2022-02">February 2022</date>
		</imprint>
	</monogr>
	<note>-2019. Submitted to Scandinavian</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Propaganda</title>
		<author>
			<persName><forename type="first">W</forename><surname>Schieder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Dipper</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Geschichtliche Grundbegriffe: Historisches Lexikon zur politisch-sozialen Sprache in Deutschland</title>
				<editor>
			<persName><forename type="first">O</forename><surname>Brunner</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">W</forename><surname>Conze</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">R</forename><surname>Koselleck</surname></persName>
		</editor>
		<meeting><address><addrLine>Stuttgart</addrLine></address></meeting>
		<imprint>
			<publisher>Klett-Cotta</publisher>
			<date type="published" when="1984">1984</date>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="69" to="112" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Meller</surname></persName>
		</author>
		<title level="m">The Development of Modern Propaganda in Britain, 1854-1902</title>
				<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
		<respStmt>
			<orgName>Durham University</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">PhD diss.</note>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Introduction</title>
		<author>
			<persName><forename type="first">R</forename><surname>Jackall</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Propaganda, 1-9</title>
				<editor>
			<persName><forename type="first">R</forename><surname>Jackall</surname></persName>
		</editor>
		<meeting><address><addrLine>Basingstoke</addrLine></address></meeting>
		<imprint>
			<publisher>Palgrave Macmillan</publisher>
			<date type="published" when="1995">1995</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">T</forename><surname>Clark</surname></persName>
		</author>
		<title level="m">Art and Propaganda in the Twentieth Century: The Political Image in the Age of Mass Culture</title>
				<meeting><address><addrLine>London</addrLine></address></meeting>
		<imprint>
			<publisher>Weidenfeld &amp; Nicolson</publisher>
			<date type="published" when="1997">1997</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Shannons&apos;s Information Theory: The Spread of an Idea</title>
		<author>
			<persName><forename type="first">R</forename><surname>Dahling</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Studies of Innovation and of Communication to the Public</title>
				<editor>
			<persName><forename type="first">W</forename><surname>Schramm</surname></persName>
		</editor>
		<meeting><address><addrLine>Stanford</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1962">1962</date>
			<biblScope unit="page" from="117" to="140" />
		</imprint>
		<respStmt>
			<orgName>Stanford University</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Reforming Government Communications in Britain</title>
		<author>
			<persName><forename type="first">B</forename><surname>Maartens</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Contemporary British History</title>
		<imprint>
			<biblScope unit="volume">30</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="542" to="562" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note>From Propaganda to &quot;information</note>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">The Information: A History, a Theory, a Flood</title>
		<author>
			<persName><forename type="first">J</forename><surname>Gleick</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2011">2011</date>
			<publisher>Pantheon Books</publisher>
			<pubPlace>New York</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">Q</forename><surname>Skinner</surname></persName>
		</author>
		<title level="m">Visions of Politics: Regarding Method</title>
				<meeting><address><addrLine>Cambridge</addrLine></address></meeting>
		<imprint>
			<publisher>Cambridge University Press</publisher>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">About Key Concepts and How to Study Them</title>
		<author>
			<persName><forename type="first">J</forename><surname>Ifversen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Contributions to the History of Concepts</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="65" to="88" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Ideology and Conceptual History</title>
		<author>
			<persName><forename type="first">B</forename><surname>Stråth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Oxford Handbook of Political Ideologies</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Freeden</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><forename type="middle">T</forename><surname>Sargent</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Stears</surname></persName>
		</editor>
		<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Oxford University Press</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="17" to="37" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Pivots and Levers: Political Rhetoric around Capitalism in Britain from the 1970s to the Present</title>
		<author>
			<persName><forename type="first">N</forename><surname>Foxlee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Contributions to the History of Concepts</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="75" to="99" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Evolving Conceptualisations of Internationalism in the UK Parliament: Collocation Analyses from the League to Brexit</title>
		<author>
			<persName><forename type="first">P</forename><surname>Ihalainen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Sahal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Digital Histories: Emergent Approaches within the New Digital History</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Fridlund</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Oiva</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">P</forename><surname>Paju</surname></persName>
		</editor>
		<meeting><address><addrLine>Helsinki</addrLine></address></meeting>
		<imprint>
			<publisher>Helsinki University Press</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="199" to="219" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">How Ideology Became Isms: A History of a Conceptual Coupling</title>
		<author>
			<persName><forename type="first">J</forename><surname>Kurunmäki</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Marjanen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Passions, Politics and the Limits of Society (volume 1 in the series Helsinki Yearbook of Intellectual History)</title>
				<editor>
			<persName><forename type="first">H</forename><surname>Haara</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">K</forename><surname>Stapelbroek</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Immanen</surname></persName>
		</editor>
		<imprint>
			<publisher>De Gruyter Oldenbourg</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="291" to="318" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">A Synopsis of Linguistic Theory 1930-1955</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Firth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Studies in Linguistic Analysis</title>
				<meeting><address><addrLine>Oxford</addrLine></address></meeting>
		<imprint>
			<publisher>Blackwell</publisher>
			<date type="published" when="1957">1957</date>
			<biblScope unit="page" from="1" to="32" />
		</imprint>
	</monogr>
	<note>Philological Society</note>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Distributional Concept Analysis: A Computational Model for History of Concepts</title>
		<author>
			<persName><forename type="first">P</forename><surname>De Bolla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Jones</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Nulty</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Recchia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Regan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Contributions to the History of Concepts</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="66" to="92" />
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Spaces of Meaning: Conceptual History, Vector Semantics, and Close Reading</title>
		<author>
			<persName><forename type="first">M</forename><surname>Gavin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Jennings</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Kersey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Pasanek</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Debates in the Digital Humanities 2019</title>
				<editor>
			<persName><forename type="first">M</forename><surname>Gold</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><surname>Klein</surname></persName>
		</editor>
		<meeting><address><addrLine>Minneapolis</addrLine></address></meeting>
		<imprint>
			<publisher>University of Minnesota Press</publisher>
			<date type="published" when="2019">2019</date>
			<biblScope unit="page" from="243" to="267" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Bayesian Word Sense Induction</title>
		<author>
			<persName><forename type="first">S</forename><surname>Brody</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lapata</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 12th Conference of the European Chapter of the ACL</title>
				<meeting>the 12th Conference of the European Chapter of the ACL<address><addrLine>Athens, Greece</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2009-04-03">30 March -3 April 2009</date>
			<biblScope unit="page" from="103" to="111" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Word Sense Induction for Novel Sense Detection</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">H</forename><surname>Lau</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Cook</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Mccarthy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Newman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Baldwin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics</title>
				<meeting>the 13th Conference of the European Chapter of the Association for Computational Linguistics<address><addrLine>Avignon, France</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2012-04-27">23-27 April 2012</date>
			<biblScope unit="page" from="591" to="601" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Latent Dirichlet Allocation</title>
		<author>
			<persName><forename type="first">D</forename><surname>Blei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Ng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Jordan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Machine Learning Research</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="993" to="1022" />
			<date type="published" when="2003">2003</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Mccallum</surname></persName>
		</author>
		<ptr target="http://mallet.cs.umass.edu" />
		<title level="m">MALLET: A Machine Learning for Language Toolkit</title>
				<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<monogr>
		<author>
			<persName><surname>Westac</surname></persName>
		</author>
		<ptr target="https://github.com/welfare-state-analytics/riksdagen-corpus" />
		<title level="m">Swedish Parliamentary Proceedings -Riksdagens Protokoll 1920-2021 v0.3.0</title>
				<imprint>
			<date type="published" when="2021-02-16">2021. 16 February 2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Information som lösning, information som problem: En digital läsning av tusentals statliga utredningar</title>
		<author>
			<persName><forename type="first">F</forename><surname>Norén</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nordicom Information</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="9" to="26" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">The Ideological Drug Problem</title>
		<author>
			<persName><forename type="first">J</forename><surname>Edman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Drugs and Alcohol Today</title>
		<imprint>
			<biblScope unit="volume">13</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="9" to="19" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
