<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Comprehensive Terms Board Visualization for News Analysis and Editorial Story Planning (Demo)</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ishrat</forename><forename type="middle">Rahman</forename><surname>Sami</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Goldsmiths</orgName>
								<orgName type="institution">University of London</orgName>
								<address>
									<addrLine>New Cross</addrLine>
									<postCode>SE14 6NW</postCode>
									<settlement>London</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><roleName>Dr</roleName><forename type="first">Tony</forename><surname>Russell-Rose</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Goldsmiths</orgName>
								<orgName type="institution">University of London</orgName>
								<address>
									<addrLine>New Cross</addrLine>
									<postCode>SE14 6NW</postCode>
									<settlement>London</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Prof</forename><forename type="middle">Larisa</forename><surname>Soldatova</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Goldsmiths</orgName>
								<orgName type="institution">University of London</orgName>
								<address>
									<addrLine>New Cross</addrLine>
									<postCode>SE14 6NW</postCode>
									<settlement>London</settlement>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Comprehensive Terms Board Visualization for News Analysis and Editorial Story Planning (Demo)</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">257208B005E79A787EAA089028051773</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-04-29T06:30+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Natural Language Processing</term>
					<term>Visualization</term>
					<term>Story Planning</term>
					<term>News Writing</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Knowledge providers, such as authors, teachers, researchers and journalists rely on researching facts to convey evidence-driven information about a selected topic and story planning in the pre-writing phase enhances engagement and understanding of the audience through a better content organization. Typical search engines support finding relevant facts, but they do not aid an individual's metacognition process of a topic. In this demo, we introduce the concept of the Terms Board, a topic-driven comprehensive visualization for presenting terms to provide a cognitive guide for news analysis and formulating their plans for storytelling in editorial writing. Terms Board is composed of six cards reflecting the major storytelling aspects: what the story is about, who are the characters of the story, where the story is located, why there are challenges, what has been done to address the challenges and why the actions were effective. Each card shows three top terms based on three factual timeline aspects: historical, consistent and latest. For this demo, we extracted emphasised terms from a collection of documents in a news archive and produced a Terms Board for the most frequent topics which were then presented to a group of study participants. Participants' performances on several tasks have been measured and analysed. The study results are encouraging. The major contribution of this research is presenting a Terms Board visualisation approach as a cognitive guide for news analysis and editorial story planning and presenting an experimental evaluation of this approach via cognitive reading and writing user experiment tasks.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Writing is a communication stream through which authors enrich, entertain and educate the audience about a specific topic. For ascertaining facts in the pre-writing researching phase, various search engines provide relevant results with titles, descriptions, links and tags along with a range of statistical charts and snippets. But these results do not comprehend underlying resources to aid individuals' metacognition. Metacognition (individual's reflections about their own knowledge <ref type="bibr" target="#b0">[1]</ref>) can be improved by cognitive control which can be introduced by purposeful goal-oriented behaviour and decision-making process <ref type="bibr" target="#b1">[2]</ref>. The "Terms Board" (TB) demo presented in this paper accommodates cognitive control from two perspectives: story planning and timeline. While story planning templates like Joseph Campbell's "The Hero's Journey" guide the authors in organizing their plans <ref type="bibr" target="#b2">[3]</ref> for better writing, the timeline In: R. Campos, A. Jorge, A. Jatowt, S. Bhatia, M. Litvak (eds.): Proceedings of the Text2Story'23 Workshop, Dublin (Republic of Ireland), 2-April-2023 Envelope isami001@gold.ac.uk (I. R. Sami); t.russell-rose@gold.ac.uk (D. T. Russell-Rose); l.soldatova@gold.ac.uk (Prof. L. Soldatova) relativity of the terms plays a major role in understanding their factual relativity. As the use of a storyboard improves individual/collective engagement and expressing ability <ref type="bibr" target="#b3">[4]</ref>, TB visualizes the terms as a board. Based on the "The Hero's Journey" template, TB displays six cards to reflect story planning aspects: what the story is about, who are the characters of the story, where the story is located, why there are challenges, what has been done to address the challenges and why the actions were effective. Each card contains three packed bubbles of three top terms in that category to reflect the timeline aspect: historical, consistent and latest. By doing this, TB is visualizing a palette of thoughts for metacognition during news analysis and editorial writing. Therefore, the major contributions of this article are: the introduction of a concept of comprehensive Terms Board (TB) visualization as a cognitive guide and reporting an experimental evaluation of TB via cognitive reading and writing user experiment tasks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related Work</head><p>Applications can facilitate information seekers by providing information through search facilities or automatically generated comprehensive categorization/visualization <ref type="bibr" target="#b4">[5]</ref>. Global and professional domain-specific search engines play a vital role in knowledge-seeking by providing various advanced search options, categorical filters and statistical information <ref type="bibr" target="#b4">[5]</ref>. On the other hand systems like Open Knowledge Maps (OKM) <ref type="bibr">[6][7]</ref>, WordStream <ref type="bibr" target="#b7">[8]</ref> and VISTopic <ref type="bibr" target="#b8">[9]</ref> comprehend metadata of the search result categorically using comprehensive visualization. Existing search and comprehensive technologies are focused on knowledge discovery, while TB is focused on analyzing and planning knowledge delivery, stimulating creativity and improving the engagement of the audience by aiding individuals to create a guided story plan.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Methodology</head><p>TB categorizes terms into the following story planning aspects. Who: role players in the documents are identified by the category "Person" or "Organization". Where: key locations in the documents are identified by the category "Location". They are represented in proper nouns. What (topics): main topics are identified by emphasized nouns and proper nouns. Why (negatives): challenges are identified by a static list for the prototype system. What (actions): verbs that reflect the actions in the documents. Why (positives): words identified by a static list for the prototype system. Each story planning category forms a card in TB. A card is further separated into three timeline-based aspects. Historical: if a term appears early in the given date range, it gets a higher weight. Latest: if a term appears later in the given date range, it gets a higher weight. Consistent: these are the terms that appear consistently both historically and in the latest documents. The weight is the average of historical and latest weight.</p><p>For displaying terms in TB, emphasized terms are extracted from a topic-based collection of documents and displayed as shown in Figure <ref type="figure" target="#fig_0">1</ref>. Each document is analyzed in the archive using the topological approach presented in the paper "A simplified topological representation of text for local and global context" <ref type="bibr" target="#b9">[10]</ref> to identify topics and actions (verbs) from a document. The identified terms are categorized into six story-planning aspects. For an extracted term, the classification type (Person/Organization/Location) of the term is identified by the categorical information provided by Google Knowledge Graph API <ref type="bibr" target="#b10">[11]</ref>. The extracted emphasized term are combined as a list of highly frequent related terms using frequency analysis. The algorithm uses the category of terms to place them in appropriate cards and weights each term on the timeline-based aspects to select the top terms of that aspect. TB visualization is generated using the D3 library's packed bubble chart <ref type="bibr" target="#b11">[12]</ref>. On hovering over each timeline bubble, full terms are displayed. Clicking a term opens a relational view modal with a related set of documents (title, meta-description and link to the main document). We applied the scanning approach to display related terms. Scanning helps to locate specific facts rapidly <ref type="bibr" target="#b12">[13]</ref>. For a given set of terms, all the sentences in the relative context are scanned based on word co-occurrence, top 5 relations for each given term are displayed using the D3 library's Sankey <ref type="bibr" target="#b12">[13]</ref> diagram when a timeline bubble is clicked. On clicking a relation, the set of documents (title, meta-description) that reflects the relation is displayed. When clicking a document, a user navigates to the original document/news.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Cognitive reading and writing experiment</head><p>We evaluated TB via an online participation-based experiment. We used a pharmaceutical news website to build a collection and produce the visualizations. Each participant from a homogeneous group was given two story writing tasks on pre-decided topics. The selected two topics were Pfizer and AstraZeneca. For both of the tasks, the participants were given a list of documents (title, introduction and link to full document text) to plan a story and write a minimum of 500 characters about the topic. But for one of the tasks, we presented TB visualization generated based on the documents having the topic before the list. The topics were randomly assigned to the control condition. The order of the tasks was generated using "Latin Square Design" to balance the systematic difference between successive conditions. We invited three academic reviewers to blindly score the writing tasks. They scored the quality of the story on a scale from 1 to 5, where 1 stands for poor and 5 stands for very good. We took the average score given by the reviewers for each task. We performed a paired t-test on the average scores of quality of writing, ease of use and completion time. We recruited 32 participants for this experiment.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Evaluation</head><p>The result is reported in Table <ref type="table" target="#tab_0">1</ref>. As shown in Table <ref type="table" target="#tab_0">1</ref>, we have achieved 95% confidence in the reported result on all criteria apart from "Quality of Story". According to the evaluation, the participants spent more time writing their stories and achieved a better quality of writing when they are assisted with TB visualization compared to the tasks where they were given only a list of documents. They found the task assisted by TB comparatively difficult as they had to process additional information than only scanning the documents.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Google Analytics Evaluation</head><p>Clearly, participants' thoughts can't be directly observed. Therefore, information about metacognition must be collected in indirect ways <ref type="bibr" target="#b0">[1]</ref>. We used Google Analytics to record participants' activity while doing the task with their permission. We collected 17 participants' activities.</p><p>We tracked clicks and hovers on cards, timeline bubbles and documents during reading and selecting documents in the story planning phase. We also tracked how long the selected document was open during the session. Using this information, we generated directed graphs. The red nodes represent the topic of the story (Pfizer or AstraZeneca)which is the root node. The amber nodes represent story planning aspects (whatTopicBox, whatActionBox, whoBox, whereBox, whyNegativeBox, whyPositive Box). The yellow nodes represent the timeline aspects (consistent, old_to_new/historical, new_to_old/latest). The green nodes represent documents. The size of the nodes represents the number of events/time associated with the node during the pre-writing story planning phase. Figure <ref type="figure">2</ref> (a) displays the story planning directed graph of a participant while writing a story using only a list of documents. The participant selected 6 documents from the list to plan their story and then gradually increased their focus on 2 documents to base the writing. participant while writing a story using TB. The figure shows that the participant investigated five cards to start with and then selected the Who card's consistent terms to select a document for reading in the planning phase. All collected observations from analytics show that the participants selected fewer documents via TB than the list of documents which refers to the fact that they selected the documents using a strategic planning process. Therefore, combining this observation along with the results reported in Table <ref type="table" target="#tab_0">1</ref>, we can state that for a group of participants, TB contributed to more metacognition (increased time for finishing the task) and better quality of writing by aiding planning before writing a story.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: The TB visualization of the topic "Nasdaq". Hovering over a term displays the full term and the number of occurrences in the topic collection is displayed.</figDesc><graphic coords="3,89.29,84.19,416.69,256.55" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 (Figure 2 :</head><label>22</label><figDesc>Figure 2: Directed graph representing strategy-making during reading and selecting documents. The bubble size represents the amount of time or event associated with the topic (red)/ reading documents (green).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Writing Performance.</figDesc><table><row><cell>Criterion</cell><cell>Scale</cell><cell cols="3">List(mean) TB and List(mean) P-value</cell><cell>Hypothesis testing with p = 0.05</cell></row><row><cell>Quality of story</cell><cell cols="2">1(Poor) -5 (Very good) 3.46875</cell><cell>3.515625</cell><cell>0.4974603636</cell><cell>Can not reject null hypothesis</cell></row><row><cell cols="2">Completion time Suggested 20 mins</cell><cell>13.8 min</cell><cell>16.0 min</cell><cell cols="2">0.03805485437 Reject null hypothesis</cell></row><row><cell>Ease</cell><cell cols="2">1(Hard) -5 (Very easy) 3.21875</cell><cell>2.84375</cell><cell cols="2">0.02596133121 Reject null hypothesis</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusion</head><p>TB is designed to comprehend facts and aid in improving an individual's metacognition to create a plan for a designated task based on a strategy. TB has the potential to contribute to news analysis and editorial writing. TB can expose the metacognition process while writing. In this paper, we demonstrated TB using a news corpus and showed that TB contributed to metacognition and quality of writing by aiding planning before writing a story. TB can also be used for other text corpora such as articles and books as an aid for brainstorming in learning and teaching.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Metacognition in Language Learning and Teaching</title>
		<author>
			<persName><forename type="first">A</forename><surname>Haukås</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bjørke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dypedahl</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Routledge Studies in Applied Linguistics</title>
		<imprint>
			<date type="published" when="2018">2018</date>
			<publisher>Taylor Francis</publisher>
		</imprint>
	</monogr>
	<note>1 ed</note>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">S</forename><surname>Gazzaniga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">B</forename><surname>Ivry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">R</forename><surname>Mangun</surname></persName>
		</author>
		<title level="m">Cognitive neuroscience: the biology of the mind</title>
				<meeting><address><addrLine>New York, N.Y</addrLine></address></meeting>
		<imprint>
			<publisher>W. W. Norton &amp; Company, Inc</publisher>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
	<note>fourth edition ed</note>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">The Hero&apos;s Journey -Template-Based Storytelling for Ubiquitous Multimedia Management</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Cao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Klamma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Jarke</surname></persName>
		</author>
		<idno type="DOI">10.4304/jmm.6.2.156-169</idno>
	</analytic>
	<monogr>
		<title level="j">Journal of Multimedia</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="156" to="169" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Improving Students&apos; Writing Ability Through Storyboard</title>
		<author>
			<persName><forename type="first">M</forename><surname>Janah</surname></persName>
		</author>
		<idno>2356-203X</idno>
	</analytic>
	<monogr>
		<title level="s">Universitas Muhammadiyah Pringsewu</title>
		<idno type="ISSN">2356-2048</idno>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
	<note>1 Publisher</note>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">Designing the Search Experience</title>
		<author>
			<persName><forename type="first">T</forename><surname>Russell-Rose</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Tate</surname></persName>
		</author>
		<idno type="DOI">10.1016/C2011-0-07401-X</idno>
		<imprint>
			<date type="published" when="2013">2013</date>
			<publisher>Elsevier</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">O</forename><forename type="middle">K</forename><surname>Maps</surname></persName>
		</author>
		<ptr target="https://openknowledgemaps.org/index" />
		<title level="m">Open Knowledge Maps -A visual interface to the world&apos;s scientific knowledge</title>
				<imprint>
			<date type="published" when="2021-09">2021. 09/03/23</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Open Knowledge Maps: Creating a Visual Interface to the World&apos;s Scientific Knowledge Based on Natural Language Processing</title>
		<author>
			<persName><forename type="first">P</forename><surname>Kraker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Kittel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Enkhbayar</surname></persName>
		</author>
		<idno type="DOI">10.12685/027.7-4-2-157</idno>
	</analytic>
	<monogr>
		<title level="j">Zeitschrift für Bibliothekskultur</title>
		<imprint>
			<biblScope unit="volume">027</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page" from="98" to="103" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">T</forename><surname>Dang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">N</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Pham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Johansson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Sadlo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Marai</surname></persName>
		</author>
		<title level="m">Wordstream: Interactive visualization for topic evolution</title>
				<imprint>
			<publisher>EuroVis</publisher>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Vistopic: A visual analytics system for making sense of large document collections using hierarchical topic modeling</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Yao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Qu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Visual Informatics</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="page" from="40" to="47" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">A simplified topological representation of text for local and global context</title>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">R</forename><surname>Sami</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Farrahi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 25th ACM international conference on Multimedia</title>
				<meeting>the 25th ACM international conference on Multimedia</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="1451" to="1456" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<ptr target="https://developers.google.com/knowledge-graph" />
		<title level="m">Google Knowledge Graph Search API</title>
				<imprint>
			<date type="published" when="2021-09">2021. 09/03/23</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<title level="m" type="main">Circular Packing | the D3 Graph Gallery</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Holtz</surname></persName>
		</author>
		<ptr target="https://www.d3-graph-gallery.com/circularpacking" />
		<imprint>
			<date type="published" when="2018-09">2018. 09/03/23</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Skimming and Scanning -TIP Sheet -Butte College</title>
		<ptr target="http://www.butte.edu/departments/cas/tipsheets/readingstrategies/skimming_scanning.html" />
		<imprint>
			<date type="published" when="2021-09">2021. 09/03/23</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
