<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Joint Glossary of Forensic Document Examination and Pattern Recognition</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Erika</forename><surname>Griechisch</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">Inés Baldatti Banco Central de la República Argentina Buenos Aires</orgName>
								<address>
									<country key="AR">Argentina</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">University of Szeged</orgName>
								<address>
									<addrLine>Árpád tér 2</addrLine>
									<postCode>H-6720</postCode>
									<settlement>Szeged</settlement>
									<country key="HU">Hungary</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Joint Glossary of Forensic Document Examination and Pattern Recognition</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">84AE64153B183D8EC7DE5C9406DF562F</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T11:46+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In this paper we introduce an open, scientific glossary which uses MediaWiki engine to the forensic examiner and pattern recognition scientific communities. Besides our aim to find editors from these communities who contribute to extend the glossary and make it as complete as possible, we would like to translate the terms from English to other languages, e.g. Portuguese, German, Chinese, Japanese, Arabian. The contribution can be started with translating the existing words at the glossary. The second part of our work when the glossary become be more completed, will consist into create the very understanding and useful glossary.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION</head><p>In the last few years there were more and more communication and joint research between the forensic examiners and pattern recognition scientists. These two communities can efficiently work together, if they understand the terms from both sciences. We have seen and heard on meetings, conferences, workshops, and during discussions it is a necessity to have a useful, extendable glossary, and dictionary which helps the common work.</p><p>Our aim is to create a glossary and a dictionary with the important terms of the forensic science for the forensic document examiners and pattern recognition experts in different languages. We are considering even the different expressions among countries who share the same language in order to obtain a better understanding into our fields no matter where we are from. Thinking and hoping this work will be a useful tool for both: the forensic and pattern recognition communities.</p><p>The long name of the glossary is Glossary of Forensic Document Examination and Pattern Recognition and the short name is GoFDER. The site is availabe on the http://projects. dfki.uni-kl.de/gofder/index.php URL. Figure <ref type="figure" target="#fig_0">1</ref> shows the logo of the glossary which depicts an important tool of the forensic scientist.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Related work</head><p>An offline glossary from 1999 is <ref type="bibr" target="#b1">[2]</ref> which integrates first time in publishing terms from forensic science. The Forensic Science Central is a great contribution with links and forum, but its own glossary [3] contains only a small portion of definition, there is no dictionary and it is not extendible. Similar holds for the website of ThinkQuest <ref type="bibr" target="#b2">[4]</ref>, in addition there are less terms and the terms there were not described by experties, the reference marked on the page is only Google search. The multilingua lexicon of European Network of Forensic Science Institutes (ENFSI) is a great contribution created by several forensic institute, founded in 1999. It contains several words in many languages, but it is only a dictionary, without explanation, descriptions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Authors</head><p>The first author Inés Rosa Baldatti is a forensic document examiner, analyst of Payment Systems at the Central Bank of Argentina Republic. The second author Erika Griechisch is a PhD student at the University of Szeged (Institute of Informatics), her topic is online signature verification. Further authors are every colleague who would like to contribute.</p><p>1) Beginning: The authors met at First International Workshop on Automated Forensic Handwriting Analysis (AFHA) in 2011, Beijing, China. In that opportunity it was clear the necessity to get a good understanding and a fluid contact among professionals worldwide. So, we decide to make this work, that takes a long time and dedication, and responsibility.</p><p>2) Creating: We read papers and books and websites, and extract the terms from them keeping the meaning.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. TECHNICAL BACKGROUND</head><p>Nowadays a glossary or dictionary which is available only in printed version is not really useful. An online glossary is more useful than a printed one and it can be easily printed if it is necessary. Several criteria should meet, which are feasible only if the glossary is an online one.  The most important criterion is the expandability of the glossary. There will be always terms which can be added, not necessary because there incompleteness of the glossary itself rather the expension of the scientist. Regarding the expansion, it is much easier to correct errors in an existing online website than a book.</p><p>Other consideration was the possibility of localization of the glossary. We know for the translation part it is necessary to be clean and easy otherwise we can expect only few translators.</p><p>In this section we introduce and describe the MediaWiki package and its extensions which were most suitable for our purposes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. MediaWiki</head><p>MediaWiki (MW) is a versatile package, written in PHP and originally developed for use on Wikipedia since 2002. It is very widespread, well-documented, can handle any kind of media easily (links, images, videos, etc), moreover it is opensource which makes it easier to discover and fix any kind of bugs. The MediaWiki package quite flexible and further functionalities can be easily integrated to a basic MediaWiki website.</p><p>MediaWiki uses a markup language called wikitext to use basic formatting so the users without knowledge of HTML can edit the pages easily. We added the WikiEditor extension to the GoFDER website, which allows the users to see 3 different view during the editing. The first one is the plain wikitext, the second is a preview, the third one shows the differences between the previous version of the page and the current (edited) version. Thus if someone is new in wikitext markup language, s/he can simply check the Preview before submission, see Figure <ref type="figure" target="#fig_1">2</ref>.</p><p>In order to achieve our goals, to create a multilingual glossary with dictionary, a basic MediaWiki is not sufficient. Thus we added other extension to improve the efficiency. In the following we describe each of them.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Semantical MediaWiki</head><p>Semantical MediaWiki (often noted by SMW) is an extension of MediaWiki that helps to search, organise, tag, browse, evaluate and share the wiki's content <ref type="bibr" target="#b5">[8]</ref> since 2005.</p><p>While a traditional Wikipedia site contains text which is useful and can be processed easily by humans, it is not easy to understand or evaluate for a computers. The Semantical Me-diaWiki helps to extend a capability of a Wiki site by adding annotations, which makes wiki a collaborative database.</p><p>Semantical MediaWiki itself has several extension too, we added the Semantic Glossary to our website, which helps to describe terms with the Terminology page of the wiki. The reader of a page just point to a word with the mouse and if the Terminology page contains that word, it's description will appear in a small box below the word.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Translate extension</head><p>The Translate extension makes MediaWiki a powerful tool to translate every kind of text. <ref type="bibr" target="#b4">[7]</ref> It runs inside MediaWiki and has many features for translators, however its usage is very simple. After a page is marked with the &lt;translation&gt; tag, the extension automatically splits the text between the translation tag to translation units. The arrangment of the translation units can be approved or redefined if it is necessary. According to the default settings each translation unit is one paragraph. After the arrangement of the translation units are saved, the page can be translated via a translation tool. Figure <ref type="figure">3 and 4</ref> show two view of a translation page, translators can use which are more convinient for them. Users can choose the language they intend to translate from a list. On the GoFDER website there is a predefined list on the top of all the translatable webpages which shows the languages we primarily intend to find translators (the first author of this paper is responsible mainly for Spanish translations, the second is about Hungarian). Nevertheless there is no language restrictions, contributors are welcome to translate to any language.</p><p>If an original English wikipage (which is marked for translation) has any changes, on the top of the page there will be a note about that and each unit which is effected will be marked as outdated translation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. CONTRIBUTION</head><p>Recently (end of May, 2013) the glossary has 20 pages (terms), the dictionary has 561 English words with Spanish translation and some Hungarian translation as well.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. New terms</head><p>We encourage forensic document examiners to extend the glossary and the dictionary as well, comment the recent terms on the discussion pages, ask if something is not clear or complete. We want to keep our glossary professional, so on our wiki site it is not possible to edit or modify pages without registration. After registration and login, users can modify the pages.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Translation</head><p>We would like to translate the terms from English primarily to Spanish, German and Hungarian. In the same way to increase the dictionary, we plan to add new languages as well. We hope to attain contributors who can create the Portuguese, Chinese, Japanese, Arabian, etc translation.</p><p>The translations are available for a page if it is marked for translation. Every registered user can translate sites, but only translation administrators can mark pages for translation and confirm completed translations. If a user creates a page and s/he is member of the Translation group, s/he can mark it for translation. if s/he is not member of this group, she can ask someone from the Translation group.</p><p>More details about editing and translating are available on the Contribution page of the website.</p><p>IV. CONCLUSION Here we presented the conception and initation of our work. The technical background of the glossary have been prepared. However we are still open for suggestions and ideas to improve the website. From now on the main part of the project is to add new terms, specify and extend the existing ones, add examples and explanation as many as possible.</p><p>We expect as an outcome of our common effort with the contributors that we can provide a useful, up-to-date and beneficial glossary for the scientific community.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. The logo of the project</figDesc><graphic coords="1,399.84,582.32,75.32,121.52" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. WikiEditor: WikiText and Preview (above), Changes (below)</figDesc><graphic coords="2,48.96,197.10,514.08,89.01" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 3 .Fig. 4 .</head><label>34</label><figDesc>Fig. 3. Translate a page: List view</figDesc><graphic coords="3,48.96,199.68,514.08,163.53" type="bitmap" /></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>ACKNOWLEDGEMENT</head><p>The second author was supported by the T ÁMOP-4.2.4B/2-11/1-2012-0001 project.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Our aim is to create a glossary and a dictionary with the important terms of the forensic science for the forensic document examiners and pattern recognition experts in different languages. We are considering even the different expressions among countries who share the same language in order to obtain a better understanding into our fields no matter where we are from. Thinking and hoping this work will be a useful tool for both: the forensic and pattern recognition community. </p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<ptr target="http://projects.dfki.uni-kl.de/gofder/" />
		<title level="m">GoFDER website</title>
				<imprint>
			<date type="published" when="2013-05">May 2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">John</forename><forename type="middle">C</forename><surname>Brenner</surname></persName>
		</author>
		<ptr target="http://www.crcpress.com/product/isbn/9780849311963" />
		<title level="m">Forensic Science Glossary</title>
				<imprint>
			<publisher>CRC Press</publisher>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<ptr target="http://library.thinkquest.org/04oct/00206/textglossary.htm" />
		<title level="m">ThinkQuest Glossary</title>
				<imprint>
			<date type="published" when="2013-05">May 2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<ptr target="http://www.ies.krakow.pl/multilingua/welcome" />
		<title level="m">MULTILINGUA lexicon</title>
				<imprint>
			<date type="published" when="2013-05">May 2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<ptr target="http://www.mediawiki.org/wiki/Extension:Translate" />
		<title level="m">Translate Extension</title>
				<imprint>
			<date type="published" when="2013-05">May 2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<ptr target="http://semantic-mediawiki.org" />
		<title level="m">Semantic MediaWiki</title>
				<imprint>
			<date type="published" when="2013-05">May 2013</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
