<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main"></title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">CBBCDEF6763197DE6470F3ED1B8683D5</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T23:24+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract/>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>Various methods and techniques have been explored in the past in the aim of automatically generating new bilingual (and multilingual) dictionaries based on existing ones, so that given L1&gt;L2 and L2&gt;L3 sets, a new L1&gt;L3 dictionary is produced. The intermediate language that is used in the process is called a pivot, and it is possible to use multiple pivots for this purpose. Although considerable work has been done to this end, it was usually conducted on different types of datasets and evaluated in different ways, applying various algorithms that are often not comparable.</p><p>The Translation Inference Across Dictionaries shared task (TIAD-2017) was launched with the intention of offering quality lexical resources within the context of a single coherent experiment, which enables reliable validation of results and solid comparison of methods and techniques for auto-generating translation equivalents across languages, as well as stimulating and enhancing further research.</p><p>The data for TIAD-2017 was released by K Dictionaries Ltd and extracted from its Global Series, which includes monolingual, bilingual and multilingual lexical sets for 24 languages <ref type="bibr">(Kernerman, 2011)</ref>. Each language core is compiled independently within a common overall framework and sharing the same technical infrastructure with all other languages. It consists of detailed, well-structured lexicographic information on the core language, which serves as a base for translation to any other language. The language pairs were thus created directly between each other, with no intermediate language bias (e.g. English, in WordNet), and joining the different pairs of each language core forms a multilingual network.</p><p>All the evaluation data and results is freely available on the Web (see https://tiad2017.wordpress.com/ )</p><p>The shared task</p><p>The objective of the shared task was to indirectly generate translations for three language pairs, based on the available translations among eight languages altogether, in 14 bilingual dictionaries, involving four possible paths -all from German to Brazilian Portuguese -that feature between 1 to 4 pivot languages.</p><p>The test dataset consisted of 100 randomly-selected German dictionary entries with their translations into a second language, and recursively exploring further translations in chained-up dictionaries -including up to 817 entries with 1,532 translation equivalents in the largest language pair that is provided. Besides the headwords and translations, the data includes information about parts of speech, subject domains and synonyms, as well as examples of usage and their translations.</p><p>The following language pairs were provided for the four different paths:</p><p>1. German&gt;English | English&gt;Portuguese The three new language pairs that were generated are:</p><formula xml:id="formula_0">• German&gt;Portuguese • Danish&gt;Spanish • Dutch&gt;French</formula><p>The Portuguese&gt;German sets were provided for the sole purpose of closing the loop as an aid for validating the results, but were not allowed to be reversed to improve results. However, it was allowed to reverse the Spanish&gt;Danish dataset -that was provided as part of path (4) -o helpt with improving the results.</p><p>Evaluation of the results of each system was carried out against KD's manually compiled dictionaries for these pairs from the Global Series and other resources, as well as by human translators.</p><p>Participants were invited to contribute on either or both of the following tracks:</p><p>• Systems that use only the KD data released for the task • Systems that exploit, in addition to the KD data, other freely available sources of background knowledge (e.g., lexical linked open data and parallel corpora) to improve performance Beyond performance, participants were encouraged to consider the following issues in particular:</p><p>• The role of the language family with respect to the newly generated pairs • The asymmetry of pairs, and how the translation direction affected the results • The behavior of different parts of speech among different languages • The role that the number of pivots played in the process</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head><p>Four teams participated in the shared task, but one of them was not able to produce results on time for the workshop. All the teams were invited to submit a paper with a technical description of their system and their results. We received three submissions. After the peer reviewing process, one of them was withdrawn and two were accepted for publication. We gave all the participants the opportunity of sharing their ideas and results with an oral presentation at the workshop: long presentations for the two participant teams with accepted papers, and short presentations for the other two. The list of presented works is as follows:</p><p>Paper presentations: </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>| Dutch&gt;Spanish | Spanish&gt;Danish | Danish&gt;French | French&gt;Portuguese Also included were four Portuguese&gt;German datasets, for closing the loop in each path, to help with the validation of the results.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head></head><label></label><figDesc>• Thomas Proisl, Philipp Heinrich, Stefan Evert and Besim Kabashi. "Translation inference across dictionaries via a combination of graph-based methods and co-occurrence statistics" • Kathrin Donandt, Christian Chiarcos and Maxim Ionov. "Using Machine Learning for Translation Inference Across Dictionaries" Short presentations: • Tom Knorr. "Translation Inference Across Dictionaries TIAD 2017 -Shared Task Data Analysis" • Uliana Sentsova. "Report on TIAD participation"</figDesc><table /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0">Preface of the TIAD workshop at LDK 2017</note>
		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0" />			</div>
			<div type="references">

				<listBibl/>
			</div>
		</back>
	</text>
</TEI>
