<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Results of GRAMS+ at SemTab 2024</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Binh</forename><surname>Vu</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">USC Information Sciences Institute</orgName>
								<address>
									<addrLine>Marina del Rey</addrLine>
									<postCode>90292</postCode>
									<region>CA</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Craig</forename><forename type="middle">A</forename><surname>Knoblock</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">USC Information Sciences Institute</orgName>
								<address>
									<addrLine>Marina del Rey</addrLine>
									<postCode>90292</postCode>
									<region>CA</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Fandel</forename><surname>Lin</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">USC Information Sciences Institute</orgName>
								<address>
									<addrLine>Marina del Rey</addrLine>
									<postCode>90292</postCode>
									<region>CA</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Results of GRAMS+ at SemTab 2024</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">458B70D436E92D611F8C79D67CDB7300</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:03+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>SemTab 2024</term>
					<term>Semantic Description</term>
					<term>Semantic Table Interpretation</term>
					<term>Knowledge Graphs</term>
					<term>Semantic Web</term>
					<term>Data Integration</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>There is an enormous number of tables available on the Web. However, it is difficult to automatically use the tables in data analytic pipelines because of the lack of semantic understanding of their structure and meaning. To address this problem, our approach, GRAMS+, automatically creates semantic descriptions of tables using distant supervision. SemTab is an annual challenge that provides a diverse set of benchmarks for systems that match tabular data with knowledge graphs. In this paper, we present the results of GRAMS+ at SemTab 2024 in the Accuracy Track. The results show that GRAMS+ is scalable and achieves competitive performance in the tasks in which we participated.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Matching tabular data to an ontology or a knowledge graph is an essential problem in Data Integration. The task is to annotate types of columns in the tables using classes of the target ontology and relations between columns using the ontology properties. We developed a novel approach, GRAMS+ <ref type="bibr" target="#b0">[1]</ref>, addressing this problem using distant supervision. The approach leverages the fact that some data in a table will often overlap with data in a knowledge graph (KG), which can be used to discover candidate types and relationships in the table. Then, the approach uses two neural networks (NN) trained with a labeled dataset generated automatically from Wikipedia tables to predict the final column types and relationships.</p><p>The Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (SemTab) is an annual challenge with the goal of providing benchmarks and evaluations of existing solutions to this problem. In this paper, we present the results of GRAMS+ at the SemTab 2024 challenge focusing on the Accuracy Track. Our approach successfully annotates a very large number of tables and achieves first place on the tasks in which we participated.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">The SemTab Challenge</head><p>The SemTab 2024 challenge consists of several tracks ranging from semantic table interpretation to dataset assessment and contributions. We focus on the Accuracy Track, which is relevant to our approach. This track contains four matching tasks: (1) the Cell Entity Annotation (CEA) matches a cell to a KG entity, (2) the Column Type Annotation (CTA) assigns a KG class to a column, (3) the Column Property Annotation (CPA) assigns a KG property to the relationship between two columns, and (4) Topic Detection (TD) assigns a KG class to a table. Figure <ref type="figure" target="#fig_1">1</ref> shows an example table annotation.</p><p>There are two types of tables in this track: horizontal tables (or relational tables) and entity tables. A horizontal table is a grid where each row represents an entity and each column shares the same semantic type (e.g., Figure <ref type="figure" target="#fig_1">1</ref>). An entity table describes a single entity, where each row contains a property of that entity.  Finally, the standard micro precision, recall, and F 1 are used to measure the performance of the participating systems <ref type="bibr" target="#b1">[2]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">GRAMS+ Approach</head><p>Figure <ref type="figure" target="#fig_2">2</ref> shows the overall approach of GRAMS+. It starts by finding KG entities that are mentioned in a table. Then, we use a neural network (NN) to compute the scores of candidate entities of each table cell. The NN model is trained with a labeled dataset automatically generated from Wikipedia tables. Using the discovered candidates and their scores, we predict column types (CTA) and column relationships (CPA). We generate the labeled dataset by leveraging the hyperlinks inside the Wikipedia tables to find corresponding Wikidata entities and predict columns' relation-ships based on the linked entities. We remove context-inconsistent hyperlinks by first automatically assigning a type to each column based on the most common type of its entities. Then, we employ a blocklist to remove all links in a column if the column header is incompatible with the predicted column types. The blocklist is constructed by manually verifying headers that appeared in multiple predicted types. As our approach is detailed in <ref type="bibr" target="#b0">[1]</ref>, the remainder of this section provides a brief overview of each component in GRAMS+, along with any changes to fit the SemTab 2024 challenge.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Entity Linking</head><p>Following typical entity linking (EL) systems, our EL approach consists of three main steps: (1) detect the entity columns, which are the cells that will be linked; (2) retrieve candidate entities for each cell; and (3) compute the candidates' likelihood.</p><p>For step 1, we directly use the target entity columns provided in SemTab's datasets instead of running the entity detection. To retrieve candidate entities, GRAMS+ combines multiple search strategies such as using public Wikidata Search API, keyword search using ElasticSearch, and fuzzy search using SymSpell. Given the huge number of tables in the Wikidata Tables dataset in Round 2 (78,745 tables), we cannot use the public Wikidata API to search and only use the two later strategies.</p><p>To compute the candidates' likelihood, we use a two-hidden-layer perceptron with RELU activations. It is trained using the auto-label dataset with the following groups of features:</p><p>Surface Features include four string similarity functions between a cell and an entity name: Levenshtein, Jaro-Winkler, Monge Elkan, and Generic Jaccard.</p><p>Entity-context Similarity Features capture the coherence between a candidate and the surrounding context of a cell. GRAMS+ uses two context similarity features: the weighted dot product of the column header and the candidate description, and the number of cells matched with the candidate's property divided by a large constant representing the maximum number of columns in a table (e.g., 20) for rescaling. The embeddings are computed from a Sentence Transformer model <ref type="bibr" target="#b2">[3]</ref> <ref type="foot" target="#foot_0">1</ref> , and the weights of embedding dimensions are learnable parameters. Note that GRAMS+ trains two entity linking models for tables with and without headers. Because tables from the SemTab datasets do not have column headers, GRAMS+ uses the model trained on tables without headers.</p><p>Entity Prior Features bias the predictions toward popular entities. Currently, we use the normalized log page rank of a candidate as the prior feature. The normalized log page rank of an entity 𝑒 is calculated as follows:</p><formula xml:id="formula_0">log(pagerank(𝑒)) − min 𝑒 ′ ∈ℰ log(pagerank(𝑒 ′ )) max 𝑒 ′ ∈ℰ log(pagerank(𝑒 ′ )) − min 𝑒 ′ ∈ℰ log(pagerank(𝑒 ′ ))</formula><p>where ℰ is the set of entities in KG, pagerank(𝑒) is the pagerank of an entity 𝑒.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Column Type Prediction</head><p>To predict the type of a column, we use a greedy algorithm that first selects the type with the highest score from the set of types directly found in the candidate entities of a column. Then, it iteratively refines the prediction by replacing it with an ancestor type within 𝑑 distance of the directed types if the score difference is larger than a specific threshold 𝛿 until 𝑑 reaches the maximum chosen distance (max_distance). The score of a type is computed by summing the maximum likelihood of the candidate entities of the type for each cell and then dividing by the number of rows. We use the same threshold (𝛿 = 0.1) and maximum distance (max_distance = 2) as in <ref type="bibr" target="#b0">[1]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Column Relationship Prediction</head><p>To predict the relationship of a column, GRAMS+ first constructs a candidate graph containing potential relationships between columns. Then, GRAMS+ uses a classifier to predict the likelihood of each link in the graph. As the SemTab challenge provides pairs of target columns for predictions, we directly use the most likely relationships between target columns as the final predictions.</p><p>The classifier employed to predict the likelihood of links is also a two-hidden-layer perceptron with RELU activations. It is trained on the auto-label dataset with features such as the relative frequency of discovering the link from top K entities, the average link likelihood, the relative frequency of finding contradicting information between the table data and KG data, and whether there is a many-to-many relationship between the source and target of the link. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">SemTab 2024 Results</head><p>Table <ref type="table" target="#tab_0">1</ref> reports the performance of GRAMS+ on the Wikidata Tables datasets. We cannot run GRAMS+ on the tBiodiv and tBiomed datasets because the values of the subject columns', which contain the main entities, were anonymized. Since the names are changed, these datasets focus on a different aspect of the problem, which is identifying the anonymous entities. This is not the focus of GRAMS+, and we leave it for future work.</p><p>At the time of writing the paper, GRAMS+ achieves first place among the participants on the Wikidata Tables datasets. The two datasets, in total, have approximately 109,000 tables. This shows that GRAMS+ is scalable and can handle a large number of tables.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Related Work</head><p>Table Understanding is an essential problem in Data Integration and has attracted many studies over the years. A comprehensive related work to GRAMS+ can be found in <ref type="bibr" target="#b0">[1]</ref>. In this section, we briefly discuss work related to GRAMS+ in the setting of the SemTab challenge.</p><p>Most systems participating in the SemTab, including GRAMS+, exploit the existing knowledge in a KG. Typically, they first identify KG entities in a table (CEA) and match the properties of entities with values in the table to find column types (CTA) and relationships between columns (CPA). The best performing systems in SemTab such as MTab <ref type="bibr" target="#b3">[4]</ref>, DAGOBAH <ref type="bibr" target="#b4">[5]</ref>, and others such as KGCode-Tab <ref type="bibr" target="#b5">[6]</ref>, LinkingPark <ref type="bibr" target="#b6">[7]</ref>, BBW <ref type="bibr" target="#b7">[8]</ref>, TorchicTab-Heuristic <ref type="bibr" target="#b8">[9]</ref>, and SemTex <ref type="bibr" target="#b9">[10]</ref> improve various aspects of the pipeline such as candidate entity retrieval, scoring functions to rank the matched results, or repeat the pipeline several times or until reaching equilibrium. Compared to GRAMS+, they often rely on hand-crafted scoring functions, while GRAMS+ uses distant supervision to learn to classify correct entities and column relationships. Moreover, GRAMS+ tackles a general setting where we need n-ary relationships to correctly model data in the tables.</p><p>The SemTab 2023 and 2024 also include other tasks, such as Table <ref type="table">Topic</ref> Detection and Matching Table Metadata to KG. These are not the focus problems of GRAMS+, and we leave them for future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Conclusion</head><p>This paper presents the results of GRAMS+, a distant supervised approach for annotating column types and relationships of tables, for the SemTab 2024 Accuracy Track. GRAMS+ achieves rank 1 for datasets on which it was evaluated.</p><p>In future work, we plan to improve the performance of GRAMS+ by jointly predicting column types and relationships. We also plan to extend GRAMS+ to leverage table context, metadata, and modeling instructions to support tables without overlapping data to a target knowledge graph.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>SemTab'24: Semantic Web Challenge on Tabular Data to Knowledge Graph Matching 2024, co-located with the 23rd International Semantic Web Conference (ISWC), November 11-15, 2024, Baltimore, USA Envelope binhvu@isi.edu (B. Vu); knoblock@isi.edu (C. A. Knoblock); fandel.lin@usc.edu (F. Lin) GLOBE https://binh-vu.github.io/ (B. Vu) Orcid 0000-0001-5808-9288 (B. Vu); 0000-0002-6371-4807 (C. A. Knoblock); 0000-0001-7024-2476 (F. Lin)</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: An example of a table with annotation</figDesc><graphic coords="2,72.00,65.61,451.29,215.14" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Overall approach of GRAMS+</figDesc><graphic coords="2,72.00,491.80,451.28,95.06" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Performance of GRAMS+ on CPA and CTA tasks. Precision and F 1 scores are reported in percentage</figDesc><table><row><cell>Dataset</cell><cell></cell><cell>CPA</cell><cell></cell><cell></cell><cell></cell><cell>CTA</cell><cell></cell><cell></cell></row><row><cell></cell><cell>F 1</cell><cell>Precision</cell><cell>Recall</cell><cell>Rank</cell><cell>F 1</cell><cell>Precision</cell><cell>Recall</cell><cell>Rank</cell></row><row><cell>Wikidata Tables round 1</cell><cell>89.8</cell><cell>98.8</cell><cell>82.30</cell><cell>1</cell><cell>92.9</cell><cell>92.9</cell><cell>92.9</cell><cell>1</cell></row><row><cell>Wikidata Tables round 2</cell><cell>89.9</cell><cell>99.2</cell><cell>82.19</cell><cell>1</cell><cell>95.6</cell><cell>95.6</cell><cell>95.6</cell><cell>1</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">We use the pretrained all-mpnet-base-v2 model.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This material is based upon research supported by the Defense Advanced Research Projects Agency (DARPA) under Agreement No. HR00112390132 and Contract No. 140D0423C0093. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the Defense Advanced Research Projects Agency (DARPA); or its Contracting Agent, the U.S. Department of the Interior, Interior Business Center, Acquisition Services Directorate, Division V.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Exploiting distant supervision to learn semantic descriptions of tables with overlapping data</title>
		<author>
			<persName><forename type="first">B</forename><surname>Vu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">A</forename><surname>Knoblock</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Shbita</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Lin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Semantic Web-ISWC 2024: 23th International Semantic Web Conference, ISWC 2024</title>
				<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2024">November 11-15, 2024. 2024</date>
		</imprint>
	</monogr>
	<note>Proceedings 20</note>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<author>
			<persName><forename type="first">O</forename><surname>Hassanzadeh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Abdelmageed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Efthymiou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Cutrona</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hulsebos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Jiménez-Ruiz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Khatiwada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Korini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Kruit</surname></persName>
		</author>
		<title level="m">CEUR Workshop Proceedings</title>
				<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="volume">3557</biblScope>
			<biblScope unit="page" from="1" to="14" />
		</imprint>
	</monogr>
	<note>Results of semtab 2023</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">N</forename><surname>Reimers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Gurevych</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1908.10084</idno>
		<title level="m">Sentence-BERT: Sentence embeddings using siamese BERT-Networks</title>
				<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<author>
			<persName><forename type="first">P</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Yamada</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Kertkeidkachorn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ichise</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Takeda</surname></persName>
		</author>
		<ptr target="????" />
		<title level="m">Tabular data annotation with MTab tool</title>
				<imprint>
			<date type="published" when="2021">2021. 2023-10-6</date>
		</imprint>
	</monogr>
	<note>SemTab</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">From heuristics to language models: A journey through the universe of semantic table interpretation with DAGOBAH</title>
		<author>
			<persName><forename type="first">V.-P</forename><surname>Huynh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Chabot</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Labbé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Troncy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Semantic Web Challenge on Tabular Data to Knowledge Graph Matching</title>
				<imprint>
			<publisher>SemTab</publisher>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<author>
			<persName><forename type="first">X</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Jiang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Hong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Wang</surname></persName>
		</author>
		<ptr target="https://ceur-ws.org/Vol-3320/paper5.pdf,????" />
		<title level="m">KGCODE-Tab results for SemTab 2022</title>
				<imprint>
			<date type="published" when="2023-10-06">2023-10-6</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<monogr>
		<author>
			<persName><forename type="first">S</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Karaoglu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Negreanu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-G</forename><surname>Yao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Williams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gordon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C.-Y</forename><surname>Lin</surname></persName>
		</author>
		<ptr target="????" />
		<title level="m">LinkingPark: An integrated approach for semantic table interpretation</title>
				<imprint>
			<date type="published" when="2023-10-06">2023-10-6</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<author>
			<persName><forename type="first">R</forename><surname>Shigapov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Zumstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kamlah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Oberlander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Mechnich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Schumm</surname></persName>
		</author>
		<ptr target="https://madoc.bib.uni-mannheim.de/57386/3/paper2.pdf,????" />
		<title level="m">bbw: Matching CSV to wikidata via meta-lookup</title>
				<imprint>
			<date type="published" when="2023-10-06">2023-10-6</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">TorchicTab: Semantic Table Annotation with Wikidata and Language Models</title>
		<author>
			<persName><forename type="first">I</forename><surname>Dasoulas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Duan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dimou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">CEUR Workshop Proceedings</title>
				<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="21" to="37" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">G</forename><surname>Henriksen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Khorsid</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Nielsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Stück</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">S</forename><surname>Sørensen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Pelgrin</surname></persName>
		</author>
		<title level="m">Semtex: A hybrid approach for semantic table interpretation</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
