<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">5th Workshop on Patent Text Mining and Semantic Technologies (PatentSemTech) collocated with the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Ralf</forename><surname>Krestel</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">ZBW -Leibniz Information Centre for Economics</orgName>
								<orgName type="institution">Kiel University</orgName>
								<address>
									<addrLine>Düsternbrooker Weg 120</addrLine>
									<postCode>24105</postCode>
									<settlement>Kiel</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Hidir</forename><surname>Aras</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">FIZ Karlsruhe -Leibniz Institute for Information Infrastructure</orgName>
								<address>
									<addrLine>Hermann-von-Helmholtz-Platz 1</addrLine>
									<postCode>76344</postCode>
									<settlement>Eggenstein-Leopoldshafen</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Linda</forename><surname>Andersson</surname></persName>
							<affiliation key="aff2">
								<orgName type="department">Artificial Researcher IT GmbH</orgName>
								<address>
									<addrLine>Taubstummengasse 11 (i2c)</addrLine>
									<postCode>1040</postCode>
									<settlement>Wien</settlement>
									<country key="AT">Austria</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Florina</forename><surname>Piroi</surname></persName>
							<affiliation key="aff3">
								<orgName type="institution">RSA FG Studio Data Science</orgName>
								<address>
									<addrLine>Thurngasse 8/16</addrLine>
									<postCode>1090</postCode>
									<settlement>Vienna</settlement>
									<country key="AT">Austria</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Allan</forename><surname>Hanbury</surname></persName>
							<affiliation key="aff4">
								<orgName type="department">Institute of Information Systems Engineering</orgName>
								<orgName type="institution">TU Wien</orgName>
								<address>
									<addrLine>Favoritenstr. 9-11/194-04</addrLine>
									<settlement>Vienna</settlement>
									<country key="AT">Austria</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Dean</forename><surname>Alderucci</surname></persName>
							<affiliation key="aff5">
								<orgName type="department">Center for AI and Patent Analysis</orgName>
								<orgName type="institution">Carnegie Mellon University</orgName>
								<address>
									<addrLine>5000 Forbes Avenue</addrLine>
									<postCode>15213</postCode>
									<settlement>Pittsburgh</settlement>
									<region>PA</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff6">
								<orgName type="department">PatentSemTech</orgName>
								<orgName type="laboratory">Workshop on Patent Text Mining and Semantic Technologies</orgName>
								<address>
									<postCode>2024</postCode>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">5th Workshop on Patent Text Mining and Semantic Technologies (PatentSemTech) collocated with the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">B16792E962BE069969F5BC5C22E55352</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:10+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract/>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Ralf Krestel et al. CEUR Workshop Proceedings</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>i-iii</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Preface</head><p>The fifth edition (PatentSemTech2024) of the workshop series Patent Text Mining and Semantic Technologies was held as a full-day event in conjunction with the SIGIR 2024 conference. As in the previous editions, the workshop focused on new developments and research in patent retrieval and patent analytics. An important focus of the workshop was to address the adaptation of existing deep learning models, e.g. large language models, for the patent domain, covering diverse scientific subject areas, such as chemistry, pharmacology, etc. In general, patent data is more difficult to analyse compared to corpora comprising other text genres. Working with patent data, besides its challenging aspects, does bring a richness of facets to be exploited with text mining and semantic analysis methods as well: (1) It constitutes a huge corpus of scientific-technical documents for a variety of technological domains. (2) They are rich in available meta-data such as spatial data, bibliographic data, classifications, temporal data, etc.</p><p>(3) Patents describe essential scientific-technical knowledge enclosing solutions for real-world applications. (4) They are complementary knowledge to scientific literature, e.g. chemical and physical properties, bio-science knowledge for drug-target-interaction, which appears first in patents, mostly not published elsewhere. With the PatentSemTech2024 workshop we continued our series of workshops launched in 2019, aiming to establish a long-term collaboration and a two-way communication channel between the IP industry and academia from relevant fields. Therefore, the 5th PatentSemTech workshop was organized as a full-day event with 10 research paper presentations that were accepted after peer-review out of 17 submissions. 6 long papers were presented as oral presentations while 4 short papers were presented as posters. In addition, Matthew Wahlrab, CEO of RapidAlpha, gave a keynote speech on "Unlocking Strategic Growth: The Role of AI Technology in Intellectual Property". In an open discussion on "How to transform research insights into products?", the workshop participants exchanged ideas and reported their experience with applying AI in the patent domain. The workshop closed with Linda Andersson looking back at 5 successful PatentSemTech workshops and how the field has developed over these years.</p><p>Germany, Austria, USA, July 2024 Ralf Krestel, Hidir Aras, Linda Andersson, Florina Piroi, Allan Hanbury, Dean Alderucci</p></div>		</body>
		<back>
			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Website</head><p>Further information on the topics, schedule, and further developments of the PatentSemTech workshop can be found on the website: http://ifs.tuwien.ac.at/patentsemtech/</p></div>			</div>
			<div type="references">

				<listBibl/>
			</div>
		</back>
	</text>
</TEI>
