<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">MLentory, an FDO registry for machine learning models</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Dhwani</forename><surname>Solanki</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">ZB MED Information Centre for Life Sciences</orgName>
								<address>
									<settlement>Cologne</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Nelson</forename><surname>Quiñones</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">ZB MED Information Centre for Life Sciences</orgName>
								<address>
									<settlement>Cologne</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Dietrich</forename><surname>Rebholz-Schuhmann</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">ZB MED Information Centre for Life Sciences</orgName>
								<address>
									<settlement>Cologne</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">University of Cologne</orgName>
								<address>
									<settlement>Cologne</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Leyla</forename><forename type="middle">Jael</forename><surname>Castro</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">ZB MED Information Centre for Life Sciences</orgName>
								<address>
									<settlement>Cologne</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">University of Cologne</orgName>
								<address>
									<settlement>Cologne</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><surname>Mlentory</surname></persName>
						</author>
						<author>
							<affiliation key="aff2">
								<address>
									<postCode>2024</postCode>
									<settlement>Leiden</settlement>
									<country key="NL">The Netherlands</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">MLentory, an FDO registry for machine learning models</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">3782D6E5CE4899FC6EDCD4B16F2B9E28</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T17:40+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Machine learning models, FAIR, FDOs, registry1 1529-0095 (D. Solanki)</term>
					<term>0000-0002-5037-0443 (N. Quiñones)</term>
					<term>0000-0002-1018-0370 (D. Rebholz-Schuhmann)</term>
					<term>0000-0003-3986-0510 (LJ. Castro)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Here we introduce MLentory, an FDO registry for Machine Learning models and their corresponding workflows, from creation to deployment. MLentory relies on FAIR Digital Objects (FDOs) to improve Findability, Accessibility, Interoperability, and Reusability while also improving reproducibility and transparency. MLentory aggregates, harmonizes and FAIRifies data from various ML model and model-related repositories and platforms. Here we present the initial architecture for data extraction, transformation, and loading.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Background</head><p>Due to the proliferation of Machine Learning (ML) models, it is necessary a systematic approach to report them. To this end, the ML model cards were proposed in 2019 <ref type="bibr" target="#b0">[1]</ref>, complemented by Dataset cards <ref type="bibr" target="#b1">[2]</ref> by providing additional information on the training datasets. A parallel and complementary effort are platforms facilitating the storing, sharing and reporting of ML models and other artifacts needed for training and deployment, e.g., HuggingFace, neptune.ai, SpaceML, Kipoi, BioImagine Model Zoo, etc. Interoperability across platforms and connection to other ML experiments and related artifacts are still challenging as the corresponding metadata is stored in different ways by different platforms. Harmonization across different efforts is a gap being addressed by different communities, e.g., Research Data Alliance (RDA) FAIR4ML Interest Group, ELIXIR ML Focus Group (ELIXIR MLFG), and National Research Data Infrastructure for Data Science (NFDI4DS) in Germany.</p></div>		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>This work has been partially supported by NFDI4DataScience, a consortium funded by the German Research Foundation (DFG), project number 460234259.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Model Cards for Model Reporting</title>
		<author>
			<persName><forename type="first">M</forename><surname>Mitchell</surname></persName>
		</author>
		<idno type="DOI">10.1145/3287560.3287596</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Conference on Fairness, Accountability, and Transparency</title>
				<meeting>the Conference on Fairness, Accountability, and Transparency</meeting>
		<imprint>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science</title>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">M</forename><surname>Bender</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Friedman</surname></persName>
		</author>
		<idno type="DOI">10.1162/tacl_a_00041</idno>
	</analytic>
	<monogr>
		<title level="j">Transactions of the Association for Computational Linguistics</title>
		<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units</title>
		<author>
			<persName><forename type="first">De</forename><surname>Smedt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Koureas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Wittenburg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename></persName>
		</author>
		<idno type="DOI">10.3390/publications8020021</idno>
	</analytic>
	<monogr>
		<title level="j">Publications</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page">21</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Schema.org: evolution of structured data on the web</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">V</forename><surname>Guha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Brickley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Macbeth</surname></persName>
		</author>
		<idno type="DOI">10.1145/2844544</idno>
	</analytic>
	<monogr>
		<title level="j">Communications of the ACM</title>
		<imprint>
			<biblScope unit="volume">59</biblScope>
			<biblScope unit="page" from="44" to="51" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
