<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A metadata analysis for machine-actionable Software Management Plans</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Olga Giraldo</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Renato Alves</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dimitrios Bampalikis</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>José M. Fernández</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eva Martin del Pico</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fotis Posomopoulos</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nelson Quiñones</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dhwani Solanki</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Allegra Via</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Leyla Jael Castro</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Barcelona Supercomputing Center</institution>
          ,
          <addr-line>Barcelona</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Centre for Research and Technology Hellas</institution>
          ,
          <addr-line>Thessaloniki</addr-line>
          ,
          <country country="GR">Greece</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>European Molecular Biology Laboratory</institution>
          ,
          <addr-line>Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Institute of Molecular Biology and Pathology - National Research Council</institution>
          ,
          <addr-line>Rome</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>National Bioinformatics Infrastructure Sweden</institution>
          ,
          <addr-line>Uppsala</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>ZB MED Information Centre for Life Sciences</institution>
          ,
          <addr-line>Cologne</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Data Management Plans (DMPs) describe the data management lifecycle for the data corresponding to a research project, including activities from collection to preservation. Machine-actionable DMPs improve text-based DMPs by adding a semantic layer representing the most common elements relevant to DMPs, from datasets to funders. Similar to DMPs, Software Management Plans (SMPs) follow the software management lifecycle. The ELIXIR SMP was developed to support life science researchers and making it easier to follow research software good practices aligned to the findable, accessible, interoperable and reusable principles for research software. Currently, the ELIXIR SMP is a questionnaire-based document that would benefit from a machine-actionable approach. Here, we present a preliminary metadata analysis including possible types and properties from Schema.org that could be used to model machine-actionable SMPs.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Research software management plans</kwd>
        <kwd>machine-actionability</kwd>
        <kwd>metadata analysis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Metadata analysis</title>
      <p>
        As a first step for the machine-actionable version of the ELIXIR Software Management Plan
(maSMP) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], we have created an initial mapping from its questions to types and properties from
Schema.org [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. We summarize our first draft in Table 1.
      </p>
      <sec id="sec-1-1">
        <title>Section</title>
      </sec>
      <sec id="sec-1-2">
        <title>Accessibility and license Question</title>
      </sec>
      <sec id="sec-1-3">
        <title>How can the software be accessed by third</title>
        <p>parties?
Does your software have a license?
schema.org type and properties</p>
      </sec>
      <sec id="sec-1-4">
        <title>SoftwareSourceCode (codeRepository, conditionsOfAccess), CreativeWork (url) CreativeWork (license)</title>
      </sec>
      <sec id="sec-1-5">
        <title>SoftwareSourceCode (workExample, workTranslation) or CreativeWork (isBasedOn) Thing (disambiguatingDescription)</title>
        <p>What type of documentation is available?</p>
        <p>Is the purpose of the software stated in the
documentation?
Does the documentation describe how to test, use, Thing (disambiguatingDescription)
build, deploy, install
What type of testing do you use?
Are sample data and/or parameters provided?
—
SoftwareApplication (supportingData),
SoftwareSourceCode (usageInfo)
Interoperability Do you use existing and standard input/output</p>
        <p>formats?
Versioning Do you use a version control system? —</p>
        <p>
          Do you use Semantic Versioning? SoftwareApplication (softwareVersion)
Reproducibility Do you provide releases of your software? SoftwareApplication (releaseNotes)
How do you define language-specific SoftwareApplication
dependencies of your software and their version? (softwareRequirements)
How do you capture the environment necessary
to run the software?
Do you provide input and output examples that SoftwareApplication (supportingData)
can be used to reproduce the functioning of your
software?
Do you state how to report bugs and/or usability —
problems by the software user(s)?
Recognition Do you include citation information and ORCID? CreativeWork (citation)
Do the releases have a PID? —
Is the software registered in a domain-specific CreativeWork (isPartOf)
registry?
We will extend the model and ontology supporting machine-actionable version of Data Management
Plans (maDMPs) [
          <xref ref-type="bibr" rid="ref3 ref4">3,4</xref>
          ] to include the case posed by maSMP. We will further develop the mapping to
schema.org and Bioschemas [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Acknowledgments</title>
      <p>This project has received funding from the European Union’s Horizon 2020 research and innovation
programme under grant agreement No 101017536 and is part of the Research Data Alliance and
European Open Science Cloud Future call 2022. This project has been supported by the Good
Practices Focus Group part of the ELIXIR Tools Platform. The work presented in this poster was part
of Project 17 at the BioHackathon Europe 2022.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Alves</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bampalikis</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Castro</surname>
            <given-names>LJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>González</surname>
            <given-names>JMF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Harrow</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuzak</surname>
            <given-names>M</given-names>
          </string-name>
          , et al.
          <article-title>ELIXIR Software Management Plan for Life Sciences</article-title>
          . BioHackrXiv;
          <year>2021</year>
          . doi:
          <volume>10</volume>
          .37044/osf.io/k8znb
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Guha</surname>
            <given-names>RV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brickley</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Macbeth</surname>
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Schema</surname>
          </string-name>
          .
          <article-title>org: evolution of structured data on the web</article-title>
          .
          <source>Commun ACM</source>
          .
          <year>2016</year>
          ;
          <volume>59</volume>
          :
          <fpage>44</fpage>
          -
          <lpage>51</lpage>
          . doi:
          <volume>10</volume>
          .1145/2844544
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Miksa</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Simms</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mietchen</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            <given-names>S.</given-names>
          </string-name>
          <article-title>Ten principles for machine-actionable data management plans</article-title>
          .
          <source>PLOS Computational Biology</source>
          .
          <year>2019</year>
          ;
          <volume>15</volume>
          : e1006750. doi:
          <volume>10</volume>
          .1371/journal.pcbi.1006750
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Cardoso</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Castro</surname>
            <given-names>LJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ekaputra</surname>
            <given-names>FJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jacquemot</surname>
            <given-names>MC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suchánek</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miksa</surname>
            <given-names>T</given-names>
          </string-name>
          , et al.
          <article-title>DCSO: towards an ontology for machine-actionable data management plans</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          .
          <year>2022</year>
          ;
          <volume>13</volume>
          :
          <fpage>21</fpage>
          . doi:
          <volume>10</volume>
          .1186/s13326-022-00274-4
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Gray</surname>
            <given-names>AJG</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goble</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jimenez</surname>
            <given-names>RC</given-names>
          </string-name>
          .
          <article-title>From Potato Salad to Protein Annotation. ISWC Posters and Demo session</article-title>
          . Vienna, Austria;
          <year>2017</year>
          . p.
          <fpage>4</fpage>
          . Available: http://ceur-ws.
          <source>org/</source>
          Vol-1963/paper579.pdf
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>