<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sara Bond</string-name>
          <email>sara.a.bond@jpl.nasa.gov</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Baptiste Cecconi</string-name>
          <email>baptiste.cecconi@obspm.fr</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Edwin Henneken</string-name>
          <email>ehenneken@cfa.harvard.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ryan McGranaghan</string-name>
          <email>ryan.m.mcgranaghan@jpl.nasa.gov</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>François Scharfe</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Brian Thomas</string-name>
          <email>brian.a.thomas@nasa.gov</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Donald Winston</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Astrophysics, Harvard &amp; Smithsonian</institution>
          ,
          <addr-line>Cambridge, Massachusetts</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>NASA/Caltech/Jet Propulsion Laboratory</institution>
          ,
          <addr-line>Pasadena, California</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Polyneme LLC</institution>
          ,
          <addr-line>New York</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>The Data Chefs</institution>
          ,
          <addr-line>New York</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <fpage>13</fpage>
      <lpage>15</lpage>
      <abstract>
        <p>This poster presents HelioKOS, a knowledge organization system for heliophysics. This system is an outcome of HelioKNOW, a project aiming to foster scientific discovery by building a knowledge commons for managing heliophysics metadata, and therefore enabling observational and model data integration and assimilation. The system provides a robust infrastructure for the ingestion, annotation, and analysis of scientific publications, enabling researchers to integrate and analyze data from diverse sources and disciplines. The system is centered around the Heliophysics Knowledge Graph, which serves as a central repository for all data and metadata within the system. To facilitate knowledge discovery, the system provides a range of tools and components for concept-scheme development, harmonization, and tagging, as well as tools for query registration, ground-truth development, and performance measurement. The poster presents the heliophysics knowledge organization system together with a playground to experiment with concept-scheme annotations.</p>
      </abstract>
      <kwd-group>
        <kwd>scientific community</kwd>
        <kwd>heliophysics</kwd>
        <kwd>knowledge commons</kwd>
        <kwd>open science</kwd>
        <kwd>metadata</kwd>
        <kwd>discovery</kwd>
        <kwd>knowledge graph</kwd>
        <kwd>data integration</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>The advancement of science and technology is increasingly dependent on the principles of
open science, which promotes transparency, collaboration, and the sharing of knowledge.
Heliophysics (the study of the solar system and its efects on our lives, climate, and space
technology) is an important area of scientific research that stands to benefit significantly from
open science. The solar system is a complex and dynamic environment, with phenomena such
as solar flares and magnetic storms having the potential to impact our daily lives, while also
afecting the Earth’s climate in the long term.</p>
      <p>Moreover, the rapid progress in commercial space technology is expected to enable human
travel to the moon and Mars in the near future. A better understanding of the solar system
is necessary to ensure the safety and success of these missions. There is a vast amount of
scientific research being produced by observatories, labs, satellites, and probes monitoring the
solar system. This research spans various disciplines, including physics, astronomy, complex
systems, data science, and computer science, and is being conducted by international teams of
scientists in diferent organizations. Thus, Heliophysics data span many orders of magnitude in
space and time and their analysis occurs across scales of research from individual to national
and international.</p>
      <p>The integration of data and research results from these diverse sources is critical for advancing
our understanding of the solar system. For example, the magnetosphere-ionosphere coupling
phenomena are complex processes that involves the interaction of the solar wind with the Earth’s
magnetic field and upper atmosphere. Understanding this process is essential for understanding
fundamental physical processes in the universe as well as predicting space weather events,
which can have significant impacts on satellite operations, power grids, and communication
systems. However, the data and research results relevant to these phenomena are spread across
diferent scales, labs, and organizations, making integration and collaboration challenging.</p>
      <p>
        To address these challenges, the Helio-KNOW project [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ] aims to build a knowledge
commons [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] for managing heliophysics metadata and fostering scientific discovery. This project
is being developed as part of a multi-year efort involving several organizations, including NASA
Jet Propulsion Laboratory, the Center for Astrophysics, Harvard and Smithsonian, and Paris
Observatory. The goal of the project is to provide an infrastructure for the management of
heliophysics knowledge at scale, enabling researchers to integrate and analyze data from diverse
sources and disciplines. For this poster we focus on HelioKOS, a knowledge organization system
for heliophysics.
      </p>
    </sec>
    <sec id="sec-3">
      <title>2. System overview</title>
      <p>The HelioKOS system is a comprehensive and modular platform designed to facilitate
knowledge discovery and management in the field of heliophysics. The system provides a robust
infrastructure for the ingestion and annotation of scientific publications, enabling researchers
to search and analyze data from diverse sources and disciplines.</p>
      <p>
        At a high level, the system is centered around the Heliophysics Knowledge Graph (HelioKG),
which serves as a central repository for all data and metadata within the system. The
Astrophysics Data System (ADS) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] provides access to a vast corpus of scientific literature, which is
ingested into the system via the Dagster-ADS component (see Figure 1). The ingested data is
then stored in the HelioKG, providing a rich knowledge base for further analysis and processing.
      </p>
      <p>
        To facilitate knowledge discovery, the system provides a range of tools and components
for concept-scheme development, harmonization, and tagging. Based on Ontoportal [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], the
Helioportal component provides access to a range of retrieval and corpus-tagging ontologies
and taxonomies, for example the Unified Astronomy Thesaurus [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], which can be loaded into
the system and used to develop new concept-schemes within the HelioKOS component. The
harmonization component of HelioKOS enables the integration of multiple concept-schemes,
providing a unified view of the data and enabling more efective analysis.
      </p>
      <p>The system also provides tools for query registration, ground-truth development, and
performance measurement. These tools are available under the ADS Query Eval library. The query
evaluation component enables users to register queries within the system, while the
groundtruth component facilitates the development of a corpus-query ground-truth in consultation
with a user. The harmonization strategy component enables the registration of
harmonizationapplication strategies, while the performance measure component enables the registration of
performance measures. The performance measure job component runs performance
measurements by comparing query results with and without annotations, as well as between diferent
harmonization strategies, and stores the results back into the HelioKG. Combined together,
these components allow to evaluate the efects of tagging a corpus of scientific research with a
concept-scheme. They also allow to measure the efectiveness of the harmonization process
over two concept-schemes.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Conclusion and Future Work</title>
      <p>Overall, the HelioKOS system provides a powerful and flexible platform for knowledge discovery
and management in the field of heliophysics. By integrating data from diverse sources and
disciplines, the system enables researchers to gain new insights and make more informed
decisions, ultimately advancing our understanding of the solar system and its impact on our
lives.</p>
      <p>Our future work as part of HelioKNOW includes providing a set of guidelines for publishing
and indexing heliophysics resources on the Web. We believe that these guidelines will promote
data sharing and interoperability, enabling researchers to integrate and analyze data from
diverse sources and disciplines. Additionally, we are developing an ontology of heliophysics
phenomena that will enable the enrichment of HelioKG with scientific datasets.</p>
      <p>We hope for this poster to provide a valuable overview of our project and will generate
insightful feedback from the community. We welcome suggestions and collaborations to help
guide our next steps and advance the state of the art in heliophysics knowledge management.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>McGranaghan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Halford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Narock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ringuette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Thompson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Young</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Cameron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schonfeld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Higginson</surname>
          </string-name>
          ,
          <article-title>The need for a Solar and Space Physics Knowledge Commons, in: Bulletin of the American Astronomical Society</article-title>
          , volume
          <volume>55</volume>
          ,
          <year>2023</year>
          , p.
          <fpage>276</fpage>
          . doi:
          <volume>10</volume>
          .3847/25c2cfeb.a4634da3.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>R. M. McGranaghan</surname>
            ,
            <given-names>B. A.</given-names>
          </string-name>
          <string-name>
            <surname>Thomas</surname>
            ,
            <given-names>B. J.</given-names>
          </string-name>
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>R. A.</given-names>
          </string-name>
          <string-name>
            <surname>Ringuette</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Young</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Cameron</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Narock</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Bard</surname>
            ,
            <given-names>A. J.</given-names>
          </string-name>
          <string-name>
            <surname>Halford</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Shimizu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Winston</surname>
            ,
            <given-names>E. A.</given-names>
          </string-name>
          <string-name>
            <surname>Henneken</surname>
          </string-name>
          ,
          <article-title>Infrastructure for 21st century science: Towards a Heliophysics knowledge commons</article-title>
          ,
          <source>in: AGU Fall Meeting Abstracts</source>
          , volume
          <year>2022</year>
          ,
          <year>2022</year>
          , pp.
          <fpage>SH56A</fpage>
          -
          <lpage>02</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>R. M. McGranaghan</surname>
          </string-name>
          ,
          <article-title>Complexity heliophysics: A lived and living history of systems and complexity science in heliophysics, 2024</article-title>
          . URL: https://arxiv.org/abs/2307.03287. arXiv:
          <volume>2307</volume>
          .
          <fpage>03287</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>McGranaghan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Klein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Cameron</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Young</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schonfeld</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Higginson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ringuette</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Halford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Bard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Narock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Thompson</surname>
          </string-name>
          ,
          <article-title>The need for a Space Data Knowledge Commons, Structuring Collective Knowledge (</article-title>
          <year>2021</year>
          ). Https://knowledgestructure.pubpub.org/pub/space
          <article-title>-knowledge-commons.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Henneken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Kurtz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Accomazzi</surname>
          </string-name>
          ,
          <source>The ADS in the Information Age - Impact on Discovery, in: Organizations</source>
          , volume
          <volume>1</volume>
          ,
          <year>2012</year>
          , pp.
          <fpage>253</fpage>
          -
          <lpage>263</lpage>
          . doi:
          <volume>10</volume>
          .48550/arXiv.1106. 5644.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>C.</given-names>
            <surname>Jonquet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Graybeal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bouazzouni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Dorf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Fiore</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Kechagioglou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Redmond</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Rosati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Skrenchuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Vendetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Musen</surname>
          </string-name>
          ,
          <article-title>Ontology repositories and semantic artefact catalogues with the ontoportal technology</article-title>
          , in: T. R.
          <string-name>
            <surname>Payne</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Presutti</surname>
            , G. Qi,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Poveda-Villalón</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Stoilos</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Hollink</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <string-name>
            <surname>Kaoudi</surname>
          </string-name>
          , G. Cheng, J.
          <source>Li (Eds.)</source>
          ,
          <source>The Semantic Web - ISWC 2023</source>
          , Springer Nature Switzerland, Cham,
          <year>2023</year>
          , pp.
          <fpage>38</fpage>
          -
          <lpage>58</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K.</given-names>
            <surname>Frey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Accomazzi</surname>
          </string-name>
          ,
          <source>The Unified Astronomy Thesaurus: Semantic Metadata for Astronomy and Astrophysics</source>
          ,
          <volume>236</volume>
          (
          <year>2018</year>
          )
          <article-title>24</article-title>
          . doi:
          <volume>10</volume>
          .3847/
          <fpage>1538</fpage>
          -4365/aab760. arXiv:
          <year>1801</year>
          .01021.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>