<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A new alignment method based on FoodOn as pivot ontology to manage incompleteness in nutritional legacy data sources</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Patrice BUCHE</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Julien CUFI</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stéphane DERVAUX</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Juliette DIBIE</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Liliana</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>IBANESCU</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alrick OUDOT</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Magalie WEBER</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>BIA INRA</institution>
          ,
          <addr-line>Nantes</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>IATE, Univ Montpellier, INRA, CIRAD, Montpellier SupAgro</institution>
          ,
          <addr-line>Montpellier</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>LIRMM, Univ Montpellier, CNRS, INRIA GraphIK</institution>
          ,
          <addr-line>Montpellier</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>UMR MIA-Paris, AgroParisTech, INRA, University Paris-Saclay</institution>
          ,
          <addr-line>Paris</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In order to correctly assess the nutritional quality of a meal or a manufactured food product in a given country, the first step is to assess the nutritional values for its ingredients. Food composition databases (FCDBs) available in a lot of countries and managed at national level provide values for energy and nutrients of food components. Unfortunately, values associated with some nutrients of interest may be lacking in the FCDB of the country in which the nutritional quality must be assessed. Finding values associated with nutrients for similar foods in other FCDBs is a way to deal with incompleteness. An additional issue arises because the vocabulary used to describe the ingredients of a meal or a recipe in a given FCDB is usually different from the one used in other ones. In this paper we address the problem of identifying the nutritional value of a recipe's ingredients by querying different FCDBs through FoodOn as pivot ontology. We present a new alignment method between two distinct FCDBs, based on syntactic and semantic approaches, whose vocabulary is previously transformed into an ontology. Our method has been evaluated on Ciqual, the French food nutritional database and USDA, the United States food nutritional. The incompleteness management task based on FoodOn as pivot ontology has been assessed with a real use-case concerning iron, Vitamin B12, Vitamin C nutrients.</p>
      </abstract>
      <kwd-group>
        <kwd />
        <kwd>Ontology alignment</kwd>
        <kwd>Food composition databases</kwd>
        <kwd>FoodOn</kwd>
        <kwd>LanguaL</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>between similar food concepts. This functionality is not available in state of the art tools
(eg. EuroFIR FoodExplorer).</p>
      <p>
        As an example, this tool should be able to use a term (for example example
‘Courgette, puree’ from CIQUAL FCDB), to be able to recover all the products and
associated nutritional values (e.g. ‘Squash, winter, acorn, cooked, boiled, mashed,
without salt’ in USDA FCDB). To achieve this, we use as background knowledge, the
LanguaL description [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] associated with the food term defined in each national agency.
LanguaL stands for "Langua aLimentaria" or "language of food". These descriptions
provide a multi-facets semantic definition of a given food expressed in a standardized
vocabulary that we will use to find similarities between food products belonging to the
vocabulary of different agencies. More than 40.000 foods used in food composition
databases are LanguaL described [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        Our method also takes into account English labels associated with food products in
FCDBs. The pivot of all these vocabularies is FoodOn [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], an ontology dedicated to food
description. FoodOn is a food ontology initially based on a conversion of the LanguaL
thesaurus. For instance, each specialization terms' hierarchy associated with each
LanguaL facet was tranlasted in FoodOn into a specialization concepts' hierarchy.
Additionally, FoodOn includes 9.500 food terms imported from the Scientific
Information and Retrieval Exchange Network of the US Food and Drug administration
food database that are organized in families and described in LanguaL.
Our approach aligns the food products of the different FCDBs on FoodOn, based on
LanguaL faceted descriptions (semantic approach) in addition to product labels
(syntactic approach). This combination of both approaches permits to overcome both the
lack of faceted description for some products and the gaps in a purely syntactic
comparison (the same food may be denoted differently in different FCDBs).
The main originality of our alignment approach is to reuse Langual descriptions
associated with FCDBs food terms available on Langual website combining relevant
alignment methods already known in the state of the art [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. We will present main
principles of the approach and results obtained to deal with the lack of values associated
with Vitamin C, Vitamin B12 and iron for a set of Ciqual food products reusing values
associated with similar foods in USDA.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Pehrsson</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Haytowitz</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Food composition databases</article-title>
          . In Caballero,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Finglas</surname>
          </string-name>
          ,
          <string-name>
            <surname>P. M.</surname>
          </string-name>
          , and Toldra_, F., editors,
          <source>Encyclopedia of Food and Health</source>
          , pages
          <fpage>16</fpage>
          -
          <lpage>21</lpage>
          . Academic Press, Oxford.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Ireland</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Moller</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Food classification and description</article-title>
          . In Caballero,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Finglas</surname>
          </string-name>
          ,
          <string-name>
            <surname>P. M.</surname>
          </string-name>
          , and Toldra_, F., editors,
          <source>Encyclopedia of Food and Health</source>
          , pages
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          . Academic Press, Oxford.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Dooley</surname>
            ,
            <given-names>D. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Griffiths</surname>
            ,
            <given-names>E. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gosal</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buttigieg</surname>
            ,
            <given-names>P. L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoehndorf</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lange</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schriml</surname>
            ,
            <given-names>L. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brinkman</surname>
            ,
            <given-names>F. S. L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Hsiao</surname>
            ,
            <given-names>W.W. L.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>Foodon: a harmonized food ontology to increase global food traceability, quality control and data integration</article-title>
          .
          <source>In npj Science of Food.</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Shvaiko</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Euzenat</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Ontology matching: State of the art and future challenges</article-title>
          .
          <source>IEEE Transactions on Knowledge and Data Engineering</source>
          ,
          <volume>25</volume>
          :
          <fpage>158</fpage>
          -
          <lpage>176</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>LanguaL indexed Datasets</surname>
          </string-name>
          (
          <year>2020</year>
          )
          <article-title>The LanguaL indexed Datasets</article-title>
          . http://langual.org/langual_indexed_datasets.asp
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>