=Paper=
{{Paper
|id=Vol-2807/abstractW
|storemode=property
|title=An Insight into Food Semantics: Review, Analysis, and Lessons Learnt over Food-Related Studies (short paper)
|pdfUrl=https://ceur-ws.org/Vol-2807/abstractW.pdf
|volume=Vol-2807
|authors=Gorjan Popovski,Gordana Ispirova,Eva Valenčič,Riste Stojanov,Tome Eftimov,Barbara Koroušić Seljak
|dblpUrl=https://dblp.org/rec/conf/icbo/PopovskiIVSEK20
}}
==An Insight into Food Semantics: Review, Analysis, and Lessons Learnt over Food-Related Studies (short paper)==
July 2020 An Insight into Food Semantics: Review, Analysis, and Lessons Learnt over Food-related Studies Gorjan Popovski a,b , Gordana Ispirova a,b , Eva Valenčič a,b , Riste Stojanov c , Tome Eftimov a , Barbara Koroušić Seljak a,1 a Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia b Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia c Faculty of Computer Science and Engineering, Ss. Cyril and Methodius, University, 1000 Skopje, North Macedonia A great amount of work has been done in predictive modelling in the past decades. This is made possible by the existence of biomedical vocabularies and standards. De- spite the availability of such resources in the health domain, the domain of food and nutrition is low-resourced and can greatly benefit from such methods. Lancet Planetary Health published that starting from 2019 the focus will be on the links between food systems, human health, and the environment. Several food ontologies exist, but each de- veloped for a specific application scenario. Hence, in 2019, the Big Food and Nutrition Data Management and Analysis (BFNDMA) workshop started at the IEEE International Conference on Big Data 2 , focusing focuses on methodologies for big data management and analysis for food and nutrition data. Recently, in an effort to tackle the task of Information Extraction (IE) from unstruc- tured text, several methodologies were proposed. DrNer [1] is a rule-based NER method for extracting information from evidence-based dietary recommendations. The authors have extended the methodology by creating a novel food NER method named FoodIE [2], which solely focuses on extracting food entities. It incorporates computational linguistic rules and semantic information. The authors have compared it to other existing food NER methods, showing that FoodIE provides the most promising results [3]. The collection of such information from different data sources is represented in various unstandardized ways, leading to the task of data normalization. It is a crucial task to facilitate and enable further analyses. Hence, StandFood [4] has been introduced - a semi-automatic system for classifying and describing foods according to FoodEx2. It is based on lexical simi- larity between food names. In a recent paper [5], the authors have conducted a domain- coverage analysis of an existing language for describing foods (LanguaL) by using Rep- resentation Learning (RL) methods, finding that the coverage of the food domain does not link the concepts together well, accenting the need for future efforts in food data nor- malization. FoodBase [6] is the first data corpus consisting of recipes (total of 23,000) annotated with the food entities found in them. The authors extend this work by propos- 1 Corresponding E-mail: barbara.korousic@ijs.si. 2 http://cs.ijs.si/bfndma/BFNDMA.html Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). July 2020 ing a food data normalization method (FoodOntoMap [7]) that performs food concept mapping across different food semantic resources. It is based on the use of food NER methods to perform the mapping. In addition, a visualization tool (FoodViz [8]) provides a framework aimed at making the links between different food standards understandable by food subject-matter experts. Finally, the authors in a recent paper [9] have performed a non-English language (i.e. Slovenian) case study where they employ RL methods in order to address the semantic similarity between food products’ names. Addressing open gaps, the extracted food entities can further be linked with entities from other domains (e.g. health, biomedicine, consumer and social sciences). This can help in reducing knowledge gaps that inhibit public health goals as well as the optimal development of scientific, agricultural and industrial policies, which requires relevant information from all food science domains, such as food safety, food authenticity and traceability, food sustainability, etc. Coupled with Representation Learning techniques, this can pave the way for methods to extract information in order to improve personalized nutrition and medicine, as well as public health. Acknowledgments. This research was supported by the Slovenian Research Agency (research core grant number P2-0098), and the European Union’s Horizon 2020 research and innovation programme (FNS-Cloud, Food Nutrition Security) (grant agreement 863059). The information and the views set out in this publication are those of the authors and do not necessarily reflect the official opinion of the European Union. Neither the European Union institutions and bodies nor any person acting on their behalf may be held responsible for the use that may be made of the information contained herein. References [1] T. Eftimov, B. Koroušić Seljak, and P. Korošec, “A rule-based named-entity recognition method for knowledge extraction of evidence-based dietary recommendations,” PloS One, vol. 12, no. 6, p. e0179488, 2017. [2] G. Popovski, S. Kochev, B. Koroušić Seljak, and T. Eftimov, “Foodie: A rule-based named-entity recog- nition method for food information extraction,” in Proceedings of the 8th International Conference on Pattern Recognition Applications and Methods, (ICPRAM 2019), pp. 915–922, 2019. [3] G. Popovski, B. Koroušić Seljak, and T. Eftimov, “A survey of named-entity recognition methods for food information extraction,” IEEE Access, vol. 8, pp. 31586–31594, 2020. [4] T. Eftimov, P. Korošec, and B. Koroušić Seljak, “Standfood: Standardization of foods using a semi- automatic system for classifying and describing foods according to FoodEx2,” Nutrients, vol. 9, no. 6, p. 542, 2017. [5] G. Popovski, B. Paudel, T. Eftimov, and B. Koroušić Seljak, “Exploring a standardized language for describing foods using embedding techniques,” in 2019 IEEE International Conference on Big Data (Big Data), pp. 5172–5176, 2019. [6] G. Popovski, B. Koroušić Seljak, and T. Eftimov, “FoodBase corpus: a new resource of annotated food entities,” Database, vol. 2019, 11 2019. baz121. [7] G. Popovski., B. K. Seljak., and T. Eftimov., “Foodontomap: Linking food concepts across different food ontologies,” in Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowl- edge Engineering and Knowledge Management - Volume 2: KEOD,, pp. 195–202, INSTICC, SciTePress, 2019. [8] R. Stojanov, G. Popovski, N. Jofce, D. Trajanov, B. Koroušić Seljak, and T. Eftimov, “Foodviz: Visu- alization of food entities linked across different standards,” in Proceedings of The Sixth International Conference on Machine Learning, p. In Press., 2020. [9] G. Popovski., G. Ispirova., N. Hadzi-Kotarova., E. Valenčič., T. Eftimov., and B. K. Seljak., “Food data in- tegration by using heuristics based on lexical and semantic similarities,” in Proceedings of the 13th Inter- national Joint Conference on Biomedical Engineering Systems and Technologies - Volume 5: HEALTH- INF,, pp. 208–216, INSTICC, SciTePress, 2020.