=Paper=
{{Paper
|id=Vol-2978/casa-paper6
|storemode=property
|title=A Framework for Adaptive Context and User-Related Management of Multimedia Contents (short paper)
|pdfUrl=https://ceur-ws.org/Vol-2978/casa-paper6.pdf
|volume=Vol-2978
|authors=Mariagrazia Fugini,Jacopo Finocchi,Elisa Rossi
|dblpUrl=https://dblp.org/rec/conf/ecsa/FuginiFR21
}}
==A Framework for Adaptive Context and User-Related Management of Multimedia Contents (short paper)==
A Framework for Adaptive Context and User-Related Management of Multimedia Contents M.G. Fugini 1, J. Finocchi 1, E. Rossi 2 1 DEIB - Politecnico di Milano, Milano, Italy 2 DIG - Politecnico di Milano, Milano, Italy Abstract This paper presents the specification of a software platform where knowledge is associated to maps, through the organization and presentation of geo- and chronological-referenced multimedia contents, in such as a way that this association is automatically adaptive to different contexts (application areas) and to different typologies of users. The platform adaptivity has the purpose of reducing the information overload of the maps and of offering the user targeted navigation and search functions, tailored to his/her typology (e.g., domain expert, citizen, public administration operator, decision maker, stakeholder, etc.) and dependent on the loaded contents. To achieve this, we focus on the problem of selecting and aggregating the multimedia contents of the map, in a way that depends on the context and on the user. This reduction is obtained as a dynamic process, using a metamodel in the form of ontology of tags. Keywords Georeferenced Data, Data Discovery, Map Enrichment, Context Adaptivity, Multimedia Content, Multimedia Tagging, Metadata Analysis. 1. Introduction1 The set of tags associated to multimedia data are modelled in an ontology with: The typical aim of geographic-based sets of concepts shared among various multimedia applications is to enrich a map with contexts (e.g., “geo-location” tag, “time”, georeferenced contents, allowing one to explore “user”, “media_type”, etc.) and selected areas and run thematic analyses at sets of concepts typical of a context (e.g. various levels of detail, and with different aims “year_of_discovery” and “host_museum” and focus, so that is will be possible to perform tags in the Cultural Heritage context, analytics on multimedia map contents. Such “approval_document” tag in the Urban analyses usually aim at exploring various levels Planning context, and “level_of_robotization” of interest on the map contents, and at obtaining and “efficiency_formula” tags in the Industry zoom levels on the represented realm. context). In this paper, we propose an adaptive framework that enriches maps with knowledge Domain experts develop and specialize the using an ontology of tags, which describe the ontology for each context. map multimedia contents, the geo references, the The tags guide the dynamic generation of time references, the context, and the user thematic map layers, as defined in [1], which are behaviour, so that the contents can adapt to: 1) the basis for content navigation. In fact, the different contexts; 2) different users. navigation interface is based on a set of layers, some recommended by the system, others customized by the user. Each layer contains the Proceedings Name, Month XX–XX, YYYY, City, Country points of interest belonging to one or more EMAIL: mariagrazia.fugini@polimi.it (M.G.Fugini); categories, i.e., connected to contents associated jacopo.finocchi@ mail.polimi.it (J.Finocchi); with those tags. Each tag identifies a category of elisa5.rossi@mail.polimi.it (E.Rossi) ORCID: 0000-0002-0692-0153 (M.G.Fugini) contents. The choice of how to split tags among Copyright 2021 © for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). the various layers is taken dynamically CEUR Workshop Proceedings (CEUR-WS.org) considering which nodes of the ontology are actually instantiated by the contents present in teaching activity tasks, to support personalized the context. flipped learning. The paper is organized as follows. In Section 2, we review related work. In Section 3, we 3. Associating knowledge to maps: present the overall approach of associating knowledge to maps. In Section 4, we discuss issues and our approach about navigating tags for adaptivity and draw the conclusions. In our platform, knowledge associated to geographical maps links geographical elements with geo- and temporal-referenced multimedia 2. Related work contents. The framework is being developed to be adaptable to: Nowadays, geographic-based applications are - various contexts, such as history, cultural a widespread object of study. Developers are heritage, urban planning, digital twinning implementing many different solutions that of devices and artefacts, and citizen combine geo-referenced information related to a map, ending up with maps that can be used either journalism. as general-purpose exploration tools, or as - different user typologies. professional tools for specific application The key innovative features of the proposed contexts. framework are as follows. Some of these tools combine geographic and 1. Handling of multimedia content: the multimedia information to improve content framework focuses on the organization and search, e.g., in [2], where visual features of navigation of georeferenced multimedia, such pictures are integrated with geographic data, as pictures, video, audio tracks, point clouds, based on the points of interest concept. unstructured text or documents. The research in [3] tackles the semantic gap 2. Adaptivity to different contexts and between semantic needs of users and the users: the framework comprises a meta- visualization of multimedia content. The structure (semantic tags ontology), which proposed solution is focused on how to extract allows instantiation onto different contexts, information from the context of multimedia via self-adaption to loaded contents. documents and their metadata, rather than from 3. Temporal dimension: chronological media raw content itself. metadata are treated separately, so that Also [4] enhances the semantic annotation of specific navigation modes can be provided multimedia contents by leveraging their web along the time dimension. context and the user comments. Videos are indexed via a large set of labels, and are linked Contents are linked to the map through the with related contents. definition of a number of Points Of Interest The solution presented in [5] implements a (POI), which are selected and clustered semantic tag recommendation technique for according to our self-adaptive semantic tag image tagging, relying on a graph of ontology. Content classification adapts to the relationships among words. context-specific ontology, by matching the The development of an adaptive multimedia content metadata with the most relevant ontology content navigation is a main goal of adaptive node. hypermedia systems, mainly based on the user In the case of massive contents loaded from profiles, as discussed in [6]. an existing data source, the classification of The problem of adaptivity in accessing multimedia under our semantic tags is performed multimedia contents is often addressed in the by a supervised ML classifier, starting from the field of e-learning systems. For example in [7], metadata natively associated to each content item ML techniques are applied to adapt the difficulty in the data sources, along with geo and temporal level of the presentation to the learner style and tags. to student assessment. In [8], the authors adopt an ontology-based approach to integrate the relevant knowledge in 3.1. Modeling the content presentation. Their ontology not only models the domain knowledge but also the For the representation of geo and temporal the geographic dimension, the temporal information, we propose a flexible data structure dimension, and the thematic dimension. The first that joins existing cartographic formats to enable can be navigated with the interactive tools linking pieces of information one to another and typical of maps, i.e., panning and zooming. The to map. second is navigable through chronological The approach consists in adding the following selection filters. The third is navigable through issues to geographical objects: the definition of content layers, with the support - unstructured multimedia content, together of an ontology of semantic classes. with source of information attributes as well The first two dimensions are organized as domain-specific attributes, and independently of the context, while the thematic - temporal dimension, to reach the concept of dimension is strongly linked to the context of 4-dimensional map. application. We started from the definition of a data The prototype we are developing will be model, aimed at representing and managing the grounded on an existing open cartographic semantic contents associated to the map. The system (precisely, OpenStreetMap) with various data model allows integrating the cartographic added layers of data, representing the context- data with heterogeneous structured data (coming specific elements. Compared to a classic map, from different sources) and with unstructured this model allows browsing the data as a 4D data (multimedia), including the chronological map, enriched with multimedia content. dimension. Tags representing the temporal dimension The hinge between geographic elements and allow associating a point of interest to a set of contextual content is the definition of POI. Since events that occurred in its location or describing knowledge is often related to a geographical the evolution of the associated contents along element rather than a single point, a POI can also time. correspond to an area, accompanied by related The knowledge associated with the map context information. through the metadata of the multimedia contents is organized along three information dimensions: Figure 1: The ontology of tags Figure 1 shows an outline of the ontology we On the right-hand part of Figure 1, we are developing. A map element is defined with represent the content associated to the map, geographic coordinates and can represent a organized into different contexts and including single point (like a building), a line (like a street) the multimedia content and its related metadata. or a polygon (an area). Deriving the content classification from a while transversal modules deal with Data semantic ontology allows for a language- Quality and Validation. independent consultation, since each node of the In the topmost area of our architecture, the ontology describing the contents can be Interface layer allows users to interact with the expressed with a specific tag for each language analysis modules, to enter new data and to managed by the system. This is different from visualise the map, dealing with Security and textual search in metadata, which is instead Privacy issues by filtering accesses according to dependent on the language used to specify the the different user privileges. Here, some modules metadata. aim at validating the provenance of the data sources, and to model the additional data 3.1. Functional architecture according to the target representation, which is visualised in a personalised way according to the rights of the different users. The functional architecture of the proposed The framework is designed to be used by framework is shown in Figure 2. different categories of users: Public The Storage area contains a spatial database Administrations, professional users such as for geographical and temporal data connected to architects or urban designers, and individual the map, together with an additional database citizens. Each category is granted access to storing information about contexts and different contents and functionalities according multimedia. to the needs and roles. In the Analytics area, the Adaptation module implements the adaptivity of the framework, Figure 2: Framework overview testing purposes, we are considering importing a 3.2. Enriching a Map: series of publicly available georeferenced multimedia contents. Semantic Annotations In the content acquisition phase, the interface layer assigns to data a set of attributes derived After the definition of the general framework, from metadata and from source registration the following step is to design an interactive tool information, such as privacy, licensing, aimed at enriching the map layer with content confidence, and so on) and to content, to enable provided by external and voluntary sources, its thematic classification. according to crowdsourcing models, since, for Considering classification of the content, Selection, namely filtering, i.e., showing either we provide a tree of predefined tags for only some contents and filtering out homogeneous and language-independent irrelevant contents. classification, or we leave the contributors free to Grouping, namely clustering, i.e., enter user-defined tags in the tree. showing a single object that summarizes Concerning who can upload content, the various different contents. framework is open to ordinary users, voluntary Ranking, namely proposing only the citizens and qualified sources with credentials, most relevant contents according to a such as public administrations, universities, given order of relevance. associations, or professionals. Regarding who can read data, we envisage The framework could adapt to the data public, private, password-protected information entered into the archive (multimedia data, or payment-based access. The type of access metadata, and tags). This should occur not depends on the type of licence the system grants: simply by returning different data (otherwise it data can be public domain, or can be used under would just be a normal data-driven system) but creative-common licences, or provided under the by modifying the navigation and presentation authorisation of the owner. Some types of users methods depending on the type of data. For can add data, which they can decide to share with example, the modification can be made other users (a community – selective access, all dependent on: a) whether we are dealing with users – public access, no users – restricted images or videos or texts etc., or b) on the value access) and combine them with the existing data. of the tags. The combination is guided by the ML If applied to multimedia elements, some ML algorithms, that help classifying data according techniques can be used to generate and better to what defined in the ontology. structure the tags associated to the different media contents, in order to run further analyses 4. Discussion about Navigating Tags of clustering or aggregations. for Adaptivity and Conclusions Our proposed navigation and query mode is conceived to display information on the map, considering the different granularity of data, The presented framework aims at offering a depending on the displayed area. The higher the query and analysis environment that supports geographical zoom level, the higher the temporal navigation along tags and enables analyses, e.g., detail and the semantic detail of the returned for decision making, by intersecting content information. In this way, information on geo/temporal referenced data coming from the map is presented at different levels of different sources and with different formats and granularity, automatically linked to each other. informative content, including multimedia data. The zooming function can operate onto the The navigation and query interface would three dimensions: spatial, temporal and thematic. adapt to the context. The way to achieve The first is the classic geographical zoom; the adaptation is, for example, the selection of second groups events into macro-events or different query filters or the dynamic generation chronological periods; the third is a semantic of a layer containing context-dependent zoom, where one can navigate a taxonomy elements. dynamically derived from the context-specific As an alternative, we envision the dynamic ontology. generation of the POI and the clusters of It is also interesting to adapt the user analyses multimedia content associated with them. The to the context; this is a matter of current study. idea is that not only the content of the clusters can be dynamically navigated (which is somehow obvious), but also their structure 5. Acknowledgements should adapt to the context. Tags adapt to the uploaded content or vary based on user queries. This paper is partially supported by the We plan to have three navigation modes to Projects BigData and Seamless of Regione reduce information overload in a way that is Lombardia, Italy, and by the EU Horizon 2020 adaptable to the context, where the context is research and innovation programme under grant defined by the set of entered contents and by the agreement No. 826232, project WorkingAge user preferences: (Smart Working environments for all Ages). 6. References [1] H. Karimi, H. Zeinivand, Integrating runoff map of a spatially distributed model and thematic layers for identifying potential rainwater harvesting suitability sites using GIS techniques, Geocarto International, (2021), pp. 320-339. [2] E. Purificato, A. M. Rinaldi, Multimedia and geographic data integration for cultural heritage information retrieval, Multimedia Tools and Applications, 77(20), 2018. [3] T. Bracamonte, B. Bustos, B. Poblete, T. Schreck, Extracting semantic knowledge from web context for multimedia IR: a taxonomy, survey and challenges, Multimedia tools and applications, 77(11), 2018. [4] D. Fernández, D. Varas, J. Espadaler, I. Masuda, J. Ferreira, A. Woodward, E. Bou, Vits: video tagging system from massive web multimedia collections, in: Proceedings of the IEEE International Conference on Computer Vision Workshops, 2017, pp. 337-346. [5] H. K. Hong, G. W. Kim, D. H. Lee, Semantic tag recommendation based on associated words exploiting the interwiki links of Wikipedia, Journal of Information Science, 44(3), 2018, pp. 298-313. [6] A. C. T. Klock, I. Gasparini, M. S. Pimenta, J. P. M. De Oliveira, Adaptive hypermedia systems, in: Advanced Methodologies and Technologies in Media and Communications, IGI Global, 2019, pp. 217-228. [7] H. Anantharaman, A. Mubarak, B. T. Shobana, Modelling an adaptive e-Learning system using LSTM and Random Forest classification, in: 2018 IEEE Conference on e-Learning, e-Management and e-Services (IC3e), 2018, pp. 29-34. [8] Y. L. Chi, T. Y. Chen, C. Hung, Learning adaptivity in support of flipped learning: an ontological problem-solving approach. Expert Systems, 35(3), 2018.