Semantic Educational Web Portal © Victor Telnov National Research Nuclear University MEPhI, Obninsk, Russia telnov@bk.ru Abstract. The paper deals with the pilot project devoted to the application of the knowledge graphs in the educational activities of the universities. The ontology of the curriculum and the training courses, as well as the means of authoring, enrichment and adaptation of the learning objects are considered. The visual navigation on the knowledge graphs is carried out by using the special searching widgets and smart RDF browser. Working with semantic repository and text analytics is performed on the cloud platforms via SPARQL queries and RESTful services. The software architecture in UML-notation are presented, examples of real use of the educational portal are given. Keywords: semantic web, educational portal, ontology, knowledge graph, triplestore, RDF storage, graph database, cloud computing. A recent authoritative overview [14] deals with the 1 Introduction Graph and RDF databases makes it possible to navigate Students and professors spend a lot of time and among modern products and solutions in the field of the efforts finding useful information, instead of having to Semantic Web, where the leaders are AllegroGraph, comprehend and interpret the learning content. It was ArangoDB, BlazeGraph, Cray, DataStax, Ontotext rightly observed that the traditional web technologies GraphDB, IBM Graph, MarkLogic, OrientDB, Neo4j, (sometimes referred to as WEB 2.0) do not provide Stardog, Teradata, Aster, Virtuoso. adequate search and navigation in the environment of It looks very promising the cooperative project distributed knowledge at the semantic level. Ontotext and Impelsys on the joint using of the platforms Naturally the thought came about some personal GraphDB and Dynamic Semantic Publishing for the smart learning agents (software), which could identify development of personalized adaptive learning. relevant information from any accessible data source and The pilot project [7] which is considered in this provide an information synthesis tailored to personal article is based on the cloud semantic platform and uses learning objective. The idea of semantic educational network RESTful services. The preferred repositories for portals that could provide a meaningful integration of learning objects themselves are the remote data storages. educational objects with the adaptation and The predecessor of this project is the Cloud cabinet of the personalisation of training courses and curricula, Educational portal “Department online” [2]. The project appeared almost simultaneously with the advent of the under consideration has been implemented in the Semantic Web. educational practice of National Research Nuclear During the semantization the data are combined into University MEPhI, Russia. triplets in accordance with the RDF model and form a RDF browser is the main highlight of the Semantic graph. If the data are the learning objects, than that form Educational Web Portal [7], which distinguishes it from the so-called knowledge graph. It is obvious that the most most of the known solutions in the field of the Semantic adequate repository for the knowledge graphs are the Web. Once being in the desired place of the knowledge graph databases. graph via the corresponding widget, then you can to The semantic graph database, also referred to as an perform a visual navigation in this graph, simply walking RDF triplestore, stands out from the other types of graph along its nodes. databases due to the possibility to support ontologies. There is a possibility to make a visual walk through The semantic graph database is capable to integrate the knowledge graph as far as you want in any direction, heterogeneous data from many sources and create scooping up the data that appears. By focusing on a relationships between datasets. That database focuses on specific graph node, it is possible to obtain text metadata, the relationships between entities and is able to infer new media content and hypertext links that are associated knowledge out of existing information. It is a powerful with this node. Along with that the nearest neighborhood tool to use in relationship analytics and knowledge of the node becomes visible and accessible for discovery. navigation. 2 Related Work and Novelty 3 The Ontology The fundamental technologies of the Semantic Web, Proceedings of the XIX International Conference the knowledge graphs for example, are based on a set of “Data Analytics and Management in Data Intensive universal standards, as set down by the World Wide Web Domains” (DAMDID/RCDL’2017), Moscow, Russia, Consortium (W3C) international community [17]. From October 10–13, 2017 50 the point of view of semantic technology, the key these terms, limiting their meaning within a particular standards that apply are the Resource Description area. At the formal level, an ontology is a hierarchical Framework (RDF) and OWL (Web Ontology system consisting of a set of concepts and a set of Language). assertions about these concepts on the basis of which it RDF(S) [18], or triplets, is the format uses to store is possible to describe classes, relations, functions, and data in knowledge graphs. OWL [19] is based on the individuals (instances of classes). Description Logics language which is designed to show In the language of Description Logics (DL) [4], a set the data schema and to represent rich and complex of assertions of a general kind, or terminology, is called knowledge about hierarchies of things and the relations TBox (intensional knowledge). It is TBox that forms an between things. It is complementary to RDF and allows ontology in the proper sense of the word. In Description for formalizing a data schema/ontology in a given Logics, sets of assertions of a individual kind – ABox domain of knowlrdge, separately from the data itself. (extensional knowledge) are singled out separately. In the general case an ontology is a formal TBox together with ABox forms a meaningful specification that provides sharable and reusable knowledge base (knowledge graph). knowledge representation. An ontology includes Below in Figure 2 is an example of the relationship descriptions of concepts and properties in a concrete between the class and individuals. Here the individual domain of knowlrdge, relationships between concepts, named Semantic_Web belongs to the class named constraints on how the relationships can be used and Training_Course. In addition, this individual has a occasionally individuals as instances of concepts. number of relations with individuals of other classes. Figure 1 partially shows the ontology [11] that is used This can be a relations of different types and directions, in the Semantic Educational Web Portal [7]. In Figure 1, as can be seen from the color and direction of the arrows the Training_Course class is intentionally highlighted, in Figure 2. because later this class and its individuals will be used as The very kinds of relations, like classes, are usually explanatory examples. defined in TBox, whereas the facts of the existence of a certain kind of relationship between concrete individuals are intrinsically some RDF-assertion in ABox and each assertion has a triplet appearance. Below Figure 3 shows a diagram of the relationship between classes from the ontology. This diagram presents only the top-level relationships. Every beam of particular color is a set of relations between individual instances of two classes. Each individual relation in the ontology (that is in the knowledge graph) inherently is an RDF assertion where the subject is an instance of one class, the object is an instance of another class, and the reference is a predicate in the RDF format. Depending on the number of relations between instances of two classes, every beam on diagram in Figure 3 can be thicker or thinner and gets a color of the class with a large number of incoming relations. These relations can be in both directions (incoming, outcoming). The number of relations (links) between classes from the ontology is shown in the legend on the diagram in Figure 3. 4 Knowledge Graphs An ontology enriched with extensional knowledge from a specific subject area is also called the knowledge graph or knowledge base. Extensional knowledge forms Figur e 1 The class hierarchy in the ontology the contents of ABox. Practically, knowledge graphs are deployed in the graph database or in a different semantic Often, ontologies are understood as special repository (triplestore or RDF store). knowledge repositories that can be read and understood Specifically, the Semantic Educational Web Portal both by people and computers, alienated from the [7] is located on the Ontotext S4 GraphDB cloud developer and reused. Ontology in the context of platform [11] (physically on the Amazon Web Services information technology is usually a hierarchical system – AWS cloud platform [1]). of concepts and terms (structure, model) of a certain subject area. Informally, an ontology is a description of the world view as applied to a particular area of interest. This description consists of terms and rules for the use of 51 Figure 2 Individual of the class with neighborhood (example) Figure 3 Relationships between classes in the ontology Figure 4 Navigation on the knowledge graphs 52 The current prototype of the Semantic Educational supplemented by the international knowledge bases Web Portal [7] supports the curriculum presented in the DBpedia and Wikidata, as well as a number of more Cloud cabinet of the Educational portal «Department specialized knowledge repositories. online» [2]. Remote work with cloud version Ontotext Each of mentioned knowledge graphs contains tons GraphDB is carried out through the provided REST API. of triplets. The widgets shown below in Figure 4 are The most common operations are creating, reading, designed to allow a student or teacher easily get into the loading, and deleting semantic data. For the practical right place of the right knowledge graph, where it is implementation of network requests HTTP methods are likely find the required learning objects. used, such as GET, POST, PUT, DELETE. These The principle of working with these widgets is network requests contains essentially automatically largely similar to how information is searched through generated SPARQL queries of the following types. popular public search engines (Google, Yandex, etc.). As • SELECT to fetch data from the knowledge graph. the user types the letters of the keyword in the input line, • CONSTRUCT to create a new RDF graph. the system rolls out a list of relevant concepts from the knowledge graph. User can choose the most suitable • INSERT to add triples to a graph. concept and dive directly into the desired area of the • DELETE to remove triples from a graph. graph. Figure 4 shows the user interface of the Semantic Then, more accurate visual navigation on the Educational Web Portal, suitable for navigating through knowledge graph becomes possible, which is performed the available knowledge graphs. Primary graph of in an intuitively clear manner using the RDF browser, as knowledge contains the materials of the master's courses, described below. which are taught at the NRNU MEPhI on the profile “Computer networks and telecommunications”. It is Figure 5 Fragment of the knowledge graph in the RDF browser (example) 5 RDF browser the nearest neighborhood of the node becomes visible RDF browser is the main highlight of the and accessible for navigation. This environment includes Semantic Educational Web Portal, which nodes not only of that graph, through which you distinguishes it from most of the known solutions in originally has come in the semantic web, but also the the field of the Semantic Web. Having got to the right nodes of all other knowledge graphs of that are supported place of the necessary graph of knowledge through by the system. the corresponding widget, then you can perform a In Figure 5, some elements of the node's visual navigation in this graph, simply walking along neighborhood that correspond to the Semantic_Web its nodes. individual are displayed, as well as some related By focusing on a specific graph node, it is possible to metadata. If you focus on the next node that is displayed obtain text metadata, media content and hypertext links by the RDF browser, it also becomes available with its that are associated with this node. It is very important that neighborhood and metadata. 53 Thus you can to make a visual walk through the graph problem can be reduced to finding the shortest path in the of knowledge as long as you like in any direction, knowledge graph. This question has been studied, for scooping up the data that appears. In Figure 5, this is not example, in [9]. Neo4j Graph Database [10] has built-in shown, but in reality, when you hover over different means for calculating the shortest paths in the graph. sections of a particular node, pop-up menus, additional In real educational practice, the process of information and prompts for various options for constructing a specific training course in the process of continuing navigation through the knowledge graph formation of a curriculum largely is empirical procedure, becomes availible. based on the experience and knowledge of the lecturer. 6 Adaptive Learning Technology In order to assess the training outcomes and learning efficiency, the evaluation tools from the Cloud cabinet of The main challenge of e-learning systems is to provide the Educational portal “Department online” are used, see training courses tailored to different students with [2]. different learning rate and knowledge degree. Adaptive learning technologies are based on the fact that each student is unique, learns at varying rates and comes with different levels of knowledge. Traditional methodology of instruction may force the student down a learning path that is either too elementary, resulting in lack of interest or too heavy to grasp the nuances of the course. Adaptive learning, aided by semantic technologies [8], considers learner’s interaction with courses and assessment modules to create personalized learning paths. The adaptive learning system generally includes the following three subsystems. 1. The subsystem of forming a model of the learner (student model). 2. Learning planning subsystem (instructional model). 3. A subsystem for evaluating training outcomes. For the student model the most popular means of determining a student's skill level is the method employed in CAT (computerized adaptive testing). In Semantic Educational Web Portal «Department online» [5] various, not just computerized means for measuring a student's skill level are used. In fact, the same training course should be built in different ways, depending not only on the level of knowledge and abilities of students, but also on the learning objectives. For example, a training course in programming will look different for students who concentrate in the field of business informatics and in the field of computer networks. To build the actual instructional model and to fill it with learning objects, the Ontotext S4 Text Analytics RESTful service [13] is actively used. The purpose of text analysis is to create sets of structured data (machine- readable facts) out of heaps of unstructured, heterogeneous documents.Text analytics involves a set of techniques and approaches towards bringing various textual content to a point where it is represented as data and then mined for insights/trends/patterns. Contextual authoring provides lecturers with related texts, images and concepts which enhance the training course, reduces the time and costs of authoring and editing new learning content. Automated content enrichment improves the Figur e 6 Software architecture quality of curriculum and allows for continuous authoring without interruption. 7 Software Architecture When constructing adapted training courses, they are Figure 6 shows the Deployment Diagram for the usually optimized according to two criteria: the Semantic Educational Web Portal, performed in effectiveness and adaptability of training. From the accordance with the UML 2 standard [6]. This diagram mathematical point of view, in the idealized case the 54 can also be considered as an enlarged Component existence of the Semantic Web and Linked Open Data. Diagram for this software. As it can be seen from Figure They continue to use traditional Content Management 6, the component named “RDF Browser” does not have Systems (CMS), also known as Learning Management its own server code (back-end). It interacts with two Systems (LMS) or Virtual Learning Environments cloud RESTful services – Ontotext Text Analytics [13] (VLE), which are built primarily on simple taxonomies and Ontotext GraphDB [11], both physically are and thesauruses. deployed on the Amazon Web Services (AWS) [1] cloud Students and professors widely practice searching the platform.Cloud service Text Analytics [13] provides information on the World Wide Web for keywords, using tools for semantic annotation and update of educational public search engines for this purpose. Tradition plays a objects during the creation and adaptation of curricula. significant role here, as well as the simplicity and high Cloud service GraphDB [11] provides the semantic speed of the search query generation, in comparison with storage for knowledge graphs and is mainly used as a the search queries to the Semantic Web. SPARQL endpoint. As a universal repository for Despite the growing commercialization of the public educational objects of an arbitrary nature, Google Drive search engines, it can be assumed with a great deal of is used. The choice of this particular storage is not certainty, that they, along with Wikipedia, will remain principled, in parallel with it, arbitrary remote the most accessible “universal textbooks” for the repositories equipped with data display means, for foreseeable future for that numerous category of students example such as Microsoft OneDrive or Yandex.Disk who not always demand the quality and completeness of can be successfully applyed. the training material. An exception to this situation could The other two components, named “Wikidata Search be students (undergraduates) of universities who Agent” and “DBpedia Search Agent” both are advanced specialize in computer science and informatics. SPARQL endpoints to the corresponding international knowledge bases. Both mentioned components are 10 Acknowledgements provided with libraries of patterns of search queries, The work was supported by the NBO “Vladimir Potanin which largely facilitate the work of users, as well as are Charity Fund”, project No ГК160001360. capable to deliver and show the found content in a variety of formats, including graphics. References 8 Discussion [1] Amazon Web Services (AWS) – Cloud Computing Services (2017). https://aws.amazon. com/ The pilot project presented in this article is aimed not only to provide students and teachers with a flexible [2] Cloud cabinet of the Educational portal «Department knowledge management tool, but also to stimulate them online» (2017). http://cloud.obninsk. ru/ to get acquainted with the world of semantic [3] DBpedia (2017). https://ru.wikipedia.org/wiki/ technologies. DBpedia To the middle of 2017 a sufficient toolkit for working [4] Description Logics (2017). http://dl.kr.org/ with ontologies, knowledge graphs and semantic [5] Educational portal “Department online” (2017). repositories of triplets, including on cloud platforms, has http://ksst.obninsk.ru/ already been created. There is a great variety of public [6] ISO 19505 UML Part 2 Superstructure (2012). SPARQL endpoints. The English segment of the World https://drive.google.com/file/d/0B0jk0QU2E5q9NV Wide Web is filled with Linked Open Data. This is IwMFNieGxOZVU mainly reference data, bibliographic, media and other [7] Knowledge graph of the Educational portal information of encyclopedic nature. “Department online” (2017). Attempts to find the open semantic data in the http://semantic.obninsk. ru/ Russian segment of the World Wide Web infrequently [8] Learning Resource Metadata Initiative (2017). lead to success. We have to agree with the fact, that in http://lrmi.dublincore.net/ Russia there are still little Linked Open Data, suitable for [9] Marwah, Alian1, Riad, Jabri: A Shortest Adaptive educational activities. The main sources of data for Learning Path in eLearning Systems: Mathematical Russian users of the semantic web are still international View. J. of American Science, 5 (6), pp. 32-42 knowledge bases, including Russian-language content, (2009). doi:10.7537/marsjas050609.08 primarily DBpedia [3] and Wikidata [15]. The prototype [10] Neo4j Graph Database (2017). https://neo4j.com/ of the semantic educational web portal created is intended to partially fill this gap. [11] Ontology of the Semantic Educational Web Portal (2017). http://drive.google.com/file/d/ 9 Concluding Remarks 0B0jk0QU2E5q9Y0x6bTJaOEpXLWM A well-known skepticism about the fact that semantic [12] Ontotext S4 GraphDB (2017). http://docs.s4. educational portals will soon become widespread in the ontotext.com/display/S4docs/Fully+Managed+ university environment seems fair. The modern realities Database of higher education are such that the overwhelming [13] Ontotext S4 Text Analytics (2017). http://docs. number of students and teachers do not suspect the s4.ontotext.com/display/S4docs/Text+Analytics 55 [14] Philip Howard: Graph and RDF Databases 2016. Obninsk, Russia, pp. 195-204. http://ceur- Market Report Paper by Bloor. http://www. ws.org/Vol-1536/ bloorresearch.com/research/market-report/graph- [16] Wikidata (2017). http://www.wikidata.org and-rdf-databases-2016/ [17] W3C Semantic Web (2017). https://www.w3.org/ [15] Victor Telnov: Semantic Web and Search Agents for standards/semanticweb/ Russian Higher Education. A Pilot Project. CEUR [18] W3C RDF Schema 1.1 (2014). https://www.w3. Workshop Proc. 1536. Selected Papers of the XVII org/TR/rdf-schema/ Int. Conf. on Data Analytics and Management in [19] W3C OWL 2 Web Ontology Language (2012). Data Intensive Domains (DAMDID/RCDL 2015). https://www.w3.org/TR/owl2-overview/ 56