ICA Records in Contexts-Ontology (RiC-O): a Semantic Framework for Describing Archival Resources* Florence Clavaud1 and Tobias Wildi2 1 Archives nationales, Pierrefitte-sur-Seine, France 2 University of Applied Sciences of the Grisons, Chur, Switzerland florence.clavaud@culture.gouv.fr, tobias.wildi@fhgr.ch Abstract. This article gives an overview of the new Records in Contexts Ontol- ogy (RiC-O), which is available at https://www.ica.org/standards/RiC/ontology. This ontology is part of the Records in Contexts (RiC) standard, which has been developed by the International Council on Archives to describe and contextualize archival resources in a comprehensible way that goes beyond the possibilities of the existing archival standards. The article explains the rationale for developing a new standard for archival description, and particularly the ontology. It provides a quick overview of the RiC Conceptual Model and then focuses on RiC-O, its design principles and content and giving references to more precise documenta- tion that is publicly available online. Finally, it presents the roadmap and future perspectives of RiC. Keywords: Archival metadata, Archival description, Records in Contexts, RiC, RiC-O, Linked Data, ontology, data modelling. 1 Introduction This article gives an overview of the new Records in Contexts Ontology (RiC-O). This ontology is part of Records in Contexts (RiC) standard, that is being developed by EGAD,1 a group of experts mandated by the International Council on Archives (ICA),2 1 See: https://www.ica.org/en/about-egad (last accessed 2021/07/03). ICA has extended the in- itial mandate of EGAD to the end of 2021. This mandate will most probably be extended again. 2 ICA is an international, neutral, non-governmental organization, which for more than sixty \HDUV³KDVXQLWHGDUFKLYDOLQVWLWXWLRQVDQGSUDFWLWLRQHUVDFURVVWKHJOREHWRDGYRFDWHIRUJRRG archival management and the physical protection of recorded heritage, to produce reputable standards and best practices, and to encourage dialogue heritage, to produce reputable stand- ards and best practices, and to encourage dialogue, exchange, and transmission of this NQRZOHGJHDQGH[SHUWLVHDFURVVQDWLRQDOERUGHUV´(*$'LVRQHRIWKHH[SHUWJURups estab- lished by the ICA Executive Board, on the recommendation of the Programme Commission. For more information, see https://www.ica.org/en/international-council-archives-0 and https://www.ica.org/en/our-professional-programme/expert-groups (last accessed 2021/07/03). * Copyright 2021 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). 2 to describe and contextualize archival resources in a comprehensible way that goes be- yond the possibilities of the previous archival standards. The current version of RiC-O is version 0.2, that was published in February 2021.3 It is an OWL 2 implementation of RiC-Conceptual Model (RiC-CM), the first part of RiC, whose full 0.2 version, including a long introduction and diagrams, was published in July 2021 [1] along with a call for comments.4 This article explains the rationale for developing a new standard for archival descrip- tion, and particularly the ontology. It provides a quick overview of RiC-CM and then focuses on RiC-O, its design principles and content, giving references to more precise documentation that is publicly available online. Finally, it presents the roadmap and perspectives of the project. 2 Why Develop a New Standard for Archival Description? Like the existing ICA standards for archival description,5 RiC can be applied to any kind of archives, whatever their provenance, date, status, or nature is and whether they contain analogue or digital material. The perimeter of RiC is the same as it was for the previous standards: describing all kinds of archival material. However, RiC is very dif- ferent from the existing ICA standards that it replaces. The previous standards were neither pure conceptual models that would accurately define the domain described, nor technical, machine-readable implementations. They were something in between and aimed to specify description rules for exactly four cat- egories of objects: archival resources, archival institutions, activities from which the archival resources result, and persons or groups related to archival resources. These four previous standards were also published independently between 1994 and 2008, which finally resulted in some overlapping and a few inconsistencies [2]. Besides, from 1998, the archival community (to be more precise, an international group led by the Society of American Archivists6) has developed and maintained, for authoring or, more often, exchanging archival metadata conforming to these description rules, two XML grammars, EAD [3] and EAC-CPF [4], that correspond to ISAD(G) and ISAAR(CPF) (and another DTD, EAG [5], that corresponds to ISDIAH and is far less used). So most of the huge quantity of archival metadata that is managed by archival institutions, at least for public historical archives, is available under the form of XML files that con- form to two different XML grammars. These models together with the description rules mentioned above have shaped the landscape of archival metadata and enabled the 3 See: https://www.ica.org/standards/RiC/ontology (last accessed 2021/09/01). 4 See: https://www.ica.org/en/call-for-comments-on-ica-ric-cm-02 (last accessed 2021/09/01). 5 ISAD(G), ISAAR(CPF), ISDF and ISDIAH; see: https://www.ica.org/en/public-re- sources/standards (last accessed 2021/07/03). 6 The Technical Subcommittee on Archival Encoded Standards (TS-EAS), a subcommittee of the SAA Standards Committee. See: https://www2.archivists.org/groups/technical-subcom- mittee-on-encoded-archival-standards-ts-eas (last accessed 2021/09/01). 3 development of portals like Archives Portal Europe.7 However, even if these descrip- tion rules are considered essential by the archives, they are not based on a global, con- sistent and rigorous conceptual framework. The components of the standards as well as the corresponding elements and attributes in the three existing XML models do not have the appropriate granularity to cover the current needs of archivists or records man- agers. For instance, ISAD(G) description rules apply to sets or records and to unitary records the same way. Moreover, the relations that are listed in ISAAR(CPF), ISDF and ISDIAH are too few and too vague to express the complexity and nuances of the archival world. RiC as a completely new developed standard takes into account the latest develop- ments of archival theory and practice. It bundles the preexisting four different standards into only one and aims to provide everything needed for describing archives and to create more homogenous data. The key statement of RiC can be summarized as follows: archives are by essence aggregations of records; each record and aggregation has its history and is engaged in multiple layers of contexts, from its creation to the management by an archival in- stitution. Examples of contexts include: the provenance, both organic and functional; the agents that authored the records; the documentary context (the relations between the record resource described and other records within the same aggregation or outside of it); the things the records are about; the successive actions (enrichment and modifi- cation, transfer, destruction or removal, appraisal, arrangement, classification, descrip- WLRQGLJLWL]DWLRQPLJUDWLRQ« WKDWZHUHDSSOLHGWKURXJKWLPHWRWKHUHFRUGVHWVRUWKH HYHQWV ZDUVHDUWKTXDNHVIORRGV« WKDWDIIHFWHGWKHPHWF No archival aggregation (record set) can be well curated (arranged, described), or used by end users without the knowledge of all these layers of contexts. However, ,6$' * DQG($'PDLQO\IRFXVRQWKHFORVHGRFXPHQWDU\FRQWH[WXVLQJ³OHYHOVRI description´LQ,6$' * DQGWKH;0/WUHHWRUHSUHVHQWLWThis is a lack of precision of the existing standards. Another such example is the fact that ISAAR(CPF) only distin- guishes three agent types and five generic relation types between them. The new RiC standard aims to fill such gaps. It defines for example Record Sets and Records, and an H[SOLFLWµLQFOXGHs or included¶UHODWLRQEHWZHHQWKHPDVZHOODVDVHTXHQFHUHODWLRQDQ association relation, a genetic relation, etc. RiC also defines six agent entities, and a significant number of relations that may exist between them. Thus, RiC intends to gen- eralize, extend and express the underlying perspective on archives and opens possibil- ities beyond the previous description rules and XML structures. RiC also takes into account current ideas of data modelling. Archival metadata was for a long time encoded and stored in tables (Excel/CSV), relational databases or the markup language XML, where EAD is the most prominent representative. To describe records in their different contexts, a more advanced way of data modelling is needed. The most appropriate method for representing such a social and documentary network and its changes through time and space is an oriented, labeled graph of linked entities or, in other terms, a semantic network. 7 About Archives Portal Europe, see: https://www.archivesportaleurope.net/about-us (last ac- cessed 2021/09/01). 4 Current archival information systems consist most often of silos that were created through time and are now coexisting, from specific heterogeneous databases and EAD authoring and publishing tools to complete digital archiving information systems. Within these systems, the same contextual entity, e.g. a person, a place, an event, or even the core archival entities i.e. record resources, may be described many times in different silos, with no link between these descriptions. Some contextual entities are not described independently as autonomous objects, but are named repeatedly, using simple textual strings within finding aids, e.g. for some access or appraisal rules. This results in various problems in managing information, such as lack of efficiency, diffi- culty to know what data the archival institution holds exactly about the entity, thus what knowledge it has about it, quality management issues and others. When end users try to retrieve records, they run into problems like lack of accuracy and insufficient recall when using the search engines provided, interfaces and lists of results that are difficult to understand, links between items that are not visible nor processable when they exist, or even several interfaces provided for the same archival institution. RiC provides a global reference framework for moving away from such systems to data-oriented architectures, or at least for breaking up these internal silos and making them permeable. Once the metadata are reorganized per entity according to RiC, each entity described can be assigned a unique identifier and linked to other entities by using one of RiC relations. The graph obtained can be published and connected to other graphs maintained by other cultural institutions or research teams, as the same person or place, for example, can be of interest in another context. A record resource held in one institution can be directly linked to a resource in another archive. RiC is intended to be the foundation for creating interoperable archival metadata, that have the potential to be compliant with the FAIR principles.8 Interoperability is not only important between the repositories within an institution, but also across institutions and projects. Archival institutions or portals and their audi- ences will undoubtedly gain from setting up the conditions for interoperability at the level of metadata, and therefore from using RDF for publishing high quality metadata, conforming to a unique, homogeneous data structure. There are many examples of col- lections dispersed between several institutions or several countries; at the scale of ar- chival series of files, the administrative and business processes also lead to a distribu- tion of documents between the different actors, and whether it is a question of better evaluating these archives or identifying the most complementary sources, linked metadata may be of major interest. Not to mention that describing the huge quantity of archival contextual entities like agents, places, and functions, cannot be done without reusing the datasets prepared by other professional communities, that could in turn find new information in the knowledge graphs built by archivists. Finally, projects involving both researchers in humanities (or communities of users like associations or genealo- gists) and archivists or record managers are becoming more numerous, and being able 8 The FAIR Guiding Principles for scientific data management and stewardship (2016) are rec- RPPHQGDWLRQVWRLPSURYH³WKH)LQGDELOLW\WKH$FFHVVLELOLW\WKH,QWHURSHUDELOLW\DQGWKH5H XVHRIGLJLWDODVVHWV´DURXQGZKLFKDFRPPXQLWy and various initiatives have developed. See the website: https://www.go-fair.org/ (last accessed 2021/09/01). 5 to provide these projects with accurately modeled metadata becomes important. RiC will play a major role in building such interoperable sets. From the perspective of archive users of any kind (both professionals and the general public, including researchers and citizens), organizing descriptive metadata as a graph of interconnected entities is a great help, since every component of a graph can be que- ried and displayed to fulfill the needs to find, identify, select, obtain, and explore the records described. These five verbs are borrowed from the Library Reference Model (LRM) [6], that was developed by and for the libraries community, following the very same idea to model the world of edited items as a graph of entities. The verb understand should in our opinion be added to the user tasks for archival metadata, as a user needs to understand the layers of contexts for selecting the records that are relevant for his/her research.9 With this target in mind, EGAD decided to build both an abstract conceptual model for archival description, and an OWL implementation of it, so that high quality RDF graphs can be created by the archival community, published and linked to other graphs. 3 History of the Project EGAD had its first meeting in November 2013. The group released a first draft version of RiC-CM (RiC-CM 0.1) in August 2016 and gave a first presentation of this version at the international conference on archives held in Seoul (South Korea) in September 2016.10 A call for comments was issued at the same time; it was closed in January 2017 and resulted in a significant quantity of comments from more than sixty professional organizations and persons, which were of great help for preparing RiC-CM 0.2. Mean- while, early versions of RiC-O were developed, which also proved to be useful for designing the conceptual model; they were thoroughly tested by EGAD11 and also shared with early reviewers that had applied as volunteers.12 The first public release of RiC-O occurred in December 2019, at the same time as the release of RiC-CM 0.2 9 Let us also, of course, add the reuse verb, as data, and particularly RDF metadata sets, are designed to be reused outside of their original environment, particularly by researchers, and by machines. 10 See the slides and audio recording of the presentation: https://www.ica.org/en/records-in-con- texts-ric-a-standard-for-archival-description-presentation-congress-2016 (last accessed 2021/07/03). 11 In particular, in February 2018, three French institutions (the Archives nationales of France, the French national Library and the French ministry of Culture) published a proof of concept, the PIAAF prototype (https://piaaf.demo.logilab.fr), based on a version of RiC-O dated 2017. 12 The comments received in 2019 on early versions of RiC-O sent to the early reviewers con- sisted in seven PDF files. The authors were at this date working in archival institutions or service and software providers, that were located in Belgium, Bulgaria, Canada, France, Por- tugal, and the USA. 6 Preview.13 Since then, the current version of RiC-O has therefore been accessible both to machines and humans through its IRI, https://www.ica.org/standards/RiC/ontology. Both RiC-CM 0.2 Preview and RiC-O 0.1 had to be carefully checked and slightly modified, and RiC-CM 0.2 Preview lacked an introduction. As a result of this work, RiC-O 0.2 was published in February 2021; it is compliant with RiC-CM 0.2 full draft, which was published in July 2021 [1], along with a call for comments. Both are signif- icantly different from the previous public releases, and supersede them. The list of changes made to RiC-O from December 2019 is included in the internal documentation of the OWL file. From March 2020, the versioned sources of RiC-O have been available through the GitHub public repository of the project,14 which makes it possible for any person to easily contact the RiC-O development team,15 to create issues, to fork the repository and more generally speaking to follow up the development process. RiC-O also has an information website,16 that includes pages on the RiC-O roadmap, on events and on projects using RiC-O. Let us also mention that RiC-O is also available in LOV.17 It is important to note that RiC-O 0.2, and RiC-CM 0.2, are still draft versions. EGAD plans to release the first stable version of RiC, having the status of an ICA rec- ommendation, by the beginning of 2022. The comments that will be submitted to EGAD from now to the fall of 2021 will be incorporated into the standard and are therefore of high importance to EGAD. 4 Developing a Domain Standard When the EGAD took up its work, no generic model or ontology for describing the domain of archival resources existed.18 Various standards and models were analyzed and reviewed: 13 Records in Contexts-Conceptual model (RiC-CM) 0.2 preview (December 2019), https://www.ica.org/sites/default/files/ric-cm-0.2_preview.pdf, last accessed 2021/09/01. 14 See https://github.com/ICA-EGAD/RiC-O. The repository also already includes several full examples as well as diagrams. The current official version of RiC-O is to be found in the main branch, while the development of the next version is always being made using the µnext-ver- sion¶ branch. 15 The most active members of RiC-O development team are the two authors of this article. Daniel Pitti, chair of EGAD, is also involved in this task. 16 https://ica-egad.github.io/RiC-O/. 17 https://lov.linkeddata.es/dataset/lov/vocabs/rico (last accessed 2021/07/03). 18 Some national conceptual models already existed, like the Spanish NEDA conceptual model [13]. Some ontologies existed, developed by smaller communities for specific projects. Among them, let us quote the British, Jisc-funded, LOCAH project that lasted from 2010 to 2013 (http://locah.archiveshub.ac.uk/) and that resulted in datasets and a RDFS vocabulary (see http://data.archiveshub.ac.uk/); and Matterhorn RDF, which was a project to convert METS-based information packages to RDF (https://matterhorn.tools/). 7 x Among very generic ontologies or broader vocabularies: as an example, the widely used schema.org RDFS vocabulary is not precise and rich enough for enabling a complete archival description.19 x Among models for cultural heritage: FRBRoo [7] ontology or the IFLA LRM model [6] and RDA registry [8] do not fit for describing the vast majority of records, as records most often are not edited creative works; thus, the core LRM Work, Mani- festation, Expression and Item entities are not relevant. Also, records are not the primary focus of CIDOC CRM [9], which, for example, lacks methods for handling aggregations of records and the relations between them through time, and lacks a lot of datatype properties for describing archival resources.20 There are important dif- ferences between describing general cultural heritage objects and archival records, which is one reason why CIDOC CRM is not suitable for modelling archival content. x Both the PREMIS [10] as well as the EBUCore ontology [11] focus on technical and administrative metadata, in particular to ensure the long-term preservation of digital records; though preservation and descriptive metadata overlap, standards for tech- nical metadata cannot just be reused for the description of entities. x The PROV Ontology [12], which is a domain-agnostic ontology to model the prov- enance of data, is a good starting point, but too generic to be able to represent the specific requirements of archives for the modeling of traceability and provenance. An intensive analysis led EGAD to the decision to develop its own conceptual model with a domain ontology for archives. The members of the EGAD had not only to work on the necessary definitions, but also to include the various feedbacks from their peers and institutions in the archival world. Once RiC is released in its final form, the long term maintenance will be assured by the ICA on behalf of, and with, the archival com- munity. New entities, attributes and relations will have to be added, old ones to be clar- ified or changed. 5 Quick Overview of RiC-CM 0.2 RiC-CM consists of 22 entities, among which four are the core ones. The core entities are interconnected with each other: Record Resource and the closely related Instantia- tion entity, as well as the Agent and Activity entities. These four entities are considered essential in describing records and the contexts in which these records emerged and were used over time. Broadly speaking, the four core entities and their relations model 19 6FKHPDRUJLVSULPDULO\EHLQJGHYHORSHGIRULQFOXGLQJ³VWUXFWXUHGGDWDRQWKHLQWHUQHW´WKDW describes various kinds of resources. It includes an ArchiveComponent class (https://schema.org/ArchiveComponent), subclass of CreativeWork (https://schema.org/CreativeWork), whose properties, particularly object properties, are far from enabling an accurate description of records. 20 )RUH[DPSOH&50RQO\SURYLGHVµ3BLVBFRPSRVHGBRI¶REMHFWSURSHUW\IRUFRQQHFWLQJWZR LQVWDQFHVRIDQ\(B6\PEROLFB2EMHFWZKLOH5L&SURYLGHVµLQFOXGHVRULQFOXGHG¶IRUFRQ necting a Record Set to a Record Set or a Record that it includes or included during some period. 8 agents who act in time and space, fulfill administrative or other tasks and thus create records. The core entities are not new for archivists and align closely with the existing ICA standards: ISAD(G) -- Record Resource, ISAAR(CPF) -- Agent, and ISDF -- Activ- ity.21 Among the core entities, only Instantiation is new. This entity helps to model different materializations of records and corresponds to the PREMIS Representation. RiC-CM would not be a complete conceptual framework without a set of entities to model archival context further. Thus, apart from the Agent entities, RiC-CM also de- fines Event and Activity; Rule and Mandate; Place; Date. RiC-CM defines 41 attributes for these entities, some of them being shared by all, some being specific to a group of entities, and some to be used only with one entity (such as µScope and content¶, an attribute of only Record Resource). It provides a rich set of 78 relations (and the inverse ones when they are not symmetric) for linking the entities to each other; for example, µKDV ,QVWDQWLDWLRQ¶ which connects a Record Re- source and one of its Instantiations, or µGRFXPHQWV¶ which can connect a Record Re- source or Instantiation and an Activity from which it results. Let us finally mention that RiC-CM also provides sub-structures for some of the entities. For example, Record, Record Set and Record Part are sub-entities of Record Resource; Activity is a sub-entity of Event; Agent has in total six sub-entities (Person, Group that includes Family and Corporate Body, Position and Mechanism). Among Agent sub-entities, Position and Mechanism are rather specific to the archival domain compared with the models of other cultural heritage communities (libraries and muse- ums). This feature provides great flexibility for implementation, that which archivists and records managers have always absolutely needed when describing archives, because they will do it more or less precisely according to the nature of the record sets or items to be described, the knowledge that they have been able to accumulate on these ar- chives, the means and the time available. So, when you don¶t know whether you are dealing with a Record or a Record Part or a Record Set (as can be the case with only reading many old inventories), you can use the Record Resource super-entity; con- versely, the Record entity has its own attributes and is the domain of some specific relations, but it also inherits attributes from Record Resource and can be used with relations whose domain is Record Resource. This feature also facilitates linking: high-level, fairly generic entities and relations, VXFK DV *URXS (YHQW RU µLV RU ZDV SDUW RI¶ DUH JRRG FDQGLGDWHV IRU PDSSLQJV ZLWK entities or relations from other models like PREMIS, PROV-O, IFLA LRM or CIDOC CRM, which will be done in future versions of RiC. Finally, this poly-hierarchical organization facilitates extensions of the model to meet specific needs. Some attributes, moreover, are defined as extensible. If desired, one can define a relation which does not exist in RiC, but which would be needed lo- cally, as a sub-relation of a RiC relation; it will then inherit its characteristics, domain, scope, and can always be replaced by the super-relation defined in RiC. 21 See RiC-CM 0.2 [1], Introduction, section 1.6 (Relationships between RiC-CM and other models and standards for describing records), p. 7-13. 9 Finally, of course, this feature makes it easier to transpose RiC-CM into an OWL ontology. The diagram below shows the most important entities of RiC and their relations; it can also be downloaded from the RiC-O Git repository.22 Fig. 1. Overview of RiC-CM main entities 6 RiC-O Design Principles and Main Characteristics As explained in more details in the introduction of RiC-O,23 the members of EGAD who developed RiC-O followed some design principles, among which we will present the most important ones. 6.1 RiC-O Has to Be Fully Compliant with RiC-CM, of which it Is the Official OWL 2 Representation This means that RiC-O is a domain ontology, having the same scope as RiC-CM. It is developed in close interaction with the archival community. If needed, it will be up- dated and maintained by this very same community. This also means that every component of RiC-CM must have a corresponding com- ponent in RiC-O. The RiC-O component has the same textual definition (rdfs:comment, 22 https://raw.githubusercontent.com/ICA-EGAD/RiC-O/master/diagrams/diagrams_v0-2/RiC- CM-overview/diagram_RiC-CM-overview-RiC-v0-2.jpg. 23 RiC-O design principles and a quick overview, including what distinguishes the ontology from the model, are presented in the OWL source file, and can be read as the introduction to RiC-O HTML view (see https://www.ica.org/standards/RiC/RiC-O_v0-2.html#design-princi- ples, and https://www.ica.org/standards/RiC/RiC-O_v0-2.html#understanding-RiCO). 10 skos:scopeNote) as the RiC-CM component when there is a simple equivalence. The hierarchies of components must also be the same when applicable. Therefore, every entity defined in RiC-CM corresponds to a class in RiC-O. Any RiC-CM attribute that has text as schema value, corresponds to a datatype property in RiC-O (IRUH[DPSOHWKHµscopeAndContent¶ DQGµauthenticityNote¶ of a Record). Any relation in RiC-CM corresponds to an object property in RiC-O. These components have the same definition, the same hierarchical position, and the same domain and range when applicable, in both parts of the standard. RiC-CM attributes that have controlled value correspond to classes in RiC-O, which results in a hierarchy of classes as concerns the *type attributes (for example, the Doc- umentary Form Type of a Record, or the Carrier Type of an Instantiation). Moreover, classes or properties have been added in RiC-O, that have no equivalence in RiC-CM. Examples are the Physical Location and the Coordinates classes, that constitute, with the Place class, and their properties, a way to accurately describe a geo-historical en- tity.24 Another example is the Extent class that was added recently. 6.2 RiC-O Has to Be Immediately Usable Usability is considered a key feature, without which RiC will not be tested and adopted widely in the next years. In particular, it is very important that existing archival metadata, that are created or generated in current archival information systems, can be converted to RDF conforming to RiC-O and pushed to the web of data (where very few archival metadata are available for now). This process must be possible without losing any data, structural or partially implicit information. What is at stake here is that metadata conforming to the previous existing ICA standards can be processed success- fully. In order to guarantee that RiC-O conforms to this principle, during the ongoing de- velopment process, a lot of successful testing has been made, using XML/EAD finding aids and XML/EAC-CPF authority records, that have been converted to RDF datasets, either by hand or using scripts. While some existing metadata sets may have a very fine level of granularity and accuracy, and might store controlled vocabularies, or describing curation events separately, a lot of metadata dRQ¶WKDYHWKHYHU\SUHFLVHVWUXFWXUHWKDW RiC-CM recommends. Even then, a conversion process is possible. The usability of a data model also depends on its documentation. The current official release of RiC-O is fully documented in English, and a human-readable view in HTML has been published online. In the near future, this documentation will be translated into French and Spanish, the two other official languages of the ICA. RiC-O is also already 24 In RiC-O, a place is considered both a geographical and historical entity. As a historical entity, among other features, it has a history, and may be preceded or succeeded by other Places. A Place also may have zero to many Physical Location through time (for instance, its bounda- ries, if it is an administrative area or a country, may change). Each Physical Location may be connected to zero to many Coordinates. This model is quite close to the Linked Places Format (https://github.com/LinkedPasts/linked-places). 11 accompanied with examples (RDF datasets) and diagrams,25 which will be enriched through time. Some tutorials should also be written, and EGAD will organize practical workshops. 6.3 RiC-O Has to Provide a Flexible Framework This principle is related to the usability principle quoted above. Archival description is flexible by essence. It is quite common that within the same archive the level of gran- ularity of information varies from one finding aid to another (or from one authority record to another), or even within the same finding aid. Some series or agents are de- scribed summarily because little is known about them and there is little time for exten- sive research, while other series, even records, or agents are described in detail; some relations (e.g. the link to the provenance) may be described without any detail while others may be thoroughly documented, as ISAAR(CPF) and EAC-CPF allow it. Being generally flexible, for an OWL ontology, depends first on the polyhierarchical systems of classes and properties it provides. A superproperty or superclass is normally very generic, its underlying subproperties and subclasses then become more concrete for handling detailed information. RiC-O is based on this principle. It lets the archivist work with a certain subset of RiC-O components, where, for example, high level classes or properties, appropriate for handling vague information on agents, can coexist with other more precise ones, necessary in a specific project to describe very accurately a collection of photographs or of mediaeval charters for example. %HVLGHVDQGWKLVLVUHODWHGWRWKHWZRµLPPHGLDWHXVDELOLW\¶DQGµIOH[LELOLW\¶GHVLJQ principles, RiC-O sometimes provides more than one way for representing information. RiC-CM, for example, defines a Date entity. RiC-O provides on the one hand a Date class with several subclasses, on the other hand D µGDWH¶ GDWDW\SH SURSHUW\ WKDW lets users encode dates in a simpler way, just as they were used to doing it in EAD or EAC- CPF files. Users then can choose from one of the methods, depending on which they want to describe a date as a full entity possibly with additional information, or just as one simple attribute among others. RiC-O also deals with many relations between entities in a similar way: they are both represented as a binary object property and as a class, which is a subclass of the Relation class. Using a class for a relation allows adding additional information to a relation, like a date or a description. In the past, EAC-CPF has already modelled relations in a similar way. The direct object property is then defined as a shortcut of this more com- plex path, using OWL 2 property chain axiom.26 25 Complete examples are available at https://github.com/ICA-EGAD/RiC- O/tree/master/examples/examples_v0-2, and diagrams at https://github.com/ICA- EGAD/RiC-O/tree/master/diagrams/diagrams_v0-2. 26 See https://www.w3.org/TR/owl2-new-features/#F8:_Property_Chain_Inclusion (last ac- cessed 2021/09/01). An example of such a relation is available through a diagram in RiC-O repository on GitHub, see: https://github.com/ICA-EGAD/RiC- O/blob/master/diagrams/diagrams_v0-2/NationalArchivesOfFrance-examples/a-complex-re- lation-and-its-shortcut.jpg. 12 6.4 RiC-O Must Open New Potential for Archival Description and Discovery This principle takes up the interests and requirements of the end users. Linked Data tools and interfaces should enable users to go through RDF/RiC-O graphs, to query them using SPARQL in an efficient way and to consult archival metadata and their contexts in new ways. With the current archival information systems, a lot of such que- ries are not possible. As an example, users should be able to ask « What are (according to your dataset) the corporate bodies that succeeded to this given entity from its end of existence, by 1840, to nowadays (as concerns this given activity)? » or « Tell me what instantiations of this photograph exist? », or « What are the existing copies of this orig- inal charter? », and get a result list of the entities. In other words, institutions or projects that make the effort to implement RiC-O must get insights into the content and context of their archives that ZHUHQ¶Wpossible with the existing ICA standards. It should be even more interesting if you can infer new assertions from the RDF datasets you built, and of course link your datasets to resources outside of your institution. This important feature is currently investigated in several projects.27 6.5 RiC-O Should Be Extensible Institutions are free to extend the ontology by adding new subclasses or subproperties if needed. RiC-O has also the potential to be usable in other contexts than purely ar- chival ones. This implies that hierarchies of classes and properties are defined and that mappings are developed with other ontologies as mentioned above. It also implies that RiC-O provides ³KRRNV´HQDEOLQJFRQQHFWLRQVZLWKIRUH[DPSOHH[LVWLQJ6.26YR cabularies ± which has been made possible by defining a hierarchy of Type classes, whose instances can easily be declared as also being SKOS concepts.28 7 Conclusion: Roadmap and Perspectives The principles and choices presented above result in an OWL domain model that relays on a strong conceptual basis (RiC-CM). RiC-O as an implementation is richer and more accurate than its conceptual model, and at the same time grants all flexibility. RiC-O 0.2 (and the future 1.0 version) provides in many cases more than one method for rep- resenting information; from this point of view, it may be considered a transitional 27 See for example a presentation by Florence Clavaud, ³,PSOHPHQWLQJ,&$5HFRUGVLQ&RQ texts-2QWRORJ\DWWKH1DWLRQDO$UFKLYHVRI)UDQFHILUVWDVVHVVPHQWDQGSURVSHFWV´IRUWKH Study Day on The Semantic Web and Cultural Heritage: From Data Convergence to Knowledge Crossing (Lille, France, February 3, 2021) - slides in English and audio recording in French: https://geriico.univ-lille.fr/detail-event/le-web-semantique-et-le-patrimoine-cul- turel-de-la-convergence-des-donnees-au-croisement-des-connai/ (last accessed 2021/07/03). 28 As done by the Archives nationales of France, which have started to publish their vocabularies and authority records, using RiC-O and SKOS, in a GitHub public repository; see: https://github.com/ArchivesNationalesFR/Referentiels. 13 ontology, in which some components may be deprecated later on. In most cases, RiC- O will only partially be implemented and institutions will choose the elements that are useful for their local project, and simultaneously, might have to extend it. As a conclusion, we would like to insist on the fact that RiC-CM and RiC-O are still at an early stage and that this is only the beginning of the adventure. After a certain time, as with any domain model and ontology under construction, various institutions will have made their first experiences with adopting RiC-O -- and adapting it to their needs. The new ontology provides in many cases more than one option to model a char- acteristic or a relation; RiC-O users will have to decide in which way they will take to model their use case. The intent of EGAD is to collect practical examples of how RiC- O is put in use and to identify VRPH³EHVWSUDFWLFHV´that stand out and prevail. Besides, the roadmap of EGAD foresees alignments with other ontologies, particu- larly the ones we quoted in section 4. In our opinion good mappings can only be achieved by working with the communities that authored these other models. The further work might also include the development of SHACL Shapes29 for the RiC-O classes and properties, in order to provide archival institutions and projects with methods for validating RiC-O knowledge graphs, at least cardinality, domain and range checks. RiC is already being implemented in various projects,30 among which many aim to publish RDF/RiC-O graphs, or already do so. The number of projects will hopefully rapidly grow. Thus, various kinds of tools will be needed and developed -- and hope- fully published under licenses that makes reuse possible for other institutions.31 By pro- posing a multidimensional, novel, consistent, inclusive insight into the content of ar- chives, new perspectives open up and new territories will be explored. In order to facilitate the construction of such linked archival data beyond institutional and national boundaries, another task for archivists and the ICA will be the develop- ment of multilingual and international authority data and vocabularies, based on SKOS and RiC-O, that could be used for specifying the categories of, for instance, documents or groups of documents, functions and occupations, curation events, access rules ± in other words, what RiC calls archival contextual entities. References 1. International Council on Archives (ICA): Records in Contexts-Conceptual model (RiC-CM) 0.2 (July 2021), https://www.ica.org/sites/default/files/ric-cm-02_july2021_0.pdf, last ac- cessed 2021/09/01. 2. International Council on Archives (ICA) Committee on Best Practices and Standards: Pro- gress report for revising and harmonising ICA descriptive standards (2012), 29 These would use Shapes Constraint Language, see https://www.w3.org/TR/shacl/ (last ac- cessed 2021/09/01). 30 See https://ica-egad.github.io/RiC-O/projects-and-tools.html. 31 Like RiC-O Converter (https://github.com/ArchivesNationalesFR/rico-converter), an open source tool that is presented in another article authored by Thomas Francart, Florence Clavaud, and Pauline Charbonnier. 14 https://www.ica.org/sites/default/files/Report_for_revising_harmonising_descrip- tion_standards.pdf, last accessed 2021/09/01. 3. Encoded Archival Description (EAD): https://www.loc.gov/ead/, last accessed 2021/07/03. 4. Encoded Archival Context-Corporate Bodies, Persons, and Families (EAC-CPF) XML schema: https://eac.staatsbibliothek-berlin.de/, last accessed 2021/07/03. 5. Encoded Archival Guide (EAG): http://www.archivesportaleurope.net/eag, last accessed 2021/09/01. 6. International Federation of Libraries Associations and Institutions (IFLA): Library Refer- ence Model (LRM) (2017), https://www.ifla.org/files/assets/cataloguing/frbr-lrm/ifla-lrm- august-2017_rev201712.pdf, last accessed 2021/07/03. 7. International Working Group on FRBR and CIDOC CRM Harmonisation: Functional Re- quirements for Bibliographic Records-object oriented (FRBRoo) model version 2.4 (No- vember 2015), http://iflastandards.info/ns/fr/frbr/frbroo/, last accessed 2021/07/03. 8. Resource Description and Access (RDA) Steering Committee: RDA Registry, https://www.rdaregistry.info/, last accessed 2021/07/03. 9. International Council of Museums (ICOM) International Committee for Documentation (CIDOC): CIDOC Conceptual Reference Model (CIDOC CRM) version 7.1.1 (April 2021), http://www.cidoc-crm.org/sites/default/files/cidoc_crm_v.7.1.1_0.pdf, last accessed 2021/07/03. 10. PREMIS (PREservation Metadata Implementation Strategies) Editorial Committee: PREMIS 3 ontology for PREMIS Data Dictionary version 3, http://www.loc.gov/premis/rdf/v3/, last accessed 2021/07/03. 11. European Broadcasting Union (EBU): EBUCore - The Dublin Core for media, https://www.ebu.ch/metadata/ontologies/ebucore/, last accessed 2021/07/03. 12. The World Wide Web Consortium (W3C): The PROV ontology (PROV-O) (2013), http://www.w3.org/TR/prov-o/, last accessed 2021/07/03. 13. Ministerio de Educación, Cultura y Deporte, Comisión de Normas Españolas de Descripción Archivística (CNEDA): Modelo conceptual de descripción archivística (2017), https://www.culturaydeporte.gob.es/cultura/areas/archivos/mc/cneda/documentacion/nor- mas/neda-mc.html, last accessed 2021/09/01.