<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SemanticHPST: Applying Semantic Web Principles and Technologies to the History and Philosophy of Science and Technology</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Olivier Bruneau</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Serge Garlatti</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Muriel Guedj</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sylvain Laube</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jean Lieber</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Telecom-Bretagne</institution>
          ,
          <addr-line>LabSTICC, CS 83818, F-29238 Brest Cedex 3</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Bretagne Occidentale, Centre Francois Viete (EA 1161)</institution>
          ,
          <addr-line>20, rue Duquesne, CS 98 837, F-29 238 Brest Cedex 3</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Lorraine, LHSP-AHP</institution>
          ,
          <addr-line>91 avenue de la Liberation, BP 454, F-54001 Nancy cedex</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Lorraine, LORIA</institution>
          ,
          <addr-line>Campus scienti que, BP 239, F-54506 Vandoeuvre-les-Nancy Cedex</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Montpellier 2, LIRDEF</institution>
          ,
          <addr-line>2 place Marcel Godechot, BP 4152, F-34092 Montpellier Cedex 5</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <fpage>65</fpage>
      <lpage>76</lpage>
      <abstract>
        <p>SemanticHPST is a project in which interacts ICT (especially Semantic Web) with history and philosophy of science and technology (HPST). Main di culties in HPST are the large diversity of sources and points of view and a large volume of data. So, HPST scholars need to use new tools devoted to digital humanities based on semantic web. To ensure a certain level of genericity, this project is initially based on three sub-projects: the rst one to the port-arsenal of Brest, the second one is dedicated to the correspondence of Henri Poincare and the third one to the concept of energy. The aim of this paper is to present the project, its issues and goals and the rst results and objectives in the eld of harvesting distributed corpora, in advanced search in HPST corpora. Finally, we want to point out some issues about epistemological aspects about this project.</p>
      </abstract>
      <kwd-group>
        <kwd>HPST (history and philosophy of science and technology)</kwd>
        <kwd>modern history</kwd>
        <kwd>Semantic web</kwd>
        <kwd>RDFS annotations</kwd>
        <kwd>HPST ontologies</kwd>
        <kwd>exact search</kwd>
        <kwd>approximate search</kwd>
        <kwd>harvesting distributed corpora</kwd>
        <kwd>epistemology</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The application of computer science to research in history has existed for a long
time [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] though it can be noticed that the recent research domain of \Digital
Humanities" (DH) is growing as result of a digital \revolution" at work that
impacts the whole society at the international level. In France, tools and utilities
dedicated to DH like the very large facility Huma-Num
(http://www.humanum.fr) have been created in order to favor \the coordination of the collective
production of corpora of sources (scienti c recommendations, technological best
practices)." It also provides research teams in the human and social sciences
with a range of utilities to facilitate the processing, access, storage and
interoperability of various types of digital data." The Dacos and Mounier report [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
shows that the French research is active, however the authors recommend the
creation of \Centers of Digital Humanities". The research network
SemanticHPST is based on a strong coupling of laboratories in History and Philosophy
of Science and Technology (HPST) and in Computer Science (LHSP{AHP,
LORIA in Nancy) and (CFV, LabSTIIC in Brest) with research questions about
the use of semantic web for HPST. The SemanticHPST project takes part in the
emerging issues at the French and international levels in the domain of HPST.1
Actually, the Semantic Web technology appears as e cient in order to generate
tools adapted to the need of production and di usion of distributed \intelligent
digital" corpus in history [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].The objectives of the project are: (i) to integrate
the existing technologies to manipulate digital contents of large volume by
modeling knowledge as ontologies (annotation, request) for History and Philosophy
of Science and Technology; (ii) to extent these technologies. The goal of this
paper is to present the SemanticHPST project: its history, its objectives, the rst
results according to the information retrieval aspect and some epistemological
issues. Because the methods in History of science and Technology are covering
some elements of others domains in humanities (for example in history or in
archeology), another goal of the SemanticHPST group is to share questions and
results with the scienti c community.
      </p>
      <p>The paper is organized as follows. Section 2 presents the main goals of the
SemanticHPST project and its three French HPST sub-projects for which
semantic web technologies are useful. Section 3 presents some requirements and
corresponding tools supporting di erent resource retrieval processes according to
the researchers' practices. Section 4 presents some issues from an epistemological
viewpoint. Section 5 concludes the paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>The SemanticHPST Project</title>
      <p>
        In November 2010, the main topic of a European workshop was the uses of ICT
and history of science and technology in education.2 To improve research in
HPST on one hand, and to promote dissemination of the HPST in the eld of
education on the other hand, some participants were convinced by the necessity
to use new ICT tools [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
1 See the 18th session organised by some authors of this paper during the last
meeting of SFHST (French society for history of science and technology), April
2014 (http://sfhst2014lyon.sciencesconf.org/resource/page/id/5), and the last
meeting of the international consortium DigitalHPS at Nancy, September 2014,
(http://dhps2014.sciencesconf.org).
2 After this workshop, an extensive book written by participants and others has been
published in 2012 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>In 2012, some historians of science and technology and computer scientists
have created a consortium called SemanticHPST.3</p>
      <p>The main goal of SemanticHPST project is to enrich the practices of
researchers and communities in HPST. According to the speci city of the practice
as historians of science, three main issues were tackled:
1. The management of large quantities of data especially for the most recent
periods (xixth, xxth centuries up to the present day). Knowing that the
historical approach involves to integrate relevant elements from the context
of production of these data into metadata.
2. The heterogeneity of sources and corpora constituted from these sources.
3. The production of new relevant digital corpora from several available digital
historical collections.</p>
      <p>To address our main goal and the three previous issues, our project is based on
the Semantic Web principles and technologies. Thus, it has three main sub-goals:
(i) Building intelligent digital corpora, that is to say corpora with primary and
secondary sources having semantic metadata and their corresponding ontologies;
(ii) Designing tools to access and enrich existing corpora and to create new ones;
(iii) Evaluating the resulting practices and building an epistemological viewpoint
about the use of TIC in HPST.</p>
      <p>To achieve these goals, it is necessary to ensure a certain level of genericity
for metadata, ontology, computer-based tools and practices.</p>
      <p>To deal with genericity and the diversity of sources, the project is applied in
three di erent use cases or sub-projects with the aim to cover di erent methods
and approaches that are typical in the domain of HPST. Those approaches are
covering only partially the methods used in history and archaeology. These
subprojects are described in the following paragraph.
2.1</p>
      <sec id="sec-2-1">
        <title>The port-arsenal of Brest</title>
        <p>
          This sub-project takes part in the research programs \History of marine science
and technology" and \Digital Humanities for History of Science and Technology"
developed in Brest in the Centre F. Viete. One topic concerns the comprehension
of the scienti c and technological evolution of the port-arsenal in Brest (France)
on a large period (xviith to xxth century) with a methodological approach
considering this military-industrial complex dedicated to shipbuilding as a large
technological system [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. The objectives are:
1. To compose and publish a digital library (based on semantic web) about the
material culture of the port-arsenal of Brest associated to several projects
3 Participants at this consortium came initially from LaB-STICC (Telecom Bretagne,
Brest), Centre Francois Viete (University of Brest), LIRDEF (University of
Montpellier), LHSP-Archives Poincare (University of Lorraine, Nancy) and later LORIA
(University of Lorraine, Nancy). During the years 2012-2014, the INSHS (a French
national institute of human and social sciences), the national network of Maisons
des Sciences de l'Homme and University of Lorraine supported this consortium.
about 3D replications of artifacts and to cultural mediations dedicated to
science and technology heritage.
2. To develop digital tools (based on semantic web) dedicated to a comparative
history of science and technology of the port on a large area and a large
period (since ancient times until now).
        </p>
        <p>
          The hypothesis is to consider the large technological system of the
portarsenal as a large spatiotemporal and multi-scale artifact which is possible to
decompose in elements of smaller scale (which are also artifacts) like industrial
workshops, shipbuilding areas, storage areas, etc. Each of these elements are
themselves composed by elements/artifact of smaller scale. The system has to
be seen as the sum of all these artifacts and of all the relationships between them.
The research in Brest [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] has shown the interest to propose an historical
evolution model of the port (inspired by works in geography [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]) where
\simple" artifact like cranes, quays, dry docks are e cient indicators to characterize
the cycle of evolution of the port-arsenal during a large period. This method is
used in a comparative research [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] between Brest (France) and Mar del Plata
(Argentine) in a thesis in progress by B. Rohou (directed by S. Garlatti and S.
Laube).4 From these works, the contribution in the SemanticHPST group is to
produce a methodology and a knowledge model e cient to produce a generic
ontology where an artifact is a material object (made by human beings) associated
to a \life cycle" with at least three steps:
1 design and construction of the artifact;
2 the artifact in use;
3 the disappearance of the artifact.
        </p>
        <p>
          That \life cycle" involves the elaboration of ves categories of entities: time
entities, actors (individuals or social groups), concepts/theories, location and
artifacts. The analysis of the important ontology in the domain of cultural
heritage named CIDOC-CRM (that \provides de nitions and a formal structure for
describing the implicit and explicit concepts and relationships used in cultural
heritage documentation")5 shows that this ontology could be a rst reference
to help and build our own ontologies because some concepts and relationships
about \temporal entities" and \actors" can be reused. But if the concept of
\Thing" exists in the CIDOC-CRM, we consider that the concept of \Artifact"
and the associated relationships have to be elaborated rst from our historical
model and by considering of course the possibility of equivalent concepts in the
CIDOC-CRM. A work is in progress in Brest about this topic from concrete
examples of artifact as crane, quays and seawalls. A second step will be to examine
others methods to produce ontologies well-adapted to our HPST problems in
the domain of marine history [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
        </p>
        <p>This work is coupled with examples of typical requests (when and where
were positioned all the cranes in the port of Brest since 1650 until 1970? In the
4 See http://brmdp.hypotheses.org/.
5 http://www.cidoc-crm.org/.
port of Mar del Plata? Which rms were in charge of the construction of the
quays/cranes in the port of Brest since 1800 until 1900? What are the engine
power of all cranes in the world since 1850 until 1970? Etc.).
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Henri Poincare's correspondence</title>
        <p>
          The platform Henri Poincare papers. In 1992, the laboratory of history of
science and philosophy Archives Henri Poincare was created to promote Henri
Poincare's manuscripts and to publish his correspondence. For more than 20
years, this long-term project has produced three volumes of letters: the rst
one is devoted to the Poincare - Mittag-Le er letters [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], the second one is on
the correspondence with physicists, chemists and engineers [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], the third one is
with astronomers and, in particular, geodesists [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. Two other volumes are in
preparation, one devoted to the letters from or of mathematicians and the other
one consists of administrative and personnal correspondences.6
        </p>
        <p>The corpus consists of more than 2000 letters, 1046 sent by Henri Poincare
and 949 received by him.7 All known letters are digitalized8 and around 50%
of them are in plain text (in LATEX and XML versions). Lots of letters contain
mathematical and physical formulae. In Henri Poincare Papers website,9 the
correspondence is available. In this platform, each known letter is indexed with
Dublin Core extended metadata.10 This enables to query the corpus by e.g.</p>
        <p>Q1 = \Letters sent by Henri Poincare in 1885"</p>
        <p>Q2 = \Letters received by Eugenie Launois between 1882 and 1894"
There is also the possibility of plain text search for the letters already
transcribed.</p>
        <p>Towards more HPST-adapted search. Now, consider the following queries:
Q3 = \Letters from an astronomer"
Q4 = \Letters in reply to a letter of Mittag-Le er"
Q5 = \Letters about the n-body problem"</p>
        <p>
          Q6 = \Letters of the late xixth century"
These queries cannot be executed in the current platform. They require
additional data and knowledge:
6 This correspondence is partly online http://henripoincarepapers.univ-lorraine.fr.
7 About 50% of this letters are with scientists. Original letters come from 63 di erent
archive centers and libraries from 14 countries.
8 Due to copyright laws, some are not available online.
9 http://henripoincarepapers.univ-lorraine.fr.
10 It exists di erent projects devoted to scienti c correspondences for example the
CKCC project (http://ckcc.huygens.knaw.nl) [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] or Mapping the Republic of
Letters (http://republico etters.stanford.edu).
{ Q3 requires to know that an individual is an astronomer, possibly using
deduction (for instance, Rodolphe Radau was a geodesist and every geodesist
is an astronomer).
{ Q4 requires to know relationships between letters (including lost letters).
{ Q5 requires semantic annotations about the content of the letters (Poincare
worked on the three-boby problem).
{ Q6 raises the problem of modeling \late xixth century": the boundaries of
interval of time are imprecise.
        </p>
        <p>The possibility to take into account such queries using semantic web
principles and technologies, are examined in the SemanticHPST consortium.
2.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>The concept of energy</title>
        <p>One part of the SemanticHPST project is dedicated to the concept of energy.
Our aim is to create an ontology of energy for researchers working in the eld
of HPST as well as for science teachers.</p>
        <p>For researchers, the ontology aims at making available a methodical body
of knowledge that allows previously unseen connections to be made. For
example, correspondence between two authors or the presence of a speci c term or
concept in a text will allow researchers to put forward hypotheses regarding the
emergence of an idea or the cross-fertilization of ideas.</p>
        <p>For teachers, the ontology aims at acting as a resource, allowing educators
to nd historical information relevant to school curricula as well as ideas for
speci c activities to carry out in the classroom.</p>
        <p>The content consists of reference texts in the eld of HPST, contemporary
scienti c texts and a database of historic scienti c instruments and documents.
This content is currently being selected and developed and will be enhanced as
the research progresses.</p>
        <p>To date, the following three steps have been undertaken on the project:
{ The rst step was to identify the presumed ways the ontology will be used,
for example, the type of requests that a researcher or teacher might make in
a search. To this end, one `persona' for a researcher and one for a teacher
have been created. Analyzing the theoretical queries from these two personas
helps in the selection of a relevant body of work and is also a useful guide
for indexing.
{ The second step was to begin indexing the reference texts. Duhem, Poincare,
Mach and Meyerson have been selected for a rst approach in order to
produce keywords and common references and to outline an embryonic model.
Using the shared scienti c knowledge of the physicists involved in the project,
a sort of `cloud' of concepts related to describing energy was de ned and
classi ed. These elements led to the structure of an initial mind map.
{ Finally, based on this mind map (created with Docear), we used Protege
software to create a rst draft overview of the project. The next steps require
documenting these three steps in detail to re ne the data and then build the
ontology.</p>
        <p>During the stages of the project carried out so far, various problems have been
identi ed that must be resolved. One of the main problems concerns the modeling
of time. How can an event be modeled? Moreover, how can knowledge be modeled
in a way that avoids immobilizing the knowledge? How should knowledge be
contextualized? What approach should be adopted when modeling concerns a
concept or an object? How can a coherent and logical body of content be created
and how can its coherence be assessed? It is clear that the question of time as
well as how to approach the treatment of objects and works are issues to be
investigated in the semanticHPST project.
3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>The SemanticHPST tools and requirements</title>
      <p>According to the three described sub-projects, the main goals of researchers in
HPST are to access and retrieve relevant resources in existing primary and
secondary sources or corpora, to produce new resources in existing corpora, to enrich
existing digital corpora or to create new ones, for answering research questions
in the history of science and technology. Existing digital corpora come from
libraries, information holdings, digital libraries or others like Gallica
(http://gallica.bnf.fr), Internet Archive (http://archive.org), Google Books
(http://books.google.com), etc., and CMS (Content Management System) (blogs, wikis, Drupal,
Omeka, etc. more generally social media tools) have been used by the
community11 and digital AHP (http://www.ahp-numerique.fr/). Some heritage and
bibliographic resources have already been described by several institutions,
associations and/or project (BNF, Gallica, British Museum, Europeana, Amsterdam
Museum, LODLAM, ...). The creation of new corpora or resources can be made
on social media tools distributed on Internet (as well as other digital corpora).</p>
      <p>The design of tools for HPST researchers has to integrate and/or aggregate
the existing heterogeneous tools and to ensure interoperability among them.
Thus, the goal is not to build a single new environment, but to design a
platform which integrates existing tools selected for their relevance according to the
practices of researchers and provide an agile architecture able to model and/or
support the processes involved in the research work and enrichment.</p>
      <p>This platform will be mainly based on the Semantic Web and Linked Data
approaches (RDF Triple Store, ontologies, OWL 2, RDFS, SPARQL, etc.).
Nevertheless, the platform will also provide access to non-semantic resources. A
network of ontologies dedicated to HPST will be designed to meet the
interoperability and open access requirements for corpora. Some existing
ontologies and standards will be reused and integrated in the ontology network, like
CIDOC-CRM, FRBRoo, FRSAD, Dublin Core, etc. and those available at LOV
(http://lov.okfn.org/dataset/lov/).</p>
      <p>In this paper, we focus our attention on the resource retrieval problem that
we can divide into two di erent aspects : advanced search in HPST corpora and
harvesting distributed corpora. The former focuses on advanced search
function11 The alambic numerique (http://alambic.hypotheses.org/4924) is based on Omeka.
alities in a single corpus. The latter studies the resource retrieval on distributed
corpora. These two aspects will be integrated.
3.1</p>
      <sec id="sec-3-1">
        <title>Advanced search in HPST corpora</title>
        <p>In order to perform advanced searches in a HPST corpus, we have to build
intelligent digital corpus: corpus with primary and secondary sources having
semantic metadata (RDF Triples) and their corresponding ontologies using a
fragment of OWL (actually, RDFS will be su cient for the following examples).
These ontologies are domain ontologies related to the corpus. A domain ontology
for Henri Poincare letters has already been designed. Finally, some tools will have
to be developed for answering some of the queries.</p>
        <p>This section presents the advanced search using the query examples Q3-Q6
introduced in Section 2.2.</p>
        <p>Q3 requires some additional data and knowledge to get satisfactory answers,
as stated in Section 2.2. In particular, if the annotation le contains the following
RDFS triples:
(letter1 isSentBy rodolphe radau)
(rodolphe radau rdf:type Geodesist)
(Geodesist rdfs:subClassOf Astronomer)
then the execution of the following SPARQL query on an engine supporting
RDFS</p>
        <p>Q3 = select ?` where f?` isSentBy ?a . ?a rdf:type Astronomerg
will return letter1.</p>
        <p>Q4, similarly, can be answered by a SPARQL engine supporting RDFS with
the following query:</p>
        <p>Q4 = select ?` where
?` isAnAnswerTo ?`2 .</p>
        <p>?`2 isSentBy mittag-leffler
It can be noticed that this query can give a letter of the corpus that answers a
lost letter: the missing letter cannot be found, but its answer can.</p>
        <p>Q5, for being executed, requires the use of annotations about the scienti c
content of the letter:</p>
        <p>Q5 = select ?` where
?` hasForTopic ?t .</p>
        <p>?t rdf:type N-body-problem
The n-body problem is a topic having sub-topics, in particular, the 3-body
problem is a problem more speci c than the n-body problem. For this reason, we have
chosen to model these two problems by two classes, the former being more
general than the latter. Therefore, a letter of the corpus about the 3-body problem
will be returned by the execution of this query.12
12 We could also have chosen to model the 3-body problem as an instance of the
nbody problem, but rst, it is more homogeneous to consider every topic as a class,</p>
        <p>Q6 can be modeled by a SPARQL query based on the assumption that \the
late xixth century" corresponds to the interval 1881 1900:</p>
        <p>Q6 = select ?` where
?` sentDuringYear ?y .</p>
        <p>filter(?y &gt;= 1881 &amp;&amp; ?y &lt;= 1900)
However, this solution is debatable: the modeling of the fuzzy period of time by
a crisp interval raises the problem of the choice of the boundaries. Indeed, some
events before 1881 or after 1900 can be considered by historians to be related to
the end of the xixth century. In order to address this issue, some approximate
search is planned. How to put this idea in practice is an ongoing work.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Harvesting distributed corpora</title>
        <p>Harvesting distributed corpora at semantic level (according to Linked Data
principles) require to solve two di erent problems. The rst one is to queries several
triple store by means of federated queries to linked distributed sources. The
second one is to get RDF triples from social media tools.</p>
        <p>Most of social media applications are data silos. In other words, data are
unavailable on the web. Only people may have access to data, not computers.
Reuse and exchange of data among social media tools are only possible by means
of API { that is to say manually by mean of one API per tool. Some social media
tools like Drupal, Semantic media wiki may have their own triple store exposing
data to others.</p>
        <p>
          A toolkit, called SMOOPLE for Semantic Massive Open Online Pervasive
Learning Environment, has been designed to solve these two problems. It was
rstly dedicated to the technology-enhanced learning domain [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. The core part
of the toolkit can be reused for HPST. It ful lls the needs of researchers in
HPST, that is to say it enables us to federate distributed sources and tools.
        </p>
        <p>SMOOPLE has semantic services which are in charge of managing
incorporated semantic models, extracting and storing the data produced on social
media tools, making and answering to semantic queries against one or several
distributed sources (federated queries). The Semantic Web server (semantic
services) is based on Jena 2. When the social media tools do not have a triple
store and a SPARQL endpoint, content and corresponding semantic metadata
can be extracted on the y from social media applications, by means of plugin
(similar to sioc export) and stored in a RDF repository. Several light ontologies
(SIOC, FOAF, DC, RDF, RDFS, etc.) are used to acquire semantic metadata
automatically. It will be necessary to de ne the interlinkage among distributed
sources (triple stores) to support federated queries.</p>
        <p>second, this way, it is always possible to consider a more speci c topic, e.g., the
restricted 3-body problem for which the mass of one of the 3 bodies in considered
to be negligible.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Epistemological aspects</title>
      <p>An aim of the SemanticHPST project is to focus on the epistemological issues
raised by the development of these new tools based on semantic web. This work
in progress takes part to epistemological questions in the domain of Digital
Humanities.13 A rst series of questions concerns the modeling of knowledge, the
main step in building ontologies so that researchers can easily identify and
apprehend knowledge. Therefore the creation of e ective ontologies requires de
ning concepts and elucidating certain tacit or implicit knowledge. So the initial
questions are: How to approach these de nitions? How to ensure that indexing
does not immobilize knowledge? How can the modeling anticipate how it will
be used in order to ensure that the knowledge generated is contextualized to
avoid anachronism and misinterpretation? Moreover, the wide range of works
in the collection, including texts (manuscripts, books, letters, web pages),
multimedia documents, 3D archaeological or historical objects and media from a
variety of sources (photographs, original texts, maps, etc.), necessitate di erent
approaches. This raises the question: How to approach a photograph, a scienti c
instrument or a text and still obtain a uni ed ontology? How can the modeling
enable relationships between objects yet avoid the pitfalls described above?</p>
      <p>
        In the eld of HPST, the issue of modeling time is central and particularly
tricky. Modeling a long period of time, an event, a succession of events or events
that are juxtaposed requires making decisions that should be taken collectively.
Indeed, this emerging issue is shared by historians [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] and should serve
to feed into theoretical discussions between researchers from di erent disciplines.
      </p>
      <p>
        A second series of questions concerns the researcher's environment, which
has signi cantly changed with the rise of digitized data. Whatever the works
considered or their origin (libraries, archives, etc.), the massive volume of data,
its diversity and location are all part of this change. Yet this radical shift is
not exclusively the result of the accumulation of a large amount of data. The
fact that data can be `analyzed as well as communicated, represented, reused {
in short, mobilized for research { in a quantity and with an ease incomparable
with previous periods' [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is a major transformation that needs to be taken into
account. This raises new questions for researchers:
{ How does one build and de ne a body of content that is coherent and
complete? Whereas `traditional' methods created collections using identi ed,
bounded, localized archives, with the question of consistency limited in most
cases to the cross-fertilization of archives as regards the historical context,
the accessibility of multiple documents today requires a reexamination of
the very concept of a collection of works.
{ How does one evaluate a body of work; in other words, how does one
recognize its relevance?
{ In this context, the type of source and its references must be speci ed. Does
the wide range of sources used require more re ned classi cation than the
13 See thematical issue \la numerisation du patrimoine" of [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] or the issue \Le metier
d'historien a l'ere numerique : nouveaux outils, nouvelle epistemologie ?" of [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ].
standard usage of primary and secondary sources? Would a new typology be
pertinent given this broad diversity? Should the references to these sources,
particularly information concerning digital archives, lead to new codi cation
that allows, for example, multiple identi cations for the considered source,
improving its accessibility?
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>The aim of this proposal is to contribute to the development of the research in
the domain of digital humanities. Based on the Semantic Web principles and
technologies, the SemanticHPST group proposes new methodologies in History
and Philosophy of Science and Technology in the framework of a strong
collaboration between labs working in the area of computer science and humanities
(here HPST). The main goal is to enrich the practices of researcher and
communities in HPST as well in science and technology heritage. To deal with such a
goal, the project has to: i) Build intelligent digital corpora, that is to say corpora
with primary and secondary sources having semantic metadata and their
corresponding ontologies; ii) Design tools to access and enrich existing corpora and
to create new ones; iii) Evaluate the resulting evolution of practices in historical
science and build an epistemological viewpoint about the impact of new tools
and practices in humanities based on knowledge modeling and semantic web.</p>
      <p>Another important issue is to deal with the reuse of intelligent digital corpora.
Thus, it is necessary to build representations of the entities, people and processes
involved in producing the digital corpora. The \PROV Model Primer" from W3C
(http://www.w3.org/TR/prov-primer/) can be used to address this issue.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>V. A.</given-names>
            <surname>Ustinov</surname>
          </string-name>
          , \
          <article-title>Les calculateurs electroniques appliques a la science historique,"</article-title>
          <source>Annales. Economies</source>
          , Societes, Civilisations, vol.
          <volume>18</volume>
          , no.
          <issue>2</issue>
          , pp.
          <volume>263</volume>
          {
          <issue>294</issue>
          ,
          <year>1963</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>O.</given-names>
            <surname>Boonstra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Breure</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Doorn</surname>
          </string-name>
          , \Past,
          <source>Present and Future of Historical Information Science," Historical Information Science</source>
          , vol.
          <volume>29</volume>
          , no.
          <issue>2</issue>
          , pp.
          <volume>4</volume>
          {
          <issue>132</issue>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>M.</given-names>
            <surname>Dacos</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Mounier</surname>
          </string-name>
          , \
          <article-title>Humanites numeriques," rapport commande, Institut Francais, Ministere des A aires etrangeres</article-title>
          , Paris,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>A.</given-names>
            <surname>Meron</surname>
          </string-name>
          <article-title>~o-Pen~uela, A</article-title>
          . Ashkpour, M. van
          <string-name>
            <surname>Erp</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Mandemakers</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Breure</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Scharnhorst</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Schlobach</surname>
            , and
            <given-names>F. van Harmelen</given-names>
          </string-name>
          , \
          <article-title>Semantic technologies for historical research: A survey,"</article-title>
          <source>Semantic Web Journal</source>
          , pp.
          <volume>1</volume>
          {
          <issue>27</issue>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>O.</given-names>
            <surname>Bruneau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Grapi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Peter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Laube</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-R.</surname>
          </string-name>
          Massa-Esteve, and T. De Vittori,
          <source>History of Science and Technology, ICT and Inquiry Based Science Teaching</source>
          . Berlin: Frank-Timme,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>O.</given-names>
            <surname>Bruneau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Laube</surname>
          </string-name>
          , and T. de Vittori, \
          <article-title>ICT and History of mathematics in the case of IBST,"</article-title>
          <source>in [5]</source>
          , pp.
          <volume>145</volume>
          {
          <issue>160</issue>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>O.</given-names>
            <surname>Bruneau</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Laube</surname>
          </string-name>
          , \
          <article-title>Inquiry based Science teaching and History of Science,"</article-title>
          <source>in [5]</source>
          , pp.
          <volume>13</volume>
          {
          <issue>28</issue>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>J. M. Gilliot</surname>
            ,
            <given-names>N. C.</given-names>
          </string-name>
          <string-name>
            <surname>Pham</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Garlatti</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <article-title>Reba, and</article-title>
          <string-name>
            <given-names>S.</given-names>
            <surname>Laube</surname>
          </string-name>
          , \Tackling Mobile &amp;
          <article-title>Pervasive Learning in IBST,"</article-title>
          <source>in [5]</source>
          , pp.
          <volume>181</volume>
          {
          <issue>201</issue>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>M.</given-names>
            <surname>Guedj</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Bachtold</surname>
          </string-name>
          , \
          <article-title>Towards a new strategy for teaching energy based on the history and philosophy of the concept of energy,"</article-title>
          <source>in [5]</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. T. P. Hughes, \
          <article-title>The Evolution of Large Technological Systems," in The Social Construction of Technological Systems (W</article-title>
          . Bijker,
          <string-name>
            <given-names>T. P.</given-names>
            <surname>Hughes</surname>
          </string-name>
          , and
          <string-name>
            <surname>T. J</surname>
          </string-name>
          . Pinch, eds.), pp.
          <volume>51</volume>
          {
          <issue>82</issue>
          , Cambridge, Massachusetts: MIT Press,
          <year>1987</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. S. Laube, \Les grues de l'
          <article-title>arsenal en tant que marqueurs de l'evolution scienti que et technologique du port arsenal de Brest,"</article-title>
          in [?], To be published in
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. S. Laube, \
          <article-title>Culture materielle du port arsenal de Brest au XVIIIeme siecle : approche systemique,"</article-title>
          in [?], To be published in
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>J. Bird</surname>
          </string-name>
          ,
          <article-title>The Major Seaports of the United Kingdom</article-title>
          . London: Hutchinson,
          <year>1963</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>S.</given-names>
            <surname>Laube</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Rohou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Garlatti</surname>
          </string-name>
          , \
          <article-title>Humanites numeriques et web semantique</article-title>
          . De l'inter^et de la
          <article-title>modelisation des connaissances en histoire des sciences et des techniques pour une histoire comparee des ports de Brest (France) et Mar del Plata (Argentine),"</article-title>
          <source>in Digital Intelligence</source>
          <year>2014</year>
          , September 17-
          <issue>19</issue>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. V. de Boer, M. van
          <string-name>
            <surname>Rossum</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Leinenga</surname>
            , and
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Hoekstra</surname>
          </string-name>
          , \
          <article-title>Dutch ships and sailors linked data," in The Semantic Web { ISWC 2014 (P</article-title>
          . Mika,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tudorache</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bernstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Welty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Knoblock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Vrandecic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Groth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Noy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Janowicz</surname>
          </string-name>
          , and C. Goble, eds.), vol.
          <volume>8796</volume>
          of Lecture Notes in Computer Science, pp.
          <volume>229</volume>
          {
          <issue>244</issue>
          , Springer International Publishing,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. P. Nabonnand, ed.,
          <source>La correspondance entre Henri</source>
          Poincare et Go
          <article-title>sta MittagLe er</article-title>
          . Basel: Birkhauser,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>S.</given-names>
            <surname>Walter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Bolmont</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <surname>A</surname>
          </string-name>
          . Core, eds.,
          <source>La correspondance entre Henri</source>
          Poincare et les physiciens,
          <source>chimistes et ingenieurs</source>
          . Basel: Birkhauser,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>S.</given-names>
            <surname>Walter</surname>
          </string-name>
          , R. Kromer, and M. Schiavon, eds.,
          <article-title>La correspondance entre Henri Poincare avec les astronomes et les geodesiens</article-title>
          . Basel: Birkhauser,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>P.</given-names>
            <surname>Wittek</surname>
          </string-name>
          and
          <string-name>
            <given-names>W.</given-names>
            <surname>Ravenek</surname>
          </string-name>
          , \
          <article-title>Supporting the Exploration of a Corpus of 17thCentury Scholarly Correspondences by Topic Modeling," in Supporting Digital Humanities 2011: Answering the unaskable (B</article-title>
          . Maegaard, ed.),
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>J.-M. Gilliot</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Garlatti</surname>
            , I. Reba, and
            <given-names>C.</given-names>
          </string-name>
          <article-title>Pham Nguyen, \A Mobile Learning Scenario improvement for HST Inquiry Based learning,"</article-title>
          in Workshop Emerging Web Technologies,
          <article-title>Facing the Future of Education (</article-title>
          , ed.),
          <year>2012</year>
          . Workshop in conjunction with www2012 conference.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21. Documents pour l'
          <source>Histoire des Techniques</source>
          , vol.
          <volume>18</volume>
          -
          <fpage>2</fpage>
          .
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Revue</surname>
          </string-name>
          <article-title>d'histoire moderne et contemporaine</article-title>
          , vol.
          <volume>58</volume>
          -
          <fpage>4bis</fpage>
          .
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <given-names>A.</given-names>
            <surname>Neelameghan</surname>
          </string-name>
          and
          <string-name>
            <given-names>G. J.</given-names>
            <surname>Narayana</surname>
          </string-name>
          , \
          <article-title>Concept and expression of time: Cultural variations and impact on knowledge organization: PART 7: Ontology and representation of time in knowledge organization tools used in information systems1,"</article-title>
          <source>Information Studies</source>
          , vol.
          <volume>19</volume>
          , no.
          <issue>2</issue>
          , p.
          <volume>105</volume>
          {
          <issue>131</issue>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24. I.
          <string-name>
            <surname>Corda</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Bennett</surname>
            , and
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Dimitrova</surname>
          </string-name>
          , \
          <article-title>A logical model of an event ontology for exploring connections in historical domains,"</article-title>
          <source>in Workshop on Detection, Representation and Exploitation of Events in Semantic Web (Derive</source>
          <year>2011</year>
          ),
          <source>Tenth International Semantic Web Conference (ISWC)</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25. E. Hyvonen, T. Lindquist, J. Tornroos, and E. Makela, \
          <article-title>History on the semantic web as linked data{an event gazetteer and timeline for the world war i,"</article-title>
          <source>in Proceedings of CIDOC</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>