=Paper= {{Paper |id=Vol-435/paper-23 |storemode=property |title=A Secure Data Repository Integrating Huge Volumes of Digital Contents as a Semantic Web |pdfUrl=https://ceur-ws.org/Vol-435/poster11.pdf |volume=Vol-435 |authors=Norio Kobayashi,Yuko Makita,Eli Kaminuma,Shuji Kawaguchi,Yuko Yoshida,Yoshiki Mochizuki,Akihiro Matsushima,Manabu Ishii,Ryo Umetsu,Satomi Asano,Naohiko Heida,Tetsuya Sakurai,Takashi Kuromori,Kazuo Shinozaki,Tetsuro Toyoda |dblpUrl=https://dblp.org/rec/conf/swat4ls/KobayashiMKKYMM08 }} ==A Secure Data Repository Integrating Huge Volumes of Digital Contents as a Semantic Web== https://ceur-ws.org/Vol-435/poster11.pdf
  A Secure Data Repository Integrating Huge
 Volumes of Digital Contents as a Semantic Web

     Norio Kobayashi1 , Yuko Makita1 , Eli Kaminuma1 , Shuji Kawaguchi1 , Yuko
      Yoshida1 , Yoshiki Mochizuki1 , Akihiro Matsushima1 , Manabu Ishii1 , Ryo
       Umetsu1 , Satomi Asano1 , Naohiko Heida1 , Tetsuya Sakurai2 , Takashi
                 Kuromori2 , Kazuo Shinozaki2 , and Tetsuro Toyoda1
      1
          Bioinformatics And Systems Engineering division (BASE), RIKEN, JAPAN
                                  toyoda@base.riken.jp??
                      2
                        Plant Sciences Center (PSC), RIKEN, JAPAN



1         Abstract

Recent omics sciences are strongly supported by the advances in various detec-
tion technologies, such as the next generations high-throughput sequencers, high-
resolution imaging devices and remote-sensing technologies. The obtained data,
the processed data and knowledge extracted from the data are huge, heterolo-
gous, complex and interconnected, and preventing us from sharing or depositing
all of the data to the public databases. We have developed a semantic-web-based
data repository by using the Semantic Web Folders (SWF for short) technology
which is able to share large-scale omics data, experimental raw data, images,
sequences and analysed data within a closed community and to publish all or a
part of the data from the repository. SWF is designed not only for the data pub-
lishing infrastructure, but also as a collaboration tool that allows researchers to
generate a database. More concretely, SWF is designed to achieve the following
features: 1) implementing an ontology-based data structure including file folders
which enables users to write and store various kind of omics data, 2) supporting
various data input/output formats including standardized data formats, 3) im-
plementing a data viewer/editor including a meta search engine over SWF data,
and 4) providing a communication tool that allow users to collaborate with each
other to edit their data. In the current implementation of SWF, we employ a
30 TB Network Attached Strage system that stores a file folder associated with
a ontology term. Moreover, more than 100 database projects including public
ontology data and RIKEN’s original and public databases have been integrated
on SWF. SWF is available at http://database.riken.jp.




??
     To whom correspondence should be addressed.