<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Breaking the Paper Barrier and Publishing At the Speed of Thought</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Katie Chapman</string-name>
          <email>katie@pentandra.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Chris Chapman</string-name>
          <email>chris@pentandra.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Skywriting</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Pentandra Research Solutions</institution>
          ,
          <addr-line>Logan, Utah</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We are starting to see Semantic Web technologies being used in our research papers and data. The next revolution will be publishing knowledge at the speed of thought.</p>
      </abstract>
      <kwd-group>
        <kwd>Semantic Publication</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In September 2007, Wired Magazine included an article by Thomas Goetz
entitled \Freeing the Dark Data of Failed Scienti c Experiments" [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], in which
Goetz discussed the incredible value of dark data|data that is led away, never
to see the light of day. We believe that researchers are already well aware of
the understandings gained through research projects that led in unexpected
directions, revealed unwanted results, or simply left the researcher feeling painted
into a dead-ended corner. Such failed research e orts are powerful opportunities
to learn and often lead to important insights. Yet most of the time these
insights are not shared. According to Goetz, the reasons dark data remains dark
include storage costs, technology barriers, nancial constraints, or career
incentives that encourage dissociation from negative and inconclusive research. Goetz
also shared e orts being made at the time to change the situation, such as the
Journal of Negative Results in Biomedicine, open notebook science, and others.
After acknowledging these contributions, Goetz wrote, \These are great rst
steps. But freeing dark data should be the norm, not the exception. Once the
storage and format problems are solved, scientists will need easy ways to search
and retrieve each other's data." Think about how far we've come since 2007.
Data storage solutions abound and great strides have been made to lower the
barrier for the search and retrieval of data by the Linked Data community,
introducing outreach standards such as JSON-LD and RDFa. Yet, with all of this
progress, dark data has still been left in the dark.
      </p>
      <p>Research|all research|has value. That value is independent of any grant
awarded, tenure attained, or publisher's acceptance. The value lies in learning.
Knowledge, truth, and the pursuit of understanding something previously not
understood are the goals and purposes of research. Nowhere in any de nition
of the word research have we read that research only has value if it produces
cherry-picked outcomes that make administrators, editors, grantors, and
publishers happy. And yet, due to the undeniable bias towards \positive" results|or
what have been deemed as such|much of the rest of all research is sentenced
to a ling-cabinet death-by-obscurity. It does not have to be this way in today's
scienti c technology revolution.</p>
      <p>Incentives are the key here. Because future job prospects, tenure, and public
acclaim all tend to revolve around journal publication, why would a researcher
want to take the time to go back, write out, format, narrate, and then attempt
to publish negative results that are likely to end up in unpretentious journals or
on personal blogs that often are not recognized as of much worth by the research
community, employers, or funding agencies? That's a lot of e ort and time for
something that will essentially provide no career or nancial advantages for the
researcher.</p>
      <p>That leaves two things we can do to bring dark data to light: one, change
the incentives involved in research to better re ect the value of negative or
inconclusive research results, and two, remove the barrier of time and e ort that
keeps a researcher from sharing this dark data in the rst place. Ideally, these two
changes would occur simultaneously, but funding agencies and institutional
policies tend to be incredibly slow to change. Social expectations are often equally
or even more sluggish to evolve. So we have a choice: do we sit around and wait
until publishers and institution administrators change incentives to publish dark
data? Or, do we nd ways to make it worth sharing this dark data now?</p>
      <p>
        If the time and e ort barrier were removed from sharing noncon rmatory
research, we believe that many researchers would gladly share those results even
if there were no external incentives. Open notebook science gives us a good
starting place, but we need to overcome the problem of data deluge [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
inherent in many of the Web 2.0 approaches and start preserving knowledge, not
just notes. Semantic Web technologies applied to that sort of \record as you
research" mentality would produce human- and machine-understandable
knowledge that could be sorted by machines and handed to researchers on a silver
platter. Imagine if the data, analyses and conclusions, as well as the thoughts
and intents of the researcher|or in short, every aspect of the research process|
were preserved in context by intuitive, domain-speci c research systems capable
of generating the underlying semantic representations automatically. The
development of open vocabularies expansive enough to preserve the entire research
process are a prerequisite to building such systems. Taking it a step further, if
that big-data amount of semantically understandable research was captured as a
researcher went along|at the speed of thought [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]|the negative or inconclusive
or downright befuddling results would be accessible without any extra e ort.
      </p>
      <p>The processes of researching and publishing would become distinct and
separate from presenting the research results to a certain audience in, for example,
a paper. If the researcher wanted to write and submit a paper based on those
noncon rmatory results, there would be nothing to prevent that. In fact, having
the entire research process preserved would make writing up such journal
articles easier anyway. Other presentation formats could be created, such as video or
interactive media, without worrying that some important parameter or intricate
concept might be left out. But the biggest game changer is that all of that dark
data would come to light, and that the paper might not always be necessary.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Goetz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Freeing the Dark Data of Failed Scienti c Experiments</article-title>
          . Wired
          <string-name>
            <surname>Magazine</surname>
          </string-name>
          (
          <year>2007</year>
          ), http://archive.wired.com/science/discoveries/magazine/15-10/ st_essay
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>2. Open Notebook Science (Drawbacks), Wikipedia, https://en.wikipedia.org/w/ index.php?title=Open_notebook_science&amp;oldid=654594797#Drawbacks</mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Harnad</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>The Future of Scholarly Skywriting</article-title>
          . In: Scammell,
          <string-name>
            <surname>A</surname>
          </string-name>
          . (ed.)
          <article-title>\i in the Sky: Visions of the information future"</article-title>
          <source>Aslib</source>
          (
          <year>1999</year>
          ), http://users.ecs.soton.ac. uk/harnad/Papers/Harnad/harnad99.aslib.html
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>