<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Research on the Identification of breakthrough technologies driven by science</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dan Wang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiao Zhou</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pengwei Zhao</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Juan Pang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Qiaoyang Ren</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Xidian University</institution>
          ,
          <addr-line>Xifeng 266 710126 Xi'an, Shaanxi Province</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The identification of breakthrough technologies plays a crucial role in driving technological innovation forward. The science-driven technology innovation pattern has emerged as a significant approach for identifying breakthrough technologies. This paper presents a novel framework for identifying breakthrough technologies based on a science-driven technological breakthrough pattern. The effectiveness of this framework is validated using the field of artificial intelligence as an illustrative example. This method not only assists researchers in accurately identifying the sources and development paths of technological breakthroughs but also provides important information for the formulation of future research and development policies.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Breakthrough technology</kwd>
        <kwd>Knowledge networks</kwd>
        <kwd>Link prediction</kwd>
        <kwd>Structural entropy 1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Breakthrough innovation, characterized by its highly
revolutionary nature, plays a pivotal role in enabling
enterprises to overhaul industry chains, enhance
competitiveness, and seize prime opportunities in the
increasingly competitive global landscape [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Recent research
has highlighted the significance of the interplay between
science (S) and technology (T) in fostering potential
breakthrough technologies [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Scholars have started to
explore the complex correlation between S and T by
integrating scientific literature and patent information.
This integration has led to the identification of three
primary interaction patterns: science-driven (S-T),
technology-pull (T-S), and science-technology synergy (S&amp;T).
Notably, the science-driven technology pattern signifies
instances where technological advancements stem from
scientific discoveries, serving as a key driver of
technological innovation [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ][
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The incorporation of scientific
insights into technological progress plays a pivotal role in
enhancing national innovation capabilities and
competitiveness [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ][
        <xref ref-type="bibr" rid="ref6">6</xref>
        ][
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>This paper adopts a fine-grained representation
approach, considering breakthrough technologies as
composed of several closely related scientific and
technological knowledge elements. To do so, this paper constructs
a breakthrough technology identification framework
based on the science-driven technology innovation
pattern. The core idea of the study is to use new science as
a signal of innovation, to deeply explore the mechanisms
and evolutionary paths through which new science leads
to technological breakthroughs, and on this basis, to
identify breakthrough technologies.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Data and Method</title>
      <p>
        The framework for identifying breakthrough
technology is shown in Figure 1. Firstly, we use papers and
patents as carriers of science and technology,
respectively. We collect data from the Web of Science (WOS)
and Incopat patent databases, using search queries
related to the research topics to download relevant
scientific papers and patents. Secondly, we focus on the
acquisition of new science, which is defined as scientific
topics that are both novel and impactful, yet have not
been integrated into existing technological systems. We
adopted Sentence-BERT (SBERT) [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and Local Outlier
Factor (LOF) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] to quantify the novelty of papers, while
utilizing citation counts as a metric for assessing paper
impact.
      </p>
      <p>Subsequently, we integrate new science into the
existing technological system through the construction of
a science-technology network. This network acts as a
channel for merging new scientific findings with
established technological advancements. Link prediction is
employed to uncover deep semantic links between new
science and technology. This is followed by the
application of community detection algorithms to filter
subnetworks containing new science-technology links. These
subnetworks serve as focal points for further analysis
and evaluation. Finally, the impact of these subnetworks
is evaluated using structural entropy to identify
breakthrough technologies.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Empirical analysis</title>
      <p>
        To assess the efficacy of the suggested approach, the
domain of artificial intelligence (AI) is selected as a
representative case study. Following a methodology similar
to that outlined by Tsay et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and subsequent
removal of duplicate records, a total of 236,333
publications and 29,468 patents related to AI, published
between 2014 and 2018, were identified.
      </p>
      <p>
        The science-technology network consists of 1,161
nodes and 62975 connecting edges, yielding a network
density of 0. 0935. We adopt an attribute feature-based
graph convolutional network (GCN) [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] for link
prediction in the science-technology network to discover
potential linkages between new science topics and
technological topics. After link prediction, Liu et al.'s method [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
is used to partition the S-T revised network into 13
subnetworks. Two subnetworks that do not contain new
science topics are excluded, leaving 11 subnetworks for
further investigation.
      </p>
      <p>
        We employ the structural entropy measure
proposed by Xu et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] to calculate the structural entropy
influence of each subnetwork. We utilized the median as
a threshold and identified five subnetworks above this
median as potential breakthrough technologies. The
final results were determined in conjunction with expert
opinions. Ultimately, the study identified five
breakthrough technologies. Among them, drug discovery
stands out due to its particularly significant impact. We
conducted a detailed analysis of this breakthrough
technology. Deep learning can train models using large-scale
biological data to predict the activity, toxicity, and other
properties of compounds, thereby rapidly screening
candidate drugs with potential therapeutic effects [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
AIdiscovered molecules were listed among the
Massachusetts Institute of Technology (MIT)'s top ten
breakthrough technologies in 2020. In recent years, drug
discovery based on deep learning algorithms has gradually
transitioned from research and development to practical
technology development. The from-scratch drug design
based on deep learning algorithms was recognized by
MIT as a breakthrough in successfully applying artificial
intelligence to the drug design process [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion and Conclusion</title>
      <p>This paper proposes a framework for identifying
breakthrough technology, starting with new sciences as
an innovation signal and tracking the evolution of
technological breakthroughs stemming from them. The
primary contributions of this study can be listed as follows.
First, this study proposes a novel method for identifying
breakthrough technologies based on the innovation
pattern of science-driven technological breakthroughs. This
approach enables dynamic tracking and measurement of
the innovation process triggered by new science. Second,
it provides an in-depth characterization of the essence
and core features of new science. Furthermore, by
employing a topic-based fine-grained approach, the study
identifies breakthrough technologies, while also tracking
the dynamic interaction trajectories between new
science and technology at the semantic level.</p>
      <p>Several limitations of our proposed method require
further improvement. This paper primarily considers the
driving effect of science on technological breakthroughs.
Future research could explore the identification of
breakthrough technologies under different patterns of
science and technology interaction. Moreover, alongside
scientific influence, the commercial aspect warrants
attention.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Acknowledgements</title>
      <p>This work was supported by the General Program of
National Natural Science Foundation of China (Grant No.
72374165) .</p>
    </sec>
    <sec id="sec-6">
      <title>6. References</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>D. S.</given-names>
            <surname>Hain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Christensen</surname>
          </string-name>
          ,
          <article-title>Capital market penalties to radical and incremental innovation</article-title>
          .
          <source>European Journal of Innovation Management</source>
          ,
          <volume>23</volume>
          (
          <issue>2</issue>
          ):
          <fpage>291</fpage>
          -
          <lpage>313</lpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Ba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Meng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Ma</surname>
          </string-name>
          , Y. Xia,
          <article-title>Discovering technological opportunities by identifying dynamic structure-coupling patterns and lead-lag distance between science and technology</article-title>
          .
          <source>Technological Forecasting and Social Change</source>
          ,
          <volume>200</volume>
          :
          <fpage>123147</fpage>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Malva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kelchtermans</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Leten</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Veugelers</surname>
          </string-name>
          ,
          <article-title>Basic science as a prescription for breakthrough inventions in the pharmaceutical industry</article-title>
          .
          <source>The Journal of Technology Transfer</source>
          ,
          <volume>40</volume>
          :
          <fpage>670</fpage>
          -
          <lpage>695</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Veugelers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Scientific novelty and technological impact</article-title>
          .
          <source>Research Policy</source>
          ,
          <volume>48</volume>
          (
          <issue>6</issue>
          ):
          <fpage>1362</fpage>
          -
          <lpage>1372</lpage>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Dedrick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. L.</given-names>
            <surname>Kraemer</surname>
          </string-name>
          ,
          <article-title>Who captures value from science-based innovation? The distribution of benefits from GMR in the hard disk drive industry</article-title>
          .
          <source>Research Policy</source>
          ,
          <volume>44</volume>
          (
          <issue>8</issue>
          ):
          <fpage>1615</fpage>
          -
          <lpage>1628</lpage>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>V.</given-names>
            <surname>Narayanamurti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. Y.</given-names>
            <surname>Tsao</surname>
          </string-name>
          ,
          <article-title>How technoscientific knowledge advances: A Bell-Labs-inspired architecture</article-title>
          .
          <source>Research Policy</source>
          ,
          <volume>53</volume>
          (
          <issue>4</issue>
          ):
          <fpage>104983</fpage>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Ke</surname>
          </string-name>
          ,
          <article-title>An analysis of the evolution of sciencetechnology linkage in biomedicine</article-title>
          .
          <source>Journal of Informetrics</source>
          ,
          <volume>14</volume>
          (
          <issue>4</issue>
          ):
          <fpage>101074</fpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>N.</given-names>
            <surname>Reimers</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Gurevych</surname>
          </string-name>
          ,
          <article-title>Sentence-bert: Sentence embeddings using siamese bert-networks</article-title>
          .
          <source>ArXiv preprint arXiv</source>
          ,
          <year>1908</year>
          :
          <volume>10084</volume>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>D.</given-names>
            <surname>Jeon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Ahn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <article-title>Measuring the novelty of scientific publications: A fastText and local outlier factor approach</article-title>
          .
          <source>Journal of Informetrics</source>
          ,
          <volume>17</volume>
          (
          <issue>4</issue>
          ):
          <fpage>101450</fpage>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M. Y.</given-names>
            <surname>Tsay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z. W.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <article-title>Analysis of the patent cooperation network in global artificial intelligence technologies based on the assignees</article-title>
          .
          <source>World Patent Information</source>
          ,
          <volume>63</volume>
          :
          <fpage>102000</fpage>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>T. N.</given-names>
            <surname>Kipf</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Welling</surname>
          </string-name>
          ,
          <article-title>Semi-supervised classification with graph convolutional networks</article-title>
          .
          <source>In: Proceedings of ICLR 2017</source>
          (pp.
          <fpage>1</fpage>
          -
          <lpage>14</lpage>
          ).
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ott</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Goyal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Du</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Levy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lewis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Zettlemoyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Stoyanov</surname>
          </string-name>
          ,
          <article-title>Roberta: A robustly optimized bert pretraining approach</article-title>
          .
          <source>ArXiv preprint arXiv</source>
          ,
          <year>1907</year>
          :
          <volume>11692</volume>
          ,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>H.</given-names>
            <surname>Xu</surname>
          </string-name>
          , et al,
          <article-title>A methodology for identifying breakthrough topics using structural entropy</article-title>
          .
          <source>Information Processing &amp; Management</source>
          ,
          <volume>59</volume>
          (
          <issue>2</issue>
          ):
          <fpage>102862</fpage>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>B.</given-names>
            <surname>Bhinder</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gilvary</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. S.</given-names>
            <surname>Madhukar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Elemento</surname>
          </string-name>
          ,
          <article-title>Artificial intelligence in cancer research and precision medicine</article-title>
          .
          <source>Cancer discovery</source>
          ,
          <volume>11</volume>
          (
          <issue>4</issue>
          ):
          <fpage>900</fpage>
          -
          <lpage>915</lpage>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Singh</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gupta</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sharma</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sahi</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <source>Advances in Artificial Intelligence (AI</source>
          )
          <article-title>-assisted approaches in drug screening</article-title>
          .
          <source>Artificial Intelligence Chemistry</source>
          ,
          <volume>2</volume>
          (
          <issue>1</issue>
          ):
          <fpage>100039</fpage>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>