<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Technology Topic Evolution from the Perspective of Patent Validity1⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jinzhu Zhang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Runze Chen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information Management, School of Economics and Management, Nanjing University of Science and Technology</institution>
          ,
          <addr-line>Xiaolinwei Str 200, 210094, Nanjing</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In technology topic evolution analysis, current research primarily uses datasets that include both valid and invalid patents, which can affect the accuracy of assessing technology development trends. This paper categorizes patents by legal status into valid and invalid groups for separate evolution. A twodimensional evolution trajectory based on patent validity is constructed for common technology topics, providing a clearer view of which technologies are becoming more mature, and which are saturated or lagging. Experimental validation in the field of 3D printing has demonstrated the effectiveness of this approach.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;topic evolution</kwd>
        <kwd>patent validity</kwd>
        <kwd>invalid patents</kwd>
        <kwd>LDA model</kwd>
        <kwd>3D printing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Exploring the evolution of technology topics in depth not only aids in advancing the technology
itself but also provides crucial support for sustained societal progress[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. A substantial number
of studies use patent data sources such as Derwent Innovation and USPTO to conduct research on
topic evolution [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. However, these studies primarily use datasets that include both valid and
invalid patents, without considering their validity. As a result, outdated technologies may still
influence the assessment of technological development trends. To more accurately identify
technological trends, this study classifies patents in the field of 3D printing by legal status into
valid and invalid categories [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Using an LDA model [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], it analyzes the evolution of technology
topics separately for each category. A two-dimensional coordinate map [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] based on the validity of
common technology topics is then constructed, with dynamic evolution trajectories plotted to
reveal which technologies are becoming more mature and which are saturated or lagging.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Data and method</title>
      <p>This paper utilizes patent validity information to classify and analyze the evolution of patents,
distributing technologies into four regions from the perspective of patent validity and revealing the
evolution trajectories of technology topics over time. First, patent data is divided by legal status
into valid and invalid categories. Next, an LDA model is applied separately to each category within
a shared vocabulary space to generate evolution trend charts of technology topic proportions and
identify common topics. Finally, a two-dimensional coordinate map and evolution trajectory are
constructed for the common technology topics of both categories, based on their valid and invalid
proportions, to reflect their evolution trends.</p>
      <sec id="sec-2-1">
        <title>2.1. Data division</title>
        <p>We obtained 3D printing patent data from Google Patents because it clearly includes the legal
status of patents, totaling 25,899 records from 2014 to 2020. Patents marked as "Active," "Granted,"
and "Active - Reinstated" were classified as valid (20,975 records), while those marked as
1Joint Workshop of the 2th Innovation Measurement for Scientific Communication (IMSC) in the Era of Big Data
(IMSC2024), Dec 20th, 2024, Hong Kong, China and Online
∗ Corresponding author.</p>
        <p>zhangjinzhu@njust.edu.cn (Jinzhu Zhang); chenrunze0819@163.com (Runze Chen)
0000-0001-7581-1850 (Jinzhu Zhang); 0009-0002-2317-3898 (Runze Chen)
© 2024 Copyright 2024 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
"Abandoned," "Expired - Fee Related," "Expired - Lifetime," and "Ceased" were classified as invalid
(4,924 records). We collected text information, including titles, abstracts, and claims.</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Topic detection of valid and invalid Patents based on LDA model</title>
        <p>To determine the optimal number of topics, we calculate both topic perplexity and topic coherence.
Separate LDA models were then trained in valid and invalid Patents, using a shared vocabulary
space. Identify common topics between valid and invalid patents by calculating keyword similarity.</p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Topic positioning and evolution based on patent validity</title>
        <p>For common technology topics between valid and invalid patents, we create a two-dimensional
coordinate map based on their proportions in valid
2.4. and invalid patents. We distribute technology topics into four quadrants, as shown in
figure 1:
</p>
        <p>
          First Quadrant: Innovation Intensive Zone (High Valid Proportion, High Invalid
Proportion): These technology topics are core competencies with high innovation potential
and commercial value[
          <xref ref-type="bibr" rid="ref8">8</xref>
          ].
        </p>
        <p>
          Second Quadrant: Mature &amp; Stable Zone (High Valid Proportion, Low Invalid Proportion):
High and stable technical maturity, representing current mainstream technologies[
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
Third Quadrant: Low Activity Zone (Low Valid Proportion, Low Invalid Proportion):
Emerging or niche markets with low attention.
        </p>
        <p>Fourth Quadrant: Risk Obsolescence Zone (Low Valid Proportion, High Invalid Proportion):
High patent risk, potentially outdated or low-value technologies.</p>
        <p>Then, plot the dynamic evolution trajectory of each common technology topic over time.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <sec id="sec-3-1">
        <title>3.1. Topic positioning based on patent validity</title>
        <p>In topic detection, we identified 12 valid patent topics and 7 invalid patent topics based on the
highest coherence and lower perplexity, finding 7 common topics between them. For two
categories, we separately plotted the evolution trends of technology topics based on topic
proportions (Figure 2, using invalid patent as an example). Additionally, we positioned the 7
common topics on a two-dimensional coordinate map based on their valid proportions and invalid
proportions (Figure 3, using 2014 as an example).</p>
        <p>It can be seen from figure 3 that " Metal &amp; Composite Powders" and " Digital Model Processing"
are in the first quadrant, indicating they have high innovative potential and commercial value, and
are likely to occupy an important position in the market in the future.</p>
        <p>"UV-Curable Resins," "Color &amp; Material Calibration," and "Optical Sensing &amp; Projection" are in
the third quadrant, suggesting low attention and potential as emerging or niche market
technologies.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Topic evolution based on patent validity</title>
        <p>We recorded the positions of each common technology topic in 2014, 2017, and 2020, and plotted
the dynamic evolution trajectories, as shown in figure 4.</p>
        <p>It can be observed that "Fluid &amp; Gas Flow Systems" moved from the fourth quadrant to the first,
indicating that its market competitiveness and technological activity have significantly increased,
entering a phase of intense competition, and it may experience greater market opportunities.</p>
        <p>In contrast, "Digital Model Processing" moved from the first quadrant to the third, and then to
the fourth, reflecting a significant decline in its applications and innovation activities, suggesting a
risk of obsolescence or replacement by other technologies.</p>
        <sec id="sec-3-2-1">
          <title>4. Conclusion</title>
          <p>This paper utilizes patent validity and invalidity information to reveal technology topic evolution
trends in the 3D printing field through the perspective of patent validity.</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>Acknowledgements</title>
          <p>This work is supported by National Natural Science Foundation of China (Grant No. 72374103,
71974095), China Society of Indexers (Grant No. CSI24C10), and Jiangsu Provincial Federation of
Philosophy and Social Sciences (Grant No. 24SYC-023). The paper is presented at the second
Workshop on “Innovation Measurement for Scientific Communication (IMSC) in the Era of Big
Data” at 2024 ACM/IEEE Joint Conference on Digital Libraries (JCDL).</p>
        </sec>
        <sec id="sec-3-2-3">
          <title>Declaration on Generative AI</title>
          <p>During the preparation of this work, the authors used ChatGPT in order to: Improve writing style,
Grammar and spelling check. After using this tool, the authors reviewed and edited the content as
needed and take full responsibility for the publication’s content.</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <source>Core Technology Topic Identification and Evolution Analysis Based on Patent Text Mining-A Case Study of Unmanned Ship. Appl. Sci</source>
          .
          <year>2024</year>
          ,
          <volume>14</volume>
          , 4661. https://doi.org/10.3390/app14114661
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feng</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
           et al. 
          <article-title>Mapping and comparing the technology evolution paths of scientific papers and patents: an integrated approach for forecasting technology trends</article-title>
          .
          <source>Scientometrics</source>
          (
          <year>2024</year>
          ). https://doi.org/10.1007/s11192-024-04961-0
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yi</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>An integrated approach for detecting and quantifying the topic evolutions of patent technology: a case study on graphene field</article-title>
          .
          <source>Scientometrics</source>
          <volume>126</volume>
          ,
          <fpage>6301</fpage>
          -
          <lpage>6321</lpage>
          (
          <year>2021</year>
          ). https://doi.org/10.1007/s11192-021-04000-2
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Zhang</given-names>
            <surname>Xian</surname>
          </string-name>
          , Zeng Rongqiang,
          <string-name>
            <surname>Li</surname>
            <given-names>Shuying</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Li</given-names>
            <surname>Jiahui</surname>
          </string-name>
          .
          <source>Patented Technology Evolution Path Identification Based on Multi-relation Data Fusion[J]. Library and Information Service</source>
          ,
          <year>2024</year>
          ,
          <volume>68</volume>
          (
          <issue>3</issue>
          ):
          <fpage>71</fpage>
          -
          <lpage>84</lpage>
          . https://doi.org/10.13266/j.issn.
          <volume>0252</volume>
          -
          <fpage>3116</fpage>
          .
          <year>2024</year>
          .
          <volume>03</volume>
          .007
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Fu</given-names>
            <surname>Ying</surname>
          </string-name>
          .
          <article-title>Strategy for Development and Utilization of Invalid Patent</article-title>
          . R &amp;
          <string-name>
            <surname>D management</surname>
          </string-name>
          .
          <year>2005</year>
          , (
          <volume>03</volume>
          ):
          <fpage>96</fpage>
          -
          <lpage>100</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>David M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andrew</surname>
            <given-names>Y.</given-names>
          </string-name>
          <string-name>
            <surname>Ng</surname>
            , and
            <given-names>Michael I. Jordan.</given-names>
          </string-name>
          "
          <source>Latent Dirichlet Allocation." Journal of Machine Learning Research</source>
          , vol.
          <volume>3</volume>
          ,
          <issue>2003</issue>
          , pp.
          <fpage>993</fpage>
          -
          <lpage>1022</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Xiaotong</given-names>
            <surname>Han</surname>
          </string-name>
          , Donghua Zhu, Ming Lei, Tugrul Daim,
          <string-name>
            <surname>R&amp;</surname>
          </string-name>
          <article-title>D trend analysis based on patent mining: An integrated use of patent applications and invalidation data</article-title>
          ,
          <source>Technological Forecasting and Social Change</source>
          , Volume
          <volume>167</volume>
          ,
          <year>2021</year>
          , 120691, ISSN 0040-1625, https://doi.org/10.1016/j.techfore.
          <year>2021</year>
          .120691
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Hall</surname>
          </string-name>
          ,
          <string-name>
            <surname>Bronwyn H</surname>
          </string-name>
          .
          <article-title>and</article-title>
          <string-name>
            <surname>Harhoff</surname>
          </string-name>
          , Dietmar,
          <source>Recent Research on the Economics of Patents (July</source>
          <year>2012</year>
          ).
          <source>Annual Review of Economics</source>
          , Vol.
          <volume>4</volume>
          , pp.
          <fpage>541</fpage>
          -
          <lpage>565</lpage>
          ,
          <year>2012</year>
          , Available at SSRN: https://ssrn.com/abstract=2139260 or http://dx.doi.org/10.1146/annureveconomics-080511-111008
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Lerner</surname>
          </string-name>
          , Josh. “
          <article-title>The Importance of Patent Scope: An Empirical Analysis</article-title>
          .
          <source>” The RAND Journal of Economics</source>
          <volume>25</volume>
          (
          <year>1994</year>
          ):
          <fpage>319</fpage>
          -
          <lpage>333</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>