=Paper=
{{Paper
|id=Vol-2957/paper7
|storemode=property
|title=Old is Not Always Gold: Early Identification of Milestone Patents Employing Network Flow Metrics
|pdfUrl=https://ceur-ws.org/Vol-2957/paper7.pdf
|volume=Vol-2957
|authors=Manajit Chakraborty,Fabio Crestani
|dblpUrl=https://dblp.org/rec/conf/swisstext/ChakrabortyC21
}}
==Old is Not Always Gold: Early Identification of Milestone Patents Employing Network Flow Metrics==
Old is Not Always Gold: Early Identification of Milestone Patents Employing Network Flow Metrics Manajit Chakraborty Fabio Crestani Faculty of Informatics Faculty of Informatics Università della Svizzera italiana Università della Svizzera italiana Lugano, Switzerland Lugano, Switzerland chakrm@usi.ch fabio.crestani@usi.ch Abstract often varies depending on the nature of the tech- nology. Previous research has already endorsed Bibliometrics has been employed previ- technological forecasting1 as an integral element ously with patents for technological fore- to stay ahead of the curve for corporations and casting. The primary challenge that tech- governments (Campbell, 1983). Acs et al. (2002) nological forecasting faces is early-stage suggested that patents provide a fairly reliable mea- identification of technologies with the po- sure of innovative activity. Identifying important tential to have a significant impact on the patents, observing their change of importance as socio-economic landscape. With this aim, captured by the variation of citation measures and we carry out an exploratory study using analyzing them can lend us new insights as to how various network-based metrics on patent innovation evolves over a period of time. This citation network to identify patents which could be beneficial for innovators and companies are possible candidates for major influence who are actively involved in producing patents. It in the immediate future. To effectively would facilitate them to take stock of the innova- uncover these patents shortly after they tion quotient of a particular technological area and are issued, we need to go beyond raw ci- help measure its growth and potential over a certain tation counts and take into account both period of time. the citation network topology and tempo- In this paper, we aim to identify influential ral information. We posit that, as with patents from different technological areas from scholarly citations, not all patent citations patent citation network using network flow algo- carry equal importance with age. This in- rithms. Identifying top patents from any particular formation is captured by dynamic network category can help companies interested in patenting flow metrics that take the effect of time to glean an overview of the important innovations on the citations into account. Identifying in their field of concern. It can also benefit govern- top patents can aid in re-ranking of search ments in deciding various policies such as funding results in a patent search. We carry out our to technological areas that have shown promise experiments on two standard collections over the last few years. We argue that while cita- of patents and present some insightful re- tion count may help us identify important patents, sults and observations based on rigorous it tends to favour patents that have been filed or analysis. granted long ago, thus providing it a longer citation 1 Introduction accumulation period. While PageRank helps to mitigate the situation to a certain extent by consid- Patent citations, namely references to prior patent ering the whole network instead of simple citation documents and the state-of-the-art included therein, count, PageRank too has been known to be biased and their frequency are also often used as indica- against recent network nodes. CiteRank (Walker tors for the technological and commercial value et al., 2007) introduces exponential penalization of of a patent and to identify “key” patents, which old nodes, thus modelling the node score such that it captures the future citation count gain. However, Copyright © 2021 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 Interna- 1 tional (CC BY 4.0) https://hbr.org/1967/03/technological-forecasting due to CiteRank’s known limitations, we propose a tions reflect the fact that either a new technology new model called Time-Attentive Ranking, which builds on an existing one or that they serve a sim- helps to capture the temporal changes and their ilar purpose. As a result, chains of citations al- effect on certain nodes. We carry out our experi- low us to trace the technological evolution, and ments on two different datasets to determine the hence patent centrality in the citation network can efficacy and effectiveness of our method against be used to score patents. In our preliminary citation baselines both qualitatively and quantitatively. We analysis, we have adopted a couple of PageRank then carry out a comparison of the top-N ranked list based approaches along with other citation metrics. of patents provided by three algorithms using Rank PageRank (Bedau et al., 2011; Bruck et al., 2016) Biased Overlap (Webber et al., 2010) and against and similar eigenvector-based metrics (Doira and a list of significant patents by Strumsky and Lobo Banerjee, 2015) has been computed on patent cita- (2015), to point out the relative changes. We posit tion networks earlier. Mariani et al. (2016) argued that top-ranked patents or the ranking criteria for on similar lines in the case of scholarly articles and the same could be employed for a ranking based proposed a re-scaled version of PageRank that dis- patent retrieval method as have been exploited by counts citations for old papers based on age. We Xue and Croft (2009) and Liao and Veeramacha- build upon this notion and perform a thorough anal- neni (2010). Our experiments are two-pronged – ysis of patent citation network in sub-categories first, we study the effect of the network metrics on and sectors and in the presence (or absence) of European patents from the MAREC dataset and patent content by employing a proposed network secondly, we employ an adapted version of a deep flow algorithm. learning model that infuses both textual content and network flow metrics on USPTO patents in 3 Methodology order to spot influential patents and validate our We employ three different network-based patent- hypothesis. level metrics for comparison: PageRank scores P , CiteRank score C and our proposed Time-Attentive 2 Related Works Rank score T . The notion of quantitative evaluation of scientific 3.1 PageRank and technological impact builds on the basic idea PageRank (Brin and Page, 1998) is a link analysis that the scientific merits of papers (Radicchi et al., algorithm and it assigns a numerical weighting to 2008), scholars (Egghe, 2006), journals (Bollen each element of a hyperlinked set of documents, et al., 2006), universities (Molinari and Molinari, such as the World Wide Web, with the purpose of 2008) and countries (Cimini et al., 2014) can be “measuring” its relative importance within the set. gauged by metrics based on the received citations. The algorithm may be applied to any collection of Bibliometrics has been employed in a variety of entities with reciprocal quotations and references. scenarios to measure and analyze citations since The numerical weight that it assigns to any given el- they provide a rich source of information. Sci- ement E is referred to as the PageRank of E. PageR- entific papers and scholarly articles have been in- ank normalizes the number of links on a document vestigated using various bibliometric tools, espe- by not counting each of them as equal. PageRank cially citations for a long period (Narin et al., 1976; can be defined as follows (Equation 1): Bakkalbasi et al., 2006). One of the early studies to measure the technological impact based on patent X pnj X pnj 1 − α citations was done by Karki (1997). He proposed a Pin+1 = α. Aij +α. + kjout N N host of technological indicators based on citations j:kjout >0 out j:kj =0 among patents. (1) where kjo ut = P Carpenter et al. (1981) and Fontana et al. (2013) l Alj is the outdegree of node compared patents associated with inventions that j, α is the teleportation parameter, and n is the received a prize and patents from a control group, iteration number. The PageRank score Pi of node i finding again evidence that “important” patents are can be interpreted as the average fraction of time more cited (the mean number of citations received spent on node i by a random walker who with was found to be about 50% higher for important probability α follows the network’s links and with patents). As argued by (Jaffe et al., 2000), cita- probability 1 − α teleports to a random node. We consider α = 0.5 throughout this paper since it is the accepted choice for citation networks (Chen N −ni , if p cites p and et al., 2007). γ i j Rn,γ (i, j) = t(pi ) = tni ≤ tn (4) 3.2 CiteRank 0, otherwise CiteRank (Walker et al., 2007) was designed specif- where γ < 1 is the retention probability given to ically for ranking papers in a citation network. Cit- attach more weight to a recent patent and decrease eRank performs a random walk on an aggregated the weight as the patent keeps ageing. The con- citation graph but initiates the walk from a recent tagion matrix can then be written as Equation 5 paper chosen with the probability that depends on (using Equation 3 and 4): its age. Authors estimated parameters of the ran- dom walk by fitting papers’ CiteRank score to the N −1 number of citations accrued by papers over some X EM N,α,γ = αi R i N (5) time period. Let us suppose M is a transfer ma- i=1 trix with elements Mij = 1/Lj if paper j cites i and 0 otherwise. The probability that a researcher and hence the score of a patent pj at the end of follows the citation links to encounter a paper is aPtime period [ti , tN ] is given by EM N (j) = defined as in Equation 2: i EM N (i, j). For our experiments we consider the best possible settings by empirically setting α = 0.1 and γ = 0.3. Elsewhere in the paper we ~ = I0 .~ C ρ +(1−α)2 M 2 .~ ρ +(1−α)M.~ ρ +... (2) refer to the EM score as T for uniformity and ease of comprehension. where I0 is an identity matrix, ρi =exp−agei /τ is the probability of initially selecting paper i, agei 4 Experimental Setup is the age of the paper and τ is the characteristic 4.1 Datasets decay time. In this paper, we consider α = 0.5 and τ = 2.6 years, as specified by Walker et al. (2007). For this study, we used two datasets: (1) European Patent (EP) collection from the MAtrixware RE- 3.3 Time-Attentive Rank search Collection (MAREC)2 and (2) US patents dataset collected by Kogan et al. (2017) that spans Our proposed model Time-Attentive Ranking is the period between 01-01-1926 and 11-02-2010. based on the notion that ‘An inventor or patentee To the best of our knowledge, there exists no study can find patents by following citations links back of a similar kind on the European Patents, which is in time from a particular patent’. The number of why we chose to work with the former collection paths that can be attenuated between patent pi and from MAREC. However, this presents a unique pj can be expressed as a contagion matrix M given challenge of finding a respective gold-standard by Equation 3: list of “milestone” patents, which is not available. Hence for this collection, we resort to a qualitative N −1 X evaluation as described in later sections. To com- MN,α = αi Ai N (3) pare our proposed approach’s performance against i=1 the state-of-the-art and perform a quantitative eval- where An is the adjacency matrix of patents cit- uation, we repeated our experiments on the USPTO ing each other for a particular year tn and α is the dataset as well. Additionally, we also performed probability of following a citation link. The more validation of our model’s performance by a deep paths there are from patent pi to pj , the higher learning technique as suggested in Chung and Sohn the likelihood that an inventor will find pj by fol- (2020) to identify a patent’s grade in determining lowing citation chains from pi , which is similar to in value. α-Centrality (Bonacich, 1987) and Katz centrality 4.2 Preprocessing (Katz, 1953) metrics. Since the existing contagion matrix does not account for time and hence weights MAREC collection We only considered granted each edge equally, the authors propose a retained patents from the ‘EP’ collection. For uniformity, 2 adjacency matrix which is given by Equation 4: http://www.ifs.tuwien.ac.at/imp/marec.shtml we removed patents that had some metadata miss- and Time-Attentive Rank score T . As mentioned ing, such as classification codes or patent citations. earlier, each patent is endorsed with several classi- We also did not consider the non-patent citations fication codes that classify them into sectors, cat- since they are out of the scope of our study. We egories, sub-categories etc. The highest level of pre-processed the data to only keep the citations classification is according to sectors (A-H). Each between patents that were issued within 1976-2008, sector consists of several categories (A61K, A61P, removing thereby the citations to patents issued be- ...), each category consists of several sub-categories fore 05-1976. Hence, we were left with a network and so on. A single patent can belong to several consisting of only EP-EP patent citations formed sectors and categories. out of 251,664 patents having 350,164 citations. Table 1: Top-5 Patents by citation count USPTO collection Unlike the well-known NBER patent data, the dataset provided by Kogan PatentID No. of Citations et al. (2017) has a vastly improved coverage. We EP0037691 125 EP0272189 121 pre-processed the data to only keep the citations EP1049021 121 between patents that were issued within the time EP0364618 121 period of 01-01-1926 and 11-02-2010, removing EP0527247 121 thereby the citations to patents issued before 01-01-1926. The resulting citation network 5.1.1 Complete Network analyzed in this paper is composed of 6,237,625 From Table 2, we can observe significant changes patents and 45,962,301 citations between them. in the ranking of the patents. The tables reveal that 5 Results and Analysis there are more recent patents granted after the year 2000 in the top-10 list ranked by Time-Attentive 5.1 MAREC Patents Ranking than that produced by either PageRank In this section, we present a qualitative of the re- or CiteRank. To be precise, the Time-Attentive sults followed by a comprehensive analysis of the ranking method includes five patents granted after same. We ran the experiments on networks that the year 2000 in the top-10 list, while for PageRank were spliced in time (yearly) over a ten year period and CiteRank, it is four out of ten for both. Of (1998-2008). The same set of experiments were course, the difference is even more pronounced as carried out on networks comprising of all patents, we go deeper in the lists, say, top-15, top-20 and so patents belonging to a certain sector and patents on, which we could not present here due to space belonging to a certain category. On studying the in- constraints. degree and out-degree of patents, we observed that For comparison, the list of top five patents on the degrees are very skewed, i.e., only a handful of the basis of citation count is presented in Table patents gets a large number of citations while most 1. One can observe that none of the three metrics of the patents in the network have less than ten cita- ranks the patents from Table 1 in their top five tions. Hence, this network follows a similar pattern list. In fact, within the top fifteen results, only to that of a scholarly article citation network (al- patent EP0272189 and EP0364618 feature in the beit with more skewness). Hence, the network-flow lists compiled according to PageRank and CiteR- algorithms that can be employed with paper cita- ank scores, while patent EP0527247 is listed by tion networks can also be adapted here. Moreover, PageRank only. The rest do not find a place in there is a strong correlation between the in-degree the top-10 of any network-based score list. This and out-degree of the patents in both collections, corroborates our initial hypothesis that is simply which implies that highly-cited patents tend to be acquiring a high citation count does not indicate cited by other highly-cited patents and to cite other the importance of a patent. highly-cited patents (Ren et al., 2018). 5.1.2 Network for a particular sector Qualitative Comparison of Top Patents : For Next, we perform the same set of experiments over an intuitive understanding of how the different individual sectors of patents. Similar to the trend network-flow metric scores affect the rank, it is shown by the complete network, for sector B which important to observe the top-ranked patents accord- has the highest number of patents, we observe from ing to the PageRank score P , CiteRank score C Table 3 that Time-Attentive Ranking features the Table 2: Top-10 patents for 2008 ranked by scores Rank PatID Title Date #Citations 1 EP0251752 Aluminum-stabilized ceria catalyst compositions and method of making 29-06-1987 111 same. 2 EP1304455 Particulate filter for purifying exhaust gases of internal combustion en- 17-10-2002 103 gines 3 EP0728435 Cyclone dust extractor 20-02-1996 87 PageRank score P 4 EP1031939 COMPOSITE IC CARD 16-11-1998 69 5 EP1261147 A method and system for simultaneous bi-directional wireless commu- 21-05-2001 78 nication between a user station and first and second base stations 6 EP0776864 Process for the aerobic biological purification of water 10-07-1996 34 7 EP1466940 Carbon fiber composite material and process for producing the same 13-04-2004 57 8 EP1400858 PHOTORESIST STRIPPER COMPOSITION 21-06-2002 28 9 EP1059092 Use of complexes among cationic liposomes and polydeoxyribonu- 08-06-1999 19 cleotides as medicaments 10 EP0534904 Imidazolylmethyl-pyridines. 21-09-1992 23 1 EP0251752 Aluminum-stabilized ceria catalyst compositions and method of making 29-06-1987 111 same. 2 EP1304455 Particulate filter for purifying exhaust gases of internal combustion en- 17-10-2002 103 gines 3 EP0728435 Cyclone dust extractor 20-02-1996 87 CiteRank score C 4 EP1031939 COMPOSITE IC CARD 16-11-1998 69 5 EP1261147 A method and system for simultaneous bi-directional wireless commu- 21-05-2001 78 nication between a user station and first and second base stations 6 EP0776864 Process for the aerobic biological purification of water 10-07-1996 34 7 EP1466940 Carbon fiber composite material and process for producing the same 13-04-2004 57 8 EP1400858 PHOTORESIST STRIPPER COMPOSITION 21-06-2002 28 9 EP1059092 Use of complexes among cationic liposomes and polydeoxyribonu- 08-06-1999 19 cleotides as medicaments 10 EP0534904 Imidazolylmethyl-pyridines. 21-09-1992 23 1 EP0251752 Aluminum-stabilized ceria catalyst compositions and method of making 29-06-1987 111 same. 2 EP1304455 Particulate filter for purifying exhaust gases of internal combustion en- 17-10-2002 103 TimeAttentiveRank score T gines 3 EP0728435 Cyclone dust extractor 20-02-1996 87 4 EP1031939 COMPOSITE IC CARD 16-11-1998 69 5 EP1835243 Evaporator with electronic circuit printed on a first side plate 26-02-2007 21 6 EP1261147 A method and system for simultaneous bi-directional wireless commu- 21-05-2001 78 nication between a user station and first and second base stations 7 EP1466940 Carbon fiber composite material and process for producing the same 13-04-2004 57 8 EP0364618 Multiple signal transmission device. 18-10-1988 121 9 EP0776864 Process for the aerobic biological purification of water 10-07-1996 57 10 EP1400858 PHOTORESIST STRIPPER COMPOSITION 21-06-2002 28 Table 3: Sector B patents of 2008 ranked Table 4: Category A61K patents of 2008 ranked Rank Patent ID Date Rank Patent ID Date 1 EP0728435 20-02-1996 1 EP0776864 10-07-1996 PageRank P PageRank P 2 EP0008860 20-07-1979 2 EP0728435 20-02-1996 3 EP0095603 07-05-1983 3 EP0071564 19-07-1982 4 EP1142619 26-09-2000 4 EP0002210 17-11-1978 5 EP0466535 18-06-1991 5 EP0447285 27-02-1991 1 EP0728435 20-02-1996 1 EP0776864 10-07-1996 CiteRank C CiteRank C 2 EP1304455 17-10-2002 2 EP0728435 20-02-1996 3 EP1142619 26-09-2000 3 EP1835243 26-02-2007 4 EP0534904 21-09-1992 4 EP1568666 22-02-2005 5 EP1731327 10-06-2005 5 EP0770375 13-09-1996 1 EP0728435 20-02-1996 1 EP0776864 10-07-1996 TimeAttentiveRank T TimeAttentiveRank T 2 EP1329412 10-10-2000 2 EP0527247 08-08-1991 3 EP1489033 05-06-2004 3 EP0364618 18-10-1988 4 EP1306147 23-10-2002 4 EP0272189 17-12-1987 5 EP1674419 21-12-2005 5 EP0728435 20-02-1996 more recent patents in their top five as compared 5.1.3 Network for a particular category to their counterparts. While it is interesting to study the complete net- work and find the most influential patents as identi- fied by Time-Attentive Rank, it does not deliver us Table 7: RBO among the full ranked Table 5: RBO@20 for 2008 Table 6: RBO@20 for A61K lists P C T P C T P C T P – P – P – C 0.4981 – C 0.3430 – C 0.8548 – T 0.3921 0.5741 – T 0.2568 0.3963 – T 0.6307 0.7270 – a lot of information. On the other hand if we limit ment can be defined as the proportion of S and T the patent citation network by categories, it could that are overlapped at depth d. Rank-biased Over- provide us some insights as to which technologies lap falls in the range [0, 1], where 0 means disjoint, have been gaining momentum in the last few years and 1 means identical. While RBO is the agree- of the patent data. The total number of categories in ment score between two indefinite lists, we are the patent database exceeds hundred. Not surpris- more concerned with the top-k elements in the lists ingly, the distribution of patents against categories and hence RBO@k provides us a better measure to is also skewed. For brevity, we present only the compare the top-ranked elements. It is imperative results for the most popular category A61K. to note that RBO > RBO@k. For our case, we From Table 4, we observe a certain peculiar- empirically consider k = 20 and p = 0.9. ity. None of the top five patents ranked by the Tables 5, 6 and 7 present the RBO confusion Time-Attentive Ranking mechanism is a post-2000 matrix. We can clearly observe a pattern here. patent. This is interesting because it implies that The overlap between CiteRank and TimeAttentive while Time-Attentive rank gives more weightage ranked lists are certainly more than the overlap to recent citations, it does not bias towards recent (agreement) between PageRank and TimeAtten- patents, thus maintaining a balance between older tive Rank, which confirms our intuition that recent and newer patents. So, the top-ranked patent in all patents receive more preference in the weighted three cases is the same indicating that EP0776864 citation measures rather than unweighted citations is indeed the most important patent in category of PageRank. A61K. 5.2 USPTO Patents Metric for comparison of ranked lists: Since our For this collection, we adopt a different approach hypothesis hinges on the ranking of patents over for carrying out our experiments. The experiments a network metric based score, it is imperative that on the MAREC patents were solely based on net- the lists generated by PageRank and CiteRank and work flow metrics, which we could not assess TimeAttentiveRank will be different in their order- quantitatively due to the lack of a standard base- ing of elements (ranks). As the lists are quite long, line. Instead, for the US patents collections, we their scores are not directly comparable, and for a compare our approach against the state-of-the-art given depth d the two lists may not even have the Re-scaled PageRank method proposed by Mari- same set of elements, we will have to resort to in- ani et al. (2019) to identify milestone patents. As definite ranking (Webber et al., 2010). To this end, a second objective, we wanted to determine the we employ rank-biased overlap (RBO) to measure value added by textual content in determining a the similarity and agreement between the two lists. patent’s worth. This objective stems from similar The RBO values for the year 2008 compared over studies on patents where it was shown that exploit- the complete list of ranked results is presented in ing the multimodal nature of patents yields better Table 7. The Rank-Biased Overlap is defined as in prediction performance (Chakraborty et al., 2020). Equation 6. For this purpose, we adapt the deep learning ap- ∞ X proach proposed by Chung and Sohn (2020). Due RBO(S, T, p) = (1 − p) pd−1 .Ad (6) to the incompatibility of NLP based approach pro- d=1 posed by Chung and Sohn (2020) and network flow where S and T are two indefinite ranked lists. p metrics-based approaches such as the one by Mari- stands for user’s persistence, which determines ani et al. (2019) and ourselves in this paper, we only how steep is the decline in weights: the smaller p, adopt the deep learning approach (DEP-net) to de- the more top-weighted is the metric. Ad , agreee- termine a patent’s grade, which is another measure of patent’s importance. As per Chung and Sohn to uncover these significant patents. The complete (2020), a patent’s quality is assigned one of three list of these patents can be found in Appendix C grades (A, B, or C) based on the average number of (Strumsky and Lobo, 2015). Presence in the of forward citations per year. The deep learning list of significant patents by Strumsky and Lobo approach is briefly summarised below: is a binary variable: a patent is either in the list or not. We can therefore study the ability of the • A patent grade (A, B, or C) is assigned based metrics to rank these outstanding patents as high as on a threshold determined by the average for- possible, in agreement with the objectives of this ward citations accrued per year by the patent. paper. While there are 175 significant patents in • Textual content (abstract and claim) from the the Strumsky-Lobo list, we restrict our analysis to patent data is extracted along with several those patents that were issued within our dataset’s other indices such as number of claims, num- temporal span and remove those that are absent in ber of inventors, number of backward cita- our dataset. This leaves us with M = 112 significant tions, number of IPCs, etc. patents. • Abstract and claims are transformed (vector- 5.2.2 Comparison against baselines ized) into word embeddings as matrices. In this section we inspect the top-ranked patents. For simplicity, we focus on the top-10 patents as • A deep neural network composed of Bi-LSTM ranked by PageRank P and Re-scaled PageRank layer is added to the CNN structure using mul- R and our Time-AttentiveRank T scores (Table tiple filters that fuses the four components (ab- 8). From Table 8, we can observe that the top-10 stract, claims, indices, network-metric score) patents by Re-scaled PageRank span a wider tem- as input to train and evaluate a patent’s quality. poral range (1942–2010) than the top-10 by PageR- Finally, we also evaluate the patent quality for ank (1942–1996), which is a direct consequence of test data. the age-bias removal. The same temporal span is It is to be noted that we add an extra component to retained by the Time-Attentive Rank as well. How- the original model proposed by Chung and Sohn ever, it is noteworthy that our proposed method can (2020), i.e., the network-metric score. Both the re- pick more (3) patents from Strumsky-Lobo’s list scaled PageRank score (R) and the Time-Attentive of significant patents. Among the ten top-ranked Rank score are fed separately as inputs to the deep patents, two are from 2010 (the last year in the neural model. To simplify things, we retained the dataset) and received only one citation. This hap- parametric setting of the neural model as proposed pens because only a few among the most recent by Chung and Sohn (2020). Finally, the features patents received citations, which results in tempo- from the abstracts, claims, indices and network- ral windows with a large fraction of patents with flow metrics are fused and used as inputs to the zero citations. Thus, within such a temporal win- fully connected layer. The loss function was cross- dow, a patent can achieve large T score thanks to entropy, and the activation function was softmax. one single citation. A possible solution for this We label this model as DEP-netPlus (as we add issue is to only include the patents whose temporal value to the DEP-net model). windows contain a certain minimal number of in- coming citations. Another observation is that both 5.2.1 Expert-selected historically significant the Re-scaled PageRank and Time-Attentive rank patents do not necessarily rank patent with grade A in a Strumsky and Lobo (2015) listed 175 patents care- higher position, so the ranking is not solely depen- fully selected “on the basis of consultation with dent on the citation count but also on the network engineers, scientists, historians, textbooks, maga- structure. zine articles, and internet searches”. The patents in the list “all started technological pathways which 5.2.3 Performance comparison against affected society, individuals and the economy in DEP-net a historically significant manner” (Strumsky and To illustrate the importance of including network- Lobo, 2015). These significant patents thus pro- flow based metric as a component, we performed vide a good “ground-truth” set of patents that can the patent grade classification as described in be used to discern the ability of different metrics (Chung and Sohn, 2020). We used the same dataset Table 8: Top-10 patents ranked by Network-metric scores. Asterisks mark the Strumsky-Lobo significant patents. Rank PatID Title Date #Citations Grade 1 4683195 Process for amplifying, detecting, and/or-cloning nucleic acid se- 28-07-1987 1956 A quences 2 4683202 Process for amplifying nucleic acid sequences 28-07-1987 2169 A 3 4237224 (*) Process for producing biologically functional molecular chimeras 02-12-1980 285 B 4 4395486 Method for the direct analysis of sickle cell anemia 26-07-1983 71 B PageRank score P 5 4723129 Bubble jet recording method and apparatus in which a heating element 02-02-1988 1962 A generates bubbles in a liquid flow path to project droplets 6 3813316 Microorganisms having multiple compatible degradative energy- 28-05-1974 16 C generating plasmids and preparation thereof 7 5536637 Method of screening for cDNA encoding novel secreted mammalian 16-06-1996 422 A proteins in yeast 8 4558413 Software version management system 10-12-1985 1956 A 9 4358535 Specific DNA probes in diagnostic microbiology 09-11-1982 436 A 10 2297691 SElectrophotography 06-10-1942 588 B 1 7764447 Optical element holding device, lens barrel, exposing device, and device 27-7-2010 1 C producing method 2 4237224 (*) Process for producing biologically functional molecular chimeras 02-12-1980 285 B Re-scaled PageRank score R 3 2297691 Electrophotography 06-10-1942 588 B 4 7749477 Carbon nanotube arrays 06-07-2010 1 C 5 7784029 Network service for modularly constructing a software defined radio 24-08-2010 1 C 6 5536637 Method of screening for cDNA encoding novel secreted mammalian 16-07-1996 422 A proteins in yeast 7 4683195 Process for amplifying, detecting, and/or-cloning nucleic acid se- 28-07-1987 1956 A quences 8 5523520 Mutant dwarfism gene of petunia 04-06-1996 1139 A 9 4395486 Method for the direct analysis of sickle cell anaemia 26-07-1983 71 B 10 4683202 Process for amplifying nucleic acid sequences 28-07-1987 2169 A 1 4683202 Process for amplifying nucleic acid sequences 28-07-1987 2169 A 2 4237224 (*) Process for producing biologically functional molecular chimeras 02-12-1980 285 B 3 2297691 Electrophotography 06-10-1942 588 B TimeAttentiveRank score T 4 D268584 (*) Personal computer 12-04-1983 3 C 5 7749477 Carbon nanotube arrays 06-07-2010 1 C 6 7784029 Network service for modularly constructing a software defined radio 24-08-2010 1 C 7 5536637 Method of screening for cDNA encoding novel secreted mammalian 16-07-1996 422 A proteins in yeast 8 5225539 (*) Using recombinant DNA to produce an altered antibody 06-07-1993 549 A 9 4683195 Process for amplifying, detecting, and/or-cloning nucleic acid se- 28-07-1987 1956 A quences 10 4395486 Method for the direct analysis of sickle cell anemia 26-07-1983 71 B Table 9: Performance matrix for DEP-netPlus. Best results are marked in bold. DEP-net DEP-netPlus Measure A grade (%) B grade (%) C grade (%) A grade (%) B grade (%) C grade (%) Precision 78.00 51.48 74.85 79.03 52.14 75.01 Recall 75.53 46.65 73.22 74.67 45.98 73.30 F-measure 76.74 48.95 74.03 76.84 49.06 74.15 of 296,933 USPTO patents pertaining to “semicon- superiority of our model in capturing not only the ductor” technology collected within the temporal “importance” of a patent but also in evaluating the span of 2000 to 2015. We carried out the same pre- patent’s grade. processing steps along with down-sampling of the data or certain classes to maintain uniformity. The 6 Conclusion and Future Work results of experiments performed with an additional In this paper, we proposed a method to proactively component, i.e., our proposed TimeAttentiveRank identify milestone patents that have been granted score to the deep learning model which we refer to in recent years. We compared the performance as DEP-netPlus are presented in Table 9. of three network-flow algorithms for this purpose From the table, we can clearly observe that the on two different datasets. On the second dataset, classification model is enhanced by the inclusion we used a deep-learning-based approach to fuse of a network flow metric that account for the net- patent content along with network flow metrics, to work effect due to citations. This also confirms the compare against state-of-the-art and discovered that our proposed approach results in better Phillip Bonacich. 1987. Power and centrality: A fam- performance both in identifying “milestone” ily of measures. American Journal of Sociology, 92(5):1170–1182. patents as well as improving the patent grade prediction. From the experimental results, we Sergey Brin and Lawrence Page. 1998. The anatomy summarily concluded that raw citation count is of a large-scale hypertextual web search engine. not enough to capture the importance of a patent Computer Networks and ISDN Systems, 30(1):107 – 117. Proceedings of the Seventh International World since it does not take into account the age of Wide Web Conference. citations. When accounted for the same using a balanced metric like Time-Attentive ranking, we Péter Bruck, István Réthy, Judit Szente, Jan Tobochnik, are guaranteed to identify potential patents that and Péter Érdi. 2016. Recognition of emerging technology trends: class-selective study of citations are likely to spur technological growth in the near in the u.s. patent citation network. Scientometrics, future. We also identified top patents per category 107(3):1465–1475. and sector, which can help in the identification Richard S. Campbell. 1983. Patent trends as a techno- of niche areas for innovation. Although patent logical forecasting tool. World Patent Information, retrieval is a recall-oriented task, these criteria 5(3):137 – 143. may also help in re-ranking the results against a keyword search for patents. Mark P. Carpenter, Francis Narin, and Patricia Woolf. 1981. Citation rates to technologically important patents. World Patent Information, 3(4):160 – 163. As part of our future work, we would like to study the importance of geographical location on Manajit Chakraborty, Seyed Ali Bahrainian, and Fabio Crestani. 2020. Forecasting patent growth by com- influential patents, such as the country they origi- bining time-series signals using covariance patterns. nated from, the citations received from other coun- In Proceedings of the Joint Conference of the Infor- tries and so on. We also plan to experiment with mation Retrieval Communities in Europe (CIRCLE the various granularity of time such as a month, 2020), Samatan, Gers, France, July 6-9, 2020, vol- ume 2621 of CEUR Workshop Proceedings. year, 5-year period, and so on. P. Chen, H. Xie, S. Maslov, and S. Redner. 2007. Acknowledgements Finding scientific gems with google’s pagerank al- gorithm. Journal of Informetrics, 1(1):8 – 15. We thank the anonymous reviewers for their valu- able comments. This work was partially supported Park Chung and So Young Sohn. 2020. Early de- by The Global Structure for Knowledge Networks tection of valuable patents using a deep learning model: Case of semiconductor industry. Technolog- project grant (No. 167326) under the SNSF Na- ical Forecasting and Social Change, 158:120146. tional Research Programme 75 “Big Data” (NRP 75). Giulio Cimini, Andrea Gabrielli, and Francesco Sylos Labini. 2014. The scientific competitiveness of na- tions. PloS one, 9(12):e113470. References Rafael A. Corre Doira and Preeta M. Banerjee. 2015. Zoltan J Acs, Luc Anselin, and Attila Varga. 2002. Measuring patent’s influence on technological evo- Patents and innovation counts as measures of re- lution: A study of knowledge spanning and subse- gional production of new knowledge. Research Pol- quent inventive activity. Research Policy, 44(2):508 icy, 31(7):1069 – 1085. – 521. Nisa Bakkalbasi, Kathleen Bauer, Janis Glover, and Leo Egghe. 2006. Theory and practise of the g-index. Lei Wang. 2006. Three options for citation tracking: Scientometrics, 69:131–152. Google scholar, scopus and web of science. Biomed- Roberto Fontana, Alessandro Nuvolari, Hiroshi ical Digital Libraries, 3(1):7. Shimizu, and Andrea Vezzulli. 2013. Reassessing Mark A Bedau, Andrew Buchanan, Devin W Chalmers, patent propensity: Evidence from a dataset of r&d C Cooper Francis, Norman H Packard, and Noah M awards, 1977-2004. Research Policy, 42(10):1780 – Pepper. 2011. Evidence in the patent record for the 1792. evolution of technology using citation and pagerank Adam B Jaffe, Manuel Trajtenberg, and Michael S Fog- statistics. In ECAL, pages 77–84. Citeseer. arty. 2000. The meaning of patent citations: Report Johan Bollen, Marko A. Rodriquez, and Herbert Van de on the nber/case-western reserve survey of patentees. Sompel. 2006. Journal status. Scientometrics, Technical report, National bureau of economic re- 69(3):669–687. search. M.M.S. Karki. 1997. Patent citation analysis: A policy evaluation of scientific activity. Computer Horizons analysis tool. World Patent Information, 19(4):269 – Cherry Hill, NJ. 272. Filippo Radicchi, Santo Fortunato, and Claudio Castel- Leo Katz. 1953. A new status index derived from so- lano. 2008. Universality of citation distributions: ciometric analysis. Psychometrika, 18(1):39–43. Toward an objective measure of scientific impact. Proceedings of the National Academy of Sciences, Leonid Kogan, Dimitris Papanikolaou, Amit Seru, and 105(45):17268–17272. Noah Stoffman. 2017. Technological Innovation, Resource Allocation, and Growth*. The Quarterly Zhuo-Ming Ren, Manuel Sebastian Mariani, Yi-Cheng Journal of Economics, 132(2):665–712. Zhang, and Matúš Medo. 2018. Randomizing grow- Wenhui Liao and Sriharsha Veeramachaneni. 2010. ing networks with a time-respecting null model. Unsupervised learning for reranking-based patent Physical Review E, 97(5):052311. retrieval. In Proceedings of the 3rd International Deborah Strumsky and José Lobo. 2015. Identifying Workshop on Patent Information Retrieval, PaIR ’10, the sources of technological novelty in the process pages 23–26, New York, NY, USA. ACM. of invention. Research Policy, 44(8):1445–1461. Manuel Sebastian Mariani, Matús̆ Medo, and François Lafond. 2019. Early identification of important Dylan Walker, Huafeng Xie, Koon-Kiu Yan, and Sergei patents: Design and validation of citation network Maslov. 2007. Ranking scientific publications using metrics. Technological Forecasting and Social a model of network traffic. Journal of Statistical Me- Change, 146:644–654. chanics: Theory and Experiment, 2007(06):P06010. Manuel Sebastian Mariani, Matúš Medo, and Yi-Cheng William Webber, Alistair Moffat, and Justin Zobel. Zhang. 2016. Identification of milestone papers 2010. A similarity measure for indefinite rankings. through time-balanced network centrality. J. Infor- ACM Trans. Inf. Syst., 28(4):20:1–20:38. metrics, 10(4):1207–1223. Xiaobing Xue and W. Bruce Croft. 2009. Automatic Jean-Francois Molinari and Alain Molinari. 2008. A query generation for patent search. In Proceedings new methodology for ranking scientific institutions. of the 18th ACM Conference on Information and Scientometrics, 75(1):163–174. Knowledge Management, CIKM ’09, pages 2037– 2040, New York, NY, USA. ACM. Francis Narin et al. 1976. Evaluative bibliometrics: The use of publication and citation analysis in the