1. Introduction

Corresponding author. †These authors contributed equally. $ ekamateri@iee.ihu.gr (E. Kamateri); renukswamy.chikkamath@hm.edu (R. Chikkamath); msa@ihu.gr (M. Salampasis); linda.andersson@artificialresearcher.com (L. Andersson); markus.endres@hm.edu (M. Endres)

Enhancing patent retrieval using automated patent summarization⋆

Eleni Kamateri

Renukswamy Chikkamath

Michail Salampasis

Linda Andersson

Markus Endres

2 0 Artificial Researcher - IT GmbH , Sweden 1 Department of Information and Electronic Engineering, International Hellenic University , Sindos, 57400 Thessaloniki , Greece 2 Hochschule München , Loth str. 34, 80335 München , Germany

2025

000 0 0003

Efective query formulation is a key challenge in long-document Information Retrieval (IR). This challenge is particularly acute in domain-specific contexts like patent retrieval, where documents are lengthy, linguistically complex, and encompass multiple interrelated technical topics. In this work, we present the application of recent extractive and abstractive summarization methods for generating concise, purpose-specific summaries of patent documents. We further assess the utility of these automatically generated summaries as surrogate queries across three benchmark patent datasets and compare their retrieval performance against conventional approaches that use entire patent sections.

eol>patent retrieval query formulation patent summarization big bird summary segment

1. Introduction

Drafting a representative abstract that accurately summarizes the core concepts of an invention is an important step in the patenting process. A well-crafted abstract conveys the core concepts of an invention, therefore it enhances both the readability and discoverability of a patent throughout its lifecycle. For instance, integrating summaries into search snippets could reduce examiner search time, or help an inventor to quickly grasp prior art. A good summary may also be a great assistance for a patent professional to evaluate the technical or legal scope of a patent. Furthermore, a good summary retaining technical details and key claims could be used for downstream task such as patent prior-art and classification [ 1, 2 ].

However, many human-authored patent abstracts do not summarize the invention efectively. This shortcoming may arise from various factors, such as the urgency to submit the application, regulatory constraints on abstract length, limited attention by inventors, and, last but not least, the intentional vagueness often employed to avoid narrowing the scope of legal protection and reduce discoverability in prior-art searches [ 3, 4 ]. Consequently, relying directly on the patent abstract for producing a search query for patent retrieval tasks is often inefective.

As a result, patent abstracts are often supplemented by human-selected keywords. Alternatively, content is extracted from other sections of the patent, such as the description or claims, to produce queries that will enhance retrieval performance [ 5, 6, 7 ]. To address this need, various methods have been developed to automatically generate search queries from patent applications employing simple intuitive heuristics (e.g., the first X words), statistical techniques, language modeling and other methods. Some of these methods enhance abstracts by directly incorporating content from the description and claims sections [ 6 ]. Other methods aim to identify the most discriminative terms across diferent sections by comparing term statistics within a given patent to those of a broader corpus, often leveraging language modelling estimation techniques [ 8 ]. Query enrichment may also involve query terms extracted from patent citations or classification codes [ 9 ].

When using LLMs, there are two basic approaches for handling large documents -such as patents- for text retrieval and other text-related tasks. The first is document chunking to overcome token limits of LLMs. The second is document summarization which aims to preserve the semantic flow and concepts of a long document leading to a reduced text representation for eficient processing. AI-generated summaries are increasingly adopted across domains for their conciseness and informativeness. In the patent domain, however, their use has been primarily explored in classification tasks [ 10 ], where recent studies show that classifiers trained on generated summaries outperform those trained on original abstracts. Beyond classification, there is a growing need for purpose-specific summaries tailored to other downstream tasks. These summaries can take diferent forms depending on what the task requires (e.g., abstract, extended summary, or first claim), summarization approach (e.g., abstractive or extractive), input source (e.g., description, claims, or brief description) and output length (e.g., 50 to 300 words). One promising application is patent retrieval task, where high-quality and rightly sized summaries can serve as search queries that contain meaningful context but not so large that they overwhelm the model.

Building on these considerations, this study presents a pipeline for automated patent summarization. Furthermore, it examines the efectiveness of these automatically generated summaries as search queries for prior-art search, a critical task in the global patent operation. We employ state-of-the-art language models and semantically rich patent documents segments to generate both extractive and abstractive summaries, with the main goal to improve retrieval performance. We assess multiple input source combinations and summarization approaches across three patent datasets, determining which configurations produce the most informative summaries for retrieval tasks. Our results show that AI-generated summaries, when used as queries, consistently outperform other traditional strategies that rely on full patent sections.

The remainder of the paper is organized as follows: Section 2 and Section 3 detail the semantically important segments found in patent description and present summarization techniques. Section 4 outlines the methodology adopted in our study. In Section 5 we present the experimental results, while in Section 6 we discuss the findings and conclude the paper.

2. Patent description for search

Among patent sections in a patent document, the description is consistently identified as the most informative and valuable source for query generation [ 6 ]. As the longest component, it ofers a detailed elaboration of the proposed invention, often extending to several thousand words [ 11 ]. However, the absence of a standardized structure or mandated format for organizing patent description complicates automatic processing. This challenge is partially mitigated by the widespread use of conventional headings, such as Background, Summary of the Invention, Brief Description of the Drawings, and Detailed Description of the Invention. These headings are commonly adopted by patent applicants to organize the description into semantically coherent segments.

Notably, the combination of the Background and Summary of the Invention, which are collectively referred to as the “background summary”, has proven to be the most efective source for extracting high-value query terms from U.S. patent documents [ 12 ]. The practical importance of these segments led to increased interest in their standardization. In response, the USPTO has expanded its public API to provide direct access to key description sub-sections, like the Background and Brief Description. The Brief Description, labeled as “brief” in the USPTO API, spans from the beginning of the description to the end of the Summary of the Invention segment, efectively capturing what is traditionally known as the “background summary”.

Similarly, the Harvard USPTO Patent Dataset (HUPD) [ 13 ] uses USPTO documents’ semi-structure and simple heuristics to extract more meaningful segments like the Summary of the Invention. This segment ofers a more comprehensive and informative description of the invention substantially augmenting the abstract’s content while improving classification performance [ 14 ].

Although these segments enhance patent retrieval and classification, they appear inconsistently across patent documents. For instance, these or other corresponding headings often do not exist in European patents. This structural inconsistency underscores further the motivation for automated methods capable of generating summary-like content (resembling the USPTO’s Summary of the Invention), particularly in documents that lack clearly defined segments in the description. To that end, summarization techniques play a key role in bridging this gap by producing coherent, informative summaries.

3. Patent summarization

Extractive and abstractive summarization represent two distinct approaches to generate summaries from text. Extractive summarization uses statistical or neural models to select the most relevant sentences directly from the source text, preserving the original text. In contrast, abstractive summarization uses transformer models to generate a condensed version of the content by rephrasing or synthesizing information using natural language generation techniques, producing more coherent and readable summaries. Finally, hybrid approaches may be used to combine accuracy (selecting existing text using extractive methods) as well as fluency by enhancing the previously extracted text with abstractive methods.

Typically, extractive models work by generating sentence-level embeddings, clustering these embeddings, and selecting the sentences nearest to the cluster centroids as the most representative. State-of-the-art models like BERT [ 15 ] and its variant SBERT [ 16 ] are widely used for this purpose. These models efectively extract the most informative sentences without altering them, maintaining the original phrasing and structure of the source document.

Abstractive summarization models, on the other hand, follow an encoder-decoder architecture: they encode the input text, generate a summary through a decoding process, and produce a fluent, often restructured, output [ 17, 18 ]. Notable models in this category include PEGASUS [ 19 ], T5 [ 20 ], and BART [ 21 ], which have demonstrated strong performance on long-document summarization tasks. Although GPT-based models [ 22 ] also demonstrate strong performance in abstractive summarization, their closed-source nature and token-based pricing present practical limitations for large-scale use.

In the patent domain, recent research has primarily focused on generating summaries from the description and/or claims sections using Large Language Models (LLMs) [ 13, 23 ]. We refer to these outputs as automated summaries, to distinguish them from the human-authored "Summary of the Invention" segments found within the description (hereafter referred to as summary segments). A number of previous studies have fine-tuned pre-trained language models on patent datasets. For instance, one early work [ 24 ] trained Seq2Seq [ 25 ], PointGenerator [ 26 ], and SentRewriting [ 27 ] models on the BIGPATENT dataset, using the description as input and the abstract as the target output. BigBird-Pegasus, a long-sequence transformer model, was later fine-tuned on BIGPATENT for improved summarization of patent texts [ 28 ]. Similarly, the work in [ 13 ] adapted two versions of the T5 model for patent data, using either the claims or the description sections to generate abstracts. Another study reported in [ 29 ] investigated which patent sections are most informative for generating the first independent claim (also referred to as the first claim), using PEGASUS and PointGenerator models, concluding that the summary segment is the most suitable input source for this task. These eforts highlight the importance of fine-tuning and domain adaptation, as general-purpose transformer models often struggle with the technical and legal precision required in patent domain. Moreover, although recent LLMs show considerable promise to process long patent descriptions (such as GPT-4 [ 30 ] and Llama-3.14 [ 31 ]), their capabilities have not been extensively investigated in the context of patentrelated tasks [ 32 ]. By examining both the structural characteristics of patent documents and the current state of research on patent summarization, we have identified several open issues that merit further investigation to improve the quality and utility of generated summaries, as follows: 1. Existing models are typically not trained on the full patent text. Due to token length constraints, typically limited to 512 or 1024 tokens in standard LLMs, such as T5, PEGASUS, and BART, and extended up to 4096 tokens or more in models like BigBird-Pegasus, the input text often needs to be truncated, segmented or adapted. This can hinder the model’s ability to fully contextual capture understanding. 2. Evaluation commonly relies on existing abstracts as ground-truth summaries, despite their frequent shortcomings in terms of clarity, completeness, and informativeness. 3. A strong semantic alignment exists between the first claim and the summary segment. However, this relationship remains underexplored in current summarization approaches.

These limitations highlight the need for summarization strategies that are specifically tailored to the structure and practical use cases of patent documents. In particular, for retrieval tasks, such as prior-art search, where the generated summary serves as an efective retrieval query, it is essential to adopt summarization approaches that leverage high-value sections or combination of them to generate summaries that attain high retrieval performance. To operationalize this approach, our study follows a pipeline that first extracts key patent segments, then trains summarization models, and finally evaluates their efectiveness of the automated summaries in prior-art retrieval across multiple benchmark datasets.

4. Methodology

This study hypothesizes that generated summaries can improve the eficiency of prior-art search. To validate this hypothesis, we designed a five-stage experimental workflow, beginning with summary generation and ending with an evaluation of their efectiveness as retrieval queries. The performance of these summaries as queries is compared against sections, abstract, claims, and description, which are commonly used by patent professionals to formulate queries. In the following sections, we describe the data collections, we detail each stage of the methodology, and explain the evaluation process used to assess both the intrinsic quality of the generated summaries and their impact on retrieval performance.

4.1. Data collections

We utilize four patent datasets, each serving a distinct role. The first dataset, HUPD, allows the extraction of salient sections from patent documents. The second dataset, BIGPATENT, is employed for the intrinsic evaluation of the generated summaries, i.e. to measure the quality of the automated summaries compared to reference summaries. Finally, the next two datasets, CLEF-IP 2013 and USPTO, are used for extrinsic evaluation applying summaries for prior-art search. 4.1.1. HUPD [ 13 ] The HUPD is a large-scale, structured corpus of English-language utility patent applications filed to the USPTO between 2004 and 2018. Each JSON-formatted entry contains rich metadata, including bibliographic details, classification codes, inventor information, and full text fields such as abstract, claims, background, summary, and description. 4.1.2. BIGPATENT [ 24 ] The BIGPATENT dataset is a large-scale patent summarization benchmark comprising approximately 1.3 million U.S. patent documents. It pairs the description section with its corresponding abstract, serving as a ground truth for training and fine-tuning summarization models. For our intrinsic evaluation, we randomly sampled 1,000 patents from the available 67,072 of the test set. 4.1.3. CLEF-IP [ 33 ] The CLEF-IP collection consists of patent documents sourced from the EPO and WIPO. The English topic set from the CLEF-IP 2013 campaign originally comprises 50 topics [ 34 ]. However, because the topics are based on patent claims rather than individual documents, there is no strict one-to-one mapping between topics and documents. In total, these topics correspond to 37 unique documents. Due to missing relevant documents for some topics in the indexed dataset, we further reduced the set to 24 English-language patents for our experiments. Each topic patent is associated with between 2 and 8 manually identified relevant documents, based on expert-curated citation links, making this dataset a reliable benchmark for evaluating prior-art retrieval performance. 4.1.4. USPTO-Explainable AI for Patent Professionals [ 35 ] This USPTO dataset was released as part of a Kaggle competition aimed at advancing explainable AI in the patent domain. Each topic patent is associated with a set of 50 most similar patents, identified using content similarity measures rather than citation-based relevance. From this dataset, we selected 3,343 topic patents in which semantically coherent segments were automatically detected. Unlike CLEF-IP, which relies on citation-based ground truth, this benchmark provides an opportunity to test retrieval performance under automated similarity-based relevance.

4.2. Patent part extraction

The first step in our pipeline focuses on identifying and extracting sections of each patent to serve as query sections or as input sources for summarization. Specifically, we use the description and claims sections, and when identifiable, include the brief description, summary and first claim.

To detect these segments, we utilized the HUPD dataset, which is currently the only resource providing annotated labels (i.e., tags) for the background and summary segments within the description section of US patent documents. Based on these annotations, we constructed a dictionary of relevant summary headings, which we then used as a reference to identify candidate headings in unannotated patents. For each heading labeled as a summary heading, the subsequent content was marked as a summary segment. Once the summary segment was identified, the brief description was also derived by selecting the text spanning from the beginning of the description section up to the end of the summary. Finally, the first claim was extracted using heuristic rules, specifically, by identifying the first claim that is not dependent on any previous claim.

4.3. Patent summarization

In this phase, we employ three summarization models, the BERT [ 15 ], SBERT [ 16 ] and BigBird-Pegasus [ 28 ] (from now on referred to as BigBird). BERT and SBERT are utilized for extractive summarization, focusing on identifying the most relevant sentences from the input text. BigBird, which has been pre-trained on the BIGPATENT dataset, serves as our primary abstractive summarization model, to handle long-form patent text efectively.

For BERT and SBERT, we used models’ default configurations. Specifically, for SBERT, we employed the “paraphrase-MiniLM-L6-v2” model. For BigBird, we used a model fine-tuned on the BIGPATENT dataset. Two configuration variants of the BigBird model were explored: the default configuration, which generates relatively short summaries (typically between 50 and 100 words, depending on the input), and a modified version, where the model’s generation parameters were adjusted (i.e., the length penalty and minimum/maximum length settings) to produce longer summaries ranging from 250 to 300 words.

In this study, the BigBird model is further fine-tuned to generate summaries using the brief description and first claim as inputs, which are two sections identified as particularly informative within patent documents. The target output for this fine-tuning process is the summary segment. This exploration aims to set the foundation for future research on fine-tuning summarization models to replicate other valuable parts of patent text, such as the extended, author-crafted summary segments found within the description section.

To achieve this, a new dataset, which is a subpart of the HUPD dataset, is specially created. Specifically, we extracted from the HUPD dataset 402,921 patents that have a distinct summary segment with a length between 150 and 250 words. Then, we extracted the brief description and first claim and selected those patents whose brief description and first claim together had a length between 700 and 800 words. This selection criterion allows to skip any adjustment steps of the input text during the training, such as truncation or chunking, which may negatively afect the model’s interpretation. All these steps led to a dataset comprising 48,322 patents, which was finally used to fine-tune the BigBird model. An overview of the summarization and training parameters is shown in Table 1.

4.4. Patent retrieval

For the patent retrieval task, we use the FAISS vector database to store and retrieve semantic vectors. Both queries and patent documents are embedded using the GTE-large-en-v1.5 model [ 36 ], hereafter referred to as GTE, which has approximately 409 million parameters, a 1,024-dimensional embedding size, and supports input lengths up to 8,192 tokens. This enables the generation of rich embeddings that efectively represent full patents. The model achieved state-of-the-art performance on the Massive Text Embedding Benchmark (MTEB) within its size category, making it well-suited for our application. To reduce hardware complexity, we limit input length to 3,000 tokens, which is sucfiient to capture both independent and dependent claims and is used as corpus embeddings. While alternative embedding models and strategies, such as using diferent sections (e.g., abstracts, descriptions) or specific segments (e.g., brief descriptions, summaries) as corpus representations ofer promising avenues for exploration, we leave these investigations for future work.

Since our primary goal is to assess the impact of generated summaries on prior-art retrieval, we restrict the vector index to a subset of 200,000 patent documents drawn from our two prior-art datasets. Retrieval performance is then compared against simpler methods that use entire patent sections or extracted segments as queries. Given that our summarization pipeline is based on semantic models, and our aim is to isolate the contribution of the generated summaries from the retrieval technique itself, we focus exclusively on embedding-based retrieval method rather than keyword-based approaches. The integration of additional retrieval methods is left for future work.

4.5. Intrinsic and extrinsic evaluation

To evaluate the efectiveness of the generated summaries, we perform two types of evaluations: intrinsic and extrinsic.

For the intrinsic evaluation, the automatically generated summaries are compared against reference summaries, either the original patent abstract or the annotated summary segment (which has been proved to provide an extended and improved summary). This evaluation aims to determine how accurately the generated summaries captures the key content of the patents. For this purpose, we use ROUGE scores [ 37 ] to assess textual overlap and compute semantic similarity, calculated as the cosine similarity between embeddings produced by Google’s BERT-for-Patent model [ 10 ], comparing the generated and reference summaries.

For the extrinsic evaluation, we measure the impact of using the generated summaries as queries in a prior-art search task. Specifically, we compare their retrieval performance against traditional query strategies using standard IR metrics, including Mean Average Precision (MAP), Precision (P), and Recall (R).

Summary Segment

5. Results 5.1. Evaluation of summary generation in the BIGPATENT dataset 5.2. Evaluation of prior-art retrieval in the CLEF-IP 2013 dataset

In the CLEF-IP 2013, we followed the TREC-based guidelines provided by CLEF-IP [ 33 ], using TRecTools [ 38 ] to calculate Precision and Recall at various cut-of levels (@5, @10 and @30), as well as MAP@100.

Table 3 reports the retrieval results of conventional query strategies, where entire patent sections commonly used by professionals are employed verbatim as queries. We observe that queries formulated using the claims text achieved the best retrieval results. This outcome is largely attributed to the fact that the claims text was also used for generating the corpus embeddings (using GTE as the embedding model), thereby ensuring a higher degree of semantic alignment between the query and the indexed documents. While extracting patent segments for query formulation could ofer valuable insights, it was not feasible to implement this approach efectively, since the description sub-sections were not consistently identifiable across all 24 topics.

Table 4 presents the retrieval results when using automated summaries as query inputs. These summaries were generated using various summarization methods and diferent patent sections as input sources. The results demonstrate that queries formulated with these automated summaries consistently outperform those based on standard patent sections. Overall, the default summaries generated by the BigBird model although outperform the standard patent sections, they were found to be insuficient in capturing the breadth of important patent content compared to longer summaries generated by the adjusted BigBird model, which achieve best retrieval performance.

Then comes the SBERT-based summaries, particularly when the description text was used as input, or the default BigBird when using the claims text. Although the BERT- and SBERT-based summaries achieved good retrieval scores, especially when produced from the description, they often retained much of the original text without adequately condensing it, which is crucial for overcoming the token limitations of LLMs in text retrieval tasks.

5.3. Evaluation of prior-art in the USPTO dataset

In the USPTO dataset, we followed the USPTO Kaggle competition guidelines, computing MAP@50 as the primary metric. For Precision and Recall, we applied the same cut-of levels used in the CLEF-IP 2013 evaluation (@5, @10 and @30) to ensure consistency and comparability across datasets.

Table 5 presents retrieval results based on traditional patent sections commonly used as queries, such as the abstract, claims, and description. It also includes results from using high-value segments, such as the summary segment, brief description, and first claim, individually and in combination. Similarly, both queries and the corpus were represented using the GTE embedding model.

Interestingly, the queries formulated by high-value segments (e.g., brief description or summary segment/brief description and first claim), consistently outperform those based on conventional patent sections. This underscores the importance of targeted content selection in enhancing retrieval efectiveness.

Table 6, on the other hand, reports retrieval performance when automated summaries are used as queries. Then, we observe that depending on the summarization method and input, automated summaries can significantly outperform their respective original patent sections.

Avg. Words 109 982 6,962 26.31% 27.72% 23.89% 14.58% 15.83% 12.50% 35.80% 36.40% 28.12%

In particular, the adjusted BigBird model, which generates longer summaries of approximately 250-300 words compared to the default version, outperforms the default BigBird model in retrieval performance. Furthermore, it achieves results that are comparable to, or slightly better than, those obtained using the simpler query formulation techniques outlined in Table 5. Notably, this approach demonstrates strong eficiency, as it achieves similar retrieval performance using generated summaries that are substantially more concise than the original patent sections.

Regarding extractive models, queries generated using SBERT summaries based on the description text achieved the highest retrieval scores across all metrics. In contrast, queries generated from the claims or brief description text performed worse than those using the original texts. Interestingly, despite their strong retrieval performance, SBERT-generated summaries, averaging 807 words. This Claims Description

BERT SBERT BigBird* BigBird** BERT SBERT BigBird* BigBird**

BigBirdFT

Note: *: Pre-trained BigBird (default), **: Adjusted pre-trained BigBird, FT: Fine-tuned BigBird 29.38% 26.60% 28.93% 32.12% 28.56% 28.93% 26.85% 30.73% 25.03% 46.92% 43.91% 49.94% 53.07% 49.63% 51.33% 48.31% 48.13% 48.10% contrast highlights the importance of aligning summary generation with its intended purpose, whether to enhance readability and support human assessment, or to optimize performance in downstream tasks such as prior-art retrieval and classification. 25.00% 23.33% 24.17% 27.50% 15.00% 15.00% 15.83% 18.33% 15.83% 15.42% 14.17% 16.67% 13.75% 51.18% 60.60% 62.54% 63.94% 60.34% 63.73% 64.18% 64.36% 65.84% 49.57% 62.60% 57.74% 59.41% 49.48% 60.83% 51.71% 44.77% 52.82% 55.16% 56.42% 53.52% 56.62% 56.91% 57.71% 58.78% 43.57% 55.39% 51.25% 52.57% 43.71% 54.09% 45.41% 6.81% 6.53% 7.08% 7.78% 6.94% 7.08% 6.81% 6.94% 6.53% 32.84% 37.88% 40.06% 41.63% 39.52% 41.34% 41.89% 42.91% 43.68% 31.87% 40.82% 38.06% 39.01% 31.98% 39.88% 33.68%

6. Discussion and conclusion

Overall, the findings of this study are promising, demonstrating that patent retrieval benefits from using targeted patent segments (when detectable) and automated summaries as queries, compared to relying solely on traditional sections typically employed by patent professionals, such as abstract, descriptions or claims. Across both prior-art datasets, CLEF-IP and USPTO, automated summaries consistently outperformed conventional query inputs. Among the summarization methods and input configurations evaluated, the adjusted BigBird model using claims as input and the SBERT model applied to the description section emerged as the most efective abstractive and extractive approaches, respectively, yielding the highest retrieval performance across both datasets.

Moreover, our initial experimental results support the hypothesis that summarization models can be further adapted to produce comprehensive and contextually relevant summaries, although confirming this required extensive validation. This approach presents a promising direction for future advancements in patent summarization.

This work was initially motivated by our participation in the European Patent Ofice’s (EPO) CodeFest 2024 competition [ 39 ], where it was selected as one of the top six finalists. Building on this foundation, we aim to further advance our research on patent segmentation and summarization techniques by evaluating their impact on patent retrieval performance across additional prior art test collections. Additionally, exploring alternative corpus representations and integrating additional retrieval methods will be key areas of focus in future work.

Acknowledgments

This research work was supported by the Hellenic Foundation for Research and Innovation (HFRI) under the HFRI PhD Fellowship grant (Fellowship Number: 10695).

Declaration on Generative AI

During the preparation of this work, the authors used GPT-4 in order to: Grammar and spelling check and paraphrase and reword. After using these tools/services, the authors reviewed and edited the content as needed and take full responsibility for the publication’s content. The code used for running the experiments of this article will become available in the a GitHub repository.

[1]

Lupu ,

Mayer ,

Tait ,

A. J.

Trippe (Eds.), Current Challenges in Patent Information Retrieval , Springer, 2011 .

[2]

Y.-H.

Tseng , C.-J. Lin , Y.-I. Lin , Text mining techniques for patent analysis , Information Processing & Management 43 ( 2007 ) 1216 - 1247 .

[3]

Bashir ,

Rauber , Improving retrievability of patents in prior-art search , in: Proceedings of the European Conference on Information Retrieval (ECIR) , 2010 , pp. 457 - 470 .

[4]

Ali ,

Tufail , L. C. De Silva , P. E. Abas , Innovating patent retrieval: A comprehensive review of techniques, trends, and challenges in prior art searches , Applied System Innovation 7 ( 2024 ) 91 .

[5]

Cetintas , L. Si, Efective query generation and postprocessing strategies for prior art patent search , Journal of the American Society for Information Science and Technology 63 ( 2012 ) 512 - 527 .

[6]

Stamatis ,

Salampasis ,

Diamantaras , A novel re-ranking architecture for patent search , World Patent Information 78 ( 2024 ) 102282 .

[7]

Althammer ,

Hofstätter ,

Hanbury , Cross-domain retrieval in the legal and patent domains: a reproducibility study , in: Proceedings of the European Conference on Information Retrieval (ECIR) , 2021 , pp. 3 - 17 .

[8]

Mahdabi ,

Keikha ,

Gerani ,

Landoni ,

Crestani , Building queries for prior-art search , in: Proceedings of the Information Retrieval Facility Conference (IRFC) , Multidisciplinary Information Retrieval , 2011 , pp. 3 - 15 .

[9]

Mahdabi ,

Andersson ,

Hanbury ,

Crestani , Report on the clef-ip 2011 experiments: Exploring patent summarization , in: CLEF Notebook Papers/Labs/Workshop, 2011 .

[10]

Srebrovic ,

Yonamine , Leveraging the BERT Algorithm for Patents with TensorFlow and BigQuery , White paper, 2020 .

[11] EPO , Chapter iv: The european search report , https://www.epo.org/en/legal/guide-epc/ 2023 /ga_ c4_ 2_4 .html, 2023 . [Online].

[12]

Xue , W. B. Croft , Transforming patents into prior-art queries , in: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) , 2009 , pp. 808 - 809 .

[13]

Suzgun ,

Melas-Kyriazi ,

Sarkar ,

S. D.

Kominers , S. Shieber, The harvard uspto patent dataset: A large-scale, well-structured, and multi-purpose corpus of patent applications , Advances in Neural Information Processing Systems 36 ( 2024 ).

[14]

S. C.

Pujari ,

Mantiuk ,

Giereth ,

Strötgen ,

Friedrich , Evaluating neural multi-field document representations for patent classification , in: Proc. International Workshop on Bibliometric Enhanced Information Retrieval (BIR) co-located with ECIR , 2022 .

[15]

Miller , Leveraging bert for extractive text summarization on lectures , arXiv preprint arXiv: 1906 .04165, 2019 .

[16]

Reimers , I. Gurevych , Sentence-bert: Sentence embeddings using siamese bert-networks , in: Proc. EMNLP-IJCNLP , 2019 , pp. 3982 - 3992 .

[17]

Abdul Samad ,

Sushma ,

G. Bharathi

Mohan ,

Samuji ,

Repakula ,

S. R.

Kothamasu , Advancing abstractive summarization: Evaluating gpt-2, bart , t5 -small, and pegasus models with baseline in rouge and bleu metrics , in: Proc. ICICDS , 2024 , pp. 119 - 131 .

[18]

Ding ,

Chen ,

Kolapudi ,

Pobbathi ,

Nguyen , Quality evaluation of summarization models for patent documents , in: Proc. QRS , 2023 , pp. 250 - 259 .

[19]

Zhang ,

Zhao ,

Saleh , P. Liu, Pegasus: Pre-training with extracted gap-sentences for abstractive summarization , in: Proc. ICML , 2020 , pp. 11328 - 11339 .

[20]

Rafel ,

Shazeer ,

Roberts ,

Lee ,

Narang ,

Matena ,

Zhou ,

Li ,

P. J.

Liu , Exploring the limits of transfer learning with a unified text-to-text transformer , Journal of Machine Learning Research 21 ( 2020 ) 1 - 67 .

[21]

Lewis ,

Liu ,

Goyal ,

Ghazvininejad ,

Mohamed ,

Levy ,

Stoyanov , L. Zettlemoyer, Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension , in: Proc. ACL , 2019 , pp. 7871 - 7880 .

[22]

N. F.

Liu ,

Lin ,

Hewitt ,

Paranjape ,

Bevilacqua ,

Petroni ,

Liang , Lost in the middle: How language models use long contexts , Transactions of the Association for Computational Linguistics 12 ( 2024 ) 157 - 173 .

[23]

Moreno , Transformers-Based Abstractive Summarization for the Generation of Patent Claims , Ph.D. thesis, Politecnico di Torino , 2023 .

[24]

Sharma ,

Li ,

Wang , Bigpatent: A large-scale dataset for abstractive and coherent summarization , in: Proc. ACL , 2019 , pp. 2204 - 2213 .

[25]

Sutskever ,

Vinyals ,

Q. V.

Le , Sequence to sequence learning with neural networks , in: Advances in Neural Information Processing Systems , volume 27 , 2014 , pp. 3104 - 3112 .

[26]

See ,

P. J.

Liu ,

C. D.

Manning , Get to the point: Summarization with pointer-generator networks , in: Proc. ACL , 2017 , pp. 1073 - 1083 .

[27] Y.-C. Chen , M. Bansal , Fast abstractive summarization with reinforce-selected sentence rewriting , in: Proc. ACL , 2018 , pp. 675 - 686 .

[28]

Zaheer , G. Guruganesh,

K. A.

Dubey ,

Ainslie ,

Alberti ,

Ontanon ,

Pham ,

Ravula ,

Wang ,

Yang , et al., Bigbird: Transformers for longer sequences , in: Advances in Neural Information Processing Systems , volume 33 , 2020 , pp. 17283 - 17297 .

[29]

Tonguz ,

Qin ,

Gu ,

H. H.

Moon , Automating claim construction in patent applications: The cmumine dataset , in: Proc. NLLP , 2021 , pp. 205 - 209 .

[30]

Achiam ,

Adler ,

Agarwal ,

Ahmad ,

Akkaya ,

F. L.

Aleman ,

Almeida ,

Altenschmidt ,

Altman ,

Anadkat , et al., Gpt-4 technical report, arXiv preprint arXiv:2303.08774 , 2024 .

[31]

Grattafiori ,

Dubey ,

Jauhri ,

Pandey ,

Kadian ,

Al-Dahle ,

Letman ,

Mathur ,

Schelten ,

Vaughan , et al., The llama 3 herd of models, arXiv preprint arXiv:2407.21783 , 2024 .

[32]

Jiang ,

S. M.

Goetz , Natural language processing in the patent domain: A survey , Artificial Intelligence Review 58 ( 2025 ) 214 .

[33]

Piroi ,

Lupu ,

Hanbury ,

Zenz , Clef-ip 2011 : Retrieval in the intellectual property domain , in: CLEF Notebook Papers/Labs/Workshop, 2011 .

[34]

Piroi ,

Lupu ,

Hanbury , Passage retrieval starting from patent claims: A clef-ip 2013 task overview , in: Working Notes for CLEF 2013 Conference , 2013 .

[35]

Beliveau ,

Cenkci ,

Alcantara ,

Frost ,

Garret ,

Reddy ,

Dane ,

Cukierski ,

Howard ,

Demkin , Uspto-explainable ai for patent professionals , Kaggle , 2024 . Available: https://kaggle. com/competitions/uspto-explainable-ai.

[36]

Zhang ,

Long ,

Xie ,

Dai ,

Tang ,

Lin ,

Yang ,

Xie ,

Huang , et al., mgte: Generalized long-context text representation and reranking models for multilingual text retrieval , in: Proc. EMNLP Industry Track , 2024 , pp. 1393 - 1412 .

[37] C.-Y. Lin , Rouge: A package for automatic evaluation of summaries , in: Text Summarization Branches Out, 2004 , pp. 74 - 81 .

[38]

Palotti ,

Scells , G. Zuccon, Trectools: An open-source python library for information retrieval practitioners involved in trec-like campaigns , in: Proc. SIGIR , 2019 , pp. 1325 - 1328 .

[39] EPO , Codefest 2024 on generative ai , 2024 . Available: https://www.epo.org/en/news-events/ in-focus/codefest/codefest-2024 - generative-ai.