-

Journal of Machine Learning Research

10.48550/arXiv.1810.04805

Semantic Statistical Model for Assessing Expert Competence in Scientific Competitions

Vitaliy Tsyganok

vitaliytsyganok@gmail.com 0 1

Yaroslav Khrolenko

0 1 0 Information Technology and Implementation, IT&I-2025 1 Institute for Information Recording of NAS of Ukraine , 2 Shpaka Street, Kyiv, 03113 , Ukraine

2026

22 142 104 116

This paper addresses the problem of objectively evaluating expert competence during the formation of juries for scientific competitions. It is shown that traditional bibliometric indicators (such as the Hirsch index, number of publications, and citation counts) the specific subject domain of a competition, particularly in cases of interdisciplinary studies. To overcome this limitation, a semantic statistical model is proposed, based on describing the comp scope through a set of concepts and its iterative expansion via co-occurrence analysis in bibliographic the integral competence score is defined as a weighted sum of matches across different levels of the model. The experimental validation of the model was carried out using data from the OpenAlex aggregator, which contains over 65,000 concepts and corpora of publications across diverse scientific domains. The study demonstrated that the model is robust to reductions in sample size: even when using only 3% of the full publication corpus, the results remain close to those of the complete model (Jaccard coefficient > 0.9). This indicates the possibility of reducing the volume of processed data without significant quality loss, thereby lowering computational requirements and enabling partial offloading of calculations to the aggregator side. The practical significance of the proposed approach lies in the development of automated decisionsupport systems for organizers of scientific competitions, conferences, and grant programs. The model enhances the objectivity and transparency of expert selection, accommodates the interdisciplinary nature of research topics, and ensures interpretability of evaluation results.

expert competence semantic statistical model competitions of scientific papers OpenAlex conceptual units thematic relevance 1

1. Introduction -balanced expert committees for student research competitions, grant allocation, and project peer review is becoming increasingly significant. The quality of decisions is directly contingent upon the competence of the selected experts, as they determine whether the submitted works align with the current state of scientific advancement, demonstrate novelty, and possess practical relevance. Conventional approaches to expert selection primarily rely on bibliometric indicators such as the Hirsch index, publication counts, citation

metrics, and academic titles. Although these indicators reflect a with the specific subject area of the competition. This limitation is particularly critical in interdisciplinary research.

Emerging domains often integrate knowledge from multiple disciplines, giving rise to novel subject areas that remain insufficiently represented in traditional classification systems. Fields such lignment as bioinformatics, quantum computing, and cognitive science exemplify this trend, as they evolve at disciplinary intersections and cannot be adequately categorized through journal hierarchies or domain-specific taxonomies alone. Consequently, the selection of experts becomes problematic: qualifications but insufficient specialization to assess submissions in narrow or emerging domains.

These challenges highlight the need for models that combine semantic analysis of scientific texts with statistical methods for evaluating the co-occurrence of key concepts in global research discourse. Such an integrated approach enables a more precise assessment of thematic proximity

2. Analysis of recent research

Over the past decades, the scientific community has proposed a variety of approaches for evaluating common remain formal bibliometric methods, which are based on quantitative indicators of research productivity. These include the Hirsch index, citation counts, total number of publications, as well as academic degrees and titles. Such metrics have clear advantages: they are easy to compute, well standardized, and allow for quick comparisons between researchers. However, their main drawback lies in the lack of connection to a specific subject domain. As Bornmann and Daniel [ 1 ] and Waltman and van Eck [2] have shown, formal indicators do not always correlate with actual expertise in narrow thematic fields.

To account for the content-specific nature of research, content-oriented methods have been developed. These are based on the analysis of keywords, abstracts, and descriptors accompanying scientific publications. In this case, the comparison of expert profiles and competition materials is performed by matching or measuring the frequency of recurring terms. The advantage of this approach lies in its intuitive interpretability and applicability even with small text corpora. Its weakness, however, stems from the ambiguity of natural language: the same concept can be expressed by different words or phrases, and author-provided keywords are often subjective and do not necessarily reflect the true content of the article, as emphasized by Haustein and Larivière [3].

The next stage of evolution involves semantic methods that employ natural language processing techniques to determine contextual proximity between terms. Modern vector-based models, such as Word2Vec (Mikolov et al. [4]), GloVe, or BERT (Devlin et al. [5]), represent words and concepts in a multidimensional space, where the distance between them corresponds to semantic similarity. This enables comparison not only of identical terms but also of semantically related concepts. For example, if a competition topi without a direct keyword match.

Statistical methods, in turn, rely on studying the patterns of co-occurrence of key concepts in publications. A typical example is the construction of co-occurrence graphs, where vertices represent concepts and edges reflect their joint frequency of use. Such graphs make it possible not only to identify directly related concepts but also to study the structure of scientific knowledge at a higher level, for instance, to detect interdisciplinary links or to form thematic clusters, as discussed by van Eck and Waltman [6].

The problem of assessing expert competence correlates with the task of assigning reviewers to scientific articles. For example, Stelmakh, Shah, and Singh [7] proposed the PeerReview4All model, which addresses the reviewer assignment problem by maximizing topical relevance between submitted works and expert profiles. A systematic review by Zhao et al. [8] summarizes recent methods of automatic reviewer assignment, highlighting relevance and scalability as key criteria. Jovanovic et al. [9], in their review o thematic competence remains central to the further development of reviewer assignment models. Leyton-Brown, Shah, Stelmakh, and Thakur [ 10 ] describe the practical LCM system, implemented at AAAI-35, which enables scalable reviewer paper matching based on topical scores. Similarly, Anjum et al. [11] propose a joint topical space model for aligning articles with reviewers, demonstrating high accuracy on datasets from computer architecture conferences.

In recent years, aggregator services have become widely adopted, providing large-scale access to scientific works and their associated metrics, while performing classification tasks that require the identification of conceptual units (concepts, topics, keywords), often on the basis of semantic similarity. Services such as Scopus, Web of Science, PubMed, or Google Scholar provide access to massive corpora of publications, along with bibliometric indicators (citations, h-index) and altmetric measures (social media impact, mentions). Each platform relies on its own system of conceptual classification: Scopus employs the ASJC journal classification, Web of Science uses its hierarchical subject categories, PubMed applies the MeSH vocabulary, while OpenAlex introduces a multilayered taxonomy of Fields of Study. Each of these approaches has advantages and limitations. For instance, Moed [12] notes that Scopus classification does not always reflect the actual topic of an individual article since it is tied to journal profiles. PubMed is highly effective in the biomedical domain but not applicable to other fields. OpenAlex, described by Priem, Piwowar, Orr, and Carbery [13], provides the broadest coverage, but its concepts are sometimes overly general.

Nevertheless, the numerical connectivity indicators calculated by aggregators provide information about the proximity of concepts within the entire system of scientific knowledge. However, the task of competence evaluation requires assessing the proximity profile to that of a specific competition. This is particularly important when competitions address emerging research areas, which often lie at the intersections of several disciplines and lack a wellestablished position in the broader system of knowledge.

Thus, a review of the literature shows that existing approaches can partially address the problem of expert competence evaluation, but remain either overly formal or narrowly specialized. None of them ensures accuracy, universality, and adaptability to interdisciplinary challenges simultaneously. This creates a foundation for developing hybrid models that integrate semantic and statistical features, thereby enabling a more comprehensive and relevant representation of scientific competence.

3. Problem statement

The object of this study is the process of evaluating expert competence in the context of forming a jury for scientific competitions. This process directly determines the quality of the evaluation of competition materials, since the correctness of reviewer selection affects both the objectivity and the scientific significance of the resulting decisions.

Recent research emphasizes that the key objective of reviewer assignment systems is to ensure a The scientific problem lies in the lack of a universal competence evaluation model that would accurately capture the topical alignment of experts with the subject matter of competitions. Such a model must account simultaneously for the high degree of specialization of research topics on the one hand, and their interdisciplinary character on the other.

The aim of this work is to develop a model for evaluating the competence of jury members in scientific competitions, based on the semantic and statistical relationships between the concepts he scientific works of potential experts. The proposed approach is intended to create a universal method that incorporates the interdisciplinary nature of modern scientific domains and enables the evaluation of expert competence with respect to the specific subject domain of the competition. The model should ensure completeness, accuracy, scalability, and practical applicability, while allowing adaptation to different types of content units (keywords, topics, descriptors) with minimal modifications.

4. Metodology

4.1. identifier assigned by a scientific aggregator (such as OpenAlex, Scopus, or PubMed) as a result of the automatic classification of a publication.

Concepts possess a number of properties. controlled vocabulary (taxonomy). For example, in OpenAlex, it is an element of the Fields of Study A concept is not a free-form keyword provided by the author but the result of machine analysis of publication metadata and/or full text performed by the aggregator using NLP algorithms. The use of standardized concepts eliminates problems of synonymy ( biology vs. electrical engineering), since each semantic notion corresponds to a unique identifier. Another advantage provided by the use of aggregator platform classifiers is their multilingual capability. For instance, OpenAlex employs NLP algorithms capable of handling various languages. When processing non-English publications, the system attempts to map local terminology onto the global FoS taxonomy, ensuring comparability of works written in different languages within a single semantic space. However, the quality of classification depends on the effectiveness of processing the specific language.

A key stage in constructing a semantic statistical model for evaluating expert competence is the competition as a set of concepts content units that reflect its thematic scope. Concepts may be represented in the form of keywords, descriptors, topics, or other semantic markers that characterize scientific texts. Thus, the set of competition concepts forms a conceptual core, relative to which the subsequent dete

The source for constructing such a set can be standardized classification systems or taxonomies used in scientific information aggregators. It is recommended to adopt the classifier of the aggregator from which the scientific works will subsequently be retrieved for analysis. The use of such classifiers with bibliographic databases.

The formalization of a competition or expert review topic begins with selecting a classification system from a scientific aggregator (for example, Fields of Study in OpenAlex, ASJC in Scopus, or MeSH in PubMed), which provides access to a standardized vocabulary of concepts with unique identifiers. The initial set of competition concepts is determined through programmatic retrieval via queries through the respective API. The system returns a ranked list of standardized concepts with relevance scores, which, after expert validation, form the final set of competition concepts.

In cases where the organizers of a competition cannot employ an existing classifier, or when the available system proves insufficiently detailed, the set of concepts may be formed directly through the analysis of competition documentation, abstracts, keywords, and titles of scientific works in the relevant field. For this purpose, methods of automatic keyword extraction, topic modeling (e.g., Latent Dirichlet Allocation [14]), or pre-trained language models (e.g., BERT [15], SciBERT [16]) can be applied to generate coherent sets of concepts. The construction of a classifier, however, represents a separate task that lies beyond the scope of this study.

To construct the semantic space of a competition, viewed as a subspace of the general space of scientific work concepts, a statistical semantic approach is proposed. Within this approach, the statistical semantic proximity of concepts is defined on the basis of the frequency of their cooccurrence in the global corpus of scientific works. Using this method, the competition model is knowledge concepts (i.e., the complete body of scientific publications). where concept: concepts :

0 = { 0 } , = 1, 0 = { 0 }, = 1, | 0| N

the number of competition concepts; 0 the n-th concept of the set T0.

Step 2. A sample of works 0 is then selected, each of which contains at least one competition Each work in the obtained sample { 0 }, = 1, | 0| is characterized by its own set of The generalized algorithm implementing this approach consists of the following steps: Step 1. As a starting point, the set of concepts describing the competition is taken. They form the set of concepts of the initial zero level (layer): (1) (2) (3) (4) (5) (6) (7) (8) (10) (11)

Step 3. Based on the obtained set of works, a set of unique concepts is formed that are coonly the absence of duplicates among the identified concepts, but also the exclusion of concepts from the previous level T0. In this way, the set of first-level (layer) concepts is constructed.

where = 1, |T1| 1 occurs is calculated.

Step 4. On the obtained set of works 0, the number of works 1 in which each concept 1 ∈ 0 = 〈 〉

} , = 1, | | = {

⋂ 0 ≠ ∅ with the restriction that the reappearance of competition concepts is not allowed: For each competition concept, the set of works is determined in which this concept occurs among their conceptual representations:

0 = { 0 }, = 1, | 0| and the number 0 of works in which it is used is calculated: 0 = | 0 | | 0|

1 = { 1} = ( ⋃ )

⁄ 0, 1 = | 1| 1 = { ∈ 1 | 1 ∈ }

Based on the frequency value of concept usage, it is possible to identify and exclude random occurrences of irrelevant concepts. The threshold frequency level for excluding a concept from consideration is determined by the specifics of the subject domain and is defined by the organizers of the competition for example, 3% of the size of the selected set of works.

1 = ∗ | 0|,

The use of a 3% threshold for concept filtering is consistent with established practices in bibliometric mapping, where similar thresholds are applied to identify statistically robust and semantically meaningful relationships between concepts [ 17 ]. The purpose of this threshold is to eliminate random coscientific discourse surrounding the topic. Empirical studies have shown that thresholds in the range of 1 5% are effective for constructing balanced and interpretable models. The selected 3% value lies in the middle of this range and, as demonstrated by our experiment, provides an optimal balance between precision and completeness of the thematic model. However, the task of determining the appropriate threshold level requires further investigation.

Step 5. Similarly, on the basis of the set T1, the next sample of works 1 is formed, in which at least one concept from the set T1 was used, and a new set of unique concepts T2 is obtained.

1 = { 1 }, = 1, | 1| | 1|

2 = { 2} = ( ⋃ 1) ⁄

( 0 ⋃ 1), де = 1, |T2|

For each concept of the set T2, the frequency of its usage 2 is calculated in the sample of works 1.

2 = | 2| 2 = { ∈ 2 | 2 ∈ } Concepts 2, for which the frequency of occurrence 2 does not satisfy the threshold level 2 = s ∗ | 1|, are excluded from consideration.

Step 6. In the same way, the set of concepts of the next (n+1)-th layer is formed. The process continues iteratively until the set of concepts of the next level becomes empty.

Thus, the model is sequentially expanded (Fig.1): where M is the maximum depth of the model, determined either by the criterion of exhausting new concepts or by the established threshold of co-occurrence frequency. As a result, the application of this algorithm yields a model of the competition in which concepts are grouped according to levels of semantic connectivity. The connectivity of concepts is constructed on the basis of statistics of their co-occurrence in scientific works from the global corpus of research.

The distinctive feature of the proposed model is that it constitutes a system of organizing scientific knowledge, in which concepts are arranged according to their increasing semantic distance (layer/level number) from the conceptual core defined by the co scope. In other words, the model implements a procedure of structural localization of general scientific knowledge with respect to the coordinate system represented by the set of competition concepts.

4.2. Properties of the Model

The proposed semantic statistical model has a number of properties that determine its suitability for use in systems for forming juries of scientific competitions.

4.2.1. Completeness

The model ensures the gradual coverage of the entire relevant subject domain due to its iterative construction principle. At each step, sets Tk are formed, which expand the competition core T0 with new concepts co-occurring in the corpus of scientific publications. This makes it possible to identify not only explicitly stated terms but also related concepts that frequently appear in the same contexts l is not limited to a narrow list of keywords but captures a broader spectrum of notions that genuinely reflect the structure of knowledge in the chosen field.

4.2.2. Universality

The construction methodology is independent of the specific type of content units. Concepts may take the form of keywords, descriptors, topics, MeSH terms, Fields of Study (FoS) from OpenAlex, or even phrase vectors generated using transformer-based models (e.g., BERT, SciBERT). This means that the model is easily integrated with various databases (Scopus, Web of Science, PubMed, OpenAlex) and can be applied in biomedical research as well as in engineering or the humanities. Its universality makes it a flexible tool for organizers of competitions with different profiles.

4.2.3. Scalability

The model is applicable to datasets of any size, including large-scale data. The use of statistical methods (once a relevant sample is obtained) does not require processing excessively large volumes and allows for a significant reduction in computational and storage requirements for intermediate results. Since at each iteration of the algorithm the concepts used in previous steps are excluded, the overall size of the model cannot exceed the size of th in the OpenAlex database the size of such a dictionary is approximately 65,000 concepts across all scientific disciplines.

4.2.4. Objectivity

Unlike traditional evaluation methods based on bibliometric indicators (e.g., Hirsch index, number of publications), the proposed model evaluates thematic relevance directly. It reduces the impact of subjective factors, since it relies on formalized metrics such as co-occurrence frequency of concepts and distances in vector space. As a result, an expert whose research profile strongly correlates with This enhances the fairness of jury formation from a scientific and methodological perspective.

4.2.5. Interpretability

The model enables transparent explanation of results. Each competence evaluation can be detailed belong to more distant levels. This allows organizers to justify why a particular expert is considered relevant and to compare experts based on clear, understandable criteria.

4.2.6. Flexibility

The algorithm allows for the adjustment of weighting coefficients which regulate the sensitivity of the model to more distant concepts. If a strict evaluation focused only on the competition core is required, rapidly decaying coefficients may be used ( = 1⁄ ). If broader context needs to be 2 considered, slower decay may be applied ( = 1⁄( + 1). This makes the model adaptable to different types of competitions and subject domains.

4.2.7. Accuracy Theoretical accuracy (accuracy by design).

The model is constructed so that the core T0 carries the maximum weight, while distant levels (T1, ) gradually lose influence due to decreasing coefficients . This makes the system robust to thematically relevant notions. In other words, the model is theoretically accurate, provided that the competition core is adequately defined.

Practical accuracy (evaluation on data).

The accuracy of the model is determined by its ability to correctly identify experts whose research set of concepts T0, the completeness of the expert publication corpus, and the classification system used for content units.

A drawback of the proposed model can be considered the fact that, in the pursuit of resource efficiency and effectiveness, it has lost its autonomy. It requires extensive and labor-intensive data preprocessing. In particular, this involves assigning thematic concepts to scientific works based on their titles, abstracts, keywords, and textual content, or constructing dictionaries of key phrases, topics, and so forth. However, this drawback cannot be regarded as critical, since many modern aggregators of scientific publications already perform such processing and provide the results in open access.

4.3. Integral Evaluation of Expert Competence

The constructed semantic statistical model provides for the formation of a generalized numerical thematic scope of the competition. Within the framework of the problem being addressed, the scientific profile of an expert is represented as the complete list of their publications.

The integral indicator of expert competence is defined as a weighted sum of matches between the 4.3.1. Model

Each expert has a set of scientific works ( ).

Each work of the expert { } is described by a set of concepts { }, = 1, |W(E)| The expert profile E is represented by the set of unique concepts derived from their works: ( )= { 1, 2, … , , } ∈ ( ⋃ { }), | ( )|

=1 where

V is the total number of unique concepts.

A typical choice is geometric decay: 4.3.2. Matching Function

For each level , a matching function is defined: where ( , )= ∑ 1 [ ∈ ( )] ∙ ( ) ∈ (19) (20) P(E), it (21) (22) (23) (24) should be emphasized that the set of unique concepts associated with an expert is derived exclusively from the classification provided by the selected scientific aggregator and does not involve the use of any proprietary extraction algorithms. condition for the correct computation of the integrated evaluation score ( ). The methodology is transparent and reproducible, as it relies entirely on publicly available data and metadata supplied by the aggregator.

The set of competition concepts T is represented as a multilayer structure:

T, which is a necessary

associated with the previous level.

where T0 is the competition core, and each subsequent level T includes concepts statistically To reflect the varying significance of levels in the competition model, weighting coefficients were introduced, which decrease as the distance from the core increases:

T0, T1, … , TM 1 > 2 > ⋯ > = 1 2 ( )

concept ccc occurs).

Thus, ( ,

) reflects the intensity of the presence of competition concepts from level The integral evaluation of expert competence is defined as:

This indicator generalizes the matches across all levels of the model, assigning the greatest weight to the concepts of the competition core while reducing the influence of more distant terms.

4.3.3. Normalization

A direct interpretation of the obtained evaluation results for an individual expert has little meaning without comparison to the results of other experts in the group. Therefore, it is reasonable to normalize the evaluation results:

( )= This normalization allows the evaluation to be interpreted on a relative scale [0;1].

4.3.4. Interpretation of Results

• • •

High values

( )≥ 0.7 Medium values 0.4

≤ acceptable in interdisciplinary competitions.

Low values ( )< 0.4 ( ) < 0.7 reflect partial relevance, which may be

5. Experiment 5.1. Objective and Hypothesis

The objective of the experiment is to test the hypothesis that, for constructing a valid semantic Thus, the task of the experiment is to determine the minimal sample sizes that guarantee the stability of the model.

5.2. Input Data

employed: • • • For the experiment, data from the scientific information aggregator OpenAlex were used. The study A corpus of scientific publications in the field of «Computer Science» containing 2 million works.

Publication metadata.

The OpenAlex concept directory (approximately 65,000 concepts) [ 18 ]. 5.3. Experimental Scenario • • • • • • • •

Initial data formation. A random set of concepts with varying degrees of specialization is selected (from different levels of the hierarchy in the OpenAlex concept taxonomy). Sampling of works. For each selected concept, a set of scientific works in which it appears is formed (see Sec. 4.1, Step 2 of the algorithm). The number of such works is determined (see .

Formation of co-occurring concepts. Based on the obtained set of works, the set of unique concepts co-occurring with the studied concept is computed (see Sec. 4.1, Step 3 of the algorithm).

Frequency estimation. For each co-occurring concept, the number of works in which it appears is counted, and its percentage relative to the sample size is calculated (see Sec. 4.1, Step 4 of the algorithm).

Noise filtering. Concepts with a frequency below 1% (scope=0.01scope = 0.01scope=0.01) are excluded from the set to eliminate random occurrences of irrelevant terms and informational noise.

Obtaining the set of relevant concepts. As a result, a subset of concepts co-occurring with the selected concept within the publication database is formed.

Artificial reduction of the sample. The number of works from Step 2 is intentionally reduced to 30%, 3%, and 0.1% of the full size.

For each reduced sample, Steps 3 6 are repeated.

Evaluation of similarity of results. The sets of concepts obtained from the full and reduced samples are compared using the Jaccard coefficient:

| ⋂ | (27) ( , )=

| ⋃ |

The Jaccard coefficient measures the similarity between sets and is defined as the size of their intersection divided by the size of their union.

5.4. Results 3% of Full 0.1% of Full Concept Mathematics

Software engineering Artificial intelligence Artificial neural network

Number of

Papers in Full

Sample 313644 30910 369553 130556

The analysis of the obtained data shows that the semantic thematic scope is highly robust to reductions in sample size. Even with a substantial decrease in the number of works (down to 3% of the full corpus), the resulting sets of concepts remain close to those constructed using the complete dataset. Significant deviations are observed only for very small samples (0.1%), particularly in the case of rare or highly specialized concepts.

Practical implications of the obtained results include: • • • the possibility of significantly reducing the volume of processed data without substantial loss of model quality; reduced computational requirements for implementing the algorithm; potential offloading of the most resource-intensive computations to the aggregator side. 6. Discussion competition scope may combine several different directions within one discipline, or even be interdisciplinary. In such cases, to obtain a correct evaluation, it is necessary to involve experts from multiple domains. This is confirmed by established practices in forming expert groups. Another option is to subject each work to independent evaluation by several experts, followed by weighting lack of competence of a particular expert is compensated by the competences of other members of the group.

From this perspective, the proposed method requires further development toward differentiating the evaluation for each competition concept individually. This, in turn, enables the formation of expert groups using multi-criteria optimization methods.

7. Conclusion

The developed model for evaluating the competence of jury members in scientific competitions, based on a semantic statistical approach, provides a universal and objective method for selecting experts, with adaptability to interdisciplinary and applied domains. The use of semantic models and statistical methods of co-occurrence frequency analysis makes it possible to construct a comprehensive and scalable representation of t

Integration with aggregator services provides access to large volumes of up-to-date and structured data (scientific articles with performed terminological analysis and concept identification, dictionaries of concepts, key phrases, and topics), which enables the practical implementation of the method and ensures its effectiveness.

The proposed algorithm accounts for both direct and indirect connections by forming a hierarchy of semantic connectivity levels. The integral competence evaluation of an expert is based on the intersection of concepts from their scientific works with the levels of the proposed semantic reducing the contribution of concepts from distant levels in a geometric progression, thus ensuring convergence of the evaluation measure.

The method is flexible and adaptable to different types of content units. In addition to concepts, it can incorporate keywords, topics, and descriptors, making it compatible with diverse approaches to structuring scientific knowledge.

Declaration on Generative AI

During the preparation of this work, the authors used ChatGPT-4 and Grammarly in order to: Grammar and spelling check. After using these tool(s)/service(s), the authors reviewed and edited the content as needed and take(s) full responsibility for the publi

arXiv preprint https://arxiv.org/abs/1301.3781. [5] J. Devlin, M.

Jan.

[Online].

Available:

009-0146-3. survey of the state-of-the- ACM Transactions on Intelligent Systems and Technology, vol. 13, no. 5, pp. 1 27, Sep. 2022. doi: 10.1145/3531046.

Jan.

2003.

[1]

Bornmann and H.- Journal of Documentation , vol. 64 , no. 1 , pp. 45 80 , Jan. 2008 . doi: 10 .1108/00220410810844150.

-normalized citation impact indicators and the choice of an Journal of Informetrics , vol. 9 , no. 4 , pp. 872 894 , Oct. 2015 . doi: Incentives and Performance: Governance of Research Organizations,

I. M.

Welpe ,

Wollersheim ,

Ringelhan , and M. Osterloh, Eds. Cham: Springer, 2015 , pp. 121 139 . doi: 10 .1007/978-3- 319 -09785- 5 _ 8 .

arXiv:1301.3781 , 2013 . Proc. EMNLP-IJCNLP, Hong Kong , China, 2019 , pp. 3613 3618 . doi: 10 .48550/arXiv. 1903 . 10676 .

[17] Waltman , L., van Eck , N. J. , & Noyons , E. C. M. ( 2010 ). A unified approach to mapping and clustering of bibliometric networks . Journal of Informetrics , 4 ( 4 ), 629 635 . doi: 10 .1016/j.joi. 2010 . 07 .002

[18] https://api.openalex. org/concepts 10244-0.

[10] K. Leytonfor reviewer assignment: Design and deployment at AAAI50 61 , Mar . 2024 . doi: 10 .1002/aaai.12139. -to-match (LCM) system AI Magazine , vol. 45 , no. 1 , pp.

reviewer representation towards automatic paper- Proc. ACM/IEEE Joint Conf. Digital Libraries (JCDL) , Champaign, IL, USA, 2019 , pp. 185 194 . doi: 10 .1109/JCDL. 2019 . 00037 .