<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Assessing Impact of Method Entities in a Special Task</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Xiaole Li</string-name>
          <email>18260038108@163.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yuzhuo Wang*</string-name>
          <email>wangyz@njust.edu.cn</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Information Management, Nanjing University of Science and Technology</institution>
          ,
          <addr-line>Nanjing</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <fpage>81</fpage>
      <lpage>82</lpage>
      <abstract>
        <p>Methods play an important role in the research. Identifying and analyzing entities about research methods can help scholars understand methods used in their field and accelerate the efficiency of scientific research. There are relatively few empirical analysis studies on method entities using quantitative methods. This paper takes named entity recognition (NER) as an example and evaluate the impact of method entities in this domain. This study found that conditional random field (CRF) is the most influential algorithms in NER. Deep learning algorithms have developed rapidly in the past 5 years. F-measure, precision and recall are the most widely used indices and measurements. Scholars do not pay enough attention to use tools and they prefer to use classic datasets.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>CCS CONCEPTS</title>
      <p>• Information systems →Information extraction
Named entity recognition, Impact of method entity, Full-text
context analysis</p>
    </sec>
    <sec id="sec-2">
      <title>INTRODUCTION</title>
      <p>Methods play an important role in the science and technology.
Different methods need to be used in the process of solving
specific tasks. If methods appearing in academic papers can be
marked and evaluated, the current status of the research can be
summarized to provide technical reference for beginners and
accelerate the efficiency of scientific research.</p>
      <p>
        Research methods comprise data collection techniques and data
analysis techniques [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Research methods include multiple method
entities, e.g. algorithms, tools, data sets and other entities used by
scholars in solving problems. In this paper, we take NER as an
example, and use full-text context analysis to label the method
entities used in NER-related papers and assess their impact.
      </p>
    </sec>
    <sec id="sec-3">
      <title>RELATED WORK</title>
      <p>
        For the annotation of method entities, Scholars have used content
analysis to label different method entities in academic papers.
Zhao extracted the data set [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], and Howison explored the software
entities [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. But their work did not consider the task that the
∗ Corresponding author.
method can solve. For entity evaluation, Pan assessed software’s
impact by number of citations and mentions [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and Wang
evaluated algorithms’ impact by number of papers, the total
number of references and the mentioned location [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Therefore,
this paper also uses context analysis to identify and assess the
impact of method entities.
3
      </p>
    </sec>
    <sec id="sec-4">
      <title>METHOLOGY</title>
      <p>The research framework is shown in Figure 1. In order to assess
the impact of method entities in a specific task, this paper first
obtains academic papers from a website, then annotates method
entities in the papers. Finally the impact of method entities is
assessed based on different indicators.</p>
      <p>annotate
Get academic papers from entities Method entity set
a special website
assess
impact</p>
      <sec id="sec-4-1">
        <title>Entity s impact</title>
      </sec>
      <sec id="sec-4-2">
        <title>Evaluation based on age distribution Evaluation based on the number of papers</title>
        <p>Data collection. We search papers containing ‘named entity
recognition’ or ‘extraction or identification’ in their title from
ACL Anthology (https://www.aclweb.org/anthology/). After
deleting non-English papers and literature review, we get full-text
content of 426 papers.</p>
        <p>Method entity annotation. In the study, we annotate method
entities that used by authors in academic papers, including
algorithms, tools, data sources, indices and measurements. The
entities are labeled by two senior students. Before formal
annotation, we compile the annotation specification, and 50
articles are selected randomly for the pre-annotation. We employ
Cohen’s kappa coefficient to measure the interrater reliability
(IRR) between the two students and achieve an IRR of 0.70,
which provide sufficient reliability for two coders to code all the
papers evenly. Table 1 gives the example of annotated entities.</p>
        <p>Table 1: Examples of four types of method entities</p>
        <sec id="sec-4-2-1">
          <title>Entity Entity Type Entity Sentence</title>
          <p>Conditional algorithm &amp; A Conditional Random Fields model
Random Fields model annotates the entities components.</p>
          <p>ACE 2005 data source We used ACE 2005 for our experiments.</p>
          <p>F-measure index &amp; Performance was measured with the
Fmeasurement measure score.</p>
          <p>Copyright 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CRF++ tool We used the CRF++ to …</p>
          <p>Impact assessment of method entities. Two indicators are used
to assess impact of method entities. One is number of papers: For
each entity, we count the number of papers using it, the more
papers the greater influence of the entity. Another is age
distribution: We get the publication time of the paper through the
download link and analyze the change in influence of method
entities over time.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4 RESULTS</title>
      <p>After annotating and sorting, we get 345 data sources, 251
algorithms, 235 tools, and 73 indices and measurements.</p>
    </sec>
    <sec id="sec-6">
      <title>4.1 Evaluation based on the number of papers</title>
      <p>Table 2 displays the top 5 highly-used entities in NER papers.</p>
      <p>Table 2: Top 5 entities and the number of papers</p>
      <sec id="sec-6-1">
        <title>Data source Algorithm Tool Index &amp; &amp; model measurement</title>
        <p>CoNLL 2003(74) CRF(194) CRF++(40) F-measure(371)
Wikipedia(74) BiLSTM(72) OpenNLP(11) Precision(258)
Twitter(37) SVM(50) word2vec(11) Recall(256)
CoNLL 2002(22) ME(50) Stanford cross</p>
        <p>CoreNLP(10) validation(55)
MSRA(20) Viterbi(49) Twitter API(10) Accuracy(34)</p>
        <p>For data source, the data sets generated in classic conferences
are used repeatedly by scholars, such as CoNLL 2002 and 2003.
MSRA is the most commonly used dataset for Chinese NER. For
algorithm, traditional machine learning methods, including the
supervised learning algorithm Support Vector Machine (SVM),
statistical models Maximum Entropy (ME) and CRF get the
highest influence. Recently, the deep learning algorithm
Bidirectional Long Short-Term Memory (BiLSTM) has been widely
used. For tools, there are tools for specific algorithms, such as
CRF++, as well as NLP and machine learning tools (CoreNLP,
OpenNLP, etc.). For indicators, the use of F-measure, Precision,
and Recall occupies 80% of the total, with the greatest influence.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>4.2 Evaluation based on age distribution</title>
      <p>Figure 2 shows the development of impact of various entities. In
general, the usage times of tools is low, algorithm is well
developed, and the top 3 indicators are quite stable.</p>
      <p>CoNLL 2003 Wikipedia Twitter CoNLL 2002 MSRA
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 19
0
year2
CRF++
(a) Age distribution of top 5 datasets</p>
      <p>OpenNLP word2vec Stanford CoreNLP
Twitter API
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018year92
1
0
30
10
r
e
b
m
u
n
(b) Age distribution of top 5 tools</p>
      <p>BiLSTM SVM ME</p>
      <p>X. L. Li et al.</p>
      <p>Viterbi
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 19
0
2
year
(c) Age distribution of top 5 algorithms
F-measure Precision Recall cross validation
Accuracy
2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018yea92r
1
0
(d) Age distribution of top 5 indices and measurements
Figure 2: The age distribution of top5 in four type entities
Figure 2(a) shows that classic datasets will be used many times
in recent years and CoNLL 2003 get the most dramatic growth. In
figure 2(b), we find that tools were used commonly before 2015,
and they have declined in recent years. On the contrary, the use of
algorithms has been greatly improved after 2015(see Figure 2(c)),
indicating that scholars began to focus on the algorithm itself to
solve complex NER tasks, instead of using tools directly. After
2015, BiLSTM is increasingly used by scholars, and its influence
has been greatly improved. As shown in Figure 2(d), F-measure is
the most commonly used indicator.</p>
    </sec>
    <sec id="sec-8">
      <title>5 CONCLUSION AND FUTURE WORKS</title>
      <p>Our results show that CRF is the most influential algorithm in
NER related papers; CRF++ is the most commonly used tool;
CoNLL2003 is the most commonly used dataset; F-measure, P,
and R are widely used indicators. The most used entity is the
algorithm and model, and the least is tool.</p>
      <p>In terms of entity evaluation, there are still certain deficiencies.
In the future, the full text can be used to determine the motivation
for using entity, and the entity distribution in different sections
can be used to further analyze entity’s impact.</p>
    </sec>
    <sec id="sec-9">
      <title>ACKNOWLEDGMENTS</title>
      <p>Thanks Chen Y. to annotate the corpus.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>Heting</given-names>
            <surname>Chu</surname>
          </string-name>
          and
          <string-name>
            <given-names>Qing</given-names>
            <surname>Ke</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Research methods: What's in the name?</article-title>
          <source>Library and Information Science Research</source>
          ,
          <volume>39</volume>
          :
          <fpage>284</fpage>
          -
          <lpage>294</lpage>
          . DOI: https://doi.org/10.1016/j.lisr.
          <year>2017</year>
          .
          <volume>11</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Zhao</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yan</surname>
            <given-names>E</given-names>
          </string-name>
          , and et al.
          <year>2018</year>
          .
          <article-title>Data set mentions and citations: A content analysis of fulltext publications</article-title>
          .
          <source>JASIST</source>
          ,
          <volume>69</volume>
          (
          <issue>1</issue>
          ):
          <fpage>32</fpage>
          -
          <lpage>46</lpage>
          . DOI: https://doi.org/10.1002/asi.23919.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Howison</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bullard</surname>
            <given-names>J.</given-names>
          </string-name>
          <year>2016</year>
          .
          <article-title>Software in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature</article-title>
          .
          <source>JASIST</source>
          ,
          <volume>67</volume>
          (
          <issue>9</issue>
          ):
          <fpage>2137</fpage>
          -
          <lpage>2155</lpage>
          . DOI: https://doi.org/10.1002/asi.23538.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Pan</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yan</surname>
            <given-names>E</given-names>
          </string-name>
          and et al.
          <year>2015</year>
          .
          <article-title>Assessing the impact of software on science: A bootstrapped learning of software entities in full-text papers</article-title>
          .
          <source>Journal of Informetrics</source>
          ,
          <volume>9</volume>
          (
          <issue>4</issue>
          ),
          <fpage>860</fpage>
          -
          <lpage>871</lpage>
          . DOI: https://doi.org/10.1016/j.joi.
          <year>2015</year>
          .
          <volume>07</volume>
          .012.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Wang</surname>
            <given-names>Y</given-names>
          </string-name>
          and
          <string-name>
            <surname>Zhang</surname>
            <given-names>C.</given-names>
          </string-name>
          <year>2018</year>
          .
          <article-title>Using Full-Text of Research Articles to Analyze Academic Impact of Algorithms</article-title>
          . In International Conference on Information. Springer International Publishing, Cham, English,
          <fpage>395</fpage>
          -
          <lpage>401</lpage>
          . DOI: https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -78105-1_
          <fpage>43</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>