<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Case-Based Comparison of Career Trajectories</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kedma Duarte</string-name>
          <email>kedmaduarte@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rosina O. Weber</string-name>
          <email>rosina@drexel.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roberto C. S. Pacheco</string-name>
          <email>pacheco@egc.ufsc.br</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>College of Computing and Informatics, Drexel University</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Graduate Program in Knowledge Engineering and Management, Federal University of Santa Catarina</institution>
          ,
          <country country="BR">Brazil</country>
        </aff>
      </contrib-group>
      <fpage>152</fpage>
      <lpage>161</lpage>
      <abstract>
        <p>Data generated across time may not be easily comparable in its original form thus potentially leading to results that may be perceived as unfair to some. We investigate quality assessment of scholarly researchers from their curricula vitae (CVs) for processes such as hiring, promotion, and grant funding. In previous work, we demonstrated that case-based reasoning (CBR) offers advantages as a transparent methodology to assess researcher quality. Its benefits include consistency, transparency, ability to adapt to specific purposes, and ability to provide explanation. The problem we now face is how to preprocess the data from the CVs to compare researchers whose scholarly production is achieved under different conditions, different points in time, and span different career trajectory lengths. We propose strategies to deal with these aspects of time during preprocessing of the data for case representation. We use 1,000 CVs from the Brazilian Lattes database to illustrate.</p>
      </abstract>
      <kwd-group>
        <kwd>case-based reasoning • time series • trajectory • career trajectory • curriculum vitae • normalization • recency</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        There is a growing interest in relying on high quality profiling systems to conduct data
studies to, as stated by Lane [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], “make science more scientific”. Researcher quality
assessment is a crucial task because characteristics of research metrics steer science and
technology decisions, ultimately steering progress, economics, and our way of life [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        Unfortunately, private organizations have started to explore this niche and are now
steering our future by offering research metrics that rely on incomplete and flawed
automatically crawled data [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In response to this present state, a group of researchers
gathered at the 2014 International Conference on Science and Technology Indicators to
produce the Leiden Manifesto [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]—a set of 10 principles for research quality metrics
that includes attention to transparency, flexibility, and context, amongst others.
      </p>
      <p>
        At the 2016 International Conference on Science and Technology Indicators, these
authors proposed a CBR approach to manipulate profiling data for researcher quality
assessment [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Our CBR method can be tailored to specific contextual purposes to
meet some of the objective principles from the manifesto because of its consistency,
transparency, ability to adapt to specific purposes, and ability to provide explanation.
      </p>
      <p>In the proposed methodology, CBR is used to classify candidate researchers as either
fit or unfit for a purpose. Purposes are characterized by features that reflect specific
jobs or promotions. Each entails a series of references of quality such as publications
in a journal or conference that are considered more relevant than others. The
characterization of the purpose comes from the users who adopt the methodology to classify CVs
of applicants. The use of CBR in this task assumes that assessing quality ultimately
implies predicting future success.</p>
      <p>In this paper, we describe the CBR implementation, and discuss three preprocessing
steps that are required due to temporal aspects of the data. The first is a standard
normalization step so that absolute volumes of scholarly production are replaced by
relative values of productivity. This avoids the comparison of absolute numbers of
production accomplished in years when conditions are different. The second aspect is recency.
We analyze researchers’ accomplishments to assess whether more recent production is
or not more predictive of quality. The third refers to grouping the relative values of
productivity depending on the lengths of career trajectories and recency. The CBR
system, as it is implemented now, uses one aggregated data point for each attribute.
Deciding how to group this data depends on directives of the users in terms of how they
favor experience, productivity, or whether they want both to have the same emphasis.</p>
      <p>
        This paper’s intended contributions are to introduce the challenges stemming from
using temporal data from CVs to assess researcher quality with CBR, and propose
preliminary strategies to address them. We illustrate these challenges and strategies with
data from 1,000 CVs from the period 2001 to 2014 from the Brazilian Lattes database
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The expected value of these strategies is to address these time-related challenges in
a way that preserves transparency and enables an easy to understand substantiation.
      </p>
      <p>In the next section, we provide the background for this work, including how we
proposed to use CBR for assessing researcher quality. We also mention a few related works
in time and CBR, and in time-series prediction. In Section 3, we describe the challenges
and our proposed strategies. We lay out directions of future work in Section 4.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>In this section, we introduce some of the concepts used in this paper. We start with
normalization, move to time-series approaches, and then discuss some aspects of
dealing with career trajectories. In the final section, we describe the CBR approach that
motivates this work.</p>
      <p>
        Normalization is a method that may be used before a classification process, required
to equalize ranges of the features from different scales, in order to obtain the same
proportion between them, making features comparable [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Several techniques have
been proposed to implement normalization (e.g., Min-Max Normalization, Linear
Scaling to Unit Range, Median Normalization, and Z-Score Normalization), and many
studies have investigated the relation between choosing the appropriated normalization
technique and improving classification accuracy (e.g., [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ][
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]). These studies
demonstrated the dependence of normalization methods in the performance of classification
accuracy.
      </p>
      <p>
        GenericPred [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] is a method for long-term time-series forecasting that addresses
chaotic behaviors such as natural phenomena strongly dependent on initial conditions,
which are many times unknown and consequently difficult to model and predict. The
results of this approach demonstrated a significant gain in accuracy over traditional
time series methods for both short and long-term predictions.
      </p>
      <p>
        Time-series using bibliometrics data have been used to discover distinguished
researchers [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Their approach is able differentiate researchers who have contributed
a significant achievement amongst those publishing a few papers over a long period.
      </p>
      <p>
        Time-series data of renal transplantation patients has been used in case-based binary
classification [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. The approach compares time series of creatinine courses using a
distance measure based on linear regression.
      </p>
      <p>
        Dynamic time warping (DTW) [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] is a distance measure to compare temporal
se-quences based on dynamic programming. DTW is much more robust than
measures based on the Euclidian distance [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] as it allows an elastic shifting of the
time axis.
2.1
      </p>
      <p>
        Career Trajectories
The terms career and trajectories are viewed as synonyms that describe the path from
entering into the job market and its following steps [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Along the same lines, the
career of a researcher has been described as a longitudinal account of an individual’s
productivity [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Our focus in this paper is on career trajectories from the
perspective of the productivity of researchers along their careers [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ][
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>
        The consideration of time when studying career trajectories is important for the
reliability of indicators and rankings. Previous indicators or metrics that attempted to
define a fixed interval of years have been highly criticized [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. The purpose of
normalizing time intervals and use annual productivity when assessing researcher
quality is to make available the same transparent standards to all researchers who are
assessed.
      </p>
      <p>Our main problem is that this process must be transparent and able to
substantiate its fairness. The assessment has to clearly consider and describe
separately the biases that come from the description of purpose from the biases that
originate in learning methods. The first issue we investigate is how to demonstrate
whether an assessment can be fair when quality assessment is case-based, which
requires comparison between researchers whose career trajectories span different
intervals.
2.2</p>
      <p>
        Purpose-Oriented Case-Based Researcher Quality Assessment
The purpose-oriented CBR approach classifies researchers as fit or unfit for the
purpose of a target process (Fig. 1) such as hiring or promotion [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. This method
supports the Leiden Manifesto [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] to incorporate purpose in research metrics aligned
with the con-cept that quality means fitness for purpose [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>A purpose-oriented approach requires users to input the purpose as a set of
standards or examples. For instance, for a target process to hire a researcher for the
federal uni-versity of Rio de Janeiro to work with the Zika virus, publications in local
conferences
where geographic issues are the focus may be considered of high importance when
as-sessing quality of an applicant. Users can also indicate examples of fit and unfit
re-searchers, which can then be used for weight learning.</p>
      <p>The first parameter to be captured for a purpose p is the target interval of interest
N, where n ∈ N is the year in question within the interval of years N that are to be included
in the data from candidates to be considered for a given purpose. Years y of
importance are y1 = Initial year, and yn = Final year.</p>
      <p>Input</p>
      <p>Classifier</p>
      <p>Output: Classified CV
Unclassified
Researcher</p>
      <p>CV data
Classified
researcher</p>
      <p>CV</p>
      <p>Classified
researcher</p>
      <p>CV
Fit for a purpose</p>
      <p>Unfit for a purpose
∑
.
, 1
(1)
We define weights , … , ,  0,1 , for each purpose p.
The local similarity measure between attribute aj, and aj’, used in the data in this article
is defined by:
,
1
,
|a a
|
(2)
0,
is the maximum distance between and ’ .</p>
      <p>This way the case-based classification of fit or unfit is not assessing similarity
between time-series but between flat cases with weights in each attribute stemming from
the characteristics of the purpose p.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Challenges and Proposed Strategies</title>
      <p>We use a simple example to illustrate the challenges in preprocessing data to populate
cases for case-based researcher quality assessment from CV data. Suppose we plan to
use our case-based quality assessment approach to classify applicants as either fit or
unfit for a given purpose. One of the parameters for a job is the target interval of interest
N, which delimits the years that are considered relevant to include in the examination
of candidates. For example, a job opening seeking social media experts would probably
not include accomplishments from candidates that predate the existence of social
media. For data in Table 1, the target interval of interest is five years. Each line in Table 1
refers to the volume of one type of accomplishment (e.g., journal articles in one field
and reputation) produced by job applicants (i.e., researchers).</p>
      <p>Suppose the data in Table 1 on the left columns designated by aijn reflect all the items
produced by each applicant. The maximum number of accomplishments varies each
year. Only in Year 1 and Year 5, a maximum of five accomplishments was produced.
In Year 3 however the maximum produced was three. We contend that there are
external factors that may have contributed to higher and lower levels of productivity. One
common example is a reduction of participation in conferences in periods of economic
depression. We therefore normalize these absolute values and convert them into
productivity rates, using Equation 3.</p>
      <p>The results of the normalization are laid out in right columns under āijn. They show
how one same absolute value (e.g., when i = 3) can represent the maximum productivity
in Year 3 and 60% in Year 1. For example, if using absolute values, production of
applicant in third row would be considered inferior to the applicant in the second row
in Year 2 whereas relative values make them equal. This simple step is easy to
describe to a broad audience and does not depend on characterization of the purpose.
3.2</p>
      <p>Recency
The challenge with respect to recency stems from the notion that recent data may be
perceived as more current and therefore more relevant in time series classification. This
possible perception may lead to claims of injustice and therefore we need to establish a
way to assess whether or not recent data is more influential. Note that in our proposed
approach, the target process may dictate the importance of recent accomplishments.
Assessing how influential recent data is would be required for implementations when
the target process is neutral about recency.</p>
      <p>Given our assumption that assessing quality implies predicting future success, it is
consistent to interpret that data is influential or relevant when it is predictive. To do
this, we take the target interval of interest N and set the last year aside as actual to
provide outcome classes. The intuition is that if a given year’s data has cases that
correctly predict the actual year then this year’s data are predictive and hence influential.</p>
      <p>
        To demonstrate this proposed analysis, we start from a hypothetical purpose,
namely, a job opening that seeks a researcher who is a successful collaborator. This
hypothetical purpose was captured using rules that assigned more importance to
publications and funded projects achieved in collaboration than to solo authored
accomplishments. The data where we applied these rules to determine who was fit or unfit for a
collaborative job was selected from the Brazilian Lattes database [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. For this reason,
some of the parameters used to create rules reflected that local culture. These data and
weights were described in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        For the analysis we now describe, we use weights learned in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The data we used
in this analysis is new. We started from the entire Lattes database that retains around 4
million CVs. From these, we selected 212,000 CVs of researchers with completed
doctoral degrees. In order to work with dense data, we kept only researchers who were
continuously productive from the target interval of interest that we defined from 2001
to 2014, resulting in 50,000 CVs. We kept CVs from researchers with a growing
absolute number of accomplishments to eliminate researchers with periods of inactivity.
This resulted in 20,000 CVs. For the analysis we show in this paper, we used a randomly
selected sample of 1,000 CVs.
      </p>
      <p>
        For the target interval of interest from 2001 to 2014, we set aside 2014 as actual to
provide outcome classes. Our goal is to assess how predictive the data from years 2001
to 2013 are. For each year, we use our case-based implementation with leave-one-out
cross validation (LOOCV) [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] to predict whether each researcher would be classified
as fit or unfit for the collaborative hypothetical purpose above described. We compute
for each researcher whether the classification using each year is correct (i.e., true
positive, true negative) or incorrect (i.e., false positive, false negative).
true positives
true negatives
false positives
false negatives
      </p>
      <p>Table 2 shows the average accuracy (AA) for all 1,000 researchers using their data
from each year in the first row. The second and third rows present respectively accuracy
of fit (i.e., ratio of true positives) and accuracy of unfit (i.e., ratio of true negatives).</p>
      <p>These results in Table 2 are difficult to interpret because we do not know if the
averages include the same or different researchers. To better understand these results, Fig.
2 plots true positives, true negatives, false positives, and false negatives. Positives are
represented with continuous lines, and negatives are represented with dots. Light color
is for true and dark for negative. A consistent trend would have true positives and true
negatives in a direction opposite to false positives and false negatives. This would mean
that accuracy increased because, for example, the true positives increased because of a
reduction of false negatives. If accuracy increased in more recent years, then we would
have to increase the relative relevance of these years when aggregating these values.
The lines in Fig. 2 do not show consistent results. Our conclusion is thus that there is
no consistent trend supporting the interpretation that recent data is more or less
predictive than older data. Hence, ∀ āijn, N= {2001,..., 2013}, gn = 1. Values for gn are used
in the next step when values are aggregated.
3.3</p>
      <p>Career Trajectories</p>
      <p>A researcher ri  R has a career trajectory CT that reflects the researcher’s years of
activity. For aggregating researchers’ production, we need:</p>
      <p>CTi = y
Max (CT i) = ymax
Min (CT i) = ymin</p>
    </sec>
    <sec id="sec-4">
      <title>Concluding Remarks and Next Steps</title>
      <p>This paper introduces time-related challenges faced when implementing CBR for
researcher quality assessment. We propose a standard normalization to compare
productivity instead of absolute volume of accomplishments, strategies to aggregate
production across different career trajectories, and an analysis of predictiveness to address
recency.</p>
      <p>Given that there is no consensus on how many years should be used to assess
researcher quality, we propose to use predictiveness of data within a target process
context as a proxy to how influential it should be. We showed an illustrative example where
data did not reveal variations in its level of predictiveness.</p>
      <p>This work is very preliminary. The next step is to study different datasets to
determine how to assess predictiveness and how to compute a measure of recency for when
data reveals consistent trends.</p>
      <p>
        The approach proposed in this paper aims to enhance the case-based researcher
quality assessment proposed in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] by adding weights within a target time interval when
results of the recency assessment determine that more recent data is more or less
predictive of the future and therefore should be considered more relevant.
      </p>
      <p>Given that normalization strategies interfere with classification accuracy, we need
to experiment with various purpose scenarios and normalization strategies to assess
which have both high accuracy and acceptable substantiation. Along these lines, we
will investigate DTW particularly when comparing career trajectories of different
lengths.</p>
      <p>This paper does not detail how the characterization of a purpose may be captured,
which can be through examples, conditions, and a combination of these. We also limit
the presentation to binary classification and do not discuss how to produce a ranking of
the applicants. These are both topics for future work.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>Authors thank the STELA Institute, particularly Rudger Taxweiler for his help
collecting data. First author is supported by Brazilian’s Goiás Research Foundation
(FAPEG) and University of the State of Goiás (UEG) under agreement number
201310267000099. Authors also thank the suggestions from the reviewers.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Lane</surname>
          </string-name>
          , J.:
          <article-title>Let's make science metrics more scientific</article-title>
          .
          <source>Nature</source>
          <volume>464</volume>
          ,
          <fpage>488</fpage>
          -
          <lpage>489</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>J.S.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Scale-Independent</surname>
            <given-names>Measures</given-names>
          </string-name>
          :
          <article-title>Theory and Practice</article-title>
          .
          <source>In 17th International Conference on Science and Technology Indicators</source>
          . Montreal, Canada,
          <fpage>1</fpage>
          -
          <lpage>19</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Van</surname>
            <given-names>Noorden</given-names>
          </string-name>
          ,
          <string-name>
            <surname>R.</surname>
          </string-name>
          :
          <article-title>Metrics: A profusion of measures</article-title>
          .
          <source>Nature</source>
          <volume>465</volume>
          (
          <issue>7300</issue>
          ),
          <fpage>864</fpage>
          -
          <lpage>866</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Hicks</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Wouters</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Waltman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Rijcke</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rafols</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Bibliometrics: The Leiden Manifesto for research metrics</article-title>
          .
          <source>Nature</source>
          ,
          <volume>520</volume>
          ,
          <fpage>429</fpage>
          -
          <lpage>431</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Duarte</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weber</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pacheco</surname>
          </string-name>
          , R.C.S.:
          <article-title>Purpose-oriented metrics to assess re-searcher quality</article-title>
          .
          <source>In 21st International Conference on Science and Technology Indicators</source>
          (
          <article-title>STI2016): Peripheries, frontiers and beyond</article-title>
          . València, Spain,
          <fpage>1312</fpage>
          -
          <lpage>1314</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Pacheco</surname>
            ,
            <given-names>R. C. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kern</surname>
            ,
            <given-names>V. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salm</surname>
            <given-names>Jr</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J. F.</given-names>
            ,
            <surname>Packer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. L.</given-names>
            ,
            <surname>Murasaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Amaral</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.D.</given-names>
            ,
            <surname>Cabezas</surname>
          </string-name>
          <string-name>
            <surname>B.</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. R.</surname>
          </string-name>
          :
          <article-title>Toward CERIF-ScienTI cooperation and interoperability</article-title>
          . In:
          <string-name>
            <given-names>A.G.S.</given-names>
            <surname>Asserson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. J.</given-names>
            <surname>Simons</surname>
          </string-name>
          (Eds.) 8th
          <source>International Conference on Current Research Information Systems</source>
          . Leuven: Leuven University Press.
          <fpage>179</fpage>
          -
          <lpage>188</lpage>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>B. K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verma</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thoke</surname>
            ,
            <given-names>A. S.:</given-names>
          </string-name>
          <article-title>Investigations on Impact of Feature Normalization Techniques on Classifier's Performance in Breast Tumor Classification</article-title>
          .
          <source>International Journal of Computer Applications</source>
          ,
          <volume>116</volume>
          (
          <issue>19</issue>
          ),
          <fpage>11</fpage>
          -
          <lpage>15</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jayalakshmi</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Santhakumaran</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Statistical Normalization and Back Propagation for Classification</article-title>
          .
          <source>International Journal of Computer Theory and Engineering</source>
          ,
          <volume>3</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>5</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Golestani</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gras</surname>
          </string-name>
          , R.:
          <article-title>Can we predict the unpredictable? Scientific reports</article-title>
          ,
          <volume>4</volume>
          ,
          <issue>6834</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kawamura</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamashita</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Matsumura</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <source>Research Activity Classification based on Time Series Bibliometrics. In 21st International Conference on Science and Technology Indicators</source>
          (
          <article-title>STI2016): Peripheries, frontiers and beyond</article-title>
          . Valencia, Spain,
          <fpage>1456</fpage>
          -
          <lpage>1460</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Schlaefer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schröter</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fritsche</surname>
            ,
            <given-names>L.:</given-names>
          </string-name>
          <article-title>A case-based approach for the classification of medical time series</article-title>
          .
          <source>In International Symposium on Medical Data Analysis</source>
          . Springer Berlin Heidelberg. 258-
          <fpage>263</fpage>
          (
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Myers</surname>
            <given-names>CS</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rabiner</surname>
            <given-names>LR</given-names>
          </string-name>
          :
          <article-title>A comparative study of several dynamic time-warping algorithms for connected word recognition</article-title>
          .
          <source>The Bell System Technical Journal</source>
          <volume>60</volume>
          (
          <issue>7</issue>
          ):
          <fpage>1389</fpage>
          -
          <lpage>1409</lpage>
          (
          <year>1981</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Al-Naymat</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chawla</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taheri</surname>
          </string-name>
          , J.:
          <string-name>
            <surname>Sparse</surname>
            <given-names>DTW</given-names>
          </string-name>
          :
          <article-title>a novel approach to speed up dynamic time warping</article-title>
          .
          <source>In Proceedings of the Eighth Australasian Data Mining Conference. Australian Computer Society</source>
          , Inc.,
          <volume>101</volume>
          ,
          <fpage>117</fpage>
          -
          <lpage>127</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Juran</surname>
            ,
            <given-names>J. M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Godfrey</surname>
            ,
            <given-names>A. B.</given-names>
          </string-name>
          :
          <article-title>Juran's Quality Handbook</article-title>
          . New York:
          <string-name>
            <surname>McGraw-Hill</surname>
          </string-name>
          .
          <article-title>5th Edi-tion</article-title>
          . (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Duarte</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weber</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pacheco</surname>
          </string-name>
          , R. C. S.:
          <article-title>Conceptual data model for research collaborators</article-title>
          .
          <source>In CIKI: VI International Conference on Knowledge and Innovation</source>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Valenduc</surname>
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vendramin</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pedaci</surname>
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piersanti</surname>
            <given-names>M.:</given-names>
          </string-name>
          <article-title>Changing careers and trajectories. How individuals cope with organisational change and restructuring, WORKS re-port, HIVAK</article-title>
          . U. Leuven, Leuven. (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Dietz</surname>
            ,
            <given-names>J. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chompalov</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bozeman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lane</surname>
            ,
            <given-names>E. O. N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Park</surname>
          </string-name>
          , J.:
          <article-title>Using the curriculum vita to study the career paths of scientists and engineers: An exploratory assessment</article-title>
          .
          <source>Scientometrics</source>
          ,
          <volume>49</volume>
          (
          <issue>3</issue>
          ),
          <fpage>419</fpage>
          -
          <lpage>442</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Bozeman</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>The impact of research collaboration on scientific productivity</article-title>
          .
          <source>Social studies of science</source>
          .
          <volume>35</volume>
          (
          <issue>5</issue>
          ),
          <fpage>673</fpage>
          -
          <lpage>702</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Unger</surname>
            ,
            <given-names>D. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rumrill</surname>
            ,
            <given-names>P. D.:</given-names>
          </string-name>
          <article-title>An Assessment of Publication Productivity in Career Development and Transition for Exceptional Individuals 1978-2012</article-title>
          .
          <article-title>Career Development and Transition for Exceptional Individuals</article-title>
          ,
          <volume>36</volume>
          (
          <issue>1</issue>
          ),
          <fpage>25</fpage>
          -
          <lpage>30</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Stuart</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Metrics for an increasingly complicated information ecosystem</article-title>
          .
          <source>Online Information Review</source>
          ,
          <volume>39</volume>
          (
          <issue>6</issue>
          ),
          <fpage>848</fpage>
          -
          <lpage>854</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Stone</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Cross-validatory choice and assessment of statistical predictions</article-title>
          .
          <source>Journal of the Royal Statistical Society. Series B (Methodological)</source>
          ,
          <fpage>111</fpage>
          -
          <lpage>147</lpage>
          (
          <year>1974</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>