<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Answers in the World Values Survey</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Arsen Matej Golubovikj</string-name>
          <email>matej.golubovik@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Branko Kavšek</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marko Tkalčič</string-name>
          <email>marko.tkalcic@famnit.upr.si</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Koper</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Slovenia</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Jožef Stefan Institute, Department for Artificial Intelligence</institution>
          ,
          <addr-line>Jamova 39, SI-1000 Ljubljana</addr-line>
          ,
          <country country="SI">Slovenia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of Primorska, Faculty of Mathematics</institution>
          ,
          <addr-line>Natural Sciences and Information Technologies, Glagoljaška 8</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Questionnaire surveys are useful for many areas of science, in particular social sciences. Such surveys are often the prime means of gathering data directly from participants, however, they are prone to missing data, which could be caused by many reasons: (i) an error by survey administrators, (ii) participants not responding to certain questions, (iii) acts of nature and, (iv) etc. In order to keep the full survey sample, researchers must often use imputation to deal with the missing data problem. Methods for imputation can sometimes ofer reasonable estimates for the missing data, however, in the case of the survey: (i) imputation can add high noise to the data, (ii) imputation becomes unreliable when more than 40% of the data is missing. This work attempts to address these issues by evaluating if the usage of matrix completion methods stemming from collaborative filtering (CF) in recommender systems can yield more accurate imputations of survey data. The rationale for the usage of these methods is (i) the similarity between the problem framing, methods and data representation used in CF and survey imputation; (ii) the efectiveness of CF-based methods in recommender systems. We use data from the World Values Survey, a valuable dataset in social science of high volume and veracity, to compare (i) one simple approach to imputation, (ii) two established imputation approaches (iii) two CF matrix completion techniques. The results show that our chosen CF matrix completion techniques perform overall comparable, but not better than existing imputation techniques for the case of survey imputation. The matrix completion techniques, however, might prove useful in niche situations, such as in the imputation of non-ordinal question answers. The right technique for imputation often depends on the problem, these results beckon the consideration of CF-based techniques in future research on survey imputation.</p>
      </abstract>
      <kwd-group>
        <kwd>imputation</kwd>
        <kwd>survey</kwd>
        <kwd>matrix completion</kwd>
        <kwd>collaborative filtering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        For many areas of science, in particular social sciences, questionnaires are an essential tool for
gathering data. The process of collecting data through questionnaires, called a survey [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], has
advantages, such as getting data directly from the participants, but also downsides, such as
missing data values [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. There are many reasons why data might be missing, (i) the survey
administrator/s made an error [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], (ii) participants might not answer all questions i.e. item
non-response [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], (iii) other reasons, e.g. acts of nature. No matter their cause, missing values
in questionnaire-acquired data must be dealt with before researchers can make inferences from
the data [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        A common approach to dealing with missing values is to delete all entries which contain
them [
        <xref ref-type="bibr" rid="ref10 ref17">10</xref>
        ]. The advantage of this deletion is its simplicity1[0], however, it forces the researcher
to operate on a partial dataset, which might produce misleading results1[0]. To operate on the
whole data, missing values must often be imputed i.e. filled in with replacement values. Often
used techniques for imputation in surveys include: (i) simple imputation12[], which replaces
missing data in a variable with its average or most frequent value (ii) hot-deck imputation12[],
which exploits the similarities between entries in the data to find suitable replacements (iii)
model based approaches [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], which model each variable based on the available data and fill in
missing values using the model for each variable.
      </p>
      <p>
        Existing imputation techniques have advantages, such as, allowing the user to operate on the
full data, however, they can have the issues of: (i) introduction of high noise to the dat1a0[]
and, (ii) in the survey case, inefectiveness when more than 40% of the data is missing1[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] (high
missingness). In our work, we address these issues by evaluating if the usage of alternative
imputation methods that are commonly used in recommender systems (RS), can yield more
accurate imputations of missing values, both in the case of low and high missingness. The
rationale for the usage of these methods is (i) the similarity of the problem framing between
questionnaires and RS, and (ii) the efectiveness of these methods in recommender systems.
      </p>
      <p>These similarities in problem framing, are most noticeable in collaborative filtering (CF) for
recommender systems. CF operates on a user-to-item ratings matrix that stores the opinion of
human users about given items, usually expressed as a scalar value called a rating (ex. 1-5 Likert
scale, where 1 is a very negative and 5 is a very positive ratting). Due to the large volume of
items in such systems, users are usually familiar with only a fraction of the items, consequently,
much of the entries in the ratings matrix are empty 1[], i.e. missing. The recommendation
is then done by filling these missing entries using solely data from this matrix, through a
process called matrix completion 1[], items with high predicted ratings, i.e. opinion, are then
recommended to the user. If we represent the questionnaire data as a matrix, where rows
represent participants, and columns represent questions, the problem of filling missing data is
now similar to the problem of matrix completion.</p>
      <p>
        This paper focuses on the comparison between matrix completion techniques and classical
survey imputation techniques, in the task of filling in missing answers in the World Values
Survey [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] - a highly valued dataset in the field of social science [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        A number of studies have utilized matrix completion and collaborative filtering outside of the
ifeld of recommender systems. Some of the fields which have used these techniques include
medicine[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], bioinformatics[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], image processing[5], infrastructure[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and security[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Many
of these fields find favorable results in the use of collaborative filtering for their specific problems,
especially when large amounts of data is missing. Moreover, the specific works of Saha et al.1[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]
and Li et al.[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] have successfully utilized matrix completion in the imputation of DNA and
highway trafic-related data, respectfully.
      </p>
      <p>Two works examine the use of matrix completion in a broad imputation scenario: (i) Wang et al.
produce an ensemble-based imputation method, which includes an item-to-item collaborative
technique in the ensemble, they show that their ensemble method outperforms k-nearest
neighbors (KNN) imputation, on common datasets from the UCI (University of California Irvine)
data repository, however, do not evaluate the performance of the item to item collaborative
technique on its own; (ii) Chi and Li3[] examines the use of low-rank matrix completion for the
general role of imputation, they use synthetic data to show that low-rank matrix completion
techniques can operate under the statistical assumptions for missing data, utilized in imputation.</p>
      <p>
        In the case of survey imputation, the use of matrix completion is also highlighted in some
cases. Vozalis et. al.[17] test the usage of a user-based collaborative filtering technique in
the imputation of a small transportation survey consisting of univariate question answers on
the Likert (1-5) scale. They report a MAE (Mean Absolute Error) of 0.846 for this technique
when imputing data with 20% missing answers. Similarly, Oliveira et al.1[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] compare matrix
factorization and item-to-item collaborative filtering techniques for the purpose of predicting
univariate Likert scale questionnaire responses in a large company survey. They find that, on
20% missing data, these techniques can distinguish between a positive and negative response
with an Area Under the Curve (AUC) score of at least 0.80 on the given data.
      </p>
      <p>Although there has been research using matrix completion on survey data, to the best of our
knowledge, there have been no attempts to compare the efectiveness of matrix completion
techniques and classical survey imputation techniques. In this work, we fill this gap by directly
comparing both approaches on the scenario of World Values Survey data.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Data Overview</title>
      <p>For the purpose of comparing the efectiveness of matrix completion and classical imputation
techniques in the case of missing survey data, we utilize data from the World Values Survey
(WVS). The WVS is an international research program devoted to the scientific and academic
study of social, political, economic, religious, and cultural values of people in the worl6d].[ In
our testing, we use a subset of the data from the WVS’s 7th wave (7th iteration) of the survey,
conducted across 57 countries in the years 2017-2021. This subset used in our testing contains
the answers of 84638 participants to 274 survey questions, covering topics such as: (i) ethical
values (ii) social values and perceptions (iii) political values, (iv) stances on various social and
political questions, (v) etc.</p>
      <p>The questions used in the WVS are closed questions, meaning that the participants respond
using a list of provided answers rather than articulating the answers themselves. Responses are
recorded as a number which denotes the participant’s choice from the list. The ranges of the
numbers used to record responses in the WVS are from 1 to the number of answers, e.g. for
ifve answers the answer range is 1 to 5. Among the questions in our subset, we find 8 question
answer ranges: ”1-2”, ”1-3”, ”1-4”, ”1-5”, ”1-7”, ”1-8”, ”1-10” and ”1-11”. Based on their range,
the questions answers in our subset can be divided into three categories: (i) Dichotomous
questions with binary (e.g. Yes/No) answers, questions in the ”1-2” range fall into this category,
(ii) Nominal-Polytomous - questions with a set of more than two answers with no inherent
ordering, in our subset questions on the ”1-3” range fall into this category, and (iii)
OrdinalPolytomous, questions with a set of more than two answers which in themselves contain an
ordering, in our subset all other ranges (”1-4” and up) fall into this category.</p>
      <p>For reasons mentioned in the methodology section of this paper, we also retain data on the
participant’s country of origin in our testing subset.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>The flow of our methodology from the data preparation step to the final comparisons between
approaches is presented in Figure1.</p>
      <p>The data utilized in our testing is described in Section 3. Among the three types of survey
question answers, observed in section 3, i.e. (i) Dichotomous, (ii) Nominal-Polytomous, and
(iii) Ordinal-Polytomous questions, we find two imputation tasks, namely, a regression task
and a classification task. Ordinal-polytomous answers are handled using regression, while
classification is used to handle answers to dichotomous and nominal-polytomous questions.</p>
      <p>In both tasks, we compare the efectiveness of matrix completion and classical imputation
approaches on the testing data for the specific task. The approaches remain the same in both
tasks, only they are adjusted to fit the problem (classification or regression).</p>
      <p>Three classical imputation approaches are considered: (i) simple imputation, which serves
as a baseline, it imputes the mean value in the regression case and the mode value in the
classification case; (ii) k-nearest neighbors (KNN) imputation, a hot-deck approach, in the
regression case it uses weighted mean resolution to impute from the neighborhood, while mode
resolution is used in the classification case; (iii) model based imputation, which performs initial
simple imputation then imputes utilizing one regressor per feature, it uses linear regression
with initial mean imputation for the regression task and a bayesian ridge regressor with initial
mode imputation in the classification task.</p>
      <p>The classical imputation approaches are compared to two matrix completion techniques: (i)
item-to-item CF, which, similarly to KNN, uses weighted mean resolution among similar items
in the regression task and mode resolution among similar items in the classification case; (ii)
non-negative matrix factorization, refined by a Decision Tree regressor and classifier in the
regression and classification tasks respectively.</p>
      <p>The non-negative matrix factorization is refined by a decision tree in the following way. Let
 be a column of the original matrix and let’ be the estimation of in the resulting matrix
from matrix factorization, for each pair of and ’ we use the available data in to train a
Decision Tree which predicts from ’ and use this model to predict the remaining missing
answers in from ’ .</p>
      <p>To compare the five approaches described above, for each task, we simulate varying degrees
of missingness in the data, from 10% to 50%, and evaluate their performance in imputing the data.
For regressors, we evaluate Mean Absolute Error (MAE) and Mean Squared error (MSE), while
classifiers are evaluated with their accuracy, precision, recall, and F1 scores. The simulation and
evaluation is done through an augmented cross-validation technique. A comparison between
ordinary and augmented cross-validation is given in Figu2re.</p>
      <p>
        To cater our imputation to the data, hence producing more robust results, we perform all
imputations per country separately. We impute per country since, if the survey data contains
clusters, such as those born of demographics, better imputation results are achieved if data is
imputed for each cluster separately [
        <xref ref-type="bibr" rid="ref15 ref8">8, 15</xref>
        ], moreover, in international surveys, an often taken
Classification
      </p>
      <p>Dichotomous and
Nominal-Polytomous</p>
      <p>Questions
Regression</p>
      <p>Ordinal-Polytomous</p>
      <p>Questions
and efective approach is imputing answers for each country separately [18].</p>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>5.1.</p>
      <sec id="sec-5-1">
        <title>Regression</title>
        <p>ifnally across all magnitudes of missingness tested (from 10% to 50%). For the task of regression,
all values are scaled from to the 1 to 3 scale.</p>
        <p>We keep this scale in our final results to
make sense
of average error in the</p>
        <p>multivariate scenario. The best-performing imputer is marked in bold.</p>
        <sec id="sec-5-1-1">
          <title>Metric per % MAE MSE</title>
        </sec>
        <sec id="sec-5-1-2">
          <title>Matrix</title>
        </sec>
        <sec id="sec-5-1-3">
          <title>Factorization (w/ Decision Tree) 0.4120 0.3713</title>
        </sec>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Classification</title>
        <p>The results for Accuracy, F1 Score, Precision, and Recall in the classification task are given in
Table 2. Similarly, as in the regression case, the evaluation statistics presented are the average
across all questions that fall under this task, as well as over all countries and magnitudes of
missingness tested. The best-performing technique for each evaluation statistic is marked in
bold.</p>
        <p>From Table 2 we can see that the mode per country is a powerful predictor in the case of
the classification task. This implies that the data is unbalanced, hence, the F1 score, Precision,
and Recall are better indicators of the performance in this imputation task. Since the F1 score
is a balanced measure between Precision and Recall, we will use it as the prime metric for
comparison in the case of classification.</p>
        <sec id="sec-5-2-1">
          <title>Metric per %</title>
        </sec>
        <sec id="sec-5-2-2">
          <title>Accuracy F1</title>
        </sec>
        <sec id="sec-5-2-3">
          <title>Precision</title>
        </sec>
        <sec id="sec-5-2-4">
          <title>Recall</title>
        </sec>
        <sec id="sec-5-2-5">
          <title>Matrix</title>
        </sec>
        <sec id="sec-5-2-6">
          <title>Factorization (w/ Decision Tree) 0.6940</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion and Conclusion</title>
      <p>The results show that our chosen CF matrix completion techniques perform overall comparable,
but not better than existing imputation techniques for the case of survey imputation. The matrix
completion techniques, however, might prove useful in niche situations highlighted in the
results. Item-to-item collaborative filtering performs comparable to the KNN technique in both
imputation tasks, only failing to match it on high ratios of missing data in the classification case.
On the other hand, item-to-item fails to compare to model-based imputation in the regression,
however, performs better than it in the classification task. Moreover, the results show that the
matrix factorization technique ofers poor performance in terms of MSE in the regression case,
failing to match both existing imputation techniques, however, in the case of classification it
outperforms all techniques tested with its F1 performance on unbalanced data.</p>
      <p>In comparison with our related work, we achieve similar results to Vozalis et. al1.7[] for MAE
in terms of matrix completion, his MAE of 0.846 on univariate 1 to 5 data is comparable to our
0.40 MAE on the scale of 1 to 3, achieved under multivariate data. This raises the question of
whether the scale afects the matrix completion techniques, collaborative filtering techniques in
recommender systems usually operate on ratings all on the same scale. Can alterations of these
techniques to fit multivariate data, be more beneficial in future work in survey imputation?</p>
      <p>We also note that the nature of the data might afect the results, for example, the
modelbased imputer performs initial mean imputation before building its models, therefore the high
performance of the model-based imputer in the regression task may be due to the power of
mean imputation in our data. Future work might compare matrix completion and classical
imputation techniques on a larger range of survey data.</p>
      <p>Future work on this subject should also consider these techniques in diferent scenarios, as
well as, examine the efects that these techniques have on the statistical inference. Moreover,
our study included only simple techniques for matrix completion, CF techniques are vast and
varied, and other techniques might succeed where we have failed, the considerations of such
techniques in the study of imputation may also prove fruitful in future work.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C. C.</given-names>
            <surname>Aggarwal</surname>
          </string-name>
          .
          <source>Recommender Systems</source>
          . Springer International Publishing,
          <year>2016i</year>
          .
          <source>sbn: 978-3-319-29657-9</source>
          . doi:
          <volume>10</volume>
          .1007/978-3-
          <fpage>319</fpage>
          -29659-3.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>S.</given-names>
            <surname>Buuren</surname>
          </string-name>
          .
          <article-title>Flexible Imputation of Missing Data</article-title>
          . 2nd ed. New York: Chapman and Hall/CRC,
          <year>2018</year>
          . isbn:
          <fpage>978</fpage>
          -0-
          <fpage>429</fpage>
          -49225-9. doi:
          <volume>10</volume>
          .1201/9780429492259.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>E. C.</given-names>
            <surname>Chi</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Li</surname>
          </string-name>
          . “
          <article-title>Matrix completion from a computational statistics perspective”</article-title>
          .
          <source>In: WIREs Comp Stat 11.5</source>
          (
          <year>2019</year>
          ). issn:
          <fpage>1939</fpage>
          -
          <lpage>5108</lpage>
          ,
          <fpage>1939</fpage>
          -
          <lpage>0068</lpage>
          . doi:
          <volume>10</volume>
          .1002/wics.1469.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Groves</surname>
          </string-name>
          , ed. Survey methodology. 2nd ed. Wiley series in survey methodology. Wiley,
          <year>2009</year>
          . 461 pp.
          <source>isbn: 978-0-470-46546-2.</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>N. Gupta</surname>
            <given-names>K.</given-names>
          </string-name>
          and Goyal and
          <string-name>
            <given-names>H.</given-names>
            <surname>Khatter</surname>
          </string-name>
          . “
          <article-title>Optimal reduction of noise in image processing using collaborative inpainting filtering with Pillar K-Mean clustering”</article-title>
          .
          <source>In:The Imaging Science Journal 67.2</source>
          (
          <issue>2019</issue>
          ), pp.
          <fpage>100</fpage>
          -
          <lpage>114</lpage>
          . issn:
          <fpage>1368</fpage>
          -
          <lpage>2199</lpage>
          ,
          <fpage>1743</fpage>
          -
          <lpage>131X</lpage>
          . doi:
          <volume>10</volume>
          .1080/13682199.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>C.</given-names>
            <surname>Haerpfer</surname>
          </string-name>
          et al.
          <source>World Values Survey Wave</source>
          <volume>7</volume>
          (
          <fpage>2017</fpage>
          <string-name>
            <surname>-2022) Cross-National</surname>
          </string-name>
          Data-Set.
          <article-title>In collab. with K. Kizilova</article-title>
          et al.
          <source>Version Number: 4.0</source>
          .0 Type: dataset.
          <year>2022</year>
          .doi:
          <volume>10</volume>
          .14281/18241.18.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Hao</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. H.</given-names>
            <surname>Blair</surname>
          </string-name>
          . “
          <article-title>A comparative study: Classification vs. user-based collaborative ifltering for clinical prediction”</article-title>
          .
          <source>In: BMC Medical Research Methodology 16.1</source>
          (
          <year>2016</year>
          ). issn:
          <fpage>1471</fpage>
          -
          <lpage>2288</lpage>
          . doi:
          <volume>10</volume>
          .1186/s12874-016-0261-9.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>N.</given-names>
            <surname>Karmitsa</surname>
          </string-name>
          et al. “
          <article-title>Missing Value Imputation via Clusterwise Linear Regression”</article-title>
          .
          <source>InI E:EE Transactions on Knowledge and Data Engineering</source>
          <volume>34</volume>
          .4 (
          <year>2022</year>
          ).
          <source>Conference Name: IEEE Transactions on Knowledge and Data Engineering</source>
          , pp.
          <fpage>1889</fpage>
          -
          <lpage>190i1s</lpage>
          .sn:
          <fpage>1558</fpage>
          -
          <lpage>2191</lpage>
          . doi:
          <volume>10</volume>
          .1109/TKDE.
          <year>2020</year>
          .
          <volume>3001694</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>L.</given-names>
            <surname>Li</surname>
          </string-name>
          et al. “
          <article-title>Missing value imputation for trafic-related time series data based on a multiview learning method”</article-title>
          .
          <source>In:IEEE Transactions on Intelligent Transportation Systems 20.8</source>
          (
          <issue>2019</issue>
          ), pp.
          <fpage>2933</fpage>
          -
          <lpage>2943</lpage>
          . issn:
          <fpage>1524</fpage>
          -
          <lpage>9050</lpage>
          . doi:
          <volume>10</volume>
          .1109/TITS.
          <year>2018</year>
          .
          <volume>2869768</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R. J. A.</given-names>
            <surname>Little</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. B.</given-names>
            <surname>Rubin</surname>
          </string-name>
          .
          <article-title>Statistical Analysis with Missing Data</article-title>
          .
          <year>3rd</year>
          . John Wiley &amp; Sons,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S. G.</given-names>
            <surname>Ludeke</surname>
          </string-name>
          and
          <string-name>
            <surname>E. G. Larsen.</surname>
          </string-name>
          “
          <article-title>Problems with the Big Five assessment in the World Values Survey”</article-title>
          .
          <source>In: Personality and Individual Diferences</source>
          <volume>112</volume>
          (
          <year>2017</year>
          ), pp.
          <fpage>103</fpage>
          -
          <lpage>105</lpage>
          . issn:
          <fpage>0191</fpage>
          -
          <lpage>8869</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.paid.
          <year>2017</year>
          .
          <volume>02</volume>
          .042.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>A.</given-names>
            <surname>Mirzaei</surname>
          </string-name>
          et al. “
          <article-title>Missing data in surveys: Key concepts, approaches, and applications”</article-title>
          .
          <source>In: Research in Social and Administrative Pharmacy 18.2</source>
          (
          <issue>2022</issue>
          ), pp.
          <fpage>2308</fpage>
          -
          <lpage>2316</lpage>
          . issn:
          <volume>15517411</volume>
          . doi:
          <volume>10</volume>
          .1016/j.sapharm.
          <year>2021</year>
          .
          <volume>03</volume>
          .009.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>R. M. Rodríguez</surname>
          </string-name>
          et al. “
          <article-title>Using collaborative filtering for dealing with missing values in nuclear safeguards evaluation”</article-title>
          .
          <source>In:International Journal of Uncertainty, Fuzziness and Knowlege-Based Systems 18.4</source>
          (
          <issue>2010</issue>
          ), pp.
          <fpage>431</fpage>
          -
          <lpage>449</lpage>
          . issn:
          <fpage>0218</fpage>
          -
          <lpage>4885</lpage>
          . doi:
          <volume>10</volume>
          . 1142 / S0218488510006635.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>S.</given-names>
            <surname>Saha</surname>
          </string-name>
          et al. “
          <article-title>Missing value imputation in DNA microarray gene expression data: A comparative study of an improved collaborative filtering method with decision tree based approach”</article-title>
          .
          <source>In: International Journal of Computational Science and Engineering 18.2</source>
          (
          <issue>2019</issue>
          ), pp.
          <fpage>130</fpage>
          -
          <lpage>139</lpage>
          . issn:
          <fpage>1742</fpage>
          -
          <lpage>7185</lpage>
          . doi:
          <volume>10</volume>
          .1504/IJCSE.
          <year>2019</year>
          .
          <volume>097954</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J.</given-names>
            <surname>Shao</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          . “
          <article-title>Sample Correlation Coeficients Based on Survey Data Under Regression Imputation”</article-title>
          .
          <source>In:Journal of the American Statistical Association</source>
          <volume>97</volume>
          .458 (
          <year>2002</year>
          ), pp.
          <fpage>544</fpage>
          -
          <lpage>552</lpage>
          . doi:
          <volume>10</volume>
          .1198/016214502760047078.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16] [17] [18]
          <string-name>
            <given-names>G. N.</given-names>
            <surname>Singh</surname>
          </string-name>
          et al. “
          <article-title>Some imputation methods for missing data in sample surveys”</article-title>
          .
          <source>In: Hacettepe Journal of Mathematics and Statistics 45.6</source>
          (
          <issue>2016</issue>
          ), pp.
          <fpage>1865</fpage>
          -
          <lpage>1880</lpage>
          . issn:
          <fpage>2651</fpage>
          -
          <lpage>477X</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <source>doi: 10</source>
          .15672/HJMS.20159714095.
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Vozalis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Basbas</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Politis.</surname>
          </string-name>
          “
          <article-title>Applying Collaborative Filtering Techniques In Transportation Surveys”</article-title>
          .
          <source>In:1st International Conference on Engineering and Applied Sciences Optimization</source>
          .
          <year>2014</year>
          , pp.
          <fpage>1630</fpage>
          -
          <lpage>1638</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Weber</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Denk</surname>
          </string-name>
          .
          <source>Imputation of Cross-Country Time Series: Techniques and Evaluation</source>
          .
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>