=Paper=
{{Paper
|id=Vol-1964/CS1
|storemode=property
|title=Automated IQ Estimation from Writing Samples
|pdfUrl=https://ceur-ws.org/Vol-1964/CS1.pdf
|volume=Vol-1964
|authors=Austin Hendrix,Roman Yampolskiy
|dblpUrl=https://dblp.org/rec/conf/maics/HendrixY17
}}
==Automated IQ Estimation from Writing Samples==
Austin Hendrix and Roman Yampolskiy MAICS 2017 pp. 3–7 Automated IQ Estimation from Writing Samples Austin Hendrix, Roman Yampolskiy University of Louisville Louisville, KY 40208 austin.hendrix@louisville.edu, roman.yampolskiy@louisville.edu Abstract demonstrates certain writing trends are independent of the authors’ language and are therefore likely stronger The primary focus of this research is to introduce a candidates for comparing authors that write in different method of measuring an individual’s IQ by analyzing languages. the vocabulary in said individual’s writing. In this paper, we show that the ratio of SAT words in a As there is no true scientific measurement that is dataset of writing samples is roughly normally currently used to quantify someone’s intelligence, many distributed, though with an obvious left skew. We go different measurements have been used. Intelligence tests on to show a method that can be used to calculate an have often been a common way to determine an individual’s individual’s IQ with this ratio and provide samples intelligence relative to others. There have been many with measured accuracy. The conclusion suggests negative and controversial opinions on these tests, yet ways to increase accuracy in order to further develop experts still agree on their overall usefulness (Snyderman, the research along with applications of doing so. Rothman; 1989). Further studies have shown that a standard intelligence test provides the best single, reliable predicator 1 Introduction of academic aptitude (Bullerdieck, 1985). One popular example of standard intelligence tests measures an Stylometry is the statistical analysis of differences in individual’s Intelligence Quotient (IQ). The assumption literature between authors (Franking, 1988). As early as behind this system of measurement is that if a large sample 1880, the study of stylometry has been used as a method of of IQs are mapped together, the distribution will be normal. authorship identification on disputed texts. With the It has been shown that there are issues with the structure and development of computers and automation techniques, quality of the standard IQ test (Lawler, 1977). Still, the IQ sylometric analysis has become easier. An early example of test can be a useful way for individuals to compare software defined stylometry was used to identify the author intelligence. For this paper, we will act under the of the disputed papers amongst the “Federalist Papers” assumption that an individual’s IQ score relates directly to (Tweedie, Singh, Holmes; 1996). This work demonstrated their true intelligence level. that stylometric analysis using automation is, at least in this application, able to draw similar conclusions about the This preliminary research project is focused on authorship of these papers as previous work on the subject. exploring whether an individual’s IQ can be determined by In recent years, stylometry has taken on a broad range of using software defined stylometry. The novelty of this applications. More specifically, stylometry has been used in process is that it is not centered around author identification. the identification of chat bots (Ali, Hindi, Yampolskiy; Instead, stylometry will be used to determine the relative 2011). Further research was done to show that when a chat writing quality of a known author. The process will involve bot changes behavior over time, the stylometry approach analyzing an attribute of a known author’s writing to becomes more difficult (Ali, Schaeffer, Yampolskiy; 2012). determine said author’s IQ. There are multiple attributes of In addition, it has been demonstrated that stylometric author writing that are potential candidates for this application. For identification processes can be used on a single author that the beginning of this research, we will focus on the is capable of writing in multiple languages. (Ali, individual in question’s vocabulary. Other research has been Yampolskiy, 2014). This is significant in that it done to discuss other attributes with possible merit. These 3 Automated IQ Estimation from Writing Samples pp. 3–7 attributes include, but are not limited to, word-length, Now that we have a clearly defined a process for calculating syllables, sentence-length, and distribution of parts of the CWR of a sample, we need to execute this software on a speech (Holmes, 1994). large dataset. An ideal dataset would consist of writing samples by many randomly selected individuals. Along with 2 Collegiate Word Ratio this, each writing sample would represent each individual’s average writing ability. As such a dataset was not available To determine an individual’s IQ based on their vocabulary, to the authors of this project, another source had to be a quantitative way to measure the quality of their vocabulary found. is necessary. For the purposes of this project, we will define a “Collegiate Word” as a word the SAT considers a part of The Common Crawl is a corpus containing raw strong vocabulary usage.1 The College Word Ratio (CWR), web page data, extracted metadata and text extractions.2 The which we will refer to through this paper, is therefore text extractions from this corpus contain the raw text taken defined as: directly from websites. We are acting under the assumptions that the text extractions are all written by humans and likely Collegiate Word Ratio = Collegiate Word Count contain that individual’s average writing. To help increase / Total Word Count the accuracy of results under this assumption, only samples with more than 100 words were used. After collecting a The CWR of each sample will be measured by software and large number of samples from the Common Crawl corpus, then compared to the rest of the samples to determine its each sample’s CWR was stored and mapped onto a relative quality by use of a distribution. A pseudo-code for distribution (Figure 2). The distribution is fairly normal, calculating the CWR of a sample is shown in Figure 1. though there is a slight skew to the left. This implies that on for SampleWord in Sample: a large number of samples, the distribution of CWR is fairly for CollegiateWord in CollegiateWordList: normal and resembles the distribution of IQs. if SampleWord == CollegiateWord: CollegiateWordCount++ CollegiateWordRatio = (CollegiateWordCount / SampleWordCount) Figure 2: Calculating CWR Pseudo-code 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.2 Collegiate Word Ratio Figure 1: Collegiate Word Ratio Distribution 1 The full list of words used for this project can be found at 2 www.freevocabulary.com. https://aws.amazon.com/public-datasets/common-crawl/ 4 Austin Hendrix and Roman Yampolskiy MAICS 2017 pp. 3–7 55 65 75 85 95 105 115 125 135 145 155 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 CWR Curve IQ Curve Figure 3: CWR VS IQ Distribution 3 Determining IQ from CWR mean of both distributions. For the IQ curve, these are fixed values. The mean IQ value of all individuals is said to be We have shown the distribution of CWR is fairly normal, 100 and the standard deviation of all IQ values is said to be and now we will demonstrate the process of using CWR to 15. For our data set, the mean CWR is 0.074759005 and the calculate an individual’s IQ. A graph showing these two standard deviation is 0.031552108. distributions overlaid is located below (Figure 3). Using these values and an induvial data point’s The IQ curve shown is the ideal expected IQ CWR, a corresponding IQ score can be calculated. distribution. It is perfectly normal with a mean of 100. The Performing this calculation involves finding the z-score of CWR distribution, though skewed slightly left, is mapped the data point. This is done by the following: very closely to the IQ distribution for the second and third positive standard deviation from the mean. For the purposes Z-Score = (CWR Data Point – CWR Mean) / of this analysis, we will assume that this indicates the CWR CWR Standard Deviation in this area will map onto its corresponding IQ. This will result in a certain amount of error when calculating IQ from CWR. Nevertheless, the distributions are close enough that the process should give a good estimation of an individual’s IQ. To begin the process of transferring between the two curves, we need to know the standard deviation and Sample World Sample Collegiate Word Sample Expected Measured % Length Count CWR IQ IQ Error 752 94 0.1250 153 123.88 19.03 412 51 0.1238 130 123.31 5.15 136 22 0.1618 141 141.36 0.26 3279 433 0.1321 129 127.24 1.36 Table 1: Predicted VS Measured IQ 5 Automated IQ Estimation from Writing Samples pp. 3–7 This z-score represents the number of standard deviations, This research paper is intended to be purely positive or negative, that the data point is away from the preliminary and simply introduce the concept and one mean. Since we know the standard deviation and mean of all possible implementation of using an individual’s vocabulary IQ scores, the corresponding IQ can be calculated as to determine IQ levels. To further develop this research, the follows: authors suggest a larger dataset be used to create a more accurate distribution. In addition, a more reliable dataset is Corresponding IQ = (Z-Score * IQ Standard Deviation) necessary to test the accuracy of these methods. For the + IQ Mean strongest possible results, self-reported IQ scores should not 4 Testing IQ Estimation Software be used. Ideally, the next stage in research will include an IQ test along with a specific writing prompt on which to run Now that a sample of writing can be used to determine the our software. Lastly, there is likely merit in exploring the IQ of an individual from their CWR, we must determine if analysis of the other attributes of writing that are mentioned the IQ is accurate. The process of doing this is at the introduction to this piece. It is possible that one or all straightforward, though difficult to accomplish. For it to be of these attributes may provide a better avenue for reliably said that CWR can be used to calculate an calculating an individual’s intelligence level. individual’s IQ, we must find multiple individuals with a known IQ and access to writing that is their own. The The ability to analyze the intelligence of pseudo-code for the software used to map the CWR of individuals is a very useful tool. It has been shown in samples on to a corresponding IQ is shown in Figure 4. previous research that numerous factors influence whether an intellectually gifted child will ultimately lead a Using social media contacts, we located several successful life (Tomlinson-Keasey, Little; 1990). Earlier individuals willing to give their IQ and a sample of their identification of these children, through application of this research, has the potential to allow these children to be Sample_Z_Score = (CWR_Sample – CWR_Mean) / guided down a positive path that will lead to their personal CWR_Standard_Deviation success. In addition, this research could play a role in Sample_IQ = (Sample_Z_Score * evaluating the abilities of persons currently prominent in the IQ_Standard_Deviation) + IQ_Mean political and scientific realms. Nevertheless, further research must be done in this area of study before anything truly Figure 4: Calculating IQ Pseudo-code conclusive can be said. writing for the purposes of testing our software. It should 6 References be noted that there is no external verification that these individuals gave an accurate IQ, but these samples are a Ali, Nawaf; Hindi, Musa; Yampolskiy, Roman. (2011). good starting point for testing the reliability of this software. Evaluation of authorship attribution software on a The data collected from these samples is shown in Table 1. Chat bot corpus. Information, Communication and Regardless of the large error in the first sample, the Automation Technologies (ICAT), 2011 XXIII accuracy of the rest of the samples provide support for this International Symposium on, IEEE. approach for calculating IQ. Ali, Nawaf; Schaeffer, Derek; Yampolskiy, Roman. (2012). 5 Conclusions and Future Work Linguistic Profiling and Behavioral Drift in Chat Bots. MAICS, 27-30. Though our first sample produced a result with a moderate error, there still seems to be merit to looking further into this Ali, Nawaf; Yampolskiy, Roman. (2014). BLN-Gram-TF- methodology. It should be noted that the samples used were ITF as a Language Independent Feature for approximately 2 standard deviations above the mean. Authorship Identification and Paragraph Similarity. Further sampling should include data on both ends of the 9th Cyber and Information Science research curve. There may not ultimately be a cause effect Conference, Oak Ridge, Tennessee. relationship between intelligence level and vocabulary Bullerdieck, K. Kelly McK. (1985). Considerations in usage, but this research does indicate the two are correlated. Defining the Gifted. The normality of the distribution of CWR may be http://journals.sagepub.com/doi/abs/10.1177/10762 significant in other applications, and should be noted 1758500800607 regardless of the final merits of this approach to calculating intelligence. 6 Austin Hendrix and Roman Yampolskiy MAICS 2017 pp. 3–7 Franking, Holly. (1988). Stylometry: A statistical method Snyderman, M., & Rothman, S. (1987). Survey of expert for determining authorship, textual integrity, and opinion on intelligence and aptitude testing. chronology. University of Kansas, ProQuest American Psychologist, 42(2), 137-144. Dissertations Publishing. Tomlinson-Keasey, Carol; Little, Todd D. (1990). Holmes, D. (1994). Authorship Attribution. Computers and Predicting educational attainment, occupational the Humanities, 28(2), 87-106. achievement, intellectual skill, and personal http://www.jstor.org/stable/30200315 adjustment among gifted men and women. Journal of Educational Psychology, vol. 82(3), pp. 442- Lawler, James. (1977). IQ: Biological Fact or 455. Methodological Construct? Science & Society, vol. 41, no. 2, pp. 208–218. Tweedie, F., Singh, S., & Holmes, D. (1996). Neural www.jstor.org/stable/40402014. Network Applications in Stylometry: The "Federalist Papers" Computers and the Humanities, 30(1), 1-10. Retrieved from http://www.jstor.org/stable/30204514 7