<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Personality Mining from Biographical Data with the &quot;Adjectival Marker &quot; Technique</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Shivani</forename><surname>Poddar</surname></persName>
							<email>shivani.poddar92@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="department">Center for Exact Humanities</orgName>
								<orgName type="institution">IIIT Hyderabad</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Venumadhav</forename><surname>Kattagoni</surname></persName>
							<email>venumadhav.kattagoni@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="department">Center for Exact Humanities</orgName>
								<orgName type="institution">IIIT Hyderabad</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Navjyoti</forename><surname>Singh</surname></persName>
							<email>singh.navjyoti@gmail.com</email>
							<affiliation key="aff0">
								<orgName type="department">Center for Exact Humanities</orgName>
								<orgName type="institution">IIIT Hyderabad</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Personality Mining from Biographical Data with the &quot;Adjectival Marker &quot; Technique</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">E44FA4D21319CDD83657B8091C791AAD</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T17:12+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Social Computing</term>
					<term>Psychology</term>
					<term>User Personality Determination</term>
					<term>Natural Language Processing</term>
					<term>Machine Learning</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The last decade has witnessed significant work in personality mining from lexical cues in social media data. Not much work has yet been undertaken in extracting these lexical cues from biographical data populating social media. Most of this work involves a large crowd of researchers leveraging dictionary-based approaches such as LIWC (which primarily focus on function words). By means of this paper we intend to introduce a novel method of personality mining from social media data called "Adjectival-marker Technique". This method involves extracting lexical features from descriptive texts (e.g. biographical data) to train a learning model, so as to predict the respective personality traits of the subject. Conceptually, it draws heavily from the last 78 years of work in lexical psychology and the Big Five personality test. However, it is not only a computational variant of the primordial theories of lexical psychology, but is also competent in conferring a substantial accuracy of personality prediction, matching that obtained by psychometric tests. In this study, we propose a variant of the Lexical Hypothesis from psychology. This modified hypothesis is validated by the computational results of personality prediction achieved by the Adjectival Marker Technique discussed below. The paper also discusses some insights illustrating the coherence of people's judgments about the subject's personality (virtual personality). The average accuracy (i.e. matching that achieved by psychometric tests for Big 5) for prediction approximated to Extraversion -82.82% Agreeableness -89.62%, Conscientiousness -92.48% and Imaginativeness/Intellect -81.67%.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>1. Introduction 1.1. Motivation Social Media has become the most abundantly used means of communicating and propagating information online. Most information here is extensively descriptive of the users who channel themselves through it. It is not only the user who gives away information about himself <ref type="bibr" target="#b9">(Goldbeck et al, 2011)</ref>, but also his peers <ref type="bibr">(Staiano et al, 2012)</ref>. This paper mainly unravels how the latter approach is nearly an absolutely accurate predictor of certain personality traits. The judgements of not only peers but of people who know us remotely over time can be an important window into solving the labyrinth of our personalities. The future of social media will witness individuals choosing workplaces, friends, books, movies, products etc, in synchrony with their own personalities. The tomorrow of the advertising industry will witness a transformation from "spammers" to "personalized suggestors". This has also been cited in various discussions wherein advertisers are advised to study personalities instead of demographics (documented in the paper personalized persuasion, <ref type="bibr" target="#b14">(Jacob, 2012)</ref>). The aforementioned applications are just a tip of the iceberg. Relationships have been discovered between personality and psychological disorders, job performance <ref type="bibr" target="#b6">(Digman et al, 1990)</ref> and satisfaction <ref type="bibr" target="#b17">(John et al, 1990)</ref>, and even romantic success. An extremely dynamic field of study which also benefits from the research in the area of Human Computer Interaction (HCI) is interface design. Many interface designing projects revolve around modelling interfaces based on people's personality oriented preferences. This study, thus, aims to contribute to bridge this gap between biographical data and personality research. We also attempt to expedite the process of personality prediction, making it more automated instead of relying heavily on psychometric tests written by the subject.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.2.">The Big Five Personality Model</head><p>There have been several personality models (The Big Three, The Big Five, The Alternative Five, etc.) that claim to encapsulate the traits that need to be summoned so as to effectively predict user personalities from social media data. However, out of all these models, the most robust and tested model, which has been consistent for the last few decades, is the Big Five (Big5) model <ref type="bibr" target="#b11">(Goldberg et al, 1992)</ref>. This personality model, being one of the most supported in lexical psychology research, stood out as being most resilient to carry out research of biographical social media resources <ref type="bibr" target="#b31">(Saucier et al, 1996)</ref>. Another one of the instrumental personality theories that has spawned the landscape of personality models is the set proposed by Carl Jung (Myers-Briggs Type Indicator (MBTI), Socionics, <ref type="bibr">Kiersey et al, 1921)</ref>. Following the paucity of data (for evaluating our model) available for personality determination via reliable psychometric tests for the Big 5 model, we decided to refer to a publicly published research dataset,<ref type="foot" target="#foot_0">1</ref> that abundantly provided us with the MBTI personalities for people. So as to bridge this gap between the MBTI (for personalities which needed to be used for evaluation) and Big 5 (the personalities which were being predicted by our model), <ref type="bibr" target="#b3">(Capraro et al (2002)</ref>, <ref type="bibr" target="#b8">Furnham et al (1996)</ref>, <ref type="bibr" target="#b23">McCrae et al (1989)</ref>) we used correlations shown in Tables <ref type="table" target="#tab_1">1 and 2</ref>. Thus, one of the major motivations of this paper is also to draw the most effective traits (namely: Extraversion, Agreeableness, Conscientiousness and Imaginativeness) from the intersection of these two instrumental paradigms of personality qualifiers. Hence, in scope of this study, the traits we predict are Extraversion, Agreeableness, Conscientiousness and Imaginativeness/Intellect.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.3.">Motivation for using Biographical Data</head><p>This research builds on the confluence of two major domains, the primordial theories of the lexical hypothesis and the recent computational techniques of data modeling. Allport's personality trait names <ref type="bibr" target="#b0">(Allport et al, 1936)</ref> lead to the creation of Goldberg's adjective marker <ref type="bibr" target="#b11">(Goldberg et al, 1992)</ref> and have ignited various studies. <ref type="bibr" target="#b10">Goldberg et al (1990)</ref>, <ref type="bibr" target="#b6">Digman et al (1990)</ref>, <ref type="bibr" target="#b17">John et al (1990)</ref>, <ref type="bibr" target="#b26">Ostendoff et al (1990)</ref> built on the same foundation. All of these converge at a single point that cites a "descriptive", "adjectival" lexicon to be the key into a person's personality. Social media today is littered with biographical or descriptive content of its over 1.4 billion users. Tapping this reservoir of content by the principles and techniques discussed below, the paper aims at unveiling a substantial part of this personality descriptive content.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.4.">Proposed modification in the "Lexical Hypothesis</head><p>of Psychology" The theories of psychology were influenced by various revolutionary concepts, for instance, "trait" -a theoretical construct which describes a basic dimension of a person's personality <ref type="bibr">(Allport, 1937)</ref>. The idea of trait gave birth to the "Lexical Hypothesis of Psychology". The initial direction of this paper was solely governed by this exact hypothesis (worked upon by <ref type="bibr">Klages, 1926</ref><ref type="bibr">Klages, /1932;;</ref><ref type="bibr" target="#b4">Cattell, 1943;</ref><ref type="bibr" target="#b24">Norman, 1963;</ref><ref type="bibr">Goldberg, 1982)</ref> -"Those individual differences that are most salient and socially relevant in people's lives will eventually become encoded into their language; the more important such a difference, the more likely is it to become expressed as a single word."</p><p>The Lexical Hypothesis has been used in its entirety in author's personality prediction systems, like the one for Greek Language described by <ref type="bibr" target="#b19">Kermanidis et al, (2012)</ref>. Motivated by the same inspiration, we too expected to extract author's personality traits from the text they wrote. This involved mobilizing huge datasets of web blogs and essays and extracting "names" from them to determine the author's personalities. However, by the course of our study, we found out that this was not as effective as the initial hypothesis proposed <ref type="bibr">(Goldberg et al, 1982)</ref>. The average accuracy of the initial experimentation was less than 50%, which was as good as a randomly predicted personality set. Thus, we propose a modification of the Lexical Hypothesis in psychology which suggests that the personality of a person is predicted based on cumulative judgements of various authors about him/her. These judgements are indicative of the respective traits of the person described along the lines of the Big5 personality Model. The "Adjectival Marker " Technique helps us unravel these judgements, and is derived from the adjectival markers of Big5 personality traits as discussed by <ref type="bibr" target="#b31">Goldberg &amp; Saucier (1996)</ref>. Thus, the modified Lexical Hypothesis of Psychology proposed and verified in this paper is as follows:</p><p>"Those individual differences that are most salient and socially relevant in people's lives will eventually (over time) become encoded into their language as well as that of people who describe them (via the knowledge they have of them, these people could be peers, associates, friends, family members, followers etc.); the more important such a difference, the more likely is it to become expressed as a single word".</p><p>The "Adjectival Marker Technique" introduced in this paper is most accurate when it is used to analyze the personality of the subject who the social media resource is descriptive of and not the author himself. We also inferred an interesting observation that suggested that the views of different people describing the subject are coherent amongst themselves and also with the results of the psychometric tests.</p><p>The average accuracy of the traits, based on the proposed hypothesis, for a series of data spread temporally and spatially (as compared to the results obtained by psychometric tests) in social media came out as discussed below.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.5.">Structure of the Paper</head><p>We begin by presenting a brief background on the Lexical Psychology theories of personality determination and related work on personality in conjuction with social media in Section 2. We then present our dataset in Section 3 &amp; 4 and methodology for analyzing, quantifying and modelling biographical data content for 574 personalities in Section 5. The study proceeds on to describe the adjectival features used along with the machine learning techniques for classification and demonstrate significant improvements that the model was able to achieve over baseline classification on each personality factor. In subsequent sections, the paper presents the results in Section 6 and analysis of the study, and discusses the methods we incorporated which were instrumental in escalating the accuracy of the model for each of the traits discussed earlier in Section 7. We finally wrap up the paper with brief discussions about the future work, sparked by this study in Section 8.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Related Work</head><p>The last few years have witnessed a considerable escalation in studies which are directed at mining user personalities from social media data. Various means of evaluation have been used by the above researchers, ranging from accuracy to AUC (Area Under the Curve) values so as to establish relative accuracies of models against each other. The above have been discussed and captured very effectively by <ref type="bibr" target="#b5">Celli et al, (2013)</ref>. One important observation which comes to surface while analyzing relevant literature is that, none of the studies so far have exploited the primordial lexical hypothesis and 'adjectival traits' suggested by <ref type="bibr" target="#b31">Saucier et al, (1996)</ref>. Our work presented in this paper carves a very different niche for itself by computing this very approach of personality adjectives, compressing the last 80 years of psychological research in the lexical front and merging it with the latest computational techniques. This confluence has yielded encouraging results, predicting traits matching those predicted by a psychometric test.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Datasets</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Biographical Data Mobilization</head><p>The data collected as a part of this study was by means of a Python-based crawler. We first used a simple web crawler to get a list of web-pages with the name of the respective "person" as the argument keyword to the crawler. These web pages were then filtered based on their meta-tags. To boost true positives, we only considered the pages which specified their content as "biographical" in the meta-tag descriptors. This resulted in mobilization of few Wikipedia resources, blog mentions and majorly some very descriptive biographical websites. We then manually cleaned the noisy data to assure entity disambiguation and irrelevant  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Personality Traits Data</head><p>The Jungian Personality functions of 574 personalities were extracted from the resource for eventual evaluation.<ref type="foot" target="#foot_1">2</ref> Since this was one of the most authentic reserves we found consisting of personality listings (so as to evaluate the ones our model predicts) we found it the most effective to be used for evaluating our own model. The "Adjectival Markers" that the paper is based on (as described below) are a proven indicator to reflect the Big5 traits of personality. Thus, to evaluate our computed predictive model via personalities for the respective subjects by an exclusively listed source, we scaled the Jungian Typology type to the closest traits of the Big5 using correlation factors as shown in Table 2 <ref type="bibr" target="#b12">(Hall et al. 2009</ref><ref type="bibr" target="#b3">, Capraro et al. 2002</ref><ref type="bibr" target="#b8">, Furnham et al. 1996</ref><ref type="bibr">, Mc-Crae et al. 1989)</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.3.">Adjectival Marker Training Set</head><p>The adjectives mined from the biographical data were refined to extract the adjectival markers i.e. specific adjectives descriptive of the subject of the biographical data. These adjectival markers were used as features in the final LASSO logistic regression model. The adjectival markers extracted are based on the work of <ref type="bibr" target="#b31">Saucier &amp; Goldberg, (1996)</ref>. 3 provides the factor loadings of few of the 435 adjectives <ref type="bibr" target="#b31">(Saucier et al, 1996)</ref> on each of the five factors as discussed in their work. The order reflects the relative size (variance) of the factors (e.g. Factor II is the highest), and the sign reflects the relative size of the item subsets at each pole of the factor (e.g. the negative pole of Factor IV has more items). We have, as a part of our study, condensed this table to solely indicate whether or not the trait is descriptive of a particular trait, so as to achieve a binary matrix for them (for the respective 4 of the Big 5 traits mentioned above). The binary equivalent for Table <ref type="table">3</ref> is shown in Table <ref type="table" target="#tab_3">4</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Biographical Data</head><p>Biographical data was mined for 574 personalities from online resources as discussed in the former Section 3.1. This data was divided into 2 categories. Testing data and Training data. Users with no substantial data (&gt;100 words were discarded for the analysis as of now). The data mining undertaken for acquiring these datasets is spread across various social media resources including Wikipedia articles, blog posts, social Q &amp; A sites and community media sites (sharing biographical book excerpts, for building datasets of word count &gt;10,000)</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1.">Training Data</head><p>The training data set, used to mine adjectival markers, comprised of biographic data content of 283 personalities. The word count of the dataset ranged from 500 -10,000 words.</p><p>The ratio of the number of adjectives to the total number of words in the dataset ranged from 0 to 0.005. This data content was mined by means of a Python-based web crawler, which parsed biographic websites, Wikipedia, and social media mentions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2.">Biographical Testing Data</head><p>The testing dataset comprised of biographic data content of a different set of 291 personalities than the ones used for training. These were mined from the social media reserves like Wikimedia articles, blog posts about the respective personalities, social Q&amp; A sites etc. The word count and the number of adjectives to the total number of words ratio ranged from 100 10,000 words and 0.0001 to 0.003 respectively. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Adjectives</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Methodology</head><p>The training data (283 users) was mined for adjectival markers according to Saucier's adjectival marker list <ref type="bibr" target="#b31">(Saucier et al, 1996)</ref>. Personality traits and their adjectival markers were represented as a sparse User-Trait Adjective Matrix for each of the 4 adjectival traits to be predicted. The entries of the respective Trait (say T) matrix were set to 1 if there existed an adjectival marker in the user's descriptive biographical data and 0 if the respective adjectival marker was not there. Thus, each personality trait was contained in a matrix wherein the Row of the matrix M, consisted of adjectival-features and the corresponding column entry consisted of the User-trait. The matrix entity M ij was a binary number which was 1 if the adjectival marker in the i th row indicated the presence of the trait T in the personality of the subject contained in the j th column of the Matrix M.</p><p>To predict the binary score of a given personality feature, we then performed a LASSO logistic regression <ref type="bibr">(Tibshirani et al., 1996</ref><ref type="bibr">, Meier et al., 2008)</ref> analysis in Weka <ref type="bibr" target="#b12">(Hall et al., 2009)</ref>. A variety of regression algorithms were tested, each with a 10-fold cross-validation with 10 iterations. The best result out of all algorithms was using a binary classifier with Lasso regression (with 10 fold cross validation  Since there was only single source where traits of major personalities are classified (i.e. celebritytypes.com) we used it to evaluate our model. We used the remaining 291 personalities for evaluation of the model. The testing biographical data was mined for adjectival trait markers and their respective traits were predicted. The results of this evaluation have been discussed elaborately in the next section. Figure <ref type="figure" target="#fig_2">2</ref>, which can be found above, is also illustrative of the procedure define above.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">Results</head><p>The results by the above illustrated method are elaborated in this section. The average accuracies compared to the personalities obtained via psychometric tests (discussed in more detail in the following section) for considered four of the Big 5 traits were: Extraversion -82.82% Agreeableness -89.62%, Conscientiousness -92.48% and Imaginativeness/Intellect -81.67%. These readings do not necessarily demonstrate the prediction accuracy of the innate personality of a person but match that predicted by the psychometric tests with the given accuracies. They are also in league with few other techniques predicting the same for instance, the work of <ref type="bibr" target="#b13">Iacobelli et al, (2011)</ref> attempted to decipher the personalities of bloggers has an average personality prediction accuracy of around 62.5%. Thus, this paper proposes a technique which illustrates mainfold elevation in the overall accuracy of personality prediction (as indicated by psychometric tests) via social media.</p><p>Figure <ref type="figure">3</ref>: Average accuracy percentage of the personality traits by adjectival marker analysis</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">Discussion</head><p>The results obtained illustrate that this method is competent for predicting the personalities of a person in coherence with other people's judgments about him/her. It gives substantial accuracies in the prediction of a person's personality matching with those obtained via psychometric tests.</p><p>As an essential part of this study, we have also attempted to capture the variation in accuracy with the change in various factors, namely, word count of the corpus, and the ratio of the number of adjectives to the total number of words.<ref type="foot" target="#foot_2">3</ref> These are mainly intended to explore a threshold for word count and the adjective distribution (for the given Few collective observations can be drawn from the gathered data. As indicated in Figure <ref type="figure" target="#fig_3">4</ref>, the accuracy in predicting the traits increases with an increase in the data word count. We also compared the accuracy results in predicting the respective traits on the basis of varying distribution of adjectives in the training dataset (Figure <ref type="figure" target="#fig_4">5</ref>). The accuracy in predicting the traits is relatively low when the ratio of the AC/TWC is low and increases with a subsequent increase in the AC/TWC ratio.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.2.">Agreeableness</head><p>The accuracy in predicting Agreeableness is relatively low (73.33%) for data with word count &lt; 5000 words, and escalates up to 99.11% for big data reserves (&gt;20,000 words). We also compared the accuracy results of predicting "Agreeableness" on the basis of varying distribution of adjectives in the training dataset.</p><p>The prediction of the "Agreeableness" trait is relatively low when the ratio of the adjectival count versus total word count is low. It illustrates an accuracy of 84.00% when the ratio is less than 0.001, improving to 94.18% when the ratio is between 0.001-0.002. Finally it escalates to 95.62% when increased to be greater than 0.003 (Figure <ref type="figure" target="#fig_4">5</ref>). As expected there is a consistent increase in accuracy with increase in word count and the ratio AC/TWC.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.3.">Conscientiousness</head><p>The accuracy in predicting Conscientiousness varies from 86.66% when the word count of the data reserves is less than 5000 words, and subsequently increases with the increase in the number of words as shown in Figure <ref type="figure" target="#fig_3">4</ref>. We also varied the adjective distribution with the word count so as to obtain respective accuracies for the same model. It varies from an accuracy of 88.00% when the ratio is less than 0.001, improving to 93.60% when the ratio is between 0.001-0.002, and finally to 95.44% when increased to be greater than 0.003 (Figure <ref type="figure" target="#fig_4">5</ref>). As expected there is a consistent increase in accuracy with increase in word count and the ratio AC/TWC. The accuracy in predicting Imaginativeness varies from 93.33% at wordcount lower than 5000 words, and goes upto 99.88% for big data reserves (Figure <ref type="figure" target="#fig_3">4</ref>). The peaks observed in the variation of accuracy for "Imaginative" trait over the distribution of adjectives (AC/TWC) range from 85.71% accuracy for AC/TWC = 0.001, 90.69% accuracy for AC/TWC = 0.002 and finally 98.42% for AC/TWC &gt;= 0.003 (Figure <ref type="figure" target="#fig_4">5</ref>).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.5.">Extraversion</head><p>The accuracy varies from 97.70% for word count &lt; 5000 words and subsequently increases to 99.88% as shown in Figure <ref type="figure" target="#fig_3">4</ref>. The accuracy of this trait varied from 85.71% for AC/TWC = 0.001 and went on to increase upto 99.68% for AC/TWC = 0.002 and then 99.70% for AC/TWC &gt;= 0.003.</p><p>The correlations for each of word count with accuracy and AC/TWC with accuracy for each of the above mentioned coefficient implies that for "Adjectival Markers" these are highly correlated to one another. This can also be validated by the graph in Figure <ref type="figure" target="#fig_5">6</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="8.">Conclusion &amp; Future Work</head><p>By means of this study we propose a simpler yet effective method to facilitate personality extraction of people in social media. In order to achieve this we have also reworked some perennial theories of Lexical Psychology and modified them with the newer concepts of machine learning models. This technique brings about a wave of novelty in the wide spread lexical concepts and techniques used to achieve user personality understanding in biographical data reserves. It is a significant contribution in the field of Computer Human interaction, since it is not just based on the modern model training techniques of artificial intelligence, but also finds solid ground in the foundational theories of human psychology. One major drawback of this study is that, it is (as of now) most optimized and accurate when tested on bigger data samples. This research is thus intended to pave way for extrapolating itself to smaller data reserves and microblogs. We intend to apply the same technique on not just adjectives but various other parts of speech (POS) in the near future. There are various studies which discuss the role of a person's personality in the development of diseases <ref type="bibr" target="#b7">(Friedman et al, 1987)</ref>. Thus, another goal that this research aims to achieve is that in the very near future it would be able facilitate personality analysis for a wide range of people with varied handicaps which render them incapable of self-analysis in order to effectively predict their personalities. Statistics say that 11% of children 4-17 years of age (6.4 million) <ref type="bibr" target="#b7">(Friedman et al, 1987)</ref> in the United States itself have been diagnosed with Attention-Deficit / Hyperactivity disorder (the number increasing by 3% this year). With valuable feedback from friends and family this model can help designing better technology for them and various other such people. Building upon this research and extending it to cover other POS would enable us to predict personalities from scanty as well as large datasets with good accuracy. The vision of this research is to train our next generation computers to not only understand people in terms of their choices, but the innate personalities which lead them to make those choices (leading to smart suggestive advertising systems etc). The future work of this research will also include combining this technique with pre-existing ones (e.g. LIWC, etc.) so as to increase the personality prediction accuracy to match that achieved by psychometric tests. We also intend to work on a lexical personality ontology, which analyzes the relationship of personality (both direct and indirect) with the various parts of speech (POS) i.e. extending it from being solely adjectival markers to various other POS. We would soon be graduating from solely Big5 trait prediction to evolving various mental states which can be predicted from the abundant lexical resources available online. Thus graduating the singly dimensioned Big5 model to a multi-dimensional graphical ontology tree of a person. </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head></head><label></label><figDesc>mentions. The same has been illustrated by means of Figure 1.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure</head><label></label><figDesc>Figure 1: Data Mobilization</figDesc><graphic coords="3,304.87,102.90,255.13,220.66" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Descriptive of the methodology</figDesc><graphic coords="5,52.16,214.06,510.25,132.17" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Accuracy variation over word count of testing data</figDesc><graphic coords="6,52.16,69.17,510.24,400.33" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: Accuracy variation over adjective distribution (AC/TWC) in testing dataset</figDesc><graphic coords="7,52.16,69.17,510.23,401.06" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 6 :</head><label>6</label><figDesc>Figure 6: Variation of correlation coefficient based on distribution of adjectives in testing dataset</figDesc><graphic coords="8,56.20,453.95,226.76,181.61" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 :</head><label>2</label><figDesc>Table 1 shows the supporting notations of the personality systems.</figDesc><table><row><cell>Big5/ Global5</cell><cell>Jung/MBTI/Kiersey</cell><cell>Strength of</cell></row><row><cell></cell><cell></cell><cell>Correlation</cell></row><row><cell>Extraversion</cell><cell>Introvert/Extrovert</cell><cell>High</cell></row><row><cell>Emotional Stability</cell><cell>Feeling/Thinking</cell><cell>Very Low</cell></row><row><cell>Conscientiousness</cell><cell>Judging/Percieving</cell><cell>High</cell></row><row><cell>Accommodation /</cell><cell></cell><cell></cell></row><row><cell>Agreeableness</cell><cell>Feeling/Thinking</cell><cell>Medium</cell></row><row><cell>Intellect</cell><cell>Sensing/Intuition</cell><cell>Medium-High</cell></row><row><cell cols="3">Table 1: Notations for Personality Models</cell></row><row><cell cols="3">As illustrated, 4 final personality traits were scaled (each</cell></row><row><cell cols="3">of which had medium to high correlation with the MBTI</cell></row></table><note>Correlations between Personality traits types) namely -Agreeableness (Accommodation -A/E), Extraversion (R/S), Conscientiousness (Orderliness -O/U) and Intellect (N/I).</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4 :</head><label>4</label><figDesc>Adjectival Marker samples for various traits. Samples with values &gt; 0 in the Saucier Goldberg table have been given a binary count of 1, while those lower than 0 have been given 0. (*Decimal Values taken from<ref type="bibr" target="#b31">Saucier et al (1996)</ref>).</figDesc><table><row><cell>Adjectives</cell><cell cols="2">Agreeableness</cell><cell cols="2">Conscientiousness</cell><cell cols="2">Extraversion</cell><cell cols="2">Imaginative</cell></row><row><cell></cell><cell cols="8">Decimal* Binary Decimal* Binary Decimal* Binary Decimal* Binary</cell></row><row><cell>Sympathetic</cell><cell>0.62</cell><cell>1</cell><cell>-0.05</cell><cell>0</cell><cell>0.02</cell><cell>1</cell><cell>0.03</cell><cell>1</cell></row><row><cell>Kind</cell><cell>0.60</cell><cell>1</cell><cell>0.06</cell><cell>1</cell><cell>0.07</cell><cell>1</cell><cell>0.00</cell><cell>0</cell></row><row><cell>Sensitive</cell><cell>0.46</cell><cell>1</cell><cell>0.00</cell><cell>0</cell><cell>-0.10</cell><cell>0</cell><cell>0.23</cell><cell>1</cell></row><row><cell>Rude</cell><cell>-0.50</cell><cell>0</cell><cell>-0.15</cell><cell>0</cell><cell>0.08</cell><cell>0</cell><cell>0.06</cell><cell>0</cell></row><row><cell>Adventurous</cell><cell>0.00</cell><cell>0</cell><cell>-0.04</cell><cell>0</cell><cell>0.38</cell><cell>1</cell><cell>0.10</cell><cell>1</cell></row></table><note>). Using the LASSO Technique ensured that there was no overfitting because of extra adjectival features for certain</note></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">The dataset can be found at http://www.celebritytypes.com.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">http://www.celebritytypes.com, wherein extensive cognitive functions have been used to derive the psychology of the given personalities.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">Please note that the accuracies discussed here are the accuracy of the prediction as evaluated by the results via psychometric tests for Big 5 and should not be confused with accuracies used for predicting the baseline of the universal personality of a person.</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Traitnames. A psycho-lexical study</title>
		<author>
			<persName><forename type="first">References</forename><surname>Allport</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Psychological Monographs</title>
		<imprint>
			<biblScope unit="volume">47</biblScope>
			<date type="published" when="1936">1936</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Lexical Predictors of Personality Type</title>
		<author>
			<persName><forename type="first">S</forename><surname>Argamon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Dhawle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Koppel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">W</forename><surname>Pennebaker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Joint Annual Meeting of the Interface and the Classification Society of North America</title>
				<meeting>Joint Annual Meeting of the Interface and the Classification Society of North America</meeting>
		<imprint>
			<publisher>Hogrefe &amp; Huber</publisher>
			<date type="published" when="2000">2005. 2000</date>
		</imprint>
	</monogr>
	<note>The Big Five personality factors: The psycholexical approach to personality</note>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<author>
			<persName><forename type="first">I</forename><surname>Briggs</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P B Gifts</forename><surname>Myers</surname></persName>
		</author>
		<author>
			<persName><surname>Differing</surname></persName>
		</author>
		<title level="m">Understanding personality type</title>
				<meeting><address><addrLine>, CA</addrLine></address></meeting>
		<imprint>
			<publisher>Mountain View</publisher>
			<date type="published" when="1980">1980</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Myers-Briggs Type Indicator Score Reliability Across: Studies a Meta-Analytic Reliability Generalization Study</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">M</forename><surname>Capraro</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Educational and Psychological Measurement</title>
				<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">The description of personality: basic traits resolved into clusters</title>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">B</forename><surname>Cattell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Abnormal and Social Psychology</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="476" to="506" />
			<date type="published" when="1943">1943</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Relationships between Personality and Interactions in Facebook</title>
		<author>
			<persName><forename type="first">F</forename><surname>Celli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Polonio</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Social Networking: Recent Trends, Emerging Issues and Future Outlook</title>
				<imprint>
			<publisher>Nova Science Publishers</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="41" to="54" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Personality structure: Emergence of the five-factor model</title>
		<author>
			<persName><forename type="first">J</forename><surname>Digman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Annual review of psychology</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="417" to="440" />
			<date type="published" when="1990">1990</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">The disease-prone personality: A meta-analytic view of the construct</title>
		<author>
			<persName><forename type="first">Howard</forename><forename type="middle">S</forename><surname>Friedman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Booth-Kewley, Stephanie American Psychologist</title>
		<imprint>
			<biblScope unit="volume">42</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="539" to="555" />
			<date type="published" when="1987">1987</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">The big five versus the big four: the relationship between the Myers-Briggs Type Indicator (MBTI) and NEO-PI five factor model of personality</title>
		<author>
			<persName><forename type="first">A</forename><surname>Furnham</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Personality and Individual Differences</title>
				<imprint>
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Predicting Personality with Social Media</title>
		<author>
			<persName><forename type="first">J</forename><surname>Goldbeck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Robles</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Turner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the annual conference extended abstracts on Human factors in computing systems</title>
				<meeting>the annual conference extended abstracts on Human factors in computing systems</meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">An alternative description of personality: The Big-Five factor structure</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">R</forename><surname>Goldberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Personality and Social Psychology</title>
		<imprint>
			<biblScope unit="volume">59</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="1216" to="1229" />
			<date type="published" when="1990">1990</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">The Development of Markers for the Big Five Factor Structure</title>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">R</forename><surname>Goldberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Psychological Assessment</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="26" to="42" />
			<date type="published" when="1992">1992</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">The WEKA data mining software. An update</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hall</surname></persName>
		</author>
		<author>
			<persName><surname>Frank</surname></persName>
		</author>
		<author>
			<persName><surname>Holmes</surname></persName>
		</author>
		<author>
			<persName><surname>Pfahringer</surname></persName>
		</author>
		<author>
			<persName><surname>Reutemann</surname></persName>
		</author>
		<author>
			<persName><surname>Witten</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM SIGKDD Explorations Newsletter</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="10" to="18" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Large scale personality classification of bloggers</title>
		<author>
			<persName><forename type="first">F</forename><surname>Iacobelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A J</forename><surname>Gill</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nowson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Oberlander</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">Lecture Notes in Computer Science</title>
		<imprint>
			<date type="published" when="2011">2011. 6975</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Personalized Persuasion : Tailoring Persuasive Appeals to Recipients Personality Traits</title>
		<author>
			<persName><forename type="first">Sonia</forename><forename type="middle">K</forename><surname>Jacob B Hirsh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Galen</forename><forename type="middle">V</forename><surname>Kang</surname></persName>
		</author>
		<author>
			<persName><surname>Bodenhausen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Psychological Science</title>
		<imprint>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Friends dont Lie -Inferring Personality Traits from Social Network Structure</title>
		<author>
			<persName><forename type="first">S</forename><surname>Jacopo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Bruno</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Nadav</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Fabio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nicu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Alex</forename><forename type="middle">P</forename></persName>
		</author>
		<idno>-1224-0/12/09</idno>
	</analytic>
	<monogr>
		<title level="m">Proceedings of UbiComp</title>
				<meeting>UbiComp<address><addrLine>Pittsburgh, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2012-09-05">2012. 5 Sep -8 Sep</date>
			<biblScope unit="page" from="978" to="979" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Operational efficiency and the growth of short-term memory span</title>
		<author>
			<persName><forename type="first">G</forename><surname>Jill</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Midian</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Experimental Child Psychology</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="386" to="404" />
			<date type="published" when="1982">1982</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">The Big Fivefactor taxonomy: Dimensions of personality in the natural language and in questionnaires</title>
		<author>
			<persName><forename type="first">O</forename><forename type="middle">E</forename><surname>John</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Handbook of personality theory and research</title>
				<editor>
			<persName><surname>Barth</surname></persName>
		</editor>
		<meeting><address><addrLine>New York; Lcipzig</addrLine></address></meeting>
		<imprint>
			<publisher>Karl Jung</publisher>
			<date type="published" when="1921">1990. 1921. 1926</date>
			<biblScope unit="page" from="66" to="100" />
		</imprint>
	</monogr>
	<note>Die Grundlagen der Chat-akterkunde. 111e science of character</note>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">The group lasso for logistic regression</title>
		<author>
			<persName><forename type="first">M</forename><surname>Lukas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sara</forename><surname>Van De Geer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Peter</forename><forename type="middle">B</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J. R. Statist Soc. Eidgenssische Technische</title>
		<imprint>
			<biblScope unit="volume">70</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="53" to="71" />
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Mining Authors Personality Traits from Modern Greek Spontaneous Text</title>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">L</forename><surname>Kermanidis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">4th International Workshop on Corpora for Research on Emotion Sentiment &amp; Social Signals, in conjunction with LREC12</title>
				<imprint>
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">A corpus for author and personality prediction from text</title>
		<author>
			<persName><forename type="first">K</forename><surname>Luyckx</surname></persName>
		</author>
		<author>
			<persName><surname>Daelemans</surname></persName>
		</author>
		<author>
			<persName><surname>Personae</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of LREC-2008, the Sixth International Language Resources and Evaluation Conference</title>
				<meeting>LREC-2008, the Sixth International Language Resources and Evaluation Conference</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Personality Generation for Dialogue</title>
		<author>
			<persName><forename type="first">F</forename><surname>Mairesse</surname></persName>
		</author>
		<author>
			<persName><surname>Walker</surname></persName>
		</author>
		<author>
			<persName><surname>Personage</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics ACL</title>
				<meeting>the 45th Annual Meeting of the Association for Computational Linguistics ACL</meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text</title>
		<author>
			<persName><forename type="first">F</forename><surname>Mairesse</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M A</forename><surname>Walker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">R</forename><surname>Mehl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Moore</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Artificial intelligence Research</title>
		<imprint>
			<biblScope unit="volume">30</biblScope>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Reinterpreting the Myers-Briggs Type Indicator From the Perspective of the Five-Factor Model of Personality</title>
		<author>
			<persName><forename type="first">R</forename><surname>Mccrae</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Costa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">National Center on Birth Defects and Developmental Disabilities Division of Human Development and Disabilities</title>
				<meeting><address><addrLine>USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="1989">1989</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Toward an adequate taxonomy of personality attributes: Replicated factor structure in peer nomination personality ratings</title>
		<author>
			<persName><forename type="first">Warren</forename><forename type="middle">T</forename><surname>Norman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Abnormal and Social Psychology</title>
		<imprint>
			<biblScope unit="volume">66</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="574" to="583" />
			<date type="published" when="1963">1963</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">Whose thumb is it anyway? classifying author personality from weblog text</title>
		<author>
			<persName><forename type="first">J</forename><surname>Oberlander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nowson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics ACL</title>
				<meeting>the 44th Annual Meeting of the Association for Computational Linguistics ACL</meeting>
		<imprint>
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<monogr>
		<author>
			<persName><forename type="first">E</forename><surname>Ostendoff</surname></persName>
		</author>
		<title level="m">prache und Personlichkeitsstruktur: Zur Validitat des Funf-Faktoren-Modeiis der Personlichkeit [Language and personality structure: On the validity of the five-factor model of personality</title>
				<imprint>
			<date type="published" when="1990">1990</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<monogr>
		<title level="m">Federal Republic of</title>
				<meeting><address><addrLine>Germany</addrLine></address></meeting>
		<imprint>
			<publisher>Roderer Verlag</publisher>
		</imprint>
		<respStmt>
			<orgName>Regensburg</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Linguistic styles: Language use as an individual difference</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">W</forename><surname>Pennebaker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>King</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Personality and Social Psychology</title>
		<imprint>
			<biblScope unit="volume">77</biblScope>
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">Our Twitter Proles, Our Selves: Predicting Personality with Twitter</title>
		<author>
			<persName><forename type="first">D</forename><surname>Quercia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kosinski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Stillwell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Crowcroft</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of SocialCom</title>
				<meeting>SocialCom</meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="180" to="185" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Regression Shrinkage and Selection via the Lasso</title>
		<author>
			<persName><forename type="first">T</forename><surname>Robert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the Royal Statistical Society. Series B (Methodological)</title>
		<imprint>
			<biblScope unit="volume">58</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="267" to="288" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Evidence for the Big Five in analysis of familiar English Personality adjectives European</title>
		<author>
			<persName><forename type="first">G</forename><surname>Saucier</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">R</forename><surname>Goldberg</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Personality</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="page" from="61" to="77" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods</title>
		<author>
			<persName><forename type="first">James</forename><forename type="middle">W</forename><surname>Yla R Tausczik</surname></persName>
		</author>
		<author>
			<persName><surname>Pennebaker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Language and Social Psychology</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="24" to="54" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
