<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Methodology for Studying Linguistic and Cultural Change in China, 1900-1950</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Spencer Dean Stewart</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Information Studies, Purdue University</institution>
          ,
          <country country="US">U.S.A</country>
        </aff>
      </contrib-group>
      <fpage>575</fpage>
      <lpage>588</lpage>
      <abstract>
        <p>This paper presents a quantitative approach to studying linguistic and cultural change in China during the first half of the twentieth century, a period that remains understudied in computational humanities research. The dramatic changes in Chinese language and culture during this time call for greater reflection on the tools and methods used for text analysis. This preliminary study ofers a framework for analyzing Chinese texts from the late nineteenth and twentieth centuries, demonstrating how established methods such as word counts and word embeddings can provide new historical insights into the complex negotiations between Western modernity and Chinese cultural discourse.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;text analysis</kwd>
        <kwd>word embeddings</kwd>
        <kwd>China</kwd>
        <kwd>conceptual history</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        is well established in humanities research, the dramatic shifts in language and culture described
above demand a closer reflection on how such changes should inform our approaches to textual
analysis. At a practical level, these shifts require several additional steps in text preprocessing
before the corpus is ready for analysis. This paper is therefore divided into two main sections.
The first provides a brief introduction to diferent textual datasets for studying modern China,
an overview of the corpus that will be used in this paper, and a discussion of how the text was
prepared for text analysis. Part two shows how diferent computational methods can be applied
to these texts to analyze the evolution of Chinese language and culture. It draws specifically
from the framework oftranslingual practice to explore issues of scale and agency in an evolving
cultural and linguistic landscape. [
        <xref ref-type="bibr" rid="ref14">13</xref>
        ]
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Data and Text Preprossesing</title>
      <p>
        The digitization of Republican China (1912-1949) periodicals, newspapers, and books is still in
its infancy, although there has been great progress in recent years. The existing projects have
largely been shaped by specific research questions or institutional interests. One such project
is The Database for the Study of Modern Chinese Thought and Literature (1830-1930)中國近
現代思想及文學史, currently hosted at National Chengchi University in Taiwan. The team,
led by Dr. Cheng Wen-huei, has produced excellent work on the conceptual and intellectual
histories of diferent ideas in late Qing and Republican China. [
        <xref ref-type="bibr" rid="ref26">25</xref>
        ] [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] A more recent efort to
create a corpus of Republican-era periodicals has been made by the Shanghai Library to digitize
and produce full-text editions of their vast collections of periodicals, with a focus on literature,
iflm, and red materials. They have partnered with the University of Chicago’s Textual Optics
Lab to make portions of the dataset available for analysis through the platform PhiloLogic.
[
        <xref ref-type="bibr" rid="ref22">21</xref>
        ] Another recent initiative is spearheaded by historian Christian Henriot who has compiled
a diverse collection of Modern Chinese texts and made them available for analysis through the
platform HistText. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
      </p>
      <p>
        For this preliminary analysis, I have chosen to focus the study on a single periodical of
significant cultural significance: Eastern Miscellany (東方雜誌). Eastern Miscellany was the flagship
periodical published by Commercial Press from January 1904 to December 1948. While it was
just one of over 20,000 periodicals published during the Republican era, Eastern Miscellan’y
s influence and broad scope make it a useful corpus for research. Historian Ted Hunters, for
example, has described Commercial Press as“the most important publisher in China during
the first half of the twentieth century, ”with Eastern Miscellany serving as “a key organ of
intellectual opinion throughout the forty-some years of its existence”. [
        <xref ref-type="bibr" rid="ref12">11</xref>
        ] During its print
run, Eastern Miscellany published on a range of topics such as contemporary events, politics,
economics, literature, science, and technology. In total, this corpus includes over 27 thousand
individual articles, 5 thousand unique authors, and 96 million total lexical items (see Figure
1). While any dataset is unavoidably imperfect,Eastern Miscellany provides an eclectic
collection of texts that serves as a useful starting point for future computational humanities research
utilizing Republican-era newspapers and periodicals.
      </p>
      <p>
        The past decade has seen increased attention to the asymmetries involved in multilingual
computational text analysis. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ][
        <xref ref-type="bibr" rid="ref25">24</xref>
        ] Studies in this realm can be technical in nature, especially
when it comes to working with non-Latin languages and scripts. Existing tools such as optical
character recognition (OCR) or word segmentation simply don’t work as well for languages
like Arabic, Japanese, or Chinese. Methods and tools for working with modern Chinese texts
have improved considerably in recent years, yet many of these tools are trained to process texts
from the last few decades. There are fewer resources and tools for working with textual data
from the late nineteenth and early twentieth centuries.
      </p>
      <p>
        The biggest obstacle for working with Chinese texts is word segmentation. After assessing
the handful of available tools, we found that Jieba performed best if: (1) we standardized the
numerous character variants 異體字; and (2) converted traditional characters to their
simpliifed equivalent. Character variants are Chinese characters that have multiple distinct graphical
representations. These variations were commonly used in commercial printing into the
twentieth century. The faithful rendering of character variants in the digitization process means
that tools such as Jieba are unable to recognize and properly segment common variants.
Subsequently, we compiled a list of the 50 most frequently used character variants in our dataset
and created a dictionary to convert them into their standardized form. Secondly, we found that
while Jieba is capable of processing traditional characters, it performed better when working
with simplified Chinese. Therefore, the entire corpus was first converted into simplified
Chinese characters prior to conducting text analysis. [
        <xref ref-type="bibr" rid="ref22">21</xref>
        ] While such preprocessing steps are not
appropriate for all research tasks, they were nevertheless useful in providing a more consistent
and standardized corpus to trace cultural change over time.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Scaling Translingual Practice</title>
      <p>In her 1995 book, Translingual Practice: Literature, National Culture, and Translated Modernity
– China, 1900-1937, Lydia H. Liu argued that scholars need to move beyond conventional
inlfuence studies, which might explicitly or implicitly assume a level of universal equivalence
when translating concepts and ideas between two languages such as English and Chinese. She
instead advocated for studying how new words enter the host language, gain legitimacy, and
are creatively employed in popular discourse. Without ignoring the impact that Western
modernization discourse had on languages and cultures around the world, this framework allows
scholars to more fully explore the intervention of local actors in the process of translating
modernity. The idea of translingual practice can shape the way we approach computational
text analysis. Rather than simply counting words to measure influence, for example, we need
to adopt other methods such as word embeddings to understand how local actors engaged with
these new words and concepts. In the case of China, we should avoid assuming equivalence
between the imposed language and its Chinese translation. Instead of ending our analysis at
the point of adoption, we need to further investigate how Chinese writers debated and engaged
with these national and global discourses.</p>
      <p>This section employs diferent methods to study China’s linguistic and cultural
transformation during the first half of the twentieth century. The first section uses word counts to
understand the nature of linguistic change and the word forms that had the greatest impact
on Chinese discourse. The second section applies word embeddings to measure cultural and
linguistic drift. The final section examines the concept of ’economy’ as a case study for how
these methods can be used to provide new insights into cultural and linguistic change.</p>
      <sec id="sec-3-1">
        <title>3.1. Counting Linguistic and Cultural Change</title>
        <p>The easiest way to study the impact of foreign ideas and words on the Chinese language is
through word counts. Counting occurrences of foreign words in the pages oEfastern
Miscellany can provide insights into the nature of language contact and how new concepts shaped
the way Chinese writers discussed a variety of topics, from politics and culture to science and
law. For this analysis, I compiled the lists of words found in the appendices oTfranslingual
Practice, a list that includes around 1,800 diferent neologisms, loanwords, and transliterations.
Figure 2 shows the frequency (count divided by lexical units per year) that these words
appeared annually in the pages ofEastern Miscellany. It shows that there was a quick increase in
frequency during the first two decades of the twentieth century, which stabilized by the 1920s
and 1930s (with some exceptions). By the 1920s, these words were on average appearing one
to two times per sentence. The most common words included concepts such as government
(政府), politics (政治), economy (經濟), and society (社會). They also included ideas about the
world (世界) and the international (國際); citizenship (國民), nationality (民族), and voting
(選舉); capital (資本), industry (工業), and production (生產); law (法律) and science (科學);
revolution (革命) and the military (軍事, 海軍). While many of these conceptions existed in</p>
        <p>China prior to the twentieth century, much of the language that writers were using to convey
ideas about politics, nationalism, economy, law, science, and military was changing.</p>
        <p>The act of translating a foreign word into Chinese didn’t always lead to its adoption in
written discourse. The success or failure of adoption speaks to the nature of language contact
during this time. Out of the list of foreign words derived from these appendices, around five
hundred don’t appear at all in the pages ofEastern Miscellany, and half appeared fewer than 50
times. There is a strong relationship between rates of adoption and the mode of introduction.
Liu divided newly introduced terms into several diferent categories. The first and earliest of
these were neologisms derived from Missionary-Chinese texts (Appendix A; n=184). These
included a variety of terms such as bread 麵(包), newspaper (報紙), physics (物理) and
university (大學). The next series of words arrived in China by way of Japan. First among these
were terms coined by Japanese translators who used Chinese characters to translate European
words that were later introduced into China (Appendix B; n=489). These included words such
as biology (生物學), comic books (漫畫), debt (債務), factory (工廠), international law (國際
公法), the natural sciences (自然科學), and voting (投票). A second, smaller group of words,
consisted ofkanji that arrived in China without necessarily involving European languages
(Appendix C; n=51), such as rickshaw (人力車) and religion (宗教).</p>
        <p>The third and most important type wasreturn graphic loans. These were Chinese-character
會議; 委員; 雜誌; 法律; 選舉; 鐵路; 學校; 帝國
國際; 政策; 銀行; 民族; 工業; 財政; 目的; 承認
宗教; 表現; 內容; 手續; 方針; 集團; 距離; 目標
政府; 社會; 主義; 經濟; 方面; 政治; 世界; 關係
馬克; 蘇維埃; 鴉片; 基督教
蘇維埃; 盧布; 烏拉
compounds with established meanings in classical Chinese, which Japanese translators adapted
to correspond to modern European words (Appendix D; n=234). Some of these can be viewed
as extensions of existing words whose meaning in some way resembled the foreign equivalent.
For example, the character pair geming (革命) traditionally referred to dynastic transition, but
in the late nineteenth and twentieth centuries it came to meanrevolution. At other times,
translators used existing character pairs to introduce entirely diferent ideas. The character
pair jiantao (檢討), discussed more below, went from being a title in the Hanlin academy to
meaning something approximatingexamination or self-criticism. The final and largest number
of new terms consisted of transliterations from English, French, and German (Appendix F;
n=761), and transliterations from Russian (Appendix G; n=78). Some of these transliterations
are recognizable today, such as logic 邏(輯), microphone (麥克風), or Darwinism (達爾文主
義), but most have fallen out of use (e.g., communist康門尼斯特 and syrup 舍利別).</p>
        <p>Based on total counts, one might assume that transliterations were the most common and
possibly most influential form of language contact. However, frequency counts based on these
categorizations confirms what existing scholarship has said about the importance of Japanese
translators in shaping Chinese written discourse. As shown in Figure 3, the most frequently
appearing words were return graphic loans (Appendix D). The most common of these included
words recognizable today, such as government 政(府), society (社會), economy (經濟),
politics (政治), and world (世界). These loanwords, which at times were just extensions of existing
words, were apparently more easily and more readily adopted by Chinese writers. Moreover, if
we look at frequency per word rather than frequency by appendix, we find that return graphic
loans were nearly three times more likely to appear than words derived from
MissionaryChinese texts (Appendix A). Terms coined by Japanese translators using Chinese characters
(Appendix B) also increased, but not as frequently as return graphic loans. Perhaps most
interesting were that transliterations, despite constituting the largest total number of overall
neologisms, ranked low on the list compared to other appendices. There are clear examples
of transliterations from this period that became localized (see opium鴉片, cofee 咖啡, logic
邏輯 or Soviet 蘇維埃), but most transliterations weren’t adopted. Overall, return graphic
loans were more than 46 times more likely to appear in the pages ofEastern Miscellany than
transliterations (see Table 1).</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Measuring Change</title>
        <p>Word frequencies can only tell us so much about Sino-Foreign interactions. A
moreChinacentered approach doesn’t require that we dismiss the impact of the West in shaping modern
China; rather, it emphasizes the active participation of Chinese actors in this process.4[] Is it
possible to computationally capture the agency of Chinese writers in translating modernity?
Can we adopt a form of distant reading informed by the concept of translingual practice?</p>
        <p>
          Temporal word embeddings are particularly well suited for studying history, especially when
it comes to understanding linguistic, cultural, and conceptual change. Word embeddings build
on J.R. Firth’s concept that “you shall know a word by the company it keeps”.We can
conclude that words such as dog and bone or cat and mouse are related given that they often appear
near each other in a sentence. Word embeddings take this logic one step further by looking
for words that share the same co-occurring terms, thereby clustering words that have similar
semantic meanings. This might include synonyms or words that share specific conceptual
relationships such as animals, countries, or automobile companies. One way to think about this
method is that word embedding models place words within a multidimensional space where
the distance between individual words represents how closely they are conceptually or
linguistically related. Previous scholarship has demonstrated the utility of this method in measuring
cultural and linguistic change over time. [
          <xref ref-type="bibr" rid="ref11">10</xref>
          ][
          <xref ref-type="bibr" rid="ref9">8</xref>
          ] Scholars have used temporal word
embeddings to propose laws of semantic change, 9[] uncover racial inequality in postwar U.S. fiction,
[
          <xref ref-type="bibr" rid="ref19">18</xref>
          ] discover the invisible labor of women’s editorial work,1[
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] highlight representations of
race and ethnicity under the Japanese empire, [
          <xref ref-type="bibr" rid="ref15">14</xref>
          ] and reconstruct networks of influence in
abolitionist writings. [
          <xref ref-type="bibr" rid="ref20">19</xref>
          ]
        </p>
        <p>
          Here I use temporal word embeddings to provide qualitative insights into the larger
systematic changes measured above. To do this, I created separate word embedding models for
distinct periods of time. I experimented with diferent periodizations, including those that
conformed to conventional political events, but for simplicity, I divided the corpus by decade (i.e.,
1904-1909; 1910-1919; 1920-1929; 1930-1939; 1940-1949). After aligning the models, [
          <xref ref-type="bibr" rid="ref10">9</xref>
          ] I then
measured (using cosine similarity scores) how a given word was or was not changing over time
.
        </p>
        <p>A useful example is the character pair jiantao 檢討, which underwent significant change
during the first half of the twentieth century. As mentioned above, jiantao originally referred to
a title/name associated with the Hanlin academy. In Japan, this two-character compound was
used to translate the concepts of examination and self-criticism.Jiantao as examination was
then introduced into China in the early twentieth century. Figure 4 provides a two-dimensional
representation of the character pair as its meaning changed in the pages ofEastern Miscellany.
Here we find it retaining its original meaning during the first decade of the twentieth century
and gradually shifting to take on the meaning of examination by the 1940s.</p>
        <p>
          Jiantao is just one example of how word embeddings can be useful for quantifying how
return graphic loans transformed existing written language. Using cosine similarity scores,
we can further isolate words that experienced a noticeable shift during the first half of the
twentieth century. As shown in Table 2, such examples include the character pairyouji 游
擊, which evolved from its traditional usage as a military position to refer tgouerrilla warfare.
Jingji 經濟, as discussed more below, once referred broadly to political governance before it
was used to translate economy. The charactersfengjian 封建 were used to translate the idea of
feudal and feudalism, despite the fact that this character pair was long used by famous thinkers
for around two thousand years to refer to the granting (feng) of land to establish (jian) vassals.
[
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] Finally, the characters geming 革命 once referred to dynastic transition, but in the late
nineteenth and twentieth centuries took on the more active meaning of revolution.
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Economy in the Popular Press</title>
        <p>The mapping of the word economy onto the Chinese character pairjingji (經濟) involved
complicated interactions between China, Japan, and the West. As Tianyu Feng has shown,jingji
was a term that originally meant governance for the people. It included aspects that would
later be identified as economy, such as finance, trade, and transportation. But even in the late
Qing, these aspects were subsumed under the umbrella of politics. In many ways, this was
comparable to how the word economy was used in Western discourse prior to the eighteenth
century. It therefore makes sense that early Japanese writers originally chose to translate it as
jingji (or keizai). However, the suitability of this translation began to be called into question
as European thinkers and classical economists constructed an idea of economy and economics
that was distinct from politics and political economy, to instead focus more narrowly on
production, distribution, and consumption.6[]</p>
        <p>
          The evolving meaning of economy in the West posed problems for Chinese translators in the
late nineteenth century. Influential figures such as Liang Qichao and Yan Fu avoided jingji as
a translation for economy, instead preferring other translations such asfuguoce 富國策
(strategy of enriching the country),cailixue 財理學 (financial management), licaixue 理財學 (wealth
administration),zisheng 資生 (welfare or wellbeing), shengjixue 生計學 (study of livelihood),
and jixue 計學 (study of counting), along with several diferent transliterations (e.g., 葉科諾
密, 愛康諾米, and 依康老米). Most Japanese writers continued to use the termjingji/keizai to
translate economy, with some Chinese commentators suggesting that these writers
mistranslated the word by mistaking two Chinese characters sharing the same pronunciation. But over
time, the influence of jingji won out as prominent political figures such as Sun Yat-sen helped
to solidify it as the accepted translation of economy.6[][
          <xref ref-type="bibr" rid="ref23">22</xref>
          ]
        </p>
        <p>Word frequencies reveal the contested nature of diferent translations during the first decade
of the twentieth century, along with the eventual rise ofjingji as the preferred translation.
The only real contenders tojingji in the pages of Eastern Miscellany were jixue (roughly 145
appearances) and licaixue (around 68 appearances), with both of these surpassingjingji xue 經
濟學 (economics) for parts of the first two decades of the twentieth century. In the 1900s and
1910s, writers used these diferent translations to appeal to the authority of economists (jixue
jia 計學家), propose the establishment of centers for economic studyj(ixue guan 計學館), and
provide introductions to economic theories and axioms. However, by the early 1910s, the term
jingji and jingji xue to refer to the economy and the study of economics, respectively, gained
prominence over these alternatives. Overall, in the pages ofEastern Miscellany, the term jingji
xue (economics) appeared around 2,900 times andjingji (economy) over 58,000 times.</p>
        <p>
          Amidst debates regarding the proper translation for economy, Chinese writers explored the
meaning of jingji as a concept and its role in Chinese politics and society. Chinese thinkers
had long written broadly about what would come to be called the economy in the twentieth
century. As Margareta Zanasi shows, the commercial revolution of the mid-1500s led Chinese
thinkers to adopt pro-market and pro-consumption ideas. By the 1800s, just as the West was
beginning to embrace these same ideas, Chinese ofÏcials and intellectuals were faced with new
internal and external circumstances that led them to instead promot“ea hybrid form of market
economy with ad hoc government intervention”.This system“more closely resembled forms
of developmental state and was intended to solve the China-specific threats of scarcity and,
after the mid-1800s, of imperialist interference.” [
          <xref ref-type="bibr" rid="ref29">28</xref>
          ] The language that they used to discuss the
economy was therefore changing just as Chinese economic thinkers moved away from
emphasizing stability to stressing the importance of economic growth for nation building. According
to Zanasi, the Chinese economic crisis“came to be framed in terms of the struggle of the
Chinese race and civilization for the survival of the fittest.”[
          <xref ref-type="bibr" rid="ref29">28</xref>
          ]
        </p>
        <p>A distant reading of the termjingji in the pages of Eastern Miscellany provides several
interesting insights into the nature of economic discourse within China. First, the idea of economy
moved from a more abstract notion during the first decade of the twentieth century to focus on
concrete industries and sectors. As shown in Table 3, terms closely associated wit hjingji
during the first decade were relatively abstract, suggesting that the economy was understood more
as a concept and phenomenon. The place of social Darwinism within this discourse is also
suggested by its close association withevolution. Diferent articles drew from a social Darwinian
framework to discuss social and economic progress in China, including the (im)possibility of
eliminating socioeconomic inequality, the advancement of commercial interests, and details of
monetary policies. The idea ofrise and fall in relation to the economy also existed in the first
two decades of the twentieth century. At times, it was used to discuss the rise and fall of the
national economy (國民經濟之盛衰). At other times, it referred to trade, industry, companies,
and even urban centers. But overall, the general trend by the 1910s pointed in the direction of
a more concrete understanding of what constituted the economy. Commerce, industry,
international interactions, finance and trade, along with the creation of businesses (e.g., 經營締造)
were closely aligned with this new conceptualization of what constitutedjingji.</p>
        <p>
          A second observation points to an industrial and production bias within popular economic
discourse. Zanasi has argued that although economic commentators recognized that
consumption was not inherently bad, their focus was often on how the overall economy failed to move
forward in“purchasing power and consumption habits”.[29] This was caused by gaps in
levels of development within China, where urban populations in Shanghai could enjoy luxurious
lifestyles while rural populations lacked basic necessities. At first glance, Table 3 suggests that
writers in the pages of Eastern Miscellany shared this concern as jingji was closely associated
with industry and industrial production (e.g.,產業, 工業, and 實業). Even in these
vernacular discussions of economy aimed at an urban audience, appearances ofproduction (生產,
n=25,858) occurred nearly four times more often than consumption (消費, n=6,528), and over
ten thousand more times thancommerce (商業, n=14,088). References to the industrial sector
(工業, n=25,945) also appeared nearly twice as often as agriculture ( 農業, n=13,819). Cotton
provides an interesting example where agriculture and industry met as the cotton industry
became more closely associated withjingji in the 1930s and 1940s. Situated between the rural
economy and urban industry, cotton was an important strategic sector promoted by the
government in the 1930s and into the 1940s. [
          <xref ref-type="bibr" rid="ref16">15</xref>
          ] Discussions of the rural economy, agriculture,
and consumption were frequent enough. But industrial production was clearly the priority for
much of this time.
        </p>
        <p>
          A third finding relates to the occurrence of the word control (統制) in the 1920s, the
appearance of which serves as a cautionary tale regarding the need to balance distant and close
reading. The idea of a control economy in Republican China is most closely associated with
the 1930s and 1940s when the government promoted Fascist models of command economy.
[29] Its appearance in the 1920s is therefore notable and at first glance might suggest
popular discussions of this form of economic governance prior to its implementation in the 1930s.
Upon closer reading, we find that the correlation between control and economy in the 1920s
derives from a handful of articles describing foreign economies such as Germany and Japan,
along with warnings about the potential dangers of government control of industry. Rather
than foreshadowing developments in the 1930s, the appearance ofcontrol in the 1920s more
accurately represents how some writers were responding to global economic discourse and
expressing reservations about government oversight. The relationship between economy and
control beginning in the 1930s is better understood through the appearance of cotton棉( 業) in
the 1930s as Eastern Miscellany occasionally reported on the Cotton Control Commission棉(
業統制委員會), an organization established by the Nationalist government in 1934 as a way
to directly control the circulation of raw cotton and the production of cotton goods2.9[][
          <xref ref-type="bibr" rid="ref21">20</xref>
          ]
The appearance of control in the 1920s therefore illustrates the need to balance close reading
with distant reading. It also demonstrates how Chinese writers were actively observing and
commenting on global economic afairs.
        </p>
        <p>This preliminary conceptual history ofjingji as found in the pages of Eastern Miscellany
demonstrates how Chinese writers were influenced by and contributed to Western economic
discourse. They debated the appropriateness of diferent translations and adjusted their
writings in accordance with changing local and global developments. Distant reading provides
one way to look at this process. The arrival of thousands of new words and ideas clearly
transformed the Chinese language. At the same time, the significance of these concepts and their
meaning for the Chinese condition were constantly being renegotiated.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>China’s linguistic and cultural transformation that began in the late nineteenth century serves
as a valuable case study for applying computational methods to historical data. These language
changes require several additional steps in the text preprocessing stage to create a standardized
corpus capable of more accurately capturing cultural change. The scale of this transformation
also provides opportunities for computational research. As shown in this paper, word
frequencies shed light on the nature of language contact in China during the first half of the twentieth
century as foreign words were introduced from the West via Japan, with new concepts often
being mapped onto older words. Word embeddings shift our attention towards how these
words were used within Chinese written discourse, including the repurposing of its linguistic
past to present a more international and modern present. Through a combination of distant
and close reading of the term economy j(ingji), this paper also demonstrates how Chinese
writers responded to global discourse. Chinese writers introduced, negotiated, rejected, debated,
and finally employed new concepts in creative and interesting ways. This study suggests that
computational humanities research, when combined with close attention to historical and
cultural context, can deepen our understanding of the interactions between method, language,
translation, and culture.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>B.</given-names>
            <surname>Baptiste</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Armand</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Henriot</surname>
          </string-name>
          . “
          <article-title>HistText: An Application for leveraging largescale historical textbases”</article-title>
          .
          <source>In: Journal of Data Mining &amp; Digital Humanities</source>
          (
          <year>2023</year>
          ). doi:
          <volume>10</volume>
          .46298/jdmdh.11756.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Carrai</surname>
          </string-name>
          . “
          <article-title>Learning Western Techniques of Empire: Republican China and the New Legal Framework for Managing Tibet”</article-title>
          .
          <source>In:Leiden Journal of International Law 30.4</source>
          (
          <issue>2017</issue>
          ), pp.
          <fpage>801</fpage>
          -
          <lpage>824</lpage>
          . doi:
          <volume>10</volume>
          .1017/s0922156517000371.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>J.</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>L. H.</given-names>
            <surname>Tsui</surname>
          </string-name>
          . “
          <article-title>Debating and Developing Digital Humanities in China: New or Old?” In: Global Debates in the Digital Humanities</article-title>
          . Ed. by
          <string-name>
            <given-names>D.</given-names>
            <surname>Fiormonte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chaudhuri</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Ricaurte</surname>
          </string-name>
          . Minneapolis, MN: University of Minnesota Press,
          <year>2022</year>
          , pp.
          <fpage>71</fpage>
          -
          <lpage>86</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>P. A.</given-names>
            <surname>Cohen</surname>
          </string-name>
          .
          <article-title>Discovering History in China: American Historical Writing on the Recent Chinese Past</article-title>
          . New York, NY: Columbia University Press,
          <year>1985</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Duara</surname>
          </string-name>
          .
          <article-title>Rescuing History from the Nation: Questioning Narratives of Modern China</article-title>
          . Chicago: University of Chicago Press,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T.</given-names>
            <surname>Feng</surname>
          </string-name>
          .
          <article-title>The Cultural History of the Chinese Concepts Fengjian (Feudalism) and Jingji (Economy)</article-title>
          .
          <source>Singapore: Palgrave Macmillan Singapore</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Fiormonte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chaudhuri</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Ricaurte</surname>
          </string-name>
          .
          <article-title>Global Debates in the Digital Humanities</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Minneapolis</surname>
          </string-name>
          , MN: University of Minnesota Press,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Guldi</surname>
          </string-name>
          .
          <article-title>The Dangerous Art of Text Mining: A Methodology for Digital History</article-title>
          . Cambridge: Cambridge University Press,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>W. L.</given-names>
            <surname>Hamilton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Leskovec</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Jurafsky</surname>
          </string-name>
          . “
          <article-title>Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change”</article-title>
          .
          <source>In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics</source>
          .
          <year>2016</year>
          , pp.
          <fpage>1489</fpage>
          -
          <lpage>1501</lpage>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>P16</fpage>
          -1141.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>R.</given-names>
            <surname>Heuser</surname>
          </string-name>
          . “Computing Koselleck: Modelling Semantic Revolutions,
          <fpage>1720</fpage>
          --
          <lpage>1960</lpage>
          ”. In:
          <article-title>Explorations in the Digital History of Ideas: New Methods and Computational Approaches</article-title>
          . Ed. by P. de Bolla. Cambridge: Cambridge University Press,
          <year>2023</year>
          , pp.
          <fpage>256</fpage>
          -
          <lpage>285</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>T.</given-names>
            <surname>Hunters</surname>
          </string-name>
          . “Culture, Capital, and
          <article-title>the Temptations of the Imagined Market: The Case of the Commercial Press”</article-title>
          . In: Beyond the May Fourth Paradigm: In Search of Chinese Modernity. Ed. by
          <string-name>
            <surname>K.-W. Chow</surname>
            , T.-k. Hon, H.-y. Ip, and
            <given-names>D. C.</given-names>
          </string-name>
          <string-name>
            <surname>Price</surname>
          </string-name>
          .
          <source>Lexington Books</source>
          ,
          <year>2008</year>
          , pp.
          <fpage>27</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L. F.</given-names>
            <surname>Klein</surname>
          </string-name>
          . “
          <article-title>Dimensions of Scale: Invisible Labor, Editorial Work, and the Future of Quantitative Literary Studies”</article-title>
          .
          <source>In:Pmla 135.1</source>
          (
          <issue>2020</issue>
          ), pp.
          <fpage>23</fpage>
          -
          <lpage>39</lpage>
          . doi:
          <volume>10</volume>
          .1632/pmla.
          <year>2020</year>
          .
          <volume>135</volume>
          .1 .23.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>L.</given-names>
            <surname>Liu</surname>
          </string-name>
          . Translingual Practice: Literature,
          <string-name>
            <given-names>National</given-names>
            <surname>Culture</surname>
          </string-name>
          , and Translated ModernityChina,
          <fpage>1900</fpage>
          -
          <lpage>1937</lpage>
          . Stanford, CA: Stanford University Press,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>H.</given-names>
            <surname>Long</surname>
          </string-name>
          .
          <article-title>Value in Numbers: Reading Japanese Literature in a Global Information Age</article-title>
          . New York, NY: Columbia University Press,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>K.</given-names>
            <surname>Pomeranz</surname>
          </string-name>
          .
          <article-title>The Making of a Hinterland: State, Society</article-title>
          , and Econmy in Inland North China,
          <fpage>1853</fpage>
          -
          <lpage>1937</lpage>
          . Berkeley, CA: University of California Press,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Reed</surname>
          </string-name>
          . “
          <article-title>Gutenberg and Modern Chinese Print Culture: The State of the Discipline II”</article-title>
          .
          <source>In: Book History</source>
          <volume>10</volume>
          (
          <year>2007</year>
          ), pp.
          <fpage>291</fpage>
          -
          <lpage>315</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>C. A.</given-names>
            <surname>Reed</surname>
          </string-name>
          . Gutenberg in Shanghai: Chinese Print Capitalism,
          <fpage>1876</fpage>
          -
          <lpage>1937</lpage>
          . Honolulu, HI: University of Hawaii Press,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>R. J.</given-names>
            <surname>So</surname>
          </string-name>
          .
          <article-title>Redlining Culture: A Data History of Racial Inequality</article-title>
          and
          <string-name>
            <given-names>Postwar</given-names>
            <surname>Fiction</surname>
          </string-name>
          . New York, NY: Columbia University Press,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>S.</given-names>
            <surname>Soni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. F.</given-names>
            <surname>Klein</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Eisenstein</surname>
          </string-name>
          . “
          <article-title>Abolitionist Networks: Modeling Language Change in Nineteenth-Century Activist Newspapers”</article-title>
          .
          <source>In:Journal of Cultural Analytics 6.1</source>
          (
          <issue>2021</issue>
          ), pp.
          <fpage>1</fpage>
          -
          <lpage>43</lpage>
          . doi:
          <volume>10</volume>
          .22148/001c.
          <fpage>18841</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>S.</given-names>
            <surname>Stewart</surname>
          </string-name>
          . “
          <string-name>
            <given-names>A Community</given-names>
            <surname>Crop: Cotton</surname>
          </string-name>
          , Science, and Extension in Interwar Shandong,
          <fpage>1918</fpage>
          -
          <lpage>1937</lpage>
          ”.
          <source>In: Twentieth-Century China 46.2</source>
          (
          <issue>2021</issue>
          ), pp.
          <fpage>199</fpage>
          -
          <lpage>219</lpage>
          . doi:
          <volume>10</volume>
          .1353/tcc.
          <year>2021</year>
          .
          <volume>0015</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>S.</given-names>
            <surname>Stewart</surname>
          </string-name>
          , 朱吟清, 吴佩臻, and 赵薇. “比较文学研究与数字基础设施: 以建设中 的民国时期期刊语料库 (1918
          <article-title>-1949), 基于 PhiloLogic4 为例”</article-title>
          .
          <source>In: Journal of Digital Humanities 数字人文</source>
          (
          <year>2020</year>
          ), pp.
          <fpage>175</fpage>
          -
          <lpage>182</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>H.</given-names>
            <surname>Tian</surname>
          </string-name>
          . “Difering Translations, Contested Meanings:
          <article-title>A Motor for the 1911 Revolution in China?” In: A Global Conceptual History of Asia</article-title>
          . Ed. by H.
          <source>Schulz-Forberg. London: Pickering &amp; Chatto</source>
          ,
          <year>2014</year>
          , pp.
          <fpage>43</fpage>
          -
          <lpage>60</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>J.</given-names>
            <surname>Tsu</surname>
          </string-name>
          .
          <article-title>Kingdom of Characters: The Language Revolution that Made China Modern</article-title>
          . New York, NY: Penguin Random House,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>L.</given-names>
            <surname>Viola</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Spence</surname>
          </string-name>
          .
          <source>Multilingual Digital Humanities. Routledge</source>
          ,
          <year>2023</year>
          . doi: https://do i.
          <source>org/10</source>
          .4324/9781003393696.
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Weiyun</surname>
          </string-name>
          and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wenhui</surname>
          </string-name>
          . “
          <article-title>Zouxiang xin shijie: Shuzi renwen shiye xia Zhongguo jindai 'shijie' gainian de xingcheng yu yanbian走向新世界: 数字人文视野下中国近代 “世 界” 概念的形成与演变”</article-title>
          .
          <source>In: Journal of Nanjing University</source>
          <volume>5</volume>
          (
          <year>2020</year>
          ), pp.
          <fpage>889</fpage>
          -
          <lpage>906</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>J.</given-names>
            <surname>Weng</surname>
          </string-name>
          . “
          <article-title>What is Mandarin? The Social Project of Language Standardization in Early Republican China”</article-title>
          .
          <source>In: Journal of Asian Studies 77.3</source>
          (
          <issue>2018</issue>
          ), pp.
          <fpage>611</fpage>
          -
          <lpage>633</lpage>
          . doi:
          <volume>10</volume>
          .1017/s0 021911818000487.
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>S.</given-names>
            <surname>Xie</surname>
          </string-name>
          . “
          <article-title>Translating Modernity Towards Translating China”</article-title>
          .
          <source>InT:ranslating China</source>
          . Ed. by
          <string-name>
            <given-names>X.</given-names>
            <surname>Luo</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Yuanjian</surname>
          </string-name>
          . Bristol: Multilingual Matters,
          <year>2009</year>
          , pp.
          <fpage>135</fpage>
          -
          <lpage>156</lpage>
          . doi:
          <volume>10</volume>
          .21832 /
          <fpage>9781847691880</fpage>
          -
          <lpage>011</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [28] [29]
          <string-name>
            <given-names>M.</given-names>
            <surname>Zanasi</surname>
          </string-name>
          . Economic Thought in Modern China: Market and Consumption, c.
          <fpage>1500</fpage>
          -
          <lpage>1937</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          Cambridge: Cambridge University Press,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Zanasi</surname>
          </string-name>
          .
          <article-title>Saving the Nation: Economic Modernity in Republican China</article-title>
          . Chicago: University of Chicago Press,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>P.</given-names>
            <surname>Zarrow</surname>
          </string-name>
          .
          <source>After Empire: The Conceptual Transformation of the Chinese State</source>
          ,
          <fpage>1885</fpage>
          -
          <lpage>1924</lpage>
          . Stanford, CA: Stanford University Press,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhong</surname>
          </string-name>
          .
          <source>Chinese Grammatology: Script Revolution and Literary Modernity</source>
          ,
          <fpage>1916</fpage>
          -
          <lpage>1958</lpage>
          . New York, NY: Columbia University Press,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>