<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>M. Wevers and T. Smits. “The Visual Digital Turn: Using Neural Networks to Study
Historical Images”. In: Digital Scholarship in the Humanities</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fabian Ofert</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Bell</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Friedrich Alexander University Erlangen-Nuremberg</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of California</institution>
          ,
          <addr-line>Santa Barbara</addr-line>
          ,
          <country country="US">U.S.A</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <volume>1</volume>
      <issue>2020</issue>
      <fpage>18</fpage>
      <lpage>20</lpage>
      <abstract>
        <p>While generative machine learning has recently attracted a significant amount of attention in the computer science community, its potential for the digital humanities has so far not been fully evaluated. In this paper, we examine generative adversarial networks, a state-of-the art generative machine learning technique. We argue that GANs can be particularly useful in digital art history, where they can be employed to facilitate the exploration of the semantic structure of large image corpora. Moreover, we posit that the foundational statistical distinction between discriminative and generative approaches ofers an alternative critical perspective on machine learning in the digital humanities context. If “all models are wrong, some are useful”, as the often-cited passage reads, we argue that, in case of the digital humanities, the most useful-wrong models are generative.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;machine learning</kwd>
        <kwd>generative models</kwd>
        <kwd>data augmentation</kwd>
        <kwd>digital art history</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        As a potential demonstration of this approach, this paper attempts to make both a
theoretical and preliminary practical contribution to the integration of machine learning into digital
humanities research. Concretely, we explore a subfield of machine learning which we believe to
have great practical and critical potential, but which so far has not been studied in the digital
humanities context. Recent research [
        <xref ref-type="bibr" rid="ref1 ref17 ref24">1, 17, 34, 24</xref>
        ] has shown the huge potential of adopting
experimental machine learning methods for digital humanities research in particular. What is
missing from these and similar investigations, however, is the subfield of generative machine
learning.
      </p>
      <p>
        On the theoretical level, we investigate the epistemological implications of generative
machine learning techniques in the context of digital humanities research. Despite a seemingly
infinite variety of highly specific machine learning applications, all machine learning is
automated statistical modeling. The rules of statistics, even if they are applied to web-scale corpora
of complex, high-dimensional data, stand as unifying principles behind all machine learning
approaches. While this is obvious to computer science practitioners, it is often disregarded
when machine learning is discussed in the humanities context, mirroring an often-diagnosed
epistemological split [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] between computer scientists and engineers and digital humanities
scholars. We argue that, somewhat counter-intuitively, statistical notions ofer an alternative
critical perspective on machine learning in the humanities context, and we suggest that the
foundational statistical distinction between discriminative and generative approaches [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] can
be utilized to guide the further development of the computational humanities. In other words,
if “all models are wrong, some are useful”, as the often-cited passage by George Box reads
[5], we argue that, in case of the computational humanities, the most useful-wrong models are
generative models.
      </p>
      <p>
        On the practical level, we explore the potential of generative machine learning techniques in
the visual domain, targeting applications in digital art history. Concretely, we examine
generative adversarial networks [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], a state-of-the art image-based generative machine learning
technique, in regard to their potential for data augmentation, i.e. the production of “realistic”
additional data from an image corpus. As visual data augmentation methods using GANs have
recently been applied successfully to a number of complex tasks in the sciences (e.g.
Ravanbakhsh et al. [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]) we propose that they can also be of practical use in the digital humanities
context, particularly in digital art history, where data is often particularly scarce.
      </p>
      <p>We conclude by arguing that much more research is needed regarding both the practical
potential and theoretical implications of generative machine learning in the digital and
computational humanities.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Generative machine learning</title>
      <p>
        For the purpose of this paper, we formally define generative machine learning as one of two
possible statistical approaches – discriminative and generative – to modeling real-world data
[
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. A toy example best demonstrates the diference between these two approaches in
statistical terms. To distinguish between two kinds of objects, say apples and oranges, based on
a dataset of labeled images of both, we can imagine two possible strategies of classification.
We can design a model that learns the most salient diference between apples and oranges
from the dataset. A good candidate for a distinctive visual feature would be color: apples are
recently established DFG research cluster “The Digital Image”, one could argue that this suggests the beginning
of a “critical turn” in the field.
usually red; oranges are usually orange. The model then uses these most salient features to
classify new, unseen samples. This is the discriminative approach. The generative approach,
however, learns the complete distribution of visual features for both apples and oranges. Apples
come in diferent shades of red, yellow, and green, oranges come in diferent shades of orange.
The model then classifies new, unseen samples by comparing their visual feature distribution
to the visual feature distribution for apples and the visual feature distribution for oranges.
In other words: the discriminative approach attempts to model a decision boundary between
classes (it literally learns “where to draw the line” between apples and oranges), while the
generative approach attempts to model the actual distribution of each class. The generative
approach essentially asks: what is the most likely source of the signal we are seeing, while the
discriminative approach simply looks for a way to distinguish one signal from the other and
does not take the source into account.
      </p>
      <p>Formally, whereas the discriminative approach attempts to learn the conditional probability
distribution p(y|x), i.e. the probability of y given x, the generative approach attempts to learn
the joint probability distribution p(x, y), i.e. the probability of x, y to appear together. While
the discriminative approach models p(y|x) directly and thus allows us to find the most likely
class y for any given set of features x, the generative approach models p(y) (the so called
class priors, i.e. the distribution of labels) and both p(x|y = apple) and p(x|y = orange). By
applying Bayes’ rule, we can then derive the posterior distribution on y given x:
p (y|x) =</p>
      <p>p (x|y)
p (y) p (x)</p>
      <p>
        Note that the generative approach is unsupervised (as there is one “model” for each class
and the label of each class thus becomes irrelevant), even though it can be transformed into
a supervised approach with the steps described above. In other words: the discriminative
approach is a sparse approach to modeling the world – only what is needed from the world
is taken into account – and the generative approach is a dense approach – an attempt to
model the world as it is. More importantly, while a sparse approach can tell us something
about the abstract properties of our data, a dense approach can tell us something about the
concrete properties of our data. While both approaches can thus solve classification problems,
the discriminative approach is obviously simpler. As resources are limited, we usually would
like to avoid learning things that are not pertinent to the problem. To distinguish apples from
oranges, for instance, learning the shape of both seems irrelevant. As Vapnik [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ] puts it: “one
should solve the [classification] problem directly and never solve a more general problem as an
intermediate step [such as modeling p(y|x)].”
      </p>
      <p>Taking the generative route, however, comes with one specific benefit: it enables us to
synthesize data, to produce new likely data points for each of the classes we are modeling. For
instance, by sampling the distribution p(x|y = apple) we would get a set of features x0, ..., xn
describing one possible manifestation of “apple”. This is impossible with the discriminative
approach, because a discriminative model has only learned the diference between apples and
oranges, and not what apples and oranges are (in terms of their features).</p>
    </sec>
    <sec id="sec-3">
      <title>3. Generative digital humanities</title>
      <p>
        The digital humanities have often been broadly criticized for the mere use of quantitative
methods, as eloquently summarized in Ted Underwood’s blog post “It looks like you’re writing
an argument against data in literary study…” [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. While some of these critiques from the
early days of the field still resonate, and general critiques of quantitative methods occasionally
reappear with force (as recently in Claire Bishop’s critique of digital art history, Bishop [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]),
generally, a consensus has grown that such a blanket rejection has no grounding in the reality of
digital humanities work. The focus of critique, thus, has shifted to more elaborate discussions
of theory building with quantitative methods [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ], and the shortcomings of specific statistical
tools in the analysis of cultural data.
      </p>
      <p>
        One of the most powerful recent critiques is Nan Z. Da’s article “The Computational Case
against Computational Literary Studies” [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Da writes: “all the things that appear in
[Computational Literary Studies]—network analysis, digital mapping, linear and nonlinear
regressions, topic modeling, topology, entropy—are just fancier ways of talking about word frequency
changes.” Based on a formal distinction between discriminative and generative methods, as
outlined above, we can see, however, how some of these methods are not the same. In fact,
linear regression and topic modeling fall on opposite ends of the spectrum between
discriminative and generative approaches, and the prevalence of latent Dirichlet allocation (LDA), also
known as (one variation of) topic modeling, in computational literary studies and in the digital
humanities in general points to an even stronger claim: the digital humanities “intuitively”
choose generative over discriminative approaches because they are more aligned to humanities
data.
      </p>
      <p>Why do the digital humanities gravitate towards generative approaches? Because generative
approaches mitigate, at least in part the alienation, the general inadequacy of quantitative
methods vis-a-vis cultural artifacts. Quantitative methods, obviously, can never fully represent
cultural artifacts. Precisely, both the sampling of cultural artifacts into data, and the modeling
of this data are reductive. In the domain of modeling, however, generative approaches stay
as close to the material as possible, while discriminative approaches essentially “ignore” the
material for the sake of classification. In other words, generative approaches, while not being
able to mitigate the problems introduced by sampling, can mitigate the problems introduced
by modeling within the realm of what can be modeled.</p>
      <p>
        Regardless, many of the problems discussed in Da’s article stand with or without generative
machine learning. Shoddy hypothesis building or the lack thereof, intentional or unintentional
over- or misinterpretation of the empirical evidence quantitative methods can ofer, or
toobroad applications of narrow technical concepts are problematic irrespective of the kind of
model involved. Hence, a focus on generative approaches does not “solve” or even explicitly
address these issues. Generative approaches do not magically produce a “self-reflexive account
of what the model has sought to measure and the limitations of its ability to produce such a
measurement”, as Richard Jean So writes [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]. On the contrary, generative methods tend to
actually, implicitly encourage the problematic “exploratory” approach that became a central
argument in the discussion following Da’s article3 [
        <xref ref-type="bibr" rid="ref31 ref7">31, 7</xref>
        ].
      </p>
      <p>
        Moreover: famously, the existence of “raw data” is an illusion [
        <xref ref-type="bibr" rid="ref10 ref12 ref21">12, 21, 10</xref>
        ], and the existence
of “neutral algorithms” even more so [
        <xref ref-type="bibr" rid="ref3 ref6 ref9">3, 6, 9</xref>
        ]. Thus, if we propose that generative methods
“stay as close to the material as possible”, we do not imply the absence of subjective guidance
through the design or selection of algorithms and datasets. Indeed, it is not only dataset bias
that shapes machine learning models, but inductive biases induced by pragmatic architectural
decisions often further entangle subjective and machinic perspectives [
        <xref ref-type="bibr" rid="ref11 ref24">24, 11</xref>
        ].
      </p>
      <p>What a critical distinction between generative and discriminative approaches ofers,
regard3One could argue that this is an efect of the reduced interpretability of generative methods.
less, is a prospective path through current and future experimental work in computer science,
where the digital humanities, we argue, need to critically consider the distinction between
generative and discriminative methods in the evaluation of new, experimental tools and methods
– all while keeping in mind that the maximum benefit of all machine learning models is a
“useful-wrong” model in the sense of Box [5], i.e. a model that stays “reasonably” close to the
material.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Generative digital art history</title>
      <p>
        In the following, we sketch such a path for digital art history. As Leonardo Impett has pointed
out [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] the computer vision problems that art history relates to are almost exclusively
searchrelated, i.e. they are classification problems. What if digital art history would start focusing on
generative methods instead? Would a closer relation to the material also establish itself in the
domain of images? Generative methods are already implicitly employed in the neural network
based clustering of images, which has become increasingly more popular in digital art history
in recent years [34]. When embeddings are utilized, a learned sub-system of a classifier is
repurposed exactly for its generative properties, which is also why recent research [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] suggests
building explicitly generative systems for the specific purpose of clustering. In the realm of
explicitly generative systems, then, we argue that generative adversarial networks can become
for the visual domain what LDA became for the text domain: an instrument of unsupervised
exploration for large-scale corpora.
      </p>
      <p>
        Generative adversarial networks, first introduced by Goodfellow et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] leverage game
theory to model the probability distribution of a corpus by means of a minimax game between
two deep convolutional neural networks. Efectively, generative adversarial networks define a
noise distribution pz which is mapped to data space via G (z; θg) where G is a “generator”, an
“inverted” convolutional neural network with parameters θg that “expands” an input variable
into an image, rather than “compressing” an image into a classification probability. G is
trained in conjunction with a “discriminator”, a second deep convolutional neural network
D (z; θd) that outputs a single scalar. D (x) represents the probability that x came from
the data rather than G. Note that the whole system, not just the “generator”, realizes the
generative approach, as the whole system is needed to model p(x|y = 0). Also note that the
system efectively learns a compression: a high-dimensional data space with dimensions &gt; z is
compressed to be reproducible from a data space with dimensions z.
      </p>
      <p>
        The original paper by Goodfellow demonstrates the potential of generative adversarial
networks to synthesize images in particular by synthesizing new handwritten digits from the
MNIST dataset. The MNIST dataset, however, has a resolution of 28x28 pixels, i.e. several
orders of magnitude below standard photo resolutions. Scaling up the approach proved difficult,
and while a lot of efort was made to go beyond marginal resolutions, progress was slow (for
machine learning) until very recently, when StyleGAN [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], a generative adversarial network
that implemented several significant optimization tricks to mitigate some of the limitations of
generative adversarial networks, was introduced. Current-generation models like StyleGAN2
[
        <xref ref-type="bibr" rid="ref20">20</xref>
        ], which presents another improvement over the original StyleGAN, are able to produce
extremely realistic samples from large image corpora, samples that – the reader should keep
this in mind – are not part of any training corpus, but share the decisive features of the images
in a corpus.
      </p>
      <p>Such samples, to the humanist, feel uncanny. GANs obviously learn “something” (maybe
everything?) about a corpus, and GAN samples “tick all the boxes” at the first glance. At the
same time, GANs seem almost useless. What knowledge is there to gain from a model that
essentially learns to recreate approximations to what exists, and nothing about what exists?
Interestingly, for a long time and despite impressive early results, the utility of generative
adversarial networks was not entirely clear in the computer science community either. And while
today there are obvious applications in digital image processing (inpainting, superresolution,
image-to-image translation, style transfer etc.) and manipulation (deep fakes etc.), the
epistemological qualities of GANs, i.e. their role in scientific (and, we argue, humanist) processes of
discovery, are still not fully explored.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Generative data augmentation</title>
      <p>
        In the meantime, however, the targeted generation of images with GANs has been improved
significantly. Bau et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] had already shown in 2018 that GANs learn less entangled
representations than comparable CNNs. More recently, approaches have been found that allow
the unsupervised identification of meaningful hyper-directions in GAN latent spaces, with an
approach called GANspace [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] currently providing the most efficient method 4. In other words,
if recent experimental approaches are taken into account, latent spaces become accessible to
exploration.
      </p>
      <p>Why is this relevant to the potential exploitation of generative methods in the visual domain?
An important epistemological aspect of GANs is the continuity of the latent spaces that can
be produced. GAN latent spaces are, for better or worse, “filled to the brink” with images.
This also means, for each sample there exist millions of other samples that look almost like
this sample, except for tiny details. It also means that, between each two samples we can
ifnd theoretically infinite “intermediate” samples, hybrid images that combine aspects of both
of the samples between which they are positioned. Digital humanities corpora, on the other
hand, visual or otherwise, always exist as discrete collections of samples. Concretely, in an art
historical corpus, there is no “intermediate” image between, for instance, a Titian Marriage
at Cana and a Last Supper, or between the latter and a Last Supper of Titian. However, if
we train a GAN on this corpus, such an “intermediate” image suddenly comes into existence.
Simply put we argue that GANs can reintroduce a certain continuity to a corpus that allows
to study the discreteness of the corpus itself.</p>
      <p>
        In the following, we present a proof of concept for this approach. In a first step, we train
a generative adversarial network, following the StyleGAN architecture, on two art historical
corpora. First, an iconographic corpus of 20,000 “adoration” scenes. This corpus contains
images referencing the adoration of the Christ Child; in particular the Adoration of the Child
(by Mary and Joseph), the Adoration of the Shepherds, and the Adoration of the Magi. The
second corpus contains 50,000 images from the Museum of Modern Art, New York, online
collection. The hypothesis, here, is that a GAN, by means of compression, would learn the
most salient diferences between images in a corpus. In the case of the adoration corpus in
particular, which is drawn from several centuries of visual culture, these diferences would likely
not only relate to the number and arrangement of people and objects, but also the style and
medium of the works. In a second step, we then analyze the most salient hyper-directions in
the learned latent space with the help of the GANspace method. Importantly, GANspace is
4In July 2020, another, conceptually diferent, approach was published [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] that shows even more promising
results. Unfortunately, we were unable to test it on our data before the submission deadline of this work.
an unsupervised method, i.e. no labeling is involved.
      </p>
      <p>What we find is that, indeed both semantic and syntactic hyper-directions in latent space
emerge. For the adoration corpus (fig. 2), an interpretation of these directions suggests that,
among other things, they represent syntactic concepts such as “painting or object” (C1),
“precious carving or non finito sculpture” (C4), “pencil or colored drawing” (C7), “sharp or blurry
outlines” (C9) and semantic concepts such as “zooming into a scene” (C3) and “number of
people in a scene” (C5, C6). For the MoMA corpus (fig. 3), unsurprisingly for a corpus
composed of mainly abstract art, we find mostly syntactic concepts such as “figurative or abstract”
(C0, C13), “organic or technical” (C3), “drawing or painting” (C1), “textural or graphical”
(C8).</p>
      <p>It is important to point out that, due to the proof-of-concept nature of the above
experiment, significant caveats apply. Obviously, our results are exploratory and trivial in the exact
sense criticized by Da. At this point, they do not expand our knowledge about either the
concrete corpus or about a potential iconography represented by it but simply confirm our
preconceived ideas about both. Moreover, comparable results could have likely been achieved
by more established methods, like clustering based on CNN features, or a principal component
analysis in pixel space. Finally, GAN latent spaces, of course, are imaginary spaces. They are
reconstructions of the defining features of a corpus, and exploring such spaces is not the same
thing as exploring the corpus itself.</p>
      <p>This imaginary quality, however, that is deeply problematic in any other application of
generative methods (for instance, in the sciences), can precisely be of use in the digital humanities
context. Here, GAN samples are not mistaken for valid information generated from nothing,
as in so many recent examples, but can be understood as an additional means to ask questions
about the information we do have, about the corpus at hand.</p>
      <p>
        While our concrete results thus remain preliminary, we argue that they point towards a
significant potential of generative methods in the visual domain. By reintroducing continuity
to a corpus of discrete images, we are forced to precisely quantify the semantic thresholds that
support its discreteness. Discrete concepts are transformed into continuous variables. What,
exactly, defines a certain iconography? How far in any direction (in the literal sense of latent
space hyper-directions) can we veer of until an image that is clearly recognizable as belonging
to a certain iconographic tradition stops being recognizable as such? A synthetic grammar of
art emerges that is not historical like Riegl’s [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] but diachronic and multimodal. In a sense,
if generative approaches automatically stay close to the material, using GANs means staying
closer to the material than actually possible by augmenting it.
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusion</title>
      <p>In this paper, we have demonstrated how the statistical distinction between generative and
discriminative approaches can inform the methodological discourse in the digital humanities
and can be understood as a starting point for a deep technical exploration in the computational
humanities. We have argued that generative methods, while they are not immune against many
of the general problems pointed out in recent discussions on the methodological grounding of
digital humanities work, can mitigate some of the problems introduced on the level of data
modeling. Moreover, we have proposed that, while computational literary studies and related
sub-disciplines of the digital humanities have already implicitly embraced generative methods,
the visual digital humanities lack equivalent tools.</p>
      <p>We have also suggested to explore generative adversarial networks as a potential generative
approach in digital art history and have documented a proof-of-concept approach utilizing
StyleGAN and the GANspace algorithm to identify meaningful directions in the latent spaces
of two GANs trained on art historical corpora. Based on the results from this experiment we
have argued that, other than in scientific uses of GANs, the imaginary nature of GAN images
allows for the emergence of a synthetic, continuous “replacement” corpus that, exactly by
means of its continuity, can serve to delineate the semantic thresholds that define a collection
of images.</p>
      <p>Pragmatically, future research will have to show if GANs are the right tool for this purpose, or
if other networks like variational autoencoders need to be considered. More importantly, future
research will have to empirically verify the hypothesis that the synthetic corpora produced by
GANs and related methods are interpretable enough to serve as a means to evaluate semantic
(e.g. iconographic) concepts in specific art-historical corpora, or if more traditional methods
remain the more viable approach for the time being.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>T.</given-names>
            <surname>Arnold</surname>
          </string-name>
          and
          <string-name>
            <given-names>L.</given-names>
            <surname>Tilton</surname>
          </string-name>
          . “Distant Viewing:
          <article-title>Analyzing Large Visual Corpora”</article-title>
          .
          <source>In: Digital Scholarship in the Humanities</source>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Bau</surname>
          </string-name>
          et al. “
          <article-title>GAN Dissection: Visualizing and Understanding Generative Adversarial Networks”</article-title>
          . In: arXiv preprint arXiv:
          <year>1811</year>
          .
          <volume>10597</volume>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>R.</given-names>
            <surname>Benjamin</surname>
          </string-name>
          .
          <article-title>Race after Technology: Abolitionist Tools for the New Jim Code</article-title>
          . John Wiley &amp; Sons,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Bishop</surname>
          </string-name>
          . “
          <article-title>Against Digital Art History”</article-title>
          .
          <source>In: International Journal for Digital Art History</source>
          <volume>3</volume>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>G. E. P.</given-names>
            <surname>Box</surname>
          </string-name>
          . “
          <article-title>Robustness in the Strategy of Scientific Model Building”</article-title>
          . In: Robustness in Statistics. Academic Press,
          <year>1979</year>
          , pp.
          <fpage>201</fpage>
          -
          <lpage>236</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Buolamwini</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Gebru</surname>
          </string-name>
          . “Gender Shades:
          <article-title>Intersectional Accuracy Disparities in Commercial Gender Classification”</article-title>
          .
          <source>In: Conference on Fairness, Accountability and Transparency</source>
          .
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>N. Z.</given-names>
            <surname>Da</surname>
          </string-name>
          . “Critical Response III.
          <string-name>
            <surname>On</surname>
            <given-names>EDA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Complexity</surname>
          </string-name>
          , and
          <article-title>Redundancy: A Response to Underwood and Weatherby”</article-title>
          .
          <source>In: Critical Inquiry 46.4</source>
          (
          <issue>2020</issue>
          ), pp.
          <fpage>913</fpage>
          -
          <lpage>924</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>N. Z.</given-names>
            <surname>Da</surname>
          </string-name>
          . “
          <article-title>The Computational Case against Computational Literary Studies”</article-title>
          .
          <source>In: Critical Inquiry 45.3</source>
          (
          <issue>2019</issue>
          ), pp.
          <fpage>601</fpage>
          -
          <lpage>639</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>T. de Vries</surname>
          </string-name>
          et al. “
          <article-title>Does Object Recognition Work for Everyone?</article-title>
          ”
          <source>In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops</source>
          .
          <year>2019</year>
          , pp.
          <fpage>52</fpage>
          -
          <lpage>59</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J. E. Dobson. Critical</given-names>
            <surname>Digital</surname>
          </string-name>
          <article-title>Humanities: The Search for a Methodology</article-title>
          . University of Illinois Press,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Geirhos</surname>
          </string-name>
          et al. “
          <article-title>Shortcut Learning in Deep Neural Networks”</article-title>
          . In: arXiv preprint arXiv:
          <year>2004</year>
          .
          <volume>07780</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.</given-names>
            <surname>Gitelman</surname>
          </string-name>
          .
          <source>Raw Data Is An Oxymoron</source>
          . Cambridge, MA: MIT Press,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>I. Goodfellow</surname>
          </string-name>
          et al. “
          <article-title>Generative Adversarial Nets”</article-title>
          .
          <source>In: Advances in Neural Information Processing Systems</source>
          .
          <year>2014</year>
          , pp.
          <fpage>2672</fpage>
          -
          <lpage>2680</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Graving</surname>
          </string-name>
          and
          <string-name>
            <given-names>I. D.</given-names>
            <surname>Couzing</surname>
          </string-name>
          . “
          <string-name>
            <surname>VAE-SNE</surname>
          </string-name>
          :
          <article-title>A Deep Generative Model for Simultaneous Dimensionality Reduction and Clustering”</article-title>
          . In: bioRxiv preprint (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>E.</given-names>
            <surname>Härkönen</surname>
          </string-name>
          et al. “
          <article-title>GANSpace: Discovering Interpretable GAN Controls”</article-title>
          . In: arXiv preprint arXiv:
          <year>2004</year>
          .
          <volume>02546</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>L.</given-names>
            <surname>Impett</surname>
          </string-name>
          . Open Problems in Computer Vision. Friedrich Alexander University ErlangenNuremberg, Mar.
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>L.</given-names>
            <surname>Impett</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Moretti</surname>
          </string-name>
          . “Totentanz.
          <article-title>Operationalizing Aby Warburg's Pathosformeln”</article-title>
          .
          <source>In: New Left Review</source>
          <volume>107</volume>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>F.</given-names>
            <surname>Jannidis</surname>
          </string-name>
          .
          <article-title>No Results or Wrong? Methodological Challenges in Computational Literary Studies</article-title>
          . University of Leipzig,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>T.</given-names>
            <surname>Karras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Laine</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Aila</surname>
          </string-name>
          .
          <article-title>“A Style-Based Generator Architecture for Generative Adversarial Networks”</article-title>
          . In: arXiv preprint arXiv:
          <year>1812</year>
          .
          <volume>04948</volume>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>T.</given-names>
            <surname>Karras</surname>
          </string-name>
          et al. “
          <article-title>Analyzing and Improving the Image Quality of StyleGAN”</article-title>
          . In: arXiv preprint arXiv:
          <year>1912</year>
          .
          <volume>04958</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>B.</given-names>
            <surname>Latour</surname>
          </string-name>
          . “
          <article-title>Circulating Reference: Sampling the Soil in the Amazon Forest”</article-title>
          .
          <source>In: Pandora's Hope: Essays on the Reality of Science Studies</source>
          . Cambridge, MA: Harvard University Press,
          <year>1999</year>
          , pp.
          <fpage>24</fpage>
          -
          <lpage>79</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <surname>G. Mercuriali.</surname>
          </string-name>
          “
          <article-title>Digital Art History and the Computational Imagination”</article-title>
          .
          <source>In: International Journal for Digital Art History</source>
          <volume>3</volume>
          (
          <year>2019</year>
          ), p.
          <fpage>141</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Ng</surname>
          </string-name>
          and
          <string-name>
            <surname>M. I. Jordan.</surname>
          </string-name>
          “
          <article-title>On Discriminative vs. Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes”</article-title>
          .
          <source>In: Advances in Neural Information Processing Systems</source>
          .
          <year>2002</year>
          , pp.
          <fpage>841</fpage>
          -
          <lpage>848</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>F.</given-names>
            <surname>Ofert</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Bell</surname>
          </string-name>
          . “
          <article-title>Perceptual Bias and Technical Meta-Images. Critical Machine Vision as a Humanities Challenge”</article-title>
          . In: AI &amp;
          <string-name>
            <surname>Society</surname>
          </string-name>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>M.</given-names>
            <surname>Piotrowski</surname>
          </string-name>
          . “
          <article-title>Ain't No Way Around It: Why We Need to Be Clear About What We Mean by “Digital Humanities””</article-title>
          . In: (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>S.</given-names>
            <surname>Ravanbakhsh</surname>
          </string-name>
          et al. “
          <article-title>Enabling Dark Energy Science with Deep Generative Models of Galaxy Images”</article-title>
          .
          <source>In: Thirty-First AAAI Conference on Artificial Intelligence</source>
          .
          <year>2017</year>
          , pp.
          <fpage>1488</fpage>
          -
          <lpage>1494</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>A.</given-names>
            <surname>Riegl</surname>
          </string-name>
          .
          <article-title>Historical Grammar of the Visual Arts</article-title>
          . New York, NY: Zone Books,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>Y.</given-names>
            <surname>Shen</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Zhou</surname>
          </string-name>
          . “
          <article-title>Closed-Form Factorization of Latent Semantics in GANs”</article-title>
          . In: arXiv preprint arXiv:
          <year>2007</year>
          .
          <volume>06600</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>R. J.</given-names>
            <surname>So</surname>
          </string-name>
          . “
          <article-title>All Models Are Wrong”</article-title>
          .
          <source>In: PMLA 132.3</source>
          (
          <issue>2017</issue>
          ), pp.
          <fpage>668</fpage>
          -
          <lpage>673</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>N.</given-names>
            <surname>Tahmasebi</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Hengchen</surname>
          </string-name>
          . “
          <article-title>The Strengths and Pitfalls of Large-Scale Text Mining for Literary Studies”</article-title>
          .
          <source>In: Samlaren: Tidskrift för Svensk Litteraturvetenskaplig Forskning</source>
          <volume>140</volume>
          (
          <year>2019</year>
          ), pp.
          <fpage>198</fpage>
          -
          <lpage>227</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>T.</given-names>
            <surname>Underwood</surname>
          </string-name>
          . “
          <article-title>Critical Response II. The Theoretical Divide Driving Debates about Computation”</article-title>
          .
          <source>In: Critical Inquiry 46.4</source>
          (
          <issue>2020</issue>
          ), pp.
          <fpage>900</fpage>
          -
          <lpage>912</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>T.</given-names>
            <surname>Underwood</surname>
          </string-name>
          .
          <article-title>It Looks like You're Writing an Argument against Data in Literary Study …</article-title>
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>V.</given-names>
            <surname>Vapnik</surname>
          </string-name>
          .
          <article-title>Statistical Learning Theory</article-title>
          . New York, NY: Wiley,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>