<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Computational Humanities Research Conference, November</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>'Psyché' as a Rosetta Stone? Assessing Collaborative Authorship in the French 17th Century Theatre</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Florian Cafiero</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jean-Baptiste Camps</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Université PSL</institution>
          ,
          <addr-line>65 rue de Richelieu, 75002 Paris</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2021</year>
      </pub-date>
      <volume>1</volume>
      <issue>4</issue>
      <fpage>7</fpage>
      <lpage>19</lpage>
      <abstract>
        <p>During the 17th century, a significant number of collaborations emerged between playwrights, among which authors as famous as Pierre Corneille, Thomas Corneille or Molière, as well as Philippe Quinault or Jean Donneau de visé. The actual division of labour between authors can sometimes be deduced from historical documents, but is most of the time uncertain. In this paper, we try to address this question by using the information we got from one specific instance of collaboration: Psyché (1671). We first try to assess the accuracy of the notice to the reader of the printed edition of the play, where each author's involvement is clearly claimed, using machine learning and “rolling stylometry” methodology. We then use the optimal parameters already applied to this play to analyse other collaborative works of the time, in particular cases of potential collaboration between Thomas Corneille and Jean Donneau de Visé in Circé and L'Inconnu.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;authorship attribution</kwd>
        <kwd>French literature</kwd>
        <kwd>17th century</kwd>
        <kwd>rolling stylometry</kwd>
        <kwd>collaborative authorship</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Yet, the task raises a number of difficulties. First, style change detection is not a completely
solved problem, and even recent competitions [30] have shown that precisely determining style
breaches, i.e. places where the authorship switches in a collaborative text, was still a complex
task.</p>
      <p>
        The task is all the more complicated for French 17th century plays, known as “théâtre
classique”. The 17th century is a time when the notion of authorship settles in France [
        <xref ref-type="bibr" rid="ref23">31</xref>
        ].
Yet, as stated by Quiel, the need for originality or the fear of being too derivative were not yet
a major worry for the authors of the time:
      </p>
      <p>French playwrights do not refrain from imitating or adapting whichever text is likely
to be presented on a stage, sometimes without even bothering to make significant
additions or to add characteristics from their own literary style [23].1</p>
      <p>
        Strongly codified, building their plots on the same Spanish, Italian, Greek or Latin models,
these plays are often very homogeneous, which makes it more difficult to properly attribute
texts . In particular, similarities induced by the literary genre or subgenre can be as strong as
similarities induced by the authors’ idiolect [
        <xref ref-type="bibr" rid="ref22 ref3 ref4">27, 3, 4</xref>
        ].
      </p>
      <p>
        Imitation could even go to the point of piracy or plagiarism, such as attempting to steal
another author play, or led to disputes about the true source of a story. The polygraph and
literary “entrepreneur” Donneau de Visé [28] – famous later on as the as the powerful founder
and editor of the monthly Mercure Galant, a collective literary periodical that published a
mix of recycled manuscript or printed pieces, reader contributions (poems, etc.) and his
own original material – was familiar of such practices at least in his early career. He more
or less started his career by trying to steal Sganarelle ou le cocu imaginaire from Molière:
before Molière could publish himself his own play, Donneau published in 1660, with the help
of the printer Jean Ribou, both a pirate edition of Molière’s Sganarelle, in which he added
his commentaries and that he went to the point of dedicating to Molière himself (!), and a
plagiarised play, La Cocue imaginaire, where he reversed masculine and feminine roles [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. He
also had an important dispute with Quinault around the Mère Coquette, two plays with this
name being published, in 1665 by Quinault and 1666 by Donneau.
      </p>
      <p>
        A final challenge is related to the diverse potential nature of collaborative writing.
According to Pennebaker and Ireland [
        <xref ref-type="bibr" rid="ref20">21</xref>
        ], three main hypotheses can be made on the result of
collaborative writing: the “Just-like-another-member-of-the-team hypothesis” were
collaborative writing can be distributed in portions successively attributable to the idiolect of one of
the authors; “The average person hypothesis” were the resulting style is an average of the
authors’ idiolects; and finally the “synergy hypothesis” were the contact situation and
interactions between diferent indivual idiolects create a resulting singular style, diferent of each
individual one. This last hypothesis tends to be verified in famous cases such as the Lennon
and McCartney collaboration or the one between Hamilton and Madison.
      </p>
      <p>As we well see, suspected collaborative writing cases in 17th century French theatre – even
though we might suspect them on declarative grounds to fall mainly in the first
“Just-likeanother-member-of-the-team hypothesis” – also presents clue of a division of the work on
diferent authorial levels, for instance content versus form, narrative versus versification.</p>
      <p>To help us address the various collaborative authorship problems raised by the writings of
this century, we thus try to work on one of the best documented collaboration of the time:
Psyché.</p>
      <p>1All translations are our own.</p>
      <sec id="sec-1-1">
        <title>1.2. ‘Psyché’ and its notice ‘to the reader’</title>
        <p>
          Psyché is a tragedy-ballet in five acts, written in free verse, and created in 1671, during the
very long festivities following the peace of Aix-la-Chapelle in 1668. It originates in Louis XIV’s
desire to give a new show in the “Salle des Machines” of the Tuileries Palace. Built in 1660
by renowned architect Louis Le Vau, this theatre took its name from the machinery designed
by Gaspare, Carlo and Lodovico Vigarani, allowing for spectacular efects and complex set
changes [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ]. The acoustics of this theatre were poor. It was thus abandoned, not being used
since 1662. But its large capacity, and the existing impressive sets from Cavalli’s opera Ercole
Amante, would have drawn the French king to commission a new play specifically designed for
this place.
        </p>
        <p>In 1758, Lagrange-Chancel reported in the preface to his own Orphée that several authors
would have proposed a project for this occasion.</p>
        <p>
          The late King having resolved to give to all his court one of these great
celebrations in which he liked to have a rest from his works, wanted to take advice from
Racine, Quinault, and Molière, which, among the the great geniuses of this century,
he regarded as the most capable of contributing, by their talent, to the
magnificence of his pleasures. To that efect, he asked them to pick a subject for which
they could use an excellent decor representing the underworld, kept safe and sound
in the furniture storage unit. Racine proposed the subject of Orphée; Quinault,
the abduction of Proserpine, which he subsequently turned into one of his most
beautiful operas; and Molière, with the help of the great Corneille, championed the
subject of Psyché, which prevailed over the two others [
          <xref ref-type="bibr" rid="ref18">17</xref>
          ].
        </p>
        <p>Through the correspondence of the Vigaranis, we know that the decision to give this Psyché
at the Salle des Machines was made only a few weeks before it was played. In a letter written
on December 12th, 1670, Vigarani explained that they were “preparing a great show, to be
performed for the Epiphany at the Tuileries theatre” [24], The imminence of the deadline forced
everyone to rush to get the work done. As stated in a letter written on December 15th, 1670,
“Carlo is very busy because of the show prepared for the Epiphany. He is very tired. He is
doing his best to please the King, but he doubts he will be strong enough to continue.”</p>
        <p>The lack of time apparently had consequences on Molière’s ability to finish the play. This
led to a singularity: an official account of each author’s implication. In a notice from the
publisher “au lecteur” (to the reader), we find this explanation on how the work is supposed
to have been divided:</p>
        <p>This work is not written by a single hand. Mr Quinault wrote all the poetry
of the parts set to music, except the Italian Complaint. Mr de Molière wrote the
outline of the play, set its arrangement - he focused more on the beauties and the
pump of the show than on its strict observance of the rules. Regarding versification,
he did not get the time to execute it in its entirety. Carnival was approaching, and
the insisting demands of the King, who wanted to entertain himself several time
before Lent, forced him into accepting some assistance. Thus, only verses from the
Prologue, the First Act, the first scene of the Second Act, and the first scene from
the third Act are his work. Mr. Corneille used two weeks to versify the rest; and
this way, His Majesty’s orders were satisfied in time [ 20].</p>
        <p>Does this notice to the reader seem plausible? The situation where the King’s urgent
demands change the author’s agenda was not unprecedented. For instance, when in 1664, Louis
XIV commissioned a new play with ballet to Molière for the “Plaisirs de l’île enchantée”
festivity in Versailles, the latter could not finish in time the versification of his play. As stated
in its first edition [ 19], “an order from the King, who pressed this matter, forced [the author]
to finish all the rest in prose.” Only thirty percent of La Princesse d’Elide is thus in verse, the
rest being in prose.</p>
        <p>
          Stylistic studies also seem to confirm the plausibility of this notice. Psyché is one of the
rare examples of mixed verses by their authors. Before that, Pierre Corneille had only used it
once, in Agésilas in 1666. The same goes for Molière, who also mixed various types of verses in
his Amphitryon in 1668. And the way the two authors use mixed verses is diferent [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. While
Molière did not hesitate to use heptasyllables in Amphitryon, Corneille never used a single one
of them in Agésilas. This distinction is still observable in Psyché: in the part attributed to
Molière, we find 37 heptasyllables - which fits the proportion observed in Amphitryon; in the
part attributed to Corneille, we do not find any heptasyllable.
        </p>
        <p>
          Without taking this notice “to the reader” for granted, we can consider that it gives
potentially truthful information regarding the play, that we will first try to disprove or verify.
1.3. Hollywood in the 17th century: Special efects, music and the “ Pièces à
machines”
Psyché raises a few specific concerns because of its genre. This play is indeed a rare instance
of pièce à machines, a subgenre very much in favour between the 1650s and the 1670s, but
of which only 15 plays or so have been composed [
          <xref ref-type="bibr" rid="ref24">33, 32</xref>
          ]. Changes of scenery in each act,
lfying characters, raging seas, thunderous blows… the “ pièces à machines” make the most of
the machinery available at the time, to propose spectacular shows to the audience, including
passages set to music. The first model of the genre probably is Andromède (1650) by Pierre
Corneille [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
        <p>
          But other authors followed him, sometimes collaborating to produce that kind of plays, like
Thomas Corneille (Pierre Corneille’s younger brother) and Jean Donneau de Visé. Amongst
the most impressive shows of the time, the first collaboration between the two authors, Circé
(1675), encouraged them to work together for other works. This play was thus quickly followed
by another collaboration: L’Inconnu (1675). It seems that Donneau de Visé’s implication in
this play could be very significant. It draws its inspiration from Donneau de Visé’s own
tenth short story [
          <xref ref-type="bibr" rid="ref19">18</xref>
          ] in Les Nouvelles Galantes, Comiques et Tragiques (1669). In Thomas
Corneille’s obituary notice, Donneau de Visé himself claimed he played an important part in
the writing:
to make progress, I wrote the whole play in prose, and while I was writing the
prose of the second act, he was transforming the prose of the first into verse; and as
prose is easier than verse, I had the time to write those of the entertainments, and
especially the dialogue of Love and Friendship, which did not displease the public
[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>Yet, despite these claims, the only name cited in the “Privilège du roi” of the first printed
edition is Thomas Corneille. Why omitting to mention Donneau de Visé? We know from
the Registre de La Grange that Donneau de Visé was payed fees for the writing of the play,
and he received the same amount for it as Thomas Corneille. It is thus historically extremely
unlikely that Donneau de Visé would not have contributed to the play. But did he overstate
his implication in the writing of the play? Or were there only “commercial reasons” to avoid
mentioning his name - Thomas Corneille being more respected as a playwright than he was?</p>
        <p>Thus, using the parameters found optimal in our benchmark phase, and then tested on
Psyché, we will try to work on Circé and L’Inconnu, two pièces à machines also written in
verse and allegedly written collaboratively.</p>
        <p>The scarcity of “pièces à machines” however makes it challenging to build a stylometric
approach only on a subgenre-specific approach. We thus compare two methods in this paper:
a genre-specific approach, on a small dataset, and a cross-genre approach, on a considerably
larger dataset (see appendix A).</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. Materials and methods</title>
      <sec id="sec-2-1">
        <title>2.1. Choice of plays and dataset</title>
        <p>
          To investigate these difficulties, we built two diferent analytic setups, with two very diferent
philosophies:
a genre-specific approach , in which we built a training corpus including one play from the
same subgenre for each of the three involved authors, as well as two control authors –
one play by Boyer, and two plays for Quinault, as his plays were significantly shorter (see
appendix A.1.1). We then benchmarked diferent sample lengths, using a leave-one-out
approach. We lacked sufficient data to include Thomas Corneille in this setup, because
the two available plays that could fit the definition, Circé and L’Inconnu are suspected
to be collaborations with Donneau de Visé that we want to analyse later on.
a cross-genre approach in which we took all available single-author verse or mixed plays
containing more than 400 verses from each candidate in the Théâtre classique corpus
[
          <xref ref-type="bibr" rid="ref13">12</xref>
          ]. In order to recreate the conditions of the actual analysis on the Pièces à machines,
that is to evaluate the performance in a cross-genre setup on a given specific genre or
sub-genre not represented in the training set, we set apart a subgroup of heroic comedies
as an unseen test set on which to benchmark the models (see appendix A.2.1 and A.2.2).
For the mixed plays, we retained them only if there were more than 400 verses to be
extracted.
        </p>
        <p>For each case studied, the set of candidate authors is known. We thus use an authorship
attribution rather than an authorship verification setup.</p>
        <p>
          In terms of features, after suppressing editorial punctuation and lowercasing the texts, we
extract character 3-grams, a standard choice in authorship attribution [
          <xref ref-type="bibr" rid="ref21">14, 26</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Calibration</title>
        <p>
          The chosen size of the sample can be seen as a trade-of between accuracy and granularity: the
smaller the samples are, the better the “resolution” of the analysis and the ability to locate
precise stylistic breaks or identify limited shifts in hands [10]; the bigger they are, the more
statistically reliable is our computation, with optima often observed in the 2500-5000 words
range [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. To find a good compromise, we experimented with a variety of lengths, from 10 to
300 verses, an upper limit close to the size of an act (Table 1). Each time, we normalised the
data by using z-scores for variables and applied Euclidean vector-length normalisation (i.e.,
L2 normalisation) to texts [
          <xref ref-type="bibr" rid="ref12">11</xref>
          ] and trained a linear Support Vector Classifier (SVC) model,
using a Python sklearn pipeline (see Code section). These results show that a first peak in all
metrics is reached at length 150 verses for the smaller genre-specific corpus, with perfect scores,
as well as for the larger cross-genre corpus when considering the F1 score (with F1= 0.92).
        </p>
        <p>Detailed scores for the retained sample length on the cross-genre corpus show that Boyer is
the best recognised, while precision for Donneau de Visé and Molière is better (the model is
never wrong in attributing plays to them) than recall (the model misses a few samples), while
the opposite is true for both Corneille brothers (Table 2).</p>
        <p>Once the sample size of 150 verses chosen, we train a final SVM model for each setup, on
the complete training set, and then, following rolling stylometry methods, we apply this model
to every successive portion of length n, with a step of 1 (and so, an overlap of n − 1 between
two successive portions, e.g., verses 1-150, 2-151, 3-152…). We then extract the classification,
and plot the decision function for each classifier.</p>
        <p>Like with many geometrical methods, this type of analysis rests on the representation of
texts or samples as points in a high dimensional space, based on the frequency of the selected
features. In our case, the frequency of each type of character 3-grams are used as a coordinate
on one axis of an high-dimensional space.</p>
        <p>A support vector machine computes a hyperplane in this high-dimensional space, in order
to achieve the best separation between two sets of dots (i.e., text from author 1, text from
all other authors). Intuitively, a good separation is achieved by the hyperplane that has the
largest distance to the nearest training data points of any class (called functional margin),
since in general the larger the margin the lower the generalisation error of the classifier.</p>
        <p>The decision function tells us how close each sample is to the hyperplane separating each
class. A negative value means that the sample is outside, a positive, inside. The higher the
score, the deeper inside the class is located a dot, which can be interpreted as a strength of
the authorial markers or an increase in the confidence of the classifier.</p>
        <p>By monitoring the decision function of all candidate authors, we can see when a portion of
the text is getting closer to the style of an individual author. When the value of the decision
function for one author gets high, while the values for all others remain low, it is easy to
attribute this portion to this candidate. Yet, in the case where all decision function would
decrease or remain low simultaneously, the status of the portion is hard to assess, and could
be alternatively attributed to a synergy between several candidates, to the intervention of
an author outside the set or could be the results of some kind of noise, in particular generic
discrepancies between the training set and the portion being assessed.</p>
        <p>Each sample’s horizontal placement on the graph is determined by its median point, in
verses (fig. 1). For instance, the score attributed to the sample ranging from the 400th to the
550th verse will be placed at the 475th verse. This implies a small but significant distortion
regarding the placement of each score in the timeline of each play. It must be taken into
account when reading the graph.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results</title>
      <p>3.1. Psyché by P. Corneille, Molière and Quinault
Both setups provide globally consistent results. The end of the prologue, most of the first act,
and the beginnings of the second and the third acts are attributed to Molière. Molière’s imprint
then seems to gradually fade away, while Pierre Corneille’s contribution appears constantly
growing throughout the play. This result seems solid, and even conservative regarding Molière’s
share of the work. When using the cross-genre setting, the precision for Molière is 100 % while
the recall is lower, at 0.75 % (Table 2). On the opposite, the recall for Pierre Corneille is perfect
(100 %), while the precision is lower (86 %). This setting should thus underestimate Molière’s
participation in the play. Yet, even this computation confirms that the statements made in
the notice from the publisher to the reader regarding him are globally accurate. A small but
interesting diference between our analysis and the notice concerns the spike of Corneille’s
decision function at the end of the first act. According to it, it could be possible that Corneille
had a hand in finishing the first act, under a fashion comparable to that of the other acts
(begun by Molière, ended by him) but in much smaller proportions, considering the brevity of
scenes 4 to 6 of the first act (forty verses in total).</p>
      <p>Performance on Quinault is also satisfying, even if his interventions can be sometimes quite
short: his alleged part in the prologue barely exceeds 50 verses, his “second intermède” is less
than 20 verses long etc. His intervention in the prologue is a bit masked by the 150 verses
window. Yet, we can see that the decision function for Quinault is high at the very beginning.
The finale (“ Cinquième intermède”) and the 60-verse long “troisième intermède” at the end
of the third act are quite visible, and attributed to Quinault, which is consistent with the
notice to the reader. Even the very short “second intermède” is detected by both methods,
and especially clearly in the genre-specific approach. 2
3.2. ‘Circé’ by T. Corneille and J. Donneau de Visé
If we consider the results of the cross-genre setup on ‘Circé’ (fig. 2) – the only setup with
training material for Thomas –, the major implication of Thomas Corneille in the writing of
this play is self-evident. Donneau de Visé seems to rise during a small passage of the prologue,
somewhere around the third scene or perhaps the “Prologue de la musique et de la comédie”
(accounting for the blurriness due to window size), a part of sung dialog, which would be
consistent with some of his claims. Yet, no other passage emerges that would make it even
seem plausible that he partly versified them.</p>
      <p>The spike of Quinault at act 2, scene 7 matches also a sung dialog (“dialogue de Sylvie et
de Tircis, qui se chante”). Quinault obviously knew how to write passages to be sung and
collaborated with Thomas Corneille in other occasions. Did Donneau de Visé have another
try at plagiarism, after having experimented with such practices earlier in his career? Or did
Donneau or Thomas Corneille ask a small amount of help to a colleague for a specific passage?
Without further analysis, it is difficult to answer, yet it is to be noted that the value of the
decision function only barely crosses 0 (implying positive class membership) on one single
point. This could be an artefact due to generic attractions, because Quinault produced an
important number of musical texts, represented in the training material. It is in any case
deserving of further investigation before drawing any firm conclusion.
3.3. ‘L’Inconnu’ by T. Corneille and J. Donneau de Visé
When applying the same method on L’Inconnu (fig. 3), here again, Donneau de Visé’s
implication is hard to assess, while Thomas Corneille’s style seems easily recognised. Small passages
for which he claims authorship could well be too difficult to detect which such windows - but
appeared in Psyché. Here, the decision function seems barely afected. It is to be noted that
2It is to be noted that we removed the “Premier intermède”, written in Italian (by Lully), for obvious
reasons.
the value of the decision function for Thomas collapses below 0 on a few occasions, especially
during the long dialogues at the end of act 1 and act 2. This could also receive an interpretation
based on generic discrepancies between the training set and this part of the text.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion</title>
      <p>On Psyché the results of the rolling analysis verifies very closely the self-declaration of the
notice. In comparison, we seem only to identify a non declared limited intervention of Corneille
at the end of the first act.</p>
      <p>
        Our results seem to confirm that Donneau de Visé’s contribution to the final writing of the
plays he co-signed with Thomas Corneille was quite scarce. This of course does not mean
that his contribution to those plays was non-existent. Stylometric analyses such as the ones
performed here mostly detect the style of the person who actually wrote the last version of
the sentences. Thomas Corneille for instance versified Molière’s famous comedy in prose, Dom
Juan, after his death and stylometry attributes the play to Thomas Corneille without a blink
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], while Molière arguably contributed quite a bit to the final result… Donneau de Visé could
have given a lot of insights about the intrigue, written large passages in prose etc. But for now,
his contribution strictly to the verses seem even scarcer that what he claimed after Thomas
Corneille’s death.
      </p>
      <p>
        In broader terms, the clues gathered here seem to point mainly towards the first case of
collaborative writing described by Pennebaker and Ireland [
        <xref ref-type="bibr" rid="ref20">21</xref>
        ], the
“Just-like-another-memberof-the-team hypothesis”: each portion is mostly attributable to the individual style of a single
author, i.e., the one responsible for the final form of the text, not for its content or for previous
formulations (in particular in the case of –necessarily heavy – transpositions between prose and
verse ). Yet, this would be deserving of further research to improve still our understanding of
authorial collaborations during the Grand Siècle and beyond. For now, it remains impossible
to say if some points of the texts, where the decision function for all candidate authors collapse,
can be attributed to synergies instead of, for instance, generic disturbances (see the case of the
Inconnu).
      </p>
    </sec>
    <sec id="sec-5">
      <title>5. Further research</title>
      <p>Further research are still needed to confirm and extend the results on Circé and L’Inconnu,
and more generally on collaborative writing during this age. Increasing the size of the training
set, especially for Donneau de Visé could be a first lead. In particular, we were not able to
secure access to usable digital text of plays such as Les Amours de Vénus et d’Adonis (1670)
or Amours de Bacchus et d’Ariane (1672), two pièces à machine he wrote alone during the
same decade as his collaborations with Thomas Corneille. Running an efficient OCR and
postcorrection of those texts, already digitised by the Bibliothèque nationale de France, should thus
be an important next step.</p>
      <p>In terms of analysed features, we could extend our work to account for metrical features
[22], accounting for instance for verse length in syllables, and the sequence of such lengths in
the “vers libres” parts. We could also try to cross stylistic with thematic features, to further
investigate contribution to the plot by opposition to contributions to the versification.</p>
      <p>The Quinault spike in Circé could also make us think that generic attractions are still an
issue in our study: having written numerous opera librettos, Quinault could be a designated
candidate for whatever looks like a sung passage to our SVM model. We should thus check for
possible imbalances in the training corpus. First experiments however seem to show that our
results stay even when downsizing Quinault’s sung passage in the training set.</p>
      <p>Finally, we could extend our process to collaborations in prose - which were quite numerous
in the théâtre classique in general, and which also occurred in a pièce à machines such as La
Devineresse by Thomas Corneille and Jean Donneau de Visé.</p>
    </sec>
    <sec id="sec-6">
      <title>Code and data availability</title>
      <p>Code and datasets are available at 10.5281/zenodo.5517801.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>We thank Pr. Georges Forestier for his input on Psyche and the subgenre of “pièces à
machines”, Thibault Clérice for fruitful discussions on machine-learning and stylometry, and
anonymous reviewers for their careful reading and insightful suggestions. Errors remain our
own.</p>
      <p>Molière. Psiché: tragédie-ballet, par I.B.P. Molière. 1671. url: https://gallica.bnf.fr/ark:
/12148/bpt6k70160j/f7.item.
[14]
[15]
[16]
[19]
[20]
[22] P. Plecháč. “Relative Contributions of Shakespeare and Fletcher in Henry VIII: An
Analysis Based on Most Frequent Words and Most Frequent Rhythmic Patterns”. In: Digital
Scholarship in the Humanities (2019).
[23] F. G. Quiel. “Comedia palaciega et tragi-comédie française au XVIIe siècle: adresse au
lecteur et transfert littéraire dans la Cassandre de l’abbé Boisrobert”. In: Revista de
Lenguas Modernas (2010).
[24]</p>
      <p>G. Rouchès. Inventaire des lettres et papiers manuscrits de Gaspare, Carlo et Lodovico
Vigarani conservés aux Archives d’État de Modène, 1634-1684. Champion, 1913.
[25] J. Rybicki, D. Hoover, and M. Kestemont. “Collaborative Authorship: Conrad, Ford and
Rolling Delta”. In: Literary and Linguistic Computing 29.3 (2014), pp. 422–431.
ference of the North American chapter of the association for computational linguistics:
Human language technologies. 2015, pp. 93–102.
Form in French Classical Theater”. In: Digital Humanities 2013: Conference Abstracts.
2013, pp. 383–86.
isbn.978-2-406-09572-9.</p>
      <p>and Process in Texts (2018), p. 116.
[28] C. Schuwey. Un entrepreneur des lettres au XVIIe siècle: Donneau de Visé, de Molière au</p>
      <p>Mercure galant. Lire le XVIIe siècle 69. Paris: Classiques Garnier, 2020. doi: 10.15122/
[29] H. Sun and M. Jin. “Collaborative Writing of ‘Otome no minato’”. In: Structure, Function</p>
      <p>M. Tschuggnall, E. Stamatatos, B. Verhoeven, W. Daelemans, G. Specht, B. Stein, and
M. Potthast. “Overview of the Author Identification Task at PAN-2017: Style Breach
Detection and Author Clustering”. In: CLEF (Working Notes). 2017.
licla%5C%5F0992-5279%5C%5F2002%5C%5Fhos%5C%5F1%5C%5F1%5C%5F1825.
[33] H. Visentin. “Le théâtre à machines: Succès majeur pour un genre mineur”. In:
Littératures classiques 51.1 (2004), pp. 205–222.</p>
    </sec>
    <sec id="sec-8">
      <title>A. Plays used as training</title>
      <sec id="sec-8-1">
        <title>A.1. Genre-specific setup</title>
        <p>A.1.1. Train (with leave-one-out test)
author title
Boyer, Claude LES AMOURS DE JUPITER ET DE SÉMÉLÉ, TRAGÉDIE
Corneille, Pierre LA CONQUÊTE DE LA TOISON D’OR, TRAGÉDIE
Donneau de Visé, Jean LES AMOURS DU SOLEIL, PASTORALE.
Molière Amphitryon, Comédie
Quinault, Philippe THÉSÉE, TRAGÉDIE
Quinault, Philippe CADMUS et HERMIONE, TRAGÉDIE</p>
      </sec>
      <sec id="sec-8-2">
        <title>A.2. Cross-genre setup</title>
        <p>A.2.1. Train
author
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Boyer, Claude
Corneille, Pierre
author
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Pierre
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Corneille, Thomas
Donneau de visé, Jean
Donneau de visé, Jean
Donneau de visé, Jean
Donneau de visé, Jean
Donneau de visé, Jean
Donneau de visé, Jean
Donneau de visé, Jean
Molière
Molière
Molière
Molière
Molière
Molière
Molière
Molière
Molière
Molière
Molière
Molière
Quinault, Philippe
Quinault, Philippe
Quinault, Philippe
Quinault, Philippe
n. words
16439
18913
18273
18226
18001
18313
16383
19191
18411
19614
18327
17623
15921
15897
20284
18730
19189
18583
19229
20532
19119
19334
15076
18642
19085
20041
18885
20353
17188
18771
19451
18803
22779
19310
19823
20384
18737
19730
13758
20702
19698
17025
20697
20512
19482
18651
20223
21978
20794
19230
20419
9659
19903
18121
19953
21509
21246
21267
18863
19804
20898
6278
8203
7413
11179
11907
6123
17603
19021
19377
12161
21708
9607
19135
6328
19590
10951
6877
21088
4733
6811
9181
6840
title
LA COMÉDIE SANS COMÉDIE, COMÉDIE
LES COUPS DE L’AMOUR ET DE LA FORTUNE, TRAGI-COMÉDIE
LE DOCTEUR DE VERRE, COMÉDIE
LE FANTOME AMOUREUX, TRAGI-COMÉDIE
LES FÊTES DE L’AMOUR ET DE BACCHUS, PASTORALE
LA GÉNÉREUSE INGRATITUDE, TRAGI-COMÉDIE PASTORALE
ISIS, TRAGÉDIE en MUSIQUE
LA MÈRE COQUETTE ou LES AMANTS BROUILLÉS, COMÉDIE
PERSÉE, TRAGÉDIE
PROSERPINE, TRAGÉDIE
ROLAND, TRAGÉDIE EN MUSIQUE
STRATONICE, TRAGI-COMÉDIE
LE TEMPLE DE LA PAIX, BALLET
THÉSÉE, TRAGÉDIE
n. words
18625
18884
20996
17723
19181
17932</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>N.</given-names>
            <surname>Akiyama</surname>
          </string-name>
          . “
          <article-title>Corneille et ses pièces à machines”</article-title>
          .
          <source>In: Dix-septième siècle 3</source>
          (
          <year>2010</year>
          ), pp.
          <fpage>403</fpage>
          -
          <lpage>417</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Bray</surname>
          </string-name>
          . “L'
          <article-title>Introduction des vers mêlés sur la scène classique”</article-title>
          .
          <source>In: Pmla 66.4</source>
          (
          <issue>1951</issue>
          ), pp.
          <fpage>456</fpage>
          -
          <lpage>484</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>F.</given-names>
            <surname>Cafiero</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.-B.</given-names>
            <surname>Camps</surname>
          </string-name>
          . “
          <article-title>Why Molière most likely did write his Plays”</article-title>
          .
          <source>In: Science advances 5.11</source>
          (
          <year>2019</year>
          ),
          <year>eaax5489</year>
          . doi:
          <volume>10</volume>
          .1126/sciadv.aax5489.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Cafiero</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.-B. Camps</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Gabay</surname>
            , and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Puren</surname>
          </string-name>
          . “
          <article-title>La naissance du style: auteur vs genre aux XVIIe et XIXe siècles”</article-title>
          .
          <source>In: Humanistica</source>
          <year>2020</year>
          .
          <year>2020</year>
          . url: https://hal.archivesouvertes.fr/hal-02577853/.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>K. van Dalen-Oskam</surname>
          </string-name>
          and
          <string-name>
            <surname>J. Van Zundert.</surname>
          </string-name>
          “
          <article-title>Delta for Middle Dutch: Author and Copyist Distinction in Walewein”</article-title>
          .
          <source>In: Literary and Linguistic Computing 22.3</source>
          (
          <issue>2007</issue>
          ), pp.
          <fpage>345</fpage>
          -
          <lpage>362</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>M. De Pure</surname>
          </string-name>
          .
          <article-title>Idée des spectacles anciens et nouveaux</article-title>
          .
          <source>Minkof</source>
          ,
          <volume>1668</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Donneau</surname>
          </string-name>
          de Visé. “[Notice nécrologique]”.
          <source>In: Mercure galant (1710)</source>
          , pp.
          <fpage>270</fpage>
          -
          <lpage>299</lpage>
          . url: https://gallica.bnf.fr/ark:/12148/bpt6k6351123z/f276.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Donneau de Visé</surname>
          </string-name>
          .
          <article-title>Préface à Sganarelle, ou le Cocu imaginaire</article-title>
          . Ed. by
          <string-name>
            <given-names>G.</given-names>
            <surname>Forestier</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Fournial</surname>
          </string-name>
          . Paris: Jean Ribou,
          <volume>1660</volume>
          . url: http://idt.huma-num.fr/notice.php?id=
          <fpage>317</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Eder</surname>
          </string-name>
          . “
          <article-title>Does Size matter? Authorship Attribution, Small Samples, Big Problem”</article-title>
          .
          <source>In: Literary and Linguistic Computing 30.2</source>
          (
          <issue>2015</issue>
          ), pp.
          <fpage>167</fpage>
          -
          <lpage>182</lpage>
          . doi:
          <volume>10</volume>
          .1093/llc/fqt066.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>url: https://academic.oup.com/dsh/article/30/2/167/390738.</mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Eder</surname>
          </string-name>
          . “
          <article-title>Rolling Stylometry”</article-title>
          .
          <source>In: Digital Scholarship in the Humanities 31.3</source>
          (
          <issue>2016</issue>
          ), pp.
          <fpage>457</fpage>
          -
          <lpage>469</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Evert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Proisl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Jannidis</surname>
          </string-name>
          , I. Reger,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pielström</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Schöch</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Vitt</surname>
          </string-name>
          . “
          <article-title>Understanding and Explaining Delta Measures for Authorship Attribution”</article-title>
          .
          <source>In: Digital Scholarship in the Humanities 32.suppl_2</source>
          (
          <issue>2017</issue>
          ), pp.
          <fpage>ii4</fpage>
          -
          <lpage>ii16</lpage>
          . doi:
          <volume>10</volume>
          .1093/llc/fqx023. url: https://academic.oup.com/dsh/article/32/suppl%5C%5F2/ii4/3865676.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>P.</given-names>
            <surname>Fièvre</surname>
          </string-name>
          . Théâtre classique.
          <year>2007</year>
          . url: http://www.theatre-classique.fr/.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A. A.</given-names>
            <surname>Gladwin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Lavin</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Look</surname>
          </string-name>
          . “
          <article-title>Stylometry and Collaborative Authorship: Eddy, Lovecraft, and 'The Loved Dead'”</article-title>
          .
          <source>In: Digital Scholarship in the Humanities 32.1</source>
          (
          <issue>2017</issue>
          ), pp.
          <fpage>123</fpage>
          -
          <lpage>140</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Kestemont</surname>
          </string-name>
          . “
          <article-title>Function Words in Authorship Attribution: From Black Magic to Theory?”</article-title>
          <source>In: Proceedings of the 3rd Workshop on Computational Linguistics for Literature (CLFL)</source>
          .
          <year>2014</year>
          , pp.
          <fpage>59</fpage>
          -
          <lpage>66</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Kestemont</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Moens</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Deploige</surname>
          </string-name>
          . “
          <article-title>Collaborative Authorship in the Twelfth Century: A Stylometric Study of Hildegard of Bingen and Guibert of Gembloux”</article-title>
          .
          <source>In: Digital Scholarship in the Humanities 30.2</source>
          (
          <issue>2015</issue>
          ), pp.
          <fpage>199</fpage>
          -
          <lpage>224</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>M.</given-names>
            <surname>Kestemont</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Stover</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Koppel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Karsdorp</surname>
          </string-name>
          , and
          <string-name>
            <given-names>W.</given-names>
            <surname>Daelemans</surname>
          </string-name>
          . “
          <article-title>Authenticating the Writings of Julius Caesar”</article-title>
          .
          <source>In: Expert Systems with Applications</source>
          <volume>63</volume>
          (
          <year>2016</year>
          ), pp.
          <fpage>86</fpage>
          -
          <lpage>96</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [17]
          <string-name>
            <surname>F.-J. ( Lagrange-Chancel. Œuvres de Monsieur de Lagrange</surname>
          </string-name>
          <article-title>Chancel revues et corrigées par lui-même</article-title>
          .
          <volume>1758</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>P.</given-names>
            <surname>Mélèse</surname>
          </string-name>
          .
          <article-title>Un homme de lettres au temps du grand roi</article-title>
          , Donneau de Visé:
          <article-title>fondateur du Mercure galant</article-title>
          .
          <source>Librairie Droz</source>
          ,
          <year>1936</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>J. W.</given-names>
            <surname>Pennebaker</surname>
          </string-name>
          and
          <string-name>
            <surname>M. E. Ireland.</surname>
          </string-name>
          “
          <article-title>Using Literature to Understand Authors: The Case for Computerized Text Analysis”</article-title>
          .
          <source>In: Scientific Study of Literature 1.1</source>
          (
          <issue>2011</issue>
          ), pp.
          <fpage>34</fpage>
          -
          <lpage>48</lpage>
          . doi:
          <volume>10</volume>
          .1075/ssol.1.1.04pen. url: https://www.jbe-platform.com/content/journals/10. 1075/ssol.1.1.04pen.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>U.</given-names>
            <surname>Sapkota</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bethard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Montes</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Solorio</surname>
          </string-name>
          . “
          <article-title>Not all Character N-Grams are Created Equal: A Study in Authorship Attribution”</article-title>
          .
          <source>In: Proceedings of the 2015 con-</source>
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>C.</given-names>
            <surname>Schöch</surname>
          </string-name>
          . “
          <article-title>Fine-Tuning our Stylometric Tools: Investigating Authorship</article-title>
          , Genre, and
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>A.</given-names>
            <surname>Viala</surname>
          </string-name>
          . Naissance de l'écrivain. Minuit,
          <year>1985</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>H.</given-names>
            <surname>Visentin</surname>
          </string-name>
          . “
          <article-title>La tragédie à machines ou l'art d'un théâtre bien ajusté”</article-title>
          . In: Littératures classiques, hors-série,
          <year>2002</year>
          .
          <article-title>Mythe et histoire dans le théâtre classique</article-title>
          .
          <source>Hommage à Christian Delmas</source>
          (
          <year>2002</year>
          ). doi:
          <volume>10</volume>
          .3406/licla.
          <year>2002</year>
          .
          <year>1825</year>
          . url: https://www.persee.fr/doc/
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>