<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The Xeno-canto collection and its relation to sound recognition and classi cation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Willem-Pier Vellinga</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Robert Planque</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Stichting Xeno-canto voor natuurgeluiden (Xeno-canto Foundation)</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper discusses distinguishing characteristics of the Xenocanto bird sound collection. The main aim is to indicate the relation between automated recognition of bird sounds (or feature recognition in digital recordings more generally) and curating large bioacoustics collections. Not only do large collections make it easier to design robust algorithmic approaches to automated species classi ers, those same algorithms should also become useful in determining the actual content of the collections.</p>
      </abstract>
      <kwd-group>
        <kwd>LifeCLEF2015</kwd>
        <kwd>BirdCLEF2015</kwd>
        <kwd>Xeno-canto</kwd>
        <kwd>bird sounds</kwd>
        <kwd>automated recognition</kwd>
        <kwd>citizen science</kwd>
        <kwd>data mining</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        For the past two years the BirdCLEF challenge [
        <xref ref-type="bibr" rid="ref1 ref2">1,2</xref>
        ], part of the LifeCLEF
workshops [
        <xref ref-type="bibr" rid="ref3 ref4">3,4</xref>
        ], has been based on sounds from Xeno-canto. Xeno-canto (XC)
aims to popularise bird sound recording, to improve accessibility of bird sounds,
and to increase knowledge of bird sounds. It tries to achieve these aims by
facilitating and curating a collaborative, shared, global bird sound collection on
www.xeno-canto.org. The collection was initiated by the authors in 2005 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
When XC started out it was mainly a project to aid identi cation of small
collections of bird sounds made by the authors in tropical forests in Peru and
Ecuador. Identifying species by sound using the means available at the time,
mostly commercial cassette tapes or CDs with up to a hundred recordings, was
cumbersome and many sounds were simply not available. (For a discussion see
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]).
      </p>
      <p>
        Sjoerd Mayer's \Birds of Bolivia" CD-ROM's [
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ] were an inspiration. They
increased the number of sounds available and species represented by an order of
magnitude, made navigation of the sounds much easier, mapped locations, and
identi ed background species on a recording. Mayer also engaged the birding
community by welcoming and crediting contributions of sounds by birders and
published corrections of errors on his website.
      </p>
      <p>The authors essentially took these concepts a step further, and designed and
constructed an interface to a non-commercial, open database situated on the
world wide web. A number of guiding principles were formulated that
distinguished XC from other sound collections at the time:
{ Anyone with web access is invited to upload sounds. XC does not refuse
recordings. Contributors can share any bird sound they nd interesting,
provided they are below a xed maximum size (initially 1 MB, now 10 MB) and
provided a required minimum set of metadata is given: species, recordist
name, location name, country, recording date, time of day, elevation, and
sound type(s). This system certainly has drawbacks: a considerable fraction
of the recordings is short, of dodgy quality, or both. Still, such recordings
may be useful. They may represent poorly known locations or vocalisations,
or may simply contribute to the sample size of individual species. Also, in
the context of automated species identi cation algorithms, it is clear that
any real-life deployment of such an algorithm would have to deal with poor
quality recordings as well.
{ The recordings uploaded to XC are shared. Re-use of the recordings is
intended, for purposes that are in line with the aims of XC, such as
downloading to personal collections, embedding sounds in educational or personal web
sites, use for scienti c research, etcetera. The Creative Commons licenses
(http://creativecommons.org/) o er a useful framework. After
consultation with the community it was decided to settle for CC-BY-ND-NC
(attribution, no-derivatives, non-commercial) licenses. Since this is in fact a rather
restrictive license, nowadays one can also choose CC-BY-NC-SA (SA stands
for share-alike) and CC-BY-SA licenses that allow more liberal re-use. In all
cases attribution of the author/contributor on republication is mandatory.
For discussion of the limits of the other terms, see the Creative Commons
website. The XC website code is written in free, open source software. It
is based on a standard LAMP (Linux, Apache, MySQL, PHP) set-up, with
some additional software written to show sonograms, implement mapping,
and so on.
{ Anyone can contribute to the collection in some way. Apart from sharing
recordings, people may contribute expertise on identi cation, set identi
cation challenges, o er experience with equipment, write articles on-site, or
just comment on recording achievements.
{ Anyone can challenge an identi cation (ID) on the site. The vast majority of
recordings have been identi ed correctly to species by the recordist, but
errors are inevitable. When challenged, the recording is set aside and does not
appear in search results until the ID is resolved by the community. This is
usually done in an open discussion on the forum. If the ID is agreed upon, the
recording is put back into the collection by the administrators. The
administrators therefore have the role of arbiters, rather than authorities, and in
fact there are no designated authorities that decide on species identi cation.
This is one of the more uncommon features of Xeno-canto, and in this sense
it di ers from other well-known community projects on natural history, such
as eBird (ebird.org), Observado (waarneming.nl / observado.org).</p>
      <p>At present, May 2015, the XC collection contains some 243,000 recordings
from over 9,300 bird species, shared by more than 2400 contributors from all
over the world. In the rest of this paper, the development and current status of
the XC collection are illustrated and a few points relevant to its relation with
automatic sound classi cation and recognition are discussed.</p>
    </sec>
    <sec id="sec-2">
      <title>Characteristics of the collection</title>
      <p>The growth of XC is illustrated in Figures 1 and 2, by plotting the number of
recordings and the number of contributors over time. Two things are noteworthy.
Firstly, the data for the initial period is incomplete, since the uploading dates
were initially not recorded. Secondly, there are pronounced seasonal e ects, most
obvious in the number of contributors. These points are remedied to some extent
in subsequent gures by plotting versus the number of recordings instead of
versus time.
2.1</p>
      <sec id="sec-2-1">
        <title>Contributors</title>
        <p>Both the number of recordings and the number of contributors grow at
increasing rates. Remarkably, plotting the number of recordings versus the number of
contributors shows that they have consistently increased at approximately the
same rate. See Figure 3. This leads to a more or less constant average number of
recordings per contributor, which turns out to be about 100. However, it should
be noted that the distribution of recordings per contributor is very broad and
skewed. At this moment 298 contributors contribute more than 100 recordings,
and many more contributors, around 2100, contribute less than 100 recordings.</p>
        <p>The three largest contributions each comprise more than 10000 recordings, more
than 100 times the average; 729 contributors contributed 1 recording, 100 times
less than the average. The Zipf-like plots in Figure 4 serve to characterise the
distribution at various stages during the development of XC.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Species</title>
        <p>
          When a sound is uploaded a set of metadata is required, among which the name
of the species. Specifying the subspecies is optional. The taxonomy of the site
was initially based on the taxonomy in Neotropical Birds [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. Other regions
were added over time (North-America, Africa, Asia, Europe and Australasia)
using other local taxonomies, which lead to problems with species occurring in
several regions. In 2011 the global IOC (International Ornithological Council)
taxonomy was adopted for all recordings and XC currently uses version 4.1 [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
The constant revision of taxonomy at the species level means that the species
assignment of the recordings needs to be updated frequently. This task falls to
the team of administrators. Splits can be problematic, since the subspeci c taxon
to which a recording belongs may not be indicated (see below).
        </p>
        <p>IOC 4.1 recognises 10,518 extant species and 150 extinct species; to this list
XC has added 16 additional recently described or as yet undescribed species.
About 9330 are represented in XC at this moment. To our best knowledge this
constitutes the largest number of species in any public collection of bird sounds.
(There is at least one private collection that has more species, but it includes
sounds of all species from XC.)</p>
        <p>The growth of the number of species may provide a clue about the moment
of completion of the collection at the species level. Figure 5 shows the species
accumulation curve up to may 2015, together with a randomised accumulation
curve and a t used for extrapolation. The randomised version is based on a
random draw from all recordings present in XC. Clearly the two curves di er
signi cantly. This is caused by the fact that XC started out with only Neotropical
species, and that other world areas were added later. The randomised species
accumulation curve does not take that into account. The two curves are seen to
meet up after about 170000 recordings, well after XC went global.</p>
        <p>For any number of reasons (abundance and size range, vocal (in)activity,
accessibility of the range of the species, accessibility of the site to name just
four) the recordings are not evenly distributed across the species. The current
expectation value for the number of recordings per species is around 20. However
some 20 species have attracted over 500 recordings, while around 1200 are still
waiting to be uploaded. The distribution plotted in Zipf-fashion is shown in
Figure 6, probability densities are shown in Figure 7.</p>
        <p>The species abundance curves can be extrapolated into the future by making
assumptions on the probability that species that are not represented at this time
will be uploaded. A reasonable t is achieved by assuming that the probability of
a new species being uploaded is 1/3 of that of a species with 1 recording in XC,
with the ratios between probabilities of species already represented remaining
equal. An extrapolation based on that assumption is shown in Figure 8. Of
course the extrapolation follows the randomised species abundance curve very
well. The extrapolation is shown up to 900,000 recordings, at which point it is
still about 600 species shy of the total number of species. The precise number
will depend on the assumptions made, but it seems reasonable to assume that
completion at the species level will take a multiple of the number of recordings
present at this moment.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Linking bird song databases and automated species recognition</title>
      <p>The BirdCLEF workshop requires entrants to identify recordings from the XC
collection to species level based on the species level identi cation provided by
the XC community. It is worthwhile to have another look at the species level
IDs in XC. For a number of reasons the ID to species level, even if correct, may
be misleading.</p>
      <p>
        { Presence of unnamed background species Although recordists are asked
to mention the background species present in the recordings, not all recordists
do so. On average 2 species are identi ed per recording, but it is certain that
many more could be identi ed. Interestingly, the presence of named
background species helps humans to identify a sound of interest (as the authors
know from personal experience), but this does not seem to lead to a higher
rate of identi cation in the algorithmic identi cations [
        <xref ref-type="bibr" rid="ref1 ref2">1,2</xref>
        ].
{ Hidden diversity The IOC 4.1 species list not only recognises 10668 species,
but identi es another 20976 subspecies for 5093 of species, bringing the
total number of taxa to 26551. On XC about 9330 species and 9140 additional
subspecies have been identi ed. This does not mean that 18470 taxa are
represented. It is likely that some recordings represent subspecies that have
not been named now. This means that the currently recognised number is an
underestimate. But it is also likely that in some cases the taxa represented
by recordings without subspeci c ID are in fact already named, which would
lead to an overestimate. Of the 9300 species present on XC 4484 are
monotypic. The 9100 subspecies therefore belong to about 4900 species adding
at least 4200 taxa. The maximum number of named taxa represented is
therefore 18400 and the minimum number 13500. An estimate based on the
number of species present (9300/10668)*26551 would lead to about 23000
taxa present at this time. Based on this estimate it seems likely that a
considerable number of taxa remains to be named on XC. At the same time this
also means that the species category may represent considerable taxonomic
diversity. It is to be expected that such diversity hidden within species on XC
is re ected in the sounds, since many subspecies are known to have distinct
vocalizations [
        <xref ref-type="bibr" rid="ref10 ref9">9,10</xref>
        ].
      </p>
      <p>Other contributions to the diversity of sounds which do not necessarily align
with subspecies categories are geographical dialects, such as in Yellowhammer
(Emberiza citrinella), and the size of the vocabulary of a species, such as in
Common Nightingale (Luscinia megarhynchos ). Little quantitative information
is available on the extent of dialect formation and the size of the vocabulary
across the range of the overwhelming majority of the 10518 species of birds.
Apparently the e ect of such diversity at the species level on the results of
automatic recognition has not been quanti ed yet. Intuitively, given a set of training
data, one would expect a species that shows little diversity to be recognised more
faithfully than a species that shows a lot of variability.</p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] it was concluded there that the recognition algorithms worked better
on average for species with more recordings in the training set. It would be
interesting to look for correlations with the number of subspecies recognised, or
the known size of vocabulary.
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>The results from the 2014 and 2015 BirdCLEF challenges o er an interesting
perspective on the use of automated algorithmic techniques on the one hand,
and large accessible public archives of sound data on the other.</p>
      <p>At present, the focus in the challenges lies squarely in the eld of automated
recognition, and understandably so. The large Xeno-canto database has been
the basis of the challenges, and give the rst general insights in automated
feature extraction and classi cation to species level for general vocalizations.
The species set included in the latest 2015 edition spans 1000 species with a
huge range of di erent types of bird songs and calls. The BirdCLEF paper in
this volume contributes to our understanding which techniques excel at this type
of challenge.</p>
      <p>We would welcome a second application of the algorithms, however, one that
would allow a deeper insight into the variety of vocalizations actually
represented in archives such as Xeno-canto. There is great potential for collaborative
projects, in which estimates would be computed of a number of statistics.
Example include (a) estimates of repertoire sizes in song birds (or other taxa); (b)
discovery of subspecies with di erent vocal signatures; (c) the ability to extract
a small representative sample of di erent vocalizations for focal species, or
focal localities. We hope that we may attract the computer science community to
work with us to start to address these types of challenges.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. Goeau, H.,
          <string-name>
            <surname>Glotin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vellinga</surname>
            ,
            <given-names>W.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Planque</surname>
            ,
            <given-names>R</given-names>
          </string-name>
          , Joly,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>LifeCLEF Bird Identication Task 2014</article-title>
          .
          <source>In: Proceedings of CLEF</source>
          <year>2014</year>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Goeau, H.,
          <string-name>
            <surname>Glotin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vellinga</surname>
            ,
            <given-names>W.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Planque</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rauber</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joly</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>LifeCLEF Bird Identi cation Task 2015</article-title>
          . In: CLEF working notes
          <year>2015</year>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Joly</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , Muller, H., Goeau, H.,
          <string-name>
            <surname>Glotin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spampinato</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rauber</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bonnet</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vellinga</surname>
            ,
            <given-names>W.P.</given-names>
          </string-name>
          , Fisher,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Planque</surname>
          </string-name>
          , R.:
          <source>LifeCLEF</source>
          <year>2014</year>
          :
          <article-title>multimedia life species identi cation</article-title>
          .
          <source>In: Proceedings of CLEF</source>
          <year>2014</year>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Joly</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , Muller, H., Goeau, H.,
          <string-name>
            <surname>Glotin</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rauber</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bonnet</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vellinga</surname>
            ,
            <given-names>W.P.</given-names>
          </string-name>
          , Fisher,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Planque</surname>
          </string-name>
          , R.:
          <source>LifeCLEF</source>
          <year>2015</year>
          :
          <article-title>multimedia life species identi cation challenges</article-title>
          . In: Cappellato,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Ferro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ,
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          , and San Juan, E., editors (
          <year>2015</year>
          ).
          <article-title>CLEF 2015 Labs and Workshops, Notebook Papers</article-title>
          .
          <source>CEUR Workshop Proceedings (CEUR-WS.org)</source>
          ,
          <source>ISSN 1613-0073</source>
          , http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1391</volume>
          /. (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Vellinga</surname>
            ,
            <given-names>W.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Planque</surname>
          </string-name>
          , R.: http://tinyurl.com/xcstart05 (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Moore</surname>
            ,
            <given-names>J.V.</given-names>
          </string-name>
          :
          <article-title>Ecuador's avifauna: the state of knowledge and availability of soundrecordings</article-title>
          .
          <source>Cotinga</source>
          <volume>29</volume>
          ,
          <issue>19</issue>
          {
          <fpage>21</fpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Mayer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Bird sounds of Bolivia / Sonidos de aves de Bolivia,
          <volume>1</volume>
          .0.
          <string-name>
            <surname>CD-ROM. Bird Songs</surname>
            <given-names>International</given-names>
          </string-name>
          , Westernieland, The Netherlands.
          <article-title>(</article-title>
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Mayer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Bird sounds of Bolivia / Sonidos de aves de Bolivia,
          <volume>2</volume>
          .0.
          <string-name>
            <surname>CD-ROM. Bird Songs</surname>
            <given-names>International</given-names>
          </string-name>
          , Westernieland, The Netherlands.
          <article-title>(</article-title>
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Stotz</surname>
            ,
            <given-names>D.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fitzpatrick</surname>
            ,
            <given-names>J.W.</given-names>
          </string-name>
          , III,
          <string-name>
            <given-names>T.A.P.</given-names>
            ,
            <surname>Moskovits</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.K.</surname>
          </string-name>
          : Neotropical Birds. University of Chicago Press (
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Gill</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Donsker</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <source>IOC World Bird Names v4.1</source>
          . Available at www.
          <source>worldbirdnames.org. CC-BY 3.0</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kroodsma</surname>
            ,
            <given-names>D.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miller</surname>
          </string-name>
          , E.H. (eds.):
          <article-title>Ecology and evolution of acoustic communication in birds</article-title>
          . Comstock Publishing Associates (
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Marler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Slabbekoorn</surname>
          </string-name>
          , H.:
          <article-title>Nature's Music</article-title>
          . Elsevier Academic Press (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>