<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Glasgow, UK, April</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>LifeCLEF: Multimedia Life Species Identification</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alexis Joly</string-name>
          <email>alexis.joly@inria.fr</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hervé Glotin</string-name>
          <email>glotin@univ-tln.fr</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pierre Bonnet</string-name>
          <email>pierre.bonnet@cirad.fr</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Henning Müller</string-name>
          <email>Henning.Mueller@hevs.ch</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Concetto Spampinato</string-name>
          <email>cspampin@dieei.unict.it</email>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Willem-Pier Vellinga</string-name>
          <email>wp@xeno-canto.org</email>
          <xref ref-type="aff" rid="aff7">7</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hervé Goëau</string-name>
          <email>herve.goeau@inria.fr</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas Rauber</string-name>
          <email>rauber@ifs.tuwien.ac.at</email>
          <xref ref-type="aff" rid="aff6">6</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bob Fisher</string-name>
          <email>rbf@inf.ed.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CIRAD</institution>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Edinburgh Univ.</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>HES-SO</institution>
          ,
          <country country="CH">Switzerland</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>INRIA</institution>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>IUF &amp; Univ. de Toulon</institution>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>University of Catania</institution>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff6">
          <label>6</label>
          <institution>Vienna Univ. of Tech.</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff7">
          <label>7</label>
          <institution>Xeno-canto foundation</institution>
          ,
          <addr-line>The</addr-line>
          ,
          <country country="NL">Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <volume>1</volume>
      <issue>2014</issue>
      <fpage>7</fpage>
      <lpage>13</lpage>
      <abstract>
        <p>Building accurate knowledge of the identity, the geographic distribution and the evolution of living species is essential for a sustainable development of humanity as well as for biodiversity conservation. In this context, using multimedia identi cation tools is considered as one of the most promising solution to help bridging the taxonomic gap. With the recent advances in digital devices/equipment, network bandwidth and information storage capacities, the production of multimedia big data has indeed become an easy task. In parallel, the emergence of citizen sciences and social networking tools has fostered the creation of large and structured communities of nature observers (e.g. eBird, Xeno-canto, Tela Botanica, etc.) that have started to produce outstanding collections of multimedia records. Unfortunately, the performance of the state-of-the-art multimedia analysis techniques on such data is still not well understood and is far from reaching the real world's requirements in terms of identi cation tools. The LifeCLEF lab proposes to evaluate these challenges around 3 tasks related to multimedia information retrieval and ne-grained classi cation problems in 3 living worlds. Each task is based on large and real-world data and the measured challenges are de ned in collaboration with biologists and environmental stakeholders in order to re ect realistic usage scenarios.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1.1</p>
    </sec>
    <sec id="sec-2">
      <title>LifeCLEF LAB OVERVIEW</title>
    </sec>
    <sec id="sec-3">
      <title>Motivations</title>
      <p>
        Building accurate knowledge of the identity, the geographic
distribution and the evolution of living species is essential for
a sustainable development of humanity as well as for
biodiversity conservation. Unfortunately, such basic information
is often only partially available for professional stakeholders,
teachers, scientists and citizens, and more often incomplete
for ecosystems that possess the highest diversity, such as
tropical regions. A noticeable cause and consequence of this
sparse knowledge is that identifying living plants or animals
is usually impossible for the general public, and often a
difcult task for professionals, such as farmers, sh farmers or
foresters, and even also for the naturalists and specialists
themselves. This taxonomic gap [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ] was actually identi ed
as one of the main ecological challenges to be solved during
the Rio's United Nations Conference in 1992.
      </p>
      <p>
        In this context, using multimedia identi cation tools is
considered as one of the most promising solution to help
bridging the taxonomic gap [
        <xref ref-type="bibr" rid="ref1 ref11 ref18 ref23 ref28 ref30 ref31 ref8">23, 11, 8, 31, 28, 1, 30, 18</xref>
        ].
With the recent advances in digital devices, network
bandwidth and information storage capacities, the production of
multimedia data has indeed become an easy task. In
parallel, the emergence of citizen sciences and social networking
tools has fostered the creation of large and structured
communities of nature observers (e.g. eBird1, Xeno-canto2, Tela
Botanica3, etc.) that have started to produce outstanding
collections of multimedia records. Unfortunately, the
performance of the state-of-the-art multimedia analysis techniques
on such data is still not well understood and are far from
reaching the real world's requirements in terms of
identication tools [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Most existing studies or available tools
typically identify a few tens or hundreds of species with
moderate accuracy whereas they should be scaled-up to take one,
two or three orders of magnitude more, in terms of number
of species (the total number of living species on earth is
estimated to be around 10K for birds, 30K for shes, 300K for
plants and more than 1.2M for invertebrates [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]).
1.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Evaluated Tasks</title>
      <p>
        The LifeCLEF lab proposes to evaluate these challenges in
the continuity of the image-based plant identi cation task
[
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] that was run within ImageCLEF lab during the last
three years with an increasing number of participants. It
however radically enlarges the evaluated challenge towards
multimodal data by (i) considering birds and sh in addition
to plants (ii) considering audio and video contents in
addition to images (iii) scaling-up the evaluation data to
hundreds of thousands of life media records and thousands of
living species. More concretely, the lab is organized around
      </p>
      <sec id="sec-4-1">
        <title>1http://ebird.org/</title>
        <p>2http://www.xeno-canto.org/
3http://www.tela-botanica.org/
three tasks:</p>
        <sec id="sec-4-1-1">
          <title>PlantCLEF: an image-based plant identi cation task</title>
        </sec>
        <sec id="sec-4-1-2">
          <title>BirdCLEF: an audio-based bird identi cation task</title>
        </sec>
        <sec id="sec-4-1-3">
          <title>FishCLEF: a video-based sh identi ca</title>
          <p>tion task
As described in more detail in the following sections, each
task is based on big and real-world data and the measured
challenges are de ned in collaboration with biologists and
environmental stakeholders so as to re ect realistic usage
scenarios. For this pilot year, the three tasks are mainly
concerned with species identi cation, i.e., helping users to
retrieve the taxonomic name of an observed living plant or
animal. Taxonomic names are actually the primary key to
organize life species and to access all available information
about them either on the web, or in herbariums, in scienti c
literature, books or magazines, etc. Identifying the taxon
observed in a given multimedia record and aligning its name
with a taxonomic reference is therefore a key step before any
other indexing or information retrieval task. More focused
or complex challenges (such as detecting species duplicates
or ambiguous species) could be evaluated in coming years.</p>
          <p>The three tasks are primarily focused on content-based
approaches (i.e. on the automatic analyses of the audio
and visual signals) rather than on interactive information
retrieval approaches involving textual or graphical
morphological attributes. The content-based approach to life species
identi cation has several advantages. It is rst
intrinsically language-independent and solves many of the
multilingual issues related to the use of classical text-based
morphological keys that are strongly language dependent and
understandable only by few experts in the world.
Furthermore, an expert of one region or a speci c taxonomic group
does not necessarily know the vocabulary dedicated to
another group of living organisms. A content-based approach
can then be much more easily generalizable to new oras
or faunas contrary to knowledge-based approaches that
require building complex models manually (ontologies with
rich descriptions, graphical illustrations of morphological
attributes, etc.). On the other hand, LifeCLEF lab is
inherently cross-modal through the presence of contextual and
social data associated to the visual and audio contents. This
includes geo-tags or location names, time information,
author names, collaborative ratings or comments, vernacular
names (common names of plants or animals), organ or
picture type tags, etc. The rules regarding the use of these
meta-data in the evaluated identi cation methods will be
speci ed in the description of each task. Overall, these rules
are always designed so as to re ect real possible usage
scenarios while o ering the largest diversity in the a ordable
approaches.
1.3</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Expected Outcomes</title>
      <p>The main expected outcomes of LifeCLEF evaluation
campaign are the following:
give a snapshot of the performances of state-of-the-art
multimedia techniques towards building real-world life
species identi cation systems
provide large and original data sets of biological records,
and then allow comparison of multimedia-based
identi cation techniques
boost research and innovation on this topic in the next
few years and encourage multimedia researchers to work
on trans-disciplinary challenges involving ecological and
environmental data
foster technological ports from one domain to another
and exchanges between the di erent communities
(information retrieval, computer vision, bio-accoustic,
machine learning, etc.)
promote citizen science and nature observation as a
way to describe, analyse and preserve biodiversity
2.
2.1</p>
    </sec>
    <sec id="sec-6">
      <title>TASK1: PlantCLEF</title>
    </sec>
    <sec id="sec-7">
      <title>Context</title>
      <p>
        Content-based image retrieval approaches are nowadays
considered to be one of the most promising solution to help
bridge the botanical taxonomic gap, as discussed in [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]
or [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] for instance. We therefore see an increasing
interest in this trans-disciplinary challenge in the multimedia
community (e.g. in [
        <xref ref-type="bibr" rid="ref16 ref17 ref21 ref24 ref4 ref9">16, 9, 21, 24, 17, 4</xref>
        ]). Beyond the
raw identi cation performances achievable by
state-of-theart computer vision algorithms, the visual search approach
o ers much more e cient and interactive ways of
browsing large oras than standard eld guides or online web
catalogs. Smartphone applications relying on such
imagebased identi cation services are particularly promising for
setting-up massive ecological monitoring systems, involving
hundreds of thousands of contributors at a very low cost.
The rst noticeable progress in this way was achieved by
the US consortium at the origin of LeafSnap4. This
popular iPhone application allows a fair identi cation of 185
common American plant species by simply shooting a cut
leaf on a uniform background (see [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] for more details). A
step beyond was achieved recently by the Pl@ntNet project
[
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] which released a cross-platform application (iPhone [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ],
android5 and web 6) allowing (i) to query the system with
pictures of plants in their natural environment and (ii) to
contribute to the dataset thanks to a collaborative data
validation work ow involving Tela Botanica7 (i.e. the largest
botanical social network in Europe).
      </p>
      <p>As promising as these applications are, their performances
are however still far from the requirements of a real-world
social-based ecological surveillance scenario. Allowing the
mass of citizens to produce accurate plant observations
requires to equip them with much more accurate identi cation
tools. Measuring and boosting the performances of
contentbased identi cation tools is therefore crucial. This was
precisely the goal of the ImageCLEF8 plant identi cation task
organized since 2011 in the context of the worldwide
evaluation forum CLEF9. In 2011, 2012 and 2013 respectively 8, 10</p>
      <sec id="sec-7-1">
        <title>4http://leafsnap.com/</title>
        <p>
          5https://play.google.com/store/apps/details?id=org.plantnethl=fr
6http://identify.plantnet-project.org/
7http://www.tela-botanica.org/
8http://www.imageclef.org/
9http://www.clef-initiative.eu/
and 12 international research groups did cross the nish line
of this large collaborative evaluation by benchmarking their
images-based plant identi cation systems (see [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] and
[
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] for more details). Data mobilised during these 3 rst
years can be consulted at the following url10, geographic
distribution of theses botanical records can be seen on Figure
1.
        </p>
        <p>Contrary to previous evaluations reported in the
literature, the key objective was to build a realistic task closer to
real-world conditions (di erent users, cameras, areas,
periods of the year, individual plants, etc.). This was initially
achieved through a citizen science initiative initiated 4 years
ago in the context of the Pl@ntNet project in order to boost
the image production of Tela Botanica social network. The
evaluation data was enriched each year with the new
contributions and progressively diversi ed with other input feeds
(Annotation and cleaning of older data, contributions made
through Pl@ntNet mobile applications). The plant task of
LifeCLEF 2014 is directly in the continuity of this e ort.
Main novelties compared to the last years are the following:
(i) an explicit multi-image query scenario (ii) the supply of
user ratings on image quality in the meta-data (iii) a new
type of view called "Branch" additionally to the 6
previous ones (iv) basically more species (about 500 which is an
important step towards covering the entire ora of a given
region).
2.2</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Dataset</title>
      <p>
        More precisely, PlantCLEF 2014 dataset is composed of
60,962 pictures belonging to 19,504 observations of 500 species
of trees, herbs and ferns living in a European region centered
around France. This data was collected by 1608 distinct
contributors. Each picture belongs to one and only one of the 7
types of view reported in the meta-data (entire plant, fruit,
leaf, ower, stem, branch, leaf scan) and is associated with
a single plant observation identi er allowing to link it with
the other pictures of the same individual plant (observed the
same day by the same person). It is noticeable that most
10http://publish.plantnet-project.org/project/plantclef
image-based identi cation methods and evaluation data
proposed in the past were so far based on leaf images (e.g. in
[
        <xref ref-type="bibr" rid="ref22 ref5 ref9">22, 5, 9</xref>
        ] or in the more recent methods evaluated in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]).
Only few of them were focused on ower's images as in [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]
or [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Leaves are far from being the only discriminant visual
key between species but, due to their shape and size, they
have the advantage to be easily observed, captured and
described. More diverse parts of the plants however have to be
considered for accurate identi cation. As an example, the
6 species depicted in Figure 2 share the same French
common name of "laurier" even though they belong to di erent
taxonomic groups (4 families, 6 genera).
      </p>
      <p>The main reason is that these shrubs, often used in hedges,
share leaves with more or less the same-sized elliptic shape.
Identifying a laurel can be very di cult for a novice by just
observing leaves, while it is undisputably easier with owers.
Beyond identi cation performances, the use of leaves alone
has also some practical and botanical limitations. Leaves
are not visible all over the year for a large fraction of plant
species. Deciduous species, distributed from temperate to
tropical regions, can't be identi ed by the use of their leaves
over di erent periods of the year. Leaves can be absent
(ie. lea ess species), too young or too much degraded (by
pathogen or insect attacks), to be exploited e ciently.
Moreover, leaves of many species are intrinsically not enough
informative or very di cult to capture (needles of pines, thin
leaves of grasses, huge leaves of palms, ...).</p>
      <p>Another originality of PlantCLEF dataset is that its social
nature makes it closer to the conditions of a real-world
identi cation scenario: (i) images of the same species are coming
from distinct plants living in distinct areas (ii) pictures are
taken by di erent users that might not used the same
protocol to acquire the images (iii) pictures are taken at di erent
periods in the year. Each image of the dataset is
associated with contextual meta-data (author, date, locality name,
plant id) and social data (user ratings on image quality,
collaboratively validated taxon names, vernacular names)
provided in a structured xml le. The gps geo-localization and
the device settings are available only for some of the images.
Table 3 gives some examples of pictures with decreasing
averaged users ratings for the di erent types of views. Note
that the users of the specialized social network creating these
ratings (Tela Botanica) are explicitely asked to rate the
images according to their plant identi cation ability and their
accordance to the pre-de ned acquisition protocol for each
view type. This is not an aesthetic or general interest
judgement as in most social image sharing sites.
2.3</p>
    </sec>
    <sec id="sec-9">
      <title>Task Description</title>
      <p>The task will be evaluated as a plant species retrieval task
based on multi-image plant observations queries. The goal
is to retrieve the correct plant species among the top results
of a ranked list of species returned by the evaluated
system. Contrary to previous plant identi cation benchmarks,
queries are not de ned as single images but as plant
observations, meaning a set of one to several images depicting
the same individual plant, observed by the same person,
the same day. Each image of a query observation is
associated with a single view type (entire plant, branch, leaf,
fruit, ower, stem or leaf scan) and with contextual
metadata (data, location, author). Each participating group is
allowed to submit up to 4 runs built from di erent methods.
Semi-supervised and interactive approaches (particularly for
segmenting parts of the plant from the background), are
allowed but will be compared independently from fully
automatic methods. Any human assistance in the processing of
the test queries has therefore to be signaled in the submitted
runs meta-data.</p>
      <p>In practice, the whole PlantCLEF dataset is split in two
parts, one for training (and/or indexing) and one for
testing. The test set was built by randomly choosing 1/3 of the
observations of each species whereas the remaining
observations were kept in the reference training set. The xml les
containing the meta-data of the query images were purged
so as to erase the taxon name (the ground truth), the
vernacular name (common name of the plant) and the image
quality ratings (that would not be available at query stage
in a real-world mobile application). Meta-data of the
observations in the training set are kept unaltered.</p>
      <p>The metric used to evaluate the submitted runs will be a
score related to the rank of the correct species in the
returned list. Each query observation will be attributed with
a score between 0 and 1 re ecting equal to the inverse of the
rank of the correct species (equal to 1 if the correct species
is the top-1 decreasing quickly while the rank of the correct
species increases). An average score will then be computed
across all plant observation queries. A simple mean on all
plant observation test would however introduce some bias.
Indeed, we remind that the PlantCLEF dataset was built
in a collaborative manner. So that few contributors might
have provided much more observations and pictures than
many other contributors who provided few. Since we want
to evaluate the ability of a system to provide the correct
answers to all users, we rather measure the mean of the average
classi cation rate per author. Finally, our primary metric is
de ned as the following average classi cation score S:</p>
      <p>U
S = 1 X</p>
      <p>1 XPu 1
U u=1 Pu p=1 Nu;p
su;p
where U is the number of users, Pu the number of
individual plants observed by the u-th user, Nu;p the number of
pictures of the p-th plant observation of the u-th user, su;p
is the score between 1 and 0 equals to the inverse of the rank
of the correct species.
3.
3.1</p>
    </sec>
    <sec id="sec-10">
      <title>TASK2: BirdCLEF</title>
    </sec>
    <sec id="sec-11">
      <title>Context</title>
      <p>
        The bird and the plant identi cation tasks share similar
usage scenarios. The general public as well as
professionals like park rangers, ecology consultants, and of course, the
ornithologists themselves might actually be users of an
automated bird identifying system, typically in the context of
wider initiatives related to ecological surveillance or
biodiversity conservation. Using audio records rather than bird
pictures is justi ed by current practices [
        <xref ref-type="bibr" rid="ref30 ref31 ref7 ref8">8, 31, 30, 7</xref>
        ]. Birds
are actually not easy to photograph as they are most of the
time hidden, perched high in a tree or frightened by human
presence, and they can y very quickly, whereas audio calls
and songs have proved to be easier to collect and very
discriminant.
      </p>
      <p>
        Only three noticeable previous initiatives on bird species
identi cation based on their songs or calls in the context of
worldwide evaluation took place, in 2013. The rst one was
the ICML4B bird challenge joint to the international
Conference on Machine Learning in Atlanta, June 2013. It was
initiated by the SABIOD MASTODONS CNRS group11,
the university of Toulon and the National Natural History
Museum of Paris [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. It included 35 species, and 76
participants submitted their 400 runs on the Kaggle interface. The
second challenge was conducted by F. Brigs at MLSP 2013
workshop, with 15 species, and 79 participants in August
2013. The third challenge, and biggest in 2013, was
organised by University of Toulon, SABIOD and Biotope, with 80
species from the Provence, France. More than thirty teams
participated, reaching 92% of average AUC. The
description of the ICML4B best systems are given into the on-line
book [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], including for some of them reference to some useful
scripts.
      </p>
      <p>In collaboration with the organizers of these previous
challenges, BirdCLEF 2014 goes one step further by (i) signi
cantly increasing the species number by almost an order of
magnitude (ii) working on real-world social data built from
hundreds of recordists (iii) moving to a more usage-driven
and system-oriented benchmark by allowing the use of
metadata and de ning information retrieval oriented metrics.
Over11http://sabiod.org
all, the task is expected to be much more di cult than
previous benchmarks because of the higher confusion risk between
the classes, the higher background noise and the higher
diversity in the acquisition conditions (devices, recordists
uses, contexts diversity, etc.). It will therefore probably
produce substantially lower scores and o er a better progression
margin towards building real-world generalist identi cation
tools.
3.2</p>
    </sec>
    <sec id="sec-12">
      <title>Dataset</title>
      <p>The training and test data of the bird task is composed
by audio recordings collected by Xeno-canto (XC)12.
Xenocanto is a web-based community of bird sound recordists
worldwide with about 1500 active contributors that have
already collected more than 150,000 recordings of about 9000
species. Nearly 500 species from Brazilian forests are used
in the BirdCLEF dataset, representing the 500 species of
that region with the highest number of recordings, totalling
about 14,000 recordings produced by hundreds of users.
Figure 4 illustrates the geographical distribution of the dataset
samples.</p>
      <p>
        To avoid any bias in the evaluation related to the used
audio devices, each audio le has been normalized to a constant
bandwidth of 44.1 kHz and coded over 16 bits in wav mono
format (the right channel is selected by default). The
conversion from the original Xeno-canto data set was done using
mpeg, sox and matlab scripts. The optimized 16 Mel
Filter Cepstrum Coe cients for bird identi cation (according
to an extended benchmark [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]) have been computed with
their rst and second temporal derivatives on the whole set.
They were used in the best systems run in ICML4B and
NIPS4B challenges.
      </p>
      <p>Audio records are associated with various meta-data
including the species of the most active singing bird, the species
of the other birds audible in the background, the type of
sound (call, song, alarm, ight, etc.), the date and location
of the observations (from which rich statistics on species
distribution can be derived), some textual comments of the
authors, multilingual common names and collaborative quality
ratings. All of them were produced collaboratively by
Xenocanto community.
3.3</p>
      <p>Participants are asked to determine the species of the most
active singing birds in each query le. The background noise
can be used as any other meta-data, but it is forbidden to
correlate the test set of the challenge with the original
annotated Xeno-canto data base (or with any external content
as many of them are circulating on the web). More precisely
and similarly to the plant task, the whole BirdCLEF dataset
has been split in two parts, one for training (and/or
indexing) and one for testing. The test set was built by randomly
choosing 1/3 of the observations of each species whereas the
remaining observations were kept in the reference training
set. Recordings of the same species done by the same
person the same day are considered as being part of the same
observation and cannot be split across the test and training
set. The xml les containing the meta-data of the query
recordings were purged so as to erase the taxon name (the
ground truth), the vernacular name (common name of the
bird) and the collaborative quality ratings (that would not
be available at query stage in a real-world mobile
application). Meta-data of the recordings in the training set are
kept unaltered.</p>
      <p>The groups participating to the task will be asked to
produce up to 4 runs containing a ranked list of the most
probable species for each query records of the test set. Each
species will have to be associated with a normalized score in
the range [0; 1] re ecting the likelihood that this species is
singing in the sample. The primary metric used to
compare the runs will be the Mean Average Precision
averaged across all queries. Additionally, to allow easy
comparisons with the previous Kaggle ICML4B and NIPS4B
benchmarks, the AUC under the ROC curve will be
computed for each species, and averaged over all species.
4.
4.1</p>
    </sec>
    <sec id="sec-13">
      <title>TASK3: FishCLEF</title>
    </sec>
    <sec id="sec-14">
      <title>Context</title>
      <p>
        Underwater video monitoring has been widely used in
recent years for marine video surveillance, as opposed to
human manned photography or net-casting methods, since it
does not in uence sh behavior and provides a large amount
of material at the same time. However, it is impractical for
humans to manually analyze the massive quantity of video
data daily generated, because it requires much time and
concentration and it is also error prone. Automatic sh identi
cation in videos is therefore of crucial importance, in order to
estimate sh existence and quantity [
        <xref ref-type="bibr" rid="ref26 ref28 ref29">29, 28, 26</xref>
        ]. Moreover,
it would help supporting marine biologists to understand
the natural underwater environment, promote its
preservation, and study behaviors and interactions between marine
animals that are part of it. Beyond this, video-based sh
species identi cation nds applications in many other
contexts: from education (e.g. primary/high schools) to the
entertainment industry (e.g. in aquarium).
      </p>
      <p>To the best of our knowledge, this is the rst worldwide
initiative on automatic image and video based sh species
identi cation.
4.2</p>
    </sec>
    <sec id="sec-15">
      <title>Dataset</title>
      <p>The underwater video dataset used for FishCLEF, is
derived from the Fish4Knowledge13 video repository, which
contains about 700k 10-minute video clips that were taken
in the past ve years to monitor Taiwan coral reefs. The
Taiwan area is particularly interesting for studying the marine
ecosystem, as it holds one of the largest sh biodiversities of
the world with more than 3000 di erent sh species whose
taxonomy is available at 14. The dataset contains videos
recorded from sunrise to sunset showing several
phenomena, e.g. murky water, algae on camera lens, etc., which
makes the sh identi cation task more complex. Each video
has a resolution of 320x240 with 8 fps and comes with some
additional metadata including date and localization of the
recordings. Figure 5 shows 4 snapshots of 4 cameras
monitoring the coral reef by Taiwan's Kenting site and it
illustrates the complexity of automatic sh detection and
recognition in real-life settings.</p>
      <p>
        More speci cally, the FishCLEF dataset consists of about
3000 videos with several thousands of detected sh. The
sh detections were obtained by processing such
underwater videos with video analysis tools [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] and then manually
labeled using the system in [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
4.3
      </p>
    </sec>
    <sec id="sec-16">
      <title>Task Description</title>
      <p>The dataset for the video-based sh identi cation task
will be released in two times: the participants will rst have
access to the training set and a few months later, they will
be provided with the testing set. The goal is to
automatically detect sh and its species. The task comprises three
sub-tasks: 1) identifying moving objects in videos by either
background modeling or object detection methods, 2)
detecting sh instances in video frames and then 3) identifying
species (taken from a subset of the most seen sh species)
of sh detected in video frames.</p>
      <p>Participants could decide to compete for only one subtask
or all subtasks. Although tasks 2 and 3 are based on still
images, participants are invited to exploit motion information
extracted from videos to support their strategies.</p>
      <p>As scoring functions, the authors are asked to produce:
ROC curves for sub-task one. In particular, precision,
recall and F-measures measured when comparing, on
a pixel basis, the ground truth binary masks and the
13www. sh4knowledge.eu
14http:// shdb.sinica.edu.tw/
output masks of the object detection methods are
required;
Recall for sh detection in still images as a function of
bounding box overlap percentage: a detection is
considered true positive if the PASCAL score between it
and the corresponding object in the ground truth is
over 0.7;
Average recall and recall per sh species for the sh
recognition subtask.</p>
      <p>The participants to the above tasks will be asked to
produce several runs containing a list of detected sh together
with their species (only for subtask 3). When dealing sh
species identi cation, a ranked list of the most probable
species (and the related likelihood values) for each detected
sh must be provided.</p>
    </sec>
    <sec id="sec-17">
      <title>5. SCHEDULE AND PERSPECTIVES</title>
      <p>LifeCLEF 2014 registrations opened in December 2013
and will close at the end of April 2014. At the time of
writing, already 61 research groups registered to at least one of
the three task and this number will continue growing. As
in any evaluation campaign, many of the registered groups
won't cross the nish line be submitting o cial runs but this
re ects at least their interest in LifeCLEF data and the
related challenges. The schedule of the ongoing and remaining
steps of LifeCLEF 2014 campaign is the following:</p>
    </sec>
    <sec id="sec-18">
      <title>ADDITIONAL AUTHORS</title>
      <p>Robert Planque (Xeno-Canto, The Netherlands, email:
r.planque@vu.nl)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <source>[1] MAED '12: Proceedings of the 1st ACM International Workshop on Multimedia Analysis for Ecological Data</source>
          , New York, NY, USA,
          <year>2012</year>
          . ACM.
          <volume>433127</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <source>[2] Proc. of the rst workshop on Machine Learning for Bioacoustics</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Angelova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wong</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Shpecht</surname>
          </string-name>
          .
          <article-title>Development and deployment of a large-scale ower recognition mobile app</article-title>
          ,
          <year>December 2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>E.</given-names>
            <surname>Aptoula</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Yanikoglu</surname>
          </string-name>
          .
          <article-title>Morphological features for leaf based plant recognition</article-title>
          .
          <source>In Proc. IEEE Int. Conf. Image Process</source>
          .,
          <string-name>
            <surname>Melbourne</surname>
          </string-name>
          , Australia, page
          <volume>7</volume>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A. R.</given-names>
            <surname>Backes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Casanova</surname>
          </string-name>
          , and
          <string-name>
            <given-names>O. M.</given-names>
            <surname>Bruno</surname>
          </string-name>
          .
          <article-title>Plant leaf identi cation based on volumetric fractal dimension</article-title>
          .
          <source>International Journal of Pattern Recognition and Arti cial Intelligence</source>
          ,
          <volume>23</volume>
          (
          <issue>6</issue>
          ):
          <volume>1145</volume>
          {
          <fpage>1160</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>H.-T. C. Baillie</surname>
            ,
            <given-names>J.E.M.</given-names>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Stuart</surname>
          </string-name>
          .
          <article-title>2004 iucn red list of threatened species. a global species assessment</article-title>
          . IUCN, Gland, Switzerland and Cambridge, UK,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F.</given-names>
            <surname>Briggs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Lakshminarayanan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Neal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X. Z.</given-names>
            <surname>Fern</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Raich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Hadley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. S.</given-names>
            <surname>Hadley</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. G.</given-names>
            <surname>Betts</surname>
          </string-name>
          .
          <article-title>Acoustic classi cation of multiple simultaneous bird species: A multi-instance multi-label approach</article-title>
          .
          <source>The Journal of the Acoustical Society of America</source>
          ,
          <volume>131</volume>
          :
          <fpage>4640</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>J.</given-names>
            <surname>Cai</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Pham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Roe</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. Zhang.</surname>
          </string-name>
          <article-title>Sensor network for the monitoring of ecosystem: Bird species recognition</article-title>
          .
          <source>In Intelligent Sensors, Sensor Networks and Information</source>
          ,
          <year>2007</year>
          .
          <source>ISSNIP</source>
          <year>2007</year>
          . 3rd International Conference on, pages
          <volume>293</volume>
          {
          <fpage>298</fpage>
          ,
          <string-name>
            <surname>Dec</surname>
          </string-name>
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>G.</given-names>
            <surname>Cerutti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Tougne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Vacavant</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Coquin</surname>
          </string-name>
          .
          <article-title>A parametric active polygon for leaf segmentation and shape estimation</article-title>
          .
          <source>In International Symposium on Visual Computing</source>
          , pages
          <volume>202</volume>
          {
          <fpage>213</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>O.</given-names>
            <surname>Dufour</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Artieres</surname>
          </string-name>
          ,
          <string-name>
            <surname>H. GLOTIN</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Giraudet</surname>
          </string-name>
          .
          <article-title>Clusterized mel lter cepstral coe cients and support vector machines for bird song iden cation</article-title>
          .
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>K. J.</given-names>
            <surname>Gaston and M. A. O'Neill</surname>
          </string-name>
          .
          <source>Automated species identi cation: why not? Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences</source>
          ,
          <volume>359</volume>
          (
          <issue>1444</issue>
          ):
          <volume>655</volume>
          {
          <fpage>667</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>H.</given-names>
            <surname>Glotin</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Sueur</surname>
          </string-name>
          .
          <article-title>Overview of the rst international challenge on bird classi cation</article-title>
          .
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>H.</given-names>
            <surname>Go</surname>
          </string-name>
          eau,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Bakic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Barbe</surname>
          </string-name>
          , I. Yahiaoui,
          <string-name>
            <given-names>S.</given-names>
            <surname>Selmi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Carre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Barthelemy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Boujemaa</surname>
          </string-name>
          , et al.
          <article-title>Pl@ ntnet mobile app</article-title>
          .
          <source>In Proceedings of the 21st ACM international conference on Multimedia</source>
          , pages
          <volume>423</volume>
          {
          <fpage>424</fpage>
          . ACM,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>H.</given-names>
            <surname>Go</surname>
          </string-name>
          eau,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Boujemaa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Barthelemy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-F.</given-names>
            <surname>Molino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Birnbaum</surname>
          </string-name>
          , E. Mouysset, and
          <string-name>
            <given-names>M.</given-names>
            <surname>Picard</surname>
          </string-name>
          .
          <article-title>The ImageCLEF 2011 plant images classi cation task</article-title>
          .
          <source>In CLEF working notes</source>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>H.</given-names>
            <surname>Go</surname>
          </string-name>
          eau,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Yahiaoui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Barthelemy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Boujemaa</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.-F.</given-names>
            <surname>Molino</surname>
          </string-name>
          .
          <article-title>The imageclef 2012 plant identi cation task</article-title>
          .
          <source>In CLEF working notes</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>H.</given-names>
            <surname>Go</surname>
          </string-name>
          <article-title>eau, A</article-title>
          . Joly,
          <string-name>
            <given-names>S.</given-names>
            <surname>Selmi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Mouysset</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Joyeux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-F.</given-names>
            <surname>Molino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Birnbaum</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bathelemy</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Boujemaa</surname>
          </string-name>
          .
          <article-title>Visual-based plant species identi cation from crowdsourced data</article-title>
          .
          <source>In ACM conference on Multimedia</source>
          , pages
          <volume>813</volume>
          {
          <fpage>814</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A.</given-names>
            <surname>Hazra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Deb</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kundu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hazra</surname>
          </string-name>
          , et al.
          <article-title>Shape oriented feature selection for tomato plant identi cation</article-title>
          .
          <source>International Journal of Computer Applications Technology and Research</source>
          ,
          <volume>2</volume>
          (
          <issue>4</issue>
          ):449{meta,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Goeau</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Bakic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Barbe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Selmi</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Yahiaoui</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Carre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Mouysset</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-F.</given-names>
            <surname>Molino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Boujemaa</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Barthelemy</surname>
          </string-name>
          .
          <article-title>Interactive plant identi cation based on social image data</article-title>
          .
          <source>Ecological Informatics</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>A.</given-names>
            <surname>Joly</surname>
          </string-name>
          , H. Goeau,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Bakic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-F.</given-names>
            <surname>Molino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Barthelemy</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Boujemaa</surname>
          </string-name>
          .
          <article-title>The Imageclef Plant Identi cation Task 2013</article-title>
          . In International workshop on
          <article-title>Multimedia analysis for ecological data</article-title>
          , Barcelone, Espagne, Oct.
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>I.</given-names>
            <surname>Kavasidis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Palazzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Salvo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Giordano</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Spampinato</surname>
          </string-name>
          .
          <article-title>An innovative web-based collaborative platform for video annotation</article-title>
          .
          <source>Multimedia Tools and Applications</source>
          , pages
          <volume>1</volume>
          {
          <fpage>20</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>H.</given-names>
            <surname>Kebapci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Yanikoglu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.</given-names>
            <surname>Unal</surname>
          </string-name>
          .
          <article-title>Plant image retrieval using color, shape and texture features</article-title>
          .
          <source>The Computer Journal</source>
          ,
          <volume>54</volume>
          (
          <issue>9</issue>
          ):
          <volume>1475</volume>
          {
          <fpage>1490</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>N.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Belhumeur</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Biswas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. W.</given-names>
            <surname>Jacobs</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. J.</given-names>
            <surname>Kress</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I. C.</given-names>
            <surname>Lopez</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. V. B.</given-names>
            <surname>Soares</surname>
          </string-name>
          . Leafsnap:
          <article-title>A computer vision system for automatic plant species identi cation</article-title>
          .
          <source>In European Conference on Computer Vision</source>
          , pages
          <volume>502</volume>
          {
          <fpage>516</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <surname>D.-J. Lee</surname>
            ,
            <given-names>R. B.</given-names>
          </string-name>
          <string-name>
            <surname>Schoenberger</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Shiozawa</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <string-name>
            <surname>Xu</surname>
            , and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Zhan</surname>
          </string-name>
          .
          <article-title>Contour matching for a sh recognition and migration-monitoring system</article-title>
          .
          <source>In Optics East</source>
          , pages
          <volume>37</volume>
          {
          <fpage>48</fpage>
          .
          <source>International Society for Optics and Photonics</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>S.</given-names>
            <surname>Mouine</surname>
          </string-name>
          ,
          <string-name>
            <surname>I.</surname>
          </string-name>
          <article-title>Yahiaoui, and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Verroust-Blondet</surname>
          </string-name>
          .
          <article-title>Advanced shape context for plant species identi cation using leaf image retrieval</article-title>
          .
          <source>In ACM International Conference on Multimedia Retrieval</source>
          , pages
          <volume>49</volume>
          :1{
          <issue>49</issue>
          :
          <fpage>8</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>M.-E. Nilsback</surname>
            and
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Zisserman</surname>
          </string-name>
          .
          <article-title>Automated ower classi cation over a large number of classes</article-title>
          .
          <source>In Indian Conference on Computer Vision, Graphics and Image Processing</source>
          , pages
          <volume>722</volume>
          {
          <fpage>729</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Shortis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ravanbakskh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Shaifat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. S.</given-names>
            <surname>Harvey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Mian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. W.</given-names>
            <surname>Seager</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. F.</given-names>
            <surname>Culverhouse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. E.</given-names>
            <surname>Cline</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D. R.</given-names>
            <surname>Edgington</surname>
          </string-name>
          .
          <article-title>A review of techniques for the identi cation and measurement of sh in underwater stereo-video image sequences</article-title>
          .
          <source>In SPIE Optical Metrology</source>
          <year>2013</year>
          , pages
          <fpage>87910G</fpage>
          {
          <fpage>87910G</fpage>
          .
          <source>International Society for Optics and Photonics</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>C.</given-names>
            <surname>Spampinato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Beauxis-Aussalet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Palazzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Beyan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ossenbruggen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Boom</surname>
          </string-name>
          , and
          <string-name>
            <given-names>X.</given-names>
            <surname>Huang</surname>
          </string-name>
          .
          <article-title>A rule-based event detection system for real-life underwater domain</article-title>
          .
          <source>Machine Vision and Applications</source>
          ,
          <volume>25</volume>
          (
          <issue>1</issue>
          ):
          <volume>99</volume>
          {
          <fpage>117</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>C.</given-names>
            <surname>Spampinato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.-H.</given-names>
            <surname>Chen-Burger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Nadarajan</surname>
          </string-name>
          , and R. B. Fisher.
          <article-title>Detecting, tracking and counting sh in low quality unconstrained underwater videos</article-title>
          .
          <source>In VISAPP (2)</source>
          , pages
          <fpage>514</fpage>
          {
          <fpage>519</fpage>
          .
          <string-name>
            <surname>Citeseer</surname>
          </string-name>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>C.</given-names>
            <surname>Spampinato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. Di</given-names>
            <surname>Salvo</surname>
          </string-name>
          , Y.
          <string-name>
            <surname>-H. J. Chen-Burger</surname>
            , R. B. Fisher, and
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Nadarajan</surname>
          </string-name>
          .
          <article-title>Automatic sh classi cation for underwater species behavior understanding</article-title>
          .
          <source>In Proceedings of ACM ARTEMIS</source>
          <year>2010</year>
          , pages
          <fpage>45</fpage>
          {
          <fpage>50</fpage>
          . ACM,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>M.</given-names>
            <surname>Towsey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Planitz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Nantes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wimmer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Roe</surname>
          </string-name>
          .
          <article-title>A toolbox for animal call recognition</article-title>
          .
          <source>Bioacoustics</source>
          ,
          <volume>21</volume>
          (
          <issue>2</issue>
          ):
          <volume>107</volume>
          {
          <fpage>125</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>V. M.</given-names>
            <surname>Trifa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Kirschel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. E.</given-names>
            <surname>Taylor</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. E.</given-names>
            <surname>Vallejo</surname>
          </string-name>
          .
          <article-title>Automated species recognition of antbirds in a mexican rainforest using hidden markov models</article-title>
          .
          <source>The Journal of the Acoustical Society of America</source>
          ,
          <volume>123</volume>
          :
          <fpage>2424</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>Q. D.</given-names>
            <surname>Wheeler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. H.</given-names>
            <surname>Raven</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E. O.</given-names>
            <surname>Wilson</surname>
          </string-name>
          .
          <source>Taxonomy: Impediment or expedient? Science</source>
          ,
          <volume>303</volume>
          (
          <issue>5656</issue>
          ):
          <fpage>285</fpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>