=Paper= {{Paper |id=Vol-1391/165-CR |storemode=property |title=A MARFCLEF Approach to LifeCLEF 2015 Tasks |pdfUrl=https://ceur-ws.org/Vol-1391/165-CR.pdf |volume=Vol-1391 |dblpUrl=https://dblp.org/rec/conf/clef/Mokhov15 }} ==A MARFCLEF Approach to LifeCLEF 2015 Tasks== https://ceur-ws.org/Vol-1391/165-CR.pdf
      A MARFCLEF Approach to LifeCLEF 2015
                   Tasks

                                 Serguei A. Mokhov

                      Concordia University, Montreal, Canada,
                            mokhov@cse.concordia.ca



       Abstract. We make the first use of MARF of fast signal-processing
       and related techniques for LifeCLEF 2015 identification tasks. We build
       an application based on a pattern recognition pipeline implemented in
       an open-source Modular A* Recognition Framework (MARF). MARF is
       also the name of the team in this submission. For that purpose to test
       and select among available algorithm a set of suitable algorithms. This is
       the first implementation of the application we call MARFCLEFApp tested on
       a very small subset of algorithms available. The approach covers Bird-,
       Plant-, and FishCLEF tasks. It was expected the bird task would be the
       best for the presented approach given MARF’s original intent for audio
       recognition. However, lack of enough run-time it turned out to be the
       worst one and is under the investigation. Processing FishCLEF however
       yield the best of the three tasks, which was expected to be the worst.
       Team MARF’s results for FishCLEF were the 2nd team after with the
       Run 1 being the best of the three.


1     Introduction
1.1    Introduction to MARF
Modular Audio Recognition Framework (MARF) is an open-source collection of
pattern recognition APIs and their implementation for unsupervised and super-
vised machine learning and classification written in Java [6]. One of its design
purposes is to act as a testbed to try out common and novel algorithms found in
literature and industry for sample loading, preprocessing, feature extraction, and
training and classification tasks. One of the main goals and design approaches
of MARF is to provide scientists with a tool for comparison of the algorithms in
a homogeneous environment and allowing the dynamic module selection (from
the implemented modules) based on the configuration options supplied by ap-
plications. Over the course of several years MARF accumulated a fair number
of implementations for each of the pipeline stages allowing reasonably compre-
hensive comparative studies of the algorithms combinations, and studying their
combined behavior and other properties when used for various pattern recog-
nition tasks. MARF is also designed to be very configurable while keeping the
generality and some sane default settings to “run-off-the-shelf” well. MARF and
its derivatives, and applications were also used beyond audio processing tasks
due to the generality of the design and implementation in [5,7] and other works.
                     Fig. 1. MARF’s Pattern Recognition Pipeline


   The methodology behind MARFCLEFApp builds on the successes and failures of
the previous similar applications used for different tasks, such as MARFCAT [9],
HEp2IdentApp and MARFIIFApp and others were the source of inspiration for all
three tasks. MARF [16] is the core framework behind them all.


1.2    Common Index File Format

All tasks’ data are annotated by a unified XML index format, so technically
any task’s metadata can be encoded in it. The input index format is otherwise
inherited from MARFCAT [15,13]. This format looks like the below.


        
                
                
                
                
                
                
       
       
               
                       Bourne shell script text executable
                       
               
               
                       

                         
                         
                         
                         
                         
                
        
        
                
                         WAV
                         
                
                
                         

                         
                         
                         
                         
                         
                
        
        
                
                       ASCII English text
                       
               
               
                       

  13075
  23546
  3
  Flower
  4742
  Rosaceae
  Filipendula
  Filipendula vulgaris Moench
  gilles carcasses
  2013-7-16
  Ipiais-Rhus
  49.12491
  2.06075
  PlantCLEF2015
  
  
  Train

                         
                         
                         
                         
                         
                
        
        
                
                         PNG