<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Adaptation of Compositional Data Analysis in Deep Learning to Predict Pasture Biomass Proportions</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Badri Narayanan</string-name>
          <email>badri.narayanan@insight-centre.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohamed Saadeldin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paul Albert</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kevin McGuinness</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Noel E. O'Connor</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Brian Mac Namee</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Insight SFI Centre for Data Analytics</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Computer Science, University College Dublin</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Electronic Engineering</institution>
          ,
          <addr-line>Dublin City University</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>Dry biomass weight measurements from a quadrat in a paddock for grass, clover and weeds when expressed as percentages of total dry herbage mass are compositional in nature. Unlike real valued regression problems, prediction of compositional data is handled di erently in statistics because of its closure property where the components of the composition are positive data adding up to a constant sum and is therefore constrained in the simplex space, in our case 100%. Our motivation in this paper was to study whether the adaptation of compositional data analysis (CoDa) techniques in deep learning improves the prediction results over the best performing deep learning model we used in our earlier paper [Narayanan et al., 2021]. Although the log ratio transformation of targets is an appropriate adaptation of CoDa and is interesting for Biomass prediction, our study indicates that the CoDa adaptation does not improve the prediction errors over our earlier method.</p>
      </abstract>
      <kwd-group>
        <kwd>Deep Learning</kwd>
        <kwd>Compositional Data Analysis</kwd>
        <kwd>Isometric Log Ratio</kwd>
        <kwd>Simplex</kwd>
        <kwd>Softmax</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>The dairy industry uses clover and grass as fodder for cows. Grass and clover
are grown together in elds to improve the consistency of high-quality biomass
yield and to reduce the need for external fertilizers. Accurate estimation of the
dry biomass percentages of grass and clover species (as well as weeds) in elds
is very important for determining optimal seeding density, fertilizer application
and elimination of weeds. The dry biomass weights of the individual components,
when expressed as percentages of overall weight of the harvested and dried
biomass, are compositional in nature.</p>
      <p>
        Compositional data are positive data summing to a constant value, and
measure relative changes in the components. They are constrained in the simplex
space. Standard multivariate statistical analysis and regression techniques assume
Copyright 2021 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0)
the sample space to be R. The sample space for compositional data, however, is
restricted in the simplex due to the sum constraint. Compositional data analysis
(CoDa)
        <xref ref-type="bibr" rid="ref2">Aitchison [2005</xref>
        ] is a set of mathematical techniques that helps in analysing
the relative proportions of individual components.
      </p>
      <p>
        In this paper, we examine the applicability of the principles of CoDa to
the problem of predicting biomass composition from farm imagery using deep
learning. We present an adaptation of the approach used in statistical CoDa
where the compositional data is transformed from the simplex space to the real
space using the isometric log ratio (ILR) transformation [
        <xref ref-type="bibr" rid="ref6">Egozcue et al., 2003</xref>
        ].
      </p>
      <p>
        These transformed values are used as targets with a model that predicts the
dry mass fractions of grass, clover and weeds from images of a section of grass
marked with a square frame (known as a quadrat ). We compare the prediction
results to the best performing model from our previous paper [
        <xref ref-type="bibr" rid="ref14">Narayanan et al.,
2021</xref>
        ] that uses the composition data directly as a target. This comparison shows
that the addition of the approaches from CoDa do not improve the performance
of the model.
      </p>
      <p>In the rest of this paper, Section 2 outlines related work for CoDa techniques
in applied statistics and highlights the few available adaptations by the machine
learning community. This is followed by a description of our experimental design
in Section 3 and a discussion of the results in Section 4. Finally, Section 5
summarises the ndings from this paper and suggests directions for future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>This section introduces compositional data analysis, establishes the relationship
between the softmax transformation and the simplex, and reviews the limited
applications of CoDa to machine learning and biomass composition prediction.
2.1</p>
      <sec id="sec-2-1">
        <title>Compositional Data Analysis</title>
        <p>
          Compositional data of D-parts is constrained in the simplex space SD by a
constant sum of the components. The sum constraint induces negative correlations
between variables [
          <xref ref-type="bibr" rid="ref5">Chayes, 1960</xref>
          ], which violates the independence assumptions
and the central limit theorem [
          <xref ref-type="bibr" rid="ref1">Aitchison, 1982</xref>
          , 2005]. Therefore, the sum
constraint needs to be broken before standard statistical methods can be applied for
analysis. This is often achieved by transforming the compositional data from the
simplex space into real space using log ratio transformations [
          <xref ref-type="bibr" rid="ref16">Pawlowsky-Glahn
and Egozcue, 2006</xref>
          ].
        </p>
        <p>
          <xref ref-type="bibr" rid="ref1">Aitchison [1982</xref>
          ] formalises three key principles of CoDa: scale invariance,
permutation invariance, and subcompositional coherence. Any statistical analysis
of compositional data must conform to these principles. Scale invariance is
characterised by the relative information that the compositional data carry, rather
than the individual size of the components. Permutation invariance mandates that
any statistical inference should be independent of the ordering of the components
within the composition. Finally, subcompositional coherence stipulates that results
from the analysis of components in a full composition should not contradict the
results from a subcomposition, i.e., the distances between two compositions
should decrease when subcompositions of the original ones are considered, and
that scale invariance is preserved within arbitrary subcompositions.
Log Ratio Transformations The centered log ratio (CLR) and isometric log
ratio (ILR) are the prevalent log ratios used in modern CoDa applications. In
a composition x of D 2 components, the sum constraint of compositional
data implies that there is at least one component that is negatively correlated
with another in the composition, and that there are at most D 1 independent
components. The composition is therefore constrained in a D 1 dimension vector
space of SD, de ned as a D-part simplex on R. The values of these components,
when scaled by their geometric mean and then log transformed, are mapped to a
hyperplane in RD and referred to as the Centred Log Ratio (CLR):
clr(x) = z = ln
x1
g(x)
x2
g(x)
; ln
; : : : ; ln
xD
g(x)
;
where g(x) is the geometric mean of the k components of x:
        </p>
        <p>
          g(x) = Dpx1x2 : : : xD:
          <xref ref-type="bibr" rid="ref17">Pawlowsky-Glahn et al. [2007</xref>
          ] highlight that the CLR introduces a mathematical
complexity in the form of a singular covariance matrix where the determinant is
zero. Additionally, the CLR transformation is not subcompositionally coherent
as the geometric mean of a subcomposition will di er from that of the whole
composition.
        </p>
        <p>
          These drawbacks led to the introduction of the isometric log ratio (ILR)
by
          <xref ref-type="bibr" rid="ref6">Egozcue et al. [2003</xref>
          ] where an isometry from SD to RD 1 is achieved from
an orthonormal basis derived from Gram-Schmidt orthogonalization. In addition
to being an isometry, the ILR is an isomorphism too, and conforms to the three
CoDa principles outlined above. Following Egozcue et al. ILR can be de ned as
follows:
        </p>
        <p>ilr(x) = [hx; e1ia; hx; e2ia; : : : ; hx; eD 1ia];
where [e1; e2; : : : ; eD 1] is an orthonormal basis in the simplex, the default
one being the orthonormal basis built by Egozcue et al. using Gram-Schmidt
orthogonalization. hx; eiia represents the Aitchison inner product between x and
ei. The inverse of ILR transformation is given by
x = ilr 1(y) =</p>
        <p>
          D 1
M(hy; ~eiia
i=1
ei);
where ~ei = ilr(ei) for all i. L and J denote the compositional operations of
perturbation and power transformation described in the Aitchison geometry of
the simplex [
          <xref ref-type="bibr" rid="ref16">Pawlowsky-Glahn and Egozcue, 2006</xref>
          ].
        </p>
        <p>
          Although both the CLR and the ILR are isometric and allow for statistical
operations in the Euclidean space, the ILR is the most prevalent in modern
applications of CoDa, simply because of its representation of the composition in
an orthogonal coordinate system. Unlike CLR, the ILR allows for the association
of angles and distances in the simplex with those in the real space, and adheres
to the 3 key principles of CoDa, thereby making it a better choice. For interested
readers,
          <xref ref-type="bibr" rid="ref24">Tolosana-Delgado [2008</xref>
          ] provide a short and comprehensive mathematical
representation of these log ratios and other foundational aspects of CoDa.
Handling zero values Zero values in compositional data, if not handled, can be
problematic. Essential zeros refer to the absence of a component in the observation,
whereas rounded zeros indicate approximate recording of a component below
detection limit [
          <xref ref-type="bibr" rid="ref13">Mart n-Fernandez et al., 2003</xref>
          ] and need to be addressed. Rounded
zeros are replaced with a threshold value using a multiplicative replacement
method that maintains the constant sum of the composition.
        </p>
        <p>
          Applications of CoDa Applied statistics has seen many applications of
Compositional Data Analysis (CoDa) in geostatistics [
          <xref ref-type="bibr" rid="ref25">Tolosana-Delgado et al., 2019</xref>
          ],
bioinformatics, environmental science and chemistry [
          <xref ref-type="bibr" rid="ref7">Filzmoser et al., 2010</xref>
          ]
where many problems are compositional in nature.
          <xref ref-type="bibr" rid="ref12">Liu et al. [2016</xref>
          ] trace the
underlying factors that in uence rock weathering and mineralisation in stream
sediments of the Nanling tectono-magmatic belt using robust factor analysis and
compositional data analysis.
        </p>
        <p>
          In the biomass composition problem studied in this paper the targets are
compositional.
          <xref ref-type="bibr" rid="ref2">Aitchison [2005</xref>
          ] presents an example similar to this that quanti es
the extent of dependence of sediment composition on water depth in arctic lakes.
Three mutually exclusive and exhaustive constituents (sand, silt and clay) are
recorded in their proportions by weight for 39 samples at di erent water depths.
The objective is to quantify the extent of dependence of sediment composition on
water depth and hence, identify the nature of sedimentation process. In another
similar example the relative mass of water, fat and protein in a meat sample is
predicted from its NIR spectrum [
          <xref ref-type="bibr" rid="ref26">Verwaeren, 2014</xref>
          ].
2.2
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Softmax and the Simplex</title>
        <p>In the context of the deep learning experiments in this work, it is necessary to
understand the relationship between the softmax activation function used in
neural networks and the simplex space. The softmax is a mathematical function
most commonly used in the output layers of neural networks for multi-class
classi cation, and provides a generalisation of the sigmoid function in logistic
regression. The softmax function has been extensively used in state-of-the-art
deep neural network models and has been used very successfully in classi cation
and regression problems.</p>
        <p>In a typical multi-class classi cation setting, the softmax function converts a
vector of k real values into a vector of k probabilities that sum to 1|a probability
distribution for the predicted classes in the target. Each of these probabilities is
a proportion of the relative scale of the corresponding individual component of
the input vector:
(zi) =</p>
        <p>ezi
PK
j=1 ezj</p>
        <p>
          for i = 1; : : : ; K and z = (z1; :::; zK ) 2 RK :
[
          <xref ref-type="bibr" rid="ref3">Amos, 2019</xref>
          , Theorem 4, pg 13] provides a theorem and proof that establishes
the relationship between the softmax activation function and the simplex, where
the softmax acts as the projection of a point x 2 Rn onto the interior of the
(n 1)-simplex.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Applications of CoDa in Machine Learning &amp; Biomass</title>
      </sec>
      <sec id="sec-2-4">
        <title>Prediction</title>
        <p>
          There are limited examples in the literature of the adaptation of the CoDa
principles to machine learning. The use of random forest models trained on
data pre-processed with log ratio transformations [
          <xref ref-type="bibr" rid="ref8">Harris and Grunsky, 2015</xref>
          ;
          <xref ref-type="bibr" rid="ref23">Talebi et al., 2018</xref>
          ] illustrate the few attempts in the use of machine learning;
however, this area remains largely unexplored and there are no speci c instances
of literature of experiments / bene ts of adapting CoDa techniques with deep
learning.
        </p>
        <p>
          A body of recent research [
          <xref ref-type="bibr" rid="ref20">Skovsen et al., 2018</xref>
          ;
          <xref ref-type="bibr" rid="ref11">Larsen et al., 2018</xref>
          ;
          <xref ref-type="bibr" rid="ref19">Sindic
and Riday, 2020</xref>
          ;
          <xref ref-type="bibr" rid="ref4">Castro et al., 2020</xref>
          ;
          <xref ref-type="bibr" rid="ref22">Sun et al., 2021</xref>
          ] employs state-of-the-art
deep learning techniques to predict dry matter yield from proximal and UAV
images of grass paddocks. These works typically rely on transfer learning [
          <xref ref-type="bibr" rid="ref15">Pan
and Yang, 2009</xref>
          ] to transfer latent representations that were learnt from large
corpus of images by deep networks like VGG16 [
          <xref ref-type="bibr" rid="ref18">Simonyan and Zisserman, 2014</xref>
          ]
and Resnet [
          <xref ref-type="bibr" rid="ref9">He et al., 2016</xref>
          ]. To our knowledge, there are no references to the
adaptation of CoDa techniques in these deep learning approaches. Therefore it
is interesting to explore the integrated approach of CoDa principles and deep
learning to solve the biomass composition prediction problem. The next section
describes the design of an experiment to assess the e ectiveness of adapting the
concept of isometric log ratio transformation (ILR) introduced by
          <xref ref-type="bibr" rid="ref6">Egozcue et al.
[2003</xref>
          ] to the deep learning architecture used in
          <xref ref-type="bibr" rid="ref14">Narayanan et al. [2021</xref>
          ].
3
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experimental Design</title>
      <p>This section describes the design of a set of experiments undertaken to assess the
e ectiveness of adopting CoDa techniques in deep learning models for biomass
composition prediction. The section describes the dataset used, the architecture
of the models built and the experimental method used.
3.1</p>
      <sec id="sec-3-1">
        <title>Data Description</title>
        <p>
          The Grass Clover Image Dataset for the Biomass Prediction Challenge [
          <xref ref-type="bibr" rid="ref21">Skovsen
et al., 2019</xref>
          ] provides us with 261 images of quadrats of grass with corresponding
dry biomass composition of grass, white clover, red clover and weeds. These are
expressed in terms of their weights proportional to the total biomass, and sum
to 1. Five example images and their target values are presented in Table 1.
Image
        </p>
        <p>Grass</p>
        <p>Weeds
ilr 1
ilr 2
ilr 3
Proportions of weight</p>
        <p>White
clover</p>
        <p>Red
clover</p>
        <p>ILR coe cients</p>
        <p>
          The dataset was divided into 209 training examples and 52 validation examples.
All the images were standardized to 500 500 pixels and, to ensure adequate
training examples for the network to learn e ciently, the training set images were
subject to 10 expansion through runtime augmentations [
          <xref ref-type="bibr" rid="ref10">Krizhevsky et al., 2012</xref>
          ].
The transformations in the augmentation included a rotation (up to 15 ), zoom
( 15%), height and width shift (20%), shearing ( 15%), horizontal re ections,
channel shift ( 50), and image wrapping to minimize loss of information.
        </p>
        <p>We use the Compositional Statistics package5 in Python for the ILR and
inverse ILR transformations in this work. Given the non-zero minimum values of
each component in the dataset presented in Table 2, a threshold value of 0.001
5 Compositional Statistics: https://composition-stats.readthedocs.io/</p>
        <p>Grass
0.051104</p>
        <p>White Clover
was selected for undetectable measurements. Zero values in the data are then
replaced with this minimum threshold value using the multiplicative replacement
method, while ensuring the sum closure of 1. The transformed targets from the
ve examples presented in Table 1 are also shown.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Model Architecture</title>
        <p>
          The model architecture used in this study is the same convolutional neural
network (CNN) architecture from our previous work [
          <xref ref-type="bibr" rid="ref14">Narayanan et al., 2021</xref>
          ]
where we used weak supervision [
          <xref ref-type="bibr" rid="ref27">Zhou, 2018</xref>
          ], transfer learning from a VGG-16
model pre-trained on the ImageNet dataset, and a multi-target output layer
with softmax regression trained to minimise root mean squared error (RMSE)
loss. The weak supervision is necessary as the dataset has missing values in 104
examples for the red and white clover subcomposition while the overall clover
values were available, and therefore had to be imputed with their corresponding
mean values and readjusted to match the overall clover proportion in the total
biomass. In doing so, the approximated examples were given a lesser weighting
during the loss calculation in a ratio of 1:1.5 with respect to the examples with
recorded values. Latent feature representations through transfer of weights from
the nal convolutional layer of the pre-trained VGG-16 network enabled faster
and better optimization of the two trainable dense layers with 4,096 and 256
neurons. The dense layers were equipped with ReLU activations and uniform
random kernel initialization, and each of the dense layers was followed by a layer
of batch normalization to help prevent over tting. The network was compiled
with the Adam optimizer with an initial learning rate 0:001 and decay factor
10 3=200. The output layer had 4 neurons with softmax activation, each neuron
corresponding to regression output for grass, white clover, red clover and weeds. As
a direct interpretation of
          <xref ref-type="bibr" rid="ref3">Amos's theorem [Amos, 2019</xref>
          ], the softmax probabilities
from the output layer can be construed as equivalent to the simplical proportions
of the predicted values of the individual components. This provides a framework
for interpreting the results of the model with softmax outputs in the context of
the simplex. In our current work, we transform the 4 target variables of grass,
white clover, red clover and weeds into 3 ILR coe cients. We modify the output
layer of the network in this experiment to 3 neurons with linear activations for
real valued outputs, and use RMSE as the loss function.
An examination of the training and validation losses during model training,
presented in Figure 1, con rms the ability of the model to learn from the ILR
transformed targets in the training data. The top row in Figure 2 shows scatter
plots from the baseline model results from our previous work, of the actual values
versus predicted values for each component of the biomass composition for data
in the validation set. Similarly, the bottom row shows scatter plots of actual vs
predicted values for the ILR transformed targets. It is interesting to note that
the ILR model predicts the proportions of grass and white clover reasonably
well. In the case of red clover, however, the predictions are generally erroneous.
Prediction of weeds is reasonably accurate when the actual weed percentage is
less than 20% of the composition, but erroneous results can be observed above
this range.
        </p>
        <p>The results from our previous work is the baseline for comparison with the
results of the CoDa adaptation experiment. Table 3 compares the performance of
this baseline model against the model trained to predict ILR transformed targets.
It is clear that the baseline model outperforms the CoDa-inspired model.</p>
        <p>This experiment shows convincing evidence of the ability to learn from ILR
transformed compositional data. Nevertheless, the CoDa adaptation results do
not improve upon the performance of this baseline model. We surmise that
there are two reasons for this. First, it is interesting to note that the softmax
function projects the real valued output vector of the network onto the simplex
(as explained in Section 2.2). In our case the training targets too are in the same
simplical dimensions, and therefore, it is e ective within the simplex. Second,
the premise of CoDa is to ensure data transformations that will satisfy the
requirements of standard statistical analyses, like the central limit theorem and
conformance to the rules of linear independence. On the contrary, deep neural
networks do not require such assumptions and have the ability to e ectively
approximate a non-linear estimation function to t an unknown distribution of
the target data. The problem that CoDa is designed to solve using standard
statistical methods does not exist in deep neural networks. Therefore, the CoDa
adaptation for deep neural networks using the ILR transformation of the targets is
an additional step over a network with an intrinsic ability to learn these non-linear
functions. We believe that these two reasons provide a plausible explanation for</p>
        <p>Validation metrics RMSE, MAE
Grass</p>
        <p>White clover</p>
        <p>Red clover</p>
        <p>Weeds</p>
        <p>Overall
Model</p>
        <p>RMSE MAE RMSE MAE RMSE MAE RMSE MAE RMSE MAE
Baseline 8.00
ILR 8.87
6.21
7.04
7.44
12.64
5.99
8.94
7.33
11.98
5.63
7.39
5.68
5.63
4.20
2.95
7.11
9.78
5.51
6.58
5</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>the better performance of the network with softmax activation over the ILR
transformed approach.</p>
      <p>In this paper we explored the usefulness of techniques from statistical
compositional data analysis in a deep learning context. In particular, we tested this
with a pasture biomass prediction problem, which is compositional in nature. We
presented an approach that transformed the biomass composition data using the
isometric log ratio (ILR) from the simplex space onto the real space and used
these transformed targets for training a deep network. This paper demonstrates
that it is possible to train a reasonably accurate prediction model using this
approach. Nevertheless, based on the evidence of the results, we conclude that
the softmax works better in the deep learning context than a model trained to
predict targets transformed using ILR. This suggests that it is not useful to adapt
techniques from statistics CoDa to deep learning models. Our further work will
focus on improving the prediction for red clover and weeds.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>This publication has emanated from research conducted with the nancial support
of Science Foundation Ireland under Grant number [16/RC/3835]. For the purpose
of Open Access, the author has applied a CC BY public copyright licence to any
Author Accepted Manuscript version arising from this submission.</p>
      <p>Our sincere thanks to Prof. Claire Gormley, School of Mathematics and
Statistics, University College Dublin, for her suggestion to explore Compositional
Data Analysis.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Aitchison</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>The statistical analysis of compositional data</article-title>
          .
          <source>Journal of the Royal Statistical Society: Series B (Methodological)</source>
          <volume>44</volume>
          (
          <issue>2</issue>
          ),
          <volume>139</volume>
          {
          <fpage>160</fpage>
          (
          <year>1982</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Aitchison</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>A Concise Guide to Compositional Data Analysis p</article-title>
          .
          <volume>134</volume>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Amos</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Di erentiable optimization-based modeling for machine learning</article-title>
          .
          <source>Ph.D. thesis</source>
          ,
          <source>PhD thesis</source>
          . Carnegie Mellon University (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Castro</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marcato</surname>
            <given-names>Junior</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Polidoro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Osco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.P.</given-names>
            ,
            <surname>Goncalves</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Rodrigues</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Santos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Jank</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Barrios</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Valle</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          , et al.:
          <article-title>Deep learning applied to phenotyping of biomass in forages with uav-based rgb imagery</article-title>
          .
          <source>Sensors</source>
          <volume>20</volume>
          (
          <issue>17</issue>
          ),
          <volume>4802</volume>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Chayes</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>On correlation between variables of constant sum</article-title>
          .
          <source>Journal of Geophysical research</source>
          <volume>65</volume>
          (
          <issue>12</issue>
          ),
          <volume>4185</volume>
          {
          <fpage>4193</fpage>
          (
          <year>1960</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Egozcue</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pawlowsky-Glahn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mateu-Figueras</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barcelo-Vidal</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Isometric logratio transformations for compositional data analysis</article-title>
          .
          <source>Mathematical geology</source>
          <volume>35</volume>
          (
          <issue>3</issue>
          ),
          <volume>279</volume>
          {
          <fpage>300</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Filzmoser</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hron</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reimann</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>The bivariate statistical analysis of environmental (compositional) data</article-title>
          .
          <source>Science of The Total Environment</source>
          <volume>408</volume>
          (
          <issue>19</issue>
          ),
          <volume>4230</volume>
          {4238 (Sep
          <year>2010</year>
          ), https://linkinghub.elsevier.com/retrieve/pii/ S0048969710004845
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Harris</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grunsky</surname>
          </string-name>
          , E.:
          <article-title>Predictive lithological mapping of Canada's North using Random Forest classi cation applied to geophysical and geochemical data</article-title>
          .
          <source>Computers &amp; Geosciences</source>
          <volume>80</volume>
          ,
          <issue>9</issue>
          {
          <fpage>25</fpage>
          (Jul
          <year>2015</year>
          ), https://linkinghub.elsevier.com/ retrieve/pii/S0098300415000709
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>He</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ren</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sun</surname>
          </string-name>
          , J.:
          <article-title>Deep residual learning for image recognition</article-title>
          .
          <source>In: Proceedings of the IEEE conference on computer vision and pattern recognition</source>
          . pp.
          <volume>770</volume>
          {
          <issue>778</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Krizhevsky</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hinton</surname>
          </string-name>
          , G.E.:
          <article-title>Imagenet classi cation with deep convolutional neural networks</article-title>
          .
          <source>In: Advances in neural information processing systems</source>
          . pp.
          <volume>1097</volume>
          {
          <issue>1105</issue>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Larsen</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steen</surname>
            ,
            <given-names>K.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grooters</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Green</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nyholm</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , et al.:
          <article-title>Autonomous mapping of grass-clover ratio based on unmanned aerial vehicles and convolutional neural networks</article-title>
          .
          <source>In: International Conference on Precision Agriculture. International Society of Precision Agriculture</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Liu</surname>
            , Y., Cheng,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xia</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Multivariate analysis for geochemical process identi cation using stream sediment geochemical data: A perspective from compositional data</article-title>
          .
          <source>Geochemical Journal</source>
          <volume>50</volume>
          (
          <issue>4</issue>
          ),
          <volume>293</volume>
          {
          <fpage>314</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Mart</surname>
            n-Fernandez,
            <given-names>J.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barcelo-Vidal</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pawlowsky-Glahn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Dealing with zeros and missing values in compositional data sets using nonparametric imputation</article-title>
          .
          <source>Mathematical Geology</source>
          <volume>35</volume>
          (
          <issue>3</issue>
          ),
          <volume>253</volume>
          {
          <fpage>278</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Narayanan</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saadeldin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Albert</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mac Namee</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Extracting pasture phenotype and biomass percentages using weakly supervised multi-target deep learning on a small dataset</article-title>
          .
          <source>arXiv preprint arXiv:2101.03198</source>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>S.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          :
          <article-title>A survey on transfer learning</article-title>
          .
          <source>IEEE Transactions on knowledge and data engineering</source>
          <volume>22</volume>
          (
          <issue>10</issue>
          ),
          <volume>1345</volume>
          {
          <fpage>1359</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Pawlowsky-Glahn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Egozcue</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          :
          <article-title>Compositional data and their analysis: an introduction</article-title>
          .
          <source>Geological Society</source>
          , London, Special Publications
          <volume>264</volume>
          (
          <issue>1</issue>
          ),
          <volume>1</volume>
          {
          <fpage>10</fpage>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Pawlowsky-Glahn</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Egozcue</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tolosana</surname>
            <given-names>Delgado</given-names>
          </string-name>
          , R.:
          <source>Lecture notes on compositional data analysis</source>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Simonyan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zisserman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Very deep convolutional networks for large-scale image recognition</article-title>
          .
          <source>arXiv preprint arXiv:1409.1556</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Sindic</surname>
            ,
            <given-names>C.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riday</surname>
          </string-name>
          , H.:
          <article-title>Using image object recognition to increase biomass in red clover (trifolium pratense l</article-title>
          .)
          <source>breeding. Crop Science</source>
          <volume>60</volume>
          (
          <issue>4</issue>
          ),
          <volume>1770</volume>
          {
          <fpage>1781</fpage>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Skovsen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dyrmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eriksen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gislum</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karstoft</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , J rgensen, R.N.:
          <article-title>Predicting dry matter composition of grass clover leys using data simulation and camera-based segmentation of eld canopies into white clover, red clover, grass and weeds</article-title>
          .
          <source>In: Proceedings of the 14th International Conference on Precision Agriculture</source>
          . Montreal, CA: International Society of Precision Agriculture (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Skovsen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dyrmann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mortensen</surname>
            ,
            <given-names>A.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Laursen</surname>
            ,
            <given-names>M.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gislum</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eriksen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Farkhani</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karstoft</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jorgensen</surname>
            ,
            <given-names>R.N.:</given-names>
          </string-name>
          <article-title>The grassclover image dataset for semantic and hierarchical species understanding in agriculture</article-title>
          .
          <source>In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops</source>
          . pp.
          <volume>0</volume>
          {
          <issue>0</issue>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liang</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zuo</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsons</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shi</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fang</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , et al.:
          <article-title>Estimation of botanical composition in mixed clover{grass elds using machine learning-based image analysis</article-title>
          .
          <source>Frontiers in Plant Science</source>
          <volume>12</volume>
          ,
          <issue>87</issue>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Talebi</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mueller</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tolosana-Delgado</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grunsky</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McKinley</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Caritat</surname>
          </string-name>
          , P.d.:
          <article-title>Sur cial and Deep Earth Material Prediction from Geochemical Compositions</article-title>
          .
          <source>Natural Resources Research</source>
          <volume>28</volume>
          (
          <year>Oct 2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Tolosana-Delgado</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Compositional data analysis in a nutshell</article-title>
          . University of Gottingen on-line
          <string-name>
            <surname>reference</surname>
          </string-name>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Tolosana-Delgado</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mueller</surname>
          </string-name>
          , U., van den Boogaart, K.G.:
          <article-title>Geostatistics for compositional data: an overview</article-title>
          .
          <source>Mathematical geosciences</source>
          <volume>51</volume>
          (
          <issue>4</issue>
          ),
          <volume>485</volume>
          {
          <fpage>526</fpage>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Verwaeren</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Mathematical optimization methods for the analysis of compositional data: subset selection, unmixing and prediction</article-title>
          .
          <source>Ph.D. thesis</source>
          , Ghent University (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>Z.H.:</given-names>
          </string-name>
          <article-title>A brief introduction to weakly supervised learning</article-title>
          .
          <source>National Science Review</source>
          <volume>5</volume>
          (
          <issue>1</issue>
          ),
          <volume>44</volume>
          {
          <fpage>53</fpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>