<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Understanding Media Memorability From Event-Related Potential Records And Visual Semantics</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ricardo Kleinlein</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Enrique R. Sebastián</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fernando Fernández-Martínez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Grupo de Tecnología del Habla y Aprendizaje Automático, Information Processing and Telecommunications Center, E.T.S.I. de Telecomunicación, Universidad Politécnica de Madrid 28040</institution>
          <addr-line>Madrid</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Instituto Cajal, CSIC</institution>
          ,
          <addr-line>Madrid</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2010</year>
      </pub-date>
      <abstract>
        <p>The memorability of a video has been de ned in the literature as an intrinsic property of its visual features, expressed as the proportion of an audience that successfully remembers having watched that video on a subsequent viewing. Hence our brains must cope not only with information about pixel statistics and scene semantics, but also to encode whether it is worth keeping information about them in memory for future retrieval. These are the hypothesis behind the 5th edition of the Predicting Media Memorability challenge, which we tackle from a two-fold perspective: rst we pursue a semantics-based approach, using both pre-trained and ne-tuned visual and textual Transformers; on the other hand, we process Event-Related Potential (ERP) data according to two feature extraction methods to obtain a representation compatible with cross-subject predictive models of media memorability, namely: (1) to extract sample-level functionals and feed them as input features to a random forest classi er, and (2) to compute coherence maps between sensor recordings at four frequency bands, training a shallow neural classi er from them. Ultimately, we seek to further comprehend why, whereas some of our visual models display performances that rival that of the current state-of-the-art predictive systems, ERP-based approaches pose a far more complex challenge.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        A detailed scienti c modelling of the factors by which some visual memories remain attached
to us for a long time while others fade shortly after has eluded a mathematical formulation for
decades. Recent studies point to the possibility that all the visual information that reaches our
eyes carry along a measure that would account for its likelihood to be remembered in subsequent
viewings, i.e., its intrinsic memorability [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ]. With the rise of social media, an automatic
system able to classify a video on these terms is of the utmost interest, both from a commercial
and a scienti c perspective. In this paper, we report on our experience during the5th edition of
the Predicting Media Memorability Challenge [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. The availability of Electroencephalography
(EEG) data enables us not only to study the link between visual features and memorability but
also to explore possible mechanisms by which human brain stores that information, building
predictive models of media memorability accordingly.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Although studies on the issue date back to R.N. Shepard (1967) and Standing (1973) [
        <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
        ], it has
not been until the work of Isola et al.[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] that researchers began to think of media memorability as
a deterministic function of fundamental visual properties (such as image colour or its brightness)
and/or the high-level semantic features of a multimedia clip [
        <xref ref-type="bibr" rid="ref7 ref8 ref9">7, 8, 9</xref>
        ]. We use Transfomers,
highly successful in an array of di erent tasks [
        <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
        ], either as visual and textual feature
extractors or ne-tuning them as predictive models of media memorability (Section3.1).
      </p>
      <p>
        EEG data open the path for further understanding of the mechanisms underpinning the
encoding of media memorability by the human brain. Much of the di culty lies in the
entanglement between di erent brain regions operating simultaneously along the process [
        <xref ref-type="bibr" rid="ref12 ref13 ref14">12, 13, 14</xref>
        ].
However, coherence between di erent brain areas (a measure of the strength of the coupling
between the signal recorded by two sensors at speci c frequency bands) has been found to
relate to memory impairment in Alzheimer’s disease [
        <xref ref-type="bibr" rid="ref15 ref16">15, 16</xref>
        ] and other dementia-related health
disorders [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Furthermore, techniques based on similar functional connectivity between EEG
channels has been demonstrated to correlate with long-term semantic memory [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. Therefore
in Section 3.2 we propose two alternative preprocessing methods for ERP data, both outlined in
Figure 1.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Experimental setup and results</title>
      <p>
        A detailed description of both the requirements and the data resources available for each subtask
can be consulted at [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. During the experimental phase we placed a special emphasis not only
on accurately predicting memorability but also on explaining the decisions made by our models.
      </p>
      <p>System description
Statistical Functionals 0.529
Delta channel only 0.490
Beta channel only 0.514
Late-fusion of all channels (Median) 0.534
Late-fusion of all channels (Max.) 0.529</p>
      <p>Val.*</p>
      <sec id="sec-3-1">
        <title>3.1. Subtask 1: Predicting memorability rates from visual features</title>
        <p>
          Our fundamental hypothesis, supported by previous experiences [
          <xref ref-type="bibr" rid="ref19 ref9">9, 19</xref>
          ], is that video-level
semantic features are robust indicators of video memorability, given the strong correlation
found between certain topics and the average memorability rates of videos depicting them.
Here we elaborate on this idea: either keeping a frame-wise (extracted at 1FPS) pre-trained
CLIP Visual Transformer (ViT) as a feature extractor upon which a linear regressor is trained
on the task of media memorability (run #4), or ne-tuning a ViT and its textual counterpart on
Memento10K data [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] (run #1, run #2). We also investigate the degree to which both modalities
can help each other in making a prediction, and hence the output of the run #3 is the average
between the prediction made by run #1 and run #2, while run #5 is the analogous for run #2
and run #4. In all cases, ne-tuning is performed optimizing the mean square loss between
predicted labels and the ground-truth memorability scores for 10 epochs. Prediction rates at
both validation and testing are shown in Table 1.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Subtask 3: Memorability classification from ERP</title>
        <p>
          We propose two di erent processing pipelines, illustrated in Figure1, aimed both at computing
useful numerical representations for the nal task of predicting whether a video will be
remembered, irrespective of the subject data comes from. This is an inherently complex scenario,
since two subjects can respond very di erently to the same video. Validation and testing
classication Area Under the Curve (AUC) rates are shown in Table2. Our rst approach consists
on concatenating statistical functionals - mean value, standard deviation, median, maximum
and minimum values, kurtosis index and the rst three quartiles of a sample - to describe
each trial (subject-video pair). As predictive algorithm, we train a random forest model. For
our second approach, for each subject and video we compute the coherency between each
ERP channel pairwise. We used the function “coherencyc” from Matlab’s® third party toolbox
Chronux1 to compute the mean coherency value for di erent power bands: delta (0.5-4Hz),
theta (4-8Hz), alpha (8-14Hz) and beta (14-30Hz). This yields a 28x28x4 matrix of coherencies
between channels in speci c spectral bands. These values, once arranged as a single vector
embedding, conform to the input features for a shallow neural network whose hidden layer
has 256 neurons, with a ReLU activation function [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] and Adam optimizer [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ] and adaptive
learning rate.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Discussion and outlook</title>
      <p>
        Interestingly enough, a ne-tuned ViT performs worse than a simpler linear regressor trained
from the features obtained by a pretrained version of the full model, even though the same
does not seem to happen in the case of text. Computing explanations using a custom version of
LIME [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], a popular post-hoc local surrogate method [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], we notice that while the text-based
model bases its predictions on concepts that we know correlate well with memorability [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], our
ne-tuned ViT (run #1) might be generalising worse due to over tting (Fig2.). As illustrated in
Figure 3, it is hard to notice a clear pattern of neural activity amidst the subjects when using
ERP data to predict memorability. Di erent people show high memorability rates (subjects 4
and 9), yet the rest fail about 80% of the time, hence leaving an extremely unbalanced dataset
that adds up to the overall complexity of the task. As a future line of research, we believe it
would be particularly interesting to explore multimodal EEG-visual-textual models, in order to
further develop scienti c knowledge on what information from a video clip is actually leaving
a lasting footprint on our brains.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgements</title>
      <p>Our work has been supported by the Spanish Ministry of Science and Innovation:
projects GOMINOLA (PID2020-118112RB-C21, PID2020-118112RB-C22, funded by
MCIN/AEI/10.13039/501100011033), AMIC-PoC (PDC2021-120846-C42, funded by
MCIN/AEI/10.13039/501100011033 and by the European Union “NextGenerationEU/PRTR”),
and the Spanish Ministry of Education (FPI grant PRE2018-083225).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Isola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Parikh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Torralba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oliva</surname>
          </string-name>
          ,
          <article-title>Understanding the intrinsic memorability of images</article-title>
          ,
          <source>in: Proceedings of the 24th International Conference on Neural Information Processing Systems (NIPS'11)</source>
          , Curran Associates Inc.,
          <string-name>
            <surname>Red</surname>
            <given-names>Hook</given-names>
          </string-name>
          ,
          <string-name>
            <surname>NY</surname>
          </string-name>
          , USA,
          <year>2011</year>
          , p.
          <fpage>2429</fpage>
          -
          <lpage>2437</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P.</given-names>
            <surname>Isola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Torralba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oliva</surname>
          </string-name>
          ,
          <article-title>What makes an image memorable?</article-title>
          ,
          <source>in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)</source>
          ,
          <year>2011</year>
          , pp.
          <fpage>145</fpage>
          -
          <lpage>152</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>P.</given-names>
            <surname>Isola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Xiao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Parikh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Torralba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oliva</surname>
          </string-name>
          ,
          <article-title>What makes a photograph memorable?</article-title>
          ,
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          <volume>36</volume>
          (
          <year>2014</year>
          )
          <fpage>1469</fpage>
          -
          <lpage>1482</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>L.</given-names>
            <surname>Sweeney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. G.</given-names>
            <surname>Constantin</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.-H. Demarty</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Fosco</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>García Seco de Herrera</surname>
            , S. Halder,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Healy</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Ionescu</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Matran-Fernandez</surname>
            ,
            <given-names>A. F.</given-names>
          </string-name>
          <string-name>
            <surname>Smeaton</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Sultana, Overview of the MediaEval 2022 predicting video memorability task</article-title>
          , in: MediaEval Multimedia Benchmark Workshop Working Notes,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R. N.</given-names>
            <surname>Shepard</surname>
          </string-name>
          ,
          <article-title>Recognition memory for words, sentences, and pictures</article-title>
          ,
          <source>Journal of Verbal Learning and Verbal Behavior</source>
          <volume>6</volume>
          (
          <year>1967</year>
          )
          <fpage>156</fpage>
          -
          <lpage>163</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.</given-names>
            <surname>Standing</surname>
          </string-name>
          , Learning 10000 pictures,
          <source>Quarterly Journal of Experimental Psychology</source>
          <volume>25</volume>
          (
          <year>1973</year>
          )
          <fpage>207</fpage>
          -
          <lpage>222</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Bylinskii</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Goetschalckx</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Newman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oliva</surname>
          </string-name>
          ,
          <string-name>
            <surname>Memorability:</surname>
          </string-name>
          <article-title>An image-computable measure of information utility</article-title>
          ,
          <year>2021</year>
          . arXiv:
          <volume>2104</volume>
          .
          <fpage>00805</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Newman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Fosco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Casser</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>McNamara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oliva</surname>
          </string-name>
          ,
          <article-title>Multimodal memorability: Modeling e ects of semantics and decay on video memorability</article-title>
          ,
          <year>2020</year>
          .arXiv:
          <year>2009</year>
          .02568.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kleinlein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Luna-Jiménez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Arias-Cuadrado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ferreiros</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Fernández-Martínez</surname>
          </string-name>
          ,
          <article-title>Topicoriented text features can match visual deep models of video memorability</article-title>
          ,
          <source>Applied Sciences</source>
          <volume>11</volume>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Vaswani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Parmar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Jones</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. N.</given-names>
            <surname>Gomez</surname>
          </string-name>
          , L. u. Kaiser,
          <string-name>
            <surname>I. Polosukhin</surname>
          </string-name>
          ,
          <article-title>Attention is all you need</article-title>
          , in: I. Guyon,
          <string-name>
            <given-names>U. V.</given-names>
            <surname>Luxburg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Bengio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wallach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Fergus</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Vishwanathan</surname>
          </string-name>
          , R. Garnett (Eds.),
          <source>Advances in Neural Information Processing Systems</source>
          , volume
          <volume>30</volume>
          ,
          <string-name>
            <surname>Curran</surname>
            <given-names>Associates</given-names>
          </string-name>
          , Inc.,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Radford</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. W.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Hallacy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ramesh</surname>
          </string-name>
          , G. Goh,
          <string-name>
            <given-names>S.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Sastry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Askell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mishkin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Krueger</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Sutskever</surname>
          </string-name>
          ,
          <article-title>Learning transferable visual models from natural language supervision</article-title>
          ,
          <source>in: International Conference on Machine Learning (ICML)</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>J.</given-names>
            <surname>Han</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Shao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Hu</surname>
          </string-name>
          , J. Han,
          <string-name>
            <surname>T</surname>
          </string-name>
          . Liu,
          <article-title>Learning computational models of video memorability from fmri brain imaging</article-title>
          ,
          <source>IEEE Transactions on Cybernetics</source>
          <volume>45</volume>
          (
          <year>2015</year>
          )
          <fpage>1692</fpage>
          -
          <lpage>1703</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>R. F.</given-names>
            <surname>Thompson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Kim</surname>
          </string-name>
          ,
          <article-title>Memory systems in the brain and localization of a memory</article-title>
          ,
          <source>Proceedings of the National Academy of Sciences</source>
          <volume>93</volume>
          (
          <year>1996</year>
          )
          <fpage>13438</fpage>
          -
          <lpage>13444</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>A.</given-names>
            <surname>Jaegle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Mehrpour</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Mohsenzadeh</surname>
          </string-name>
          , T. Meyer, A. Oliva,
          <string-name>
            <given-names>N.</given-names>
            <surname>Rust</surname>
          </string-name>
          ,
          <article-title>Population response magnitude variation in inferotemporal cortex predicts image memorability</article-title>
          ,
          <source>eLife</source>
          <volume>8</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>G.</given-names>
            <surname>Adler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Brassen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Jajcevic</surname>
          </string-name>
          ,
          <article-title>Eeg coherence in alzheimer's dementia</article-title>
          ,
          <source>Journal of Neural Transmission</source>
          <volume>110</volume>
          (
          <year>2003</year>
          )
          <fpage>1051</fpage>
          -
          <lpage>1058</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>M. J. Hogan</surname>
            , G. Swanwick,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Kaiser</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Rowan</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Lawlor</surname>
          </string-name>
          ,
          <article-title>Memory-related eeg power and coherence reductions in mild alzheimer's disease</article-title>
          ,
          <source>International Journal of Psychophysiology</source>
          <volume>49</volume>
          (
          <year>2003</year>
          )
          <fpage>147</fpage>
          -
          <lpage>163</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>D.</given-names>
            <surname>Laptinskaya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Fissler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. C.</given-names>
            <surname>Küster</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wischniowski</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Thurm</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Elbert</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. A. F.</given-names>
            von
            <surname>Arnim</surname>
          </string-name>
          , I.-T. Kolassa,
          <article-title>Global eeg coherence as a marker for cognition in older adults at risk for dementia</article-title>
          ,
          <source>Psychophysiology</source>
          <volume>57</volume>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S.</given-names>
            <surname>Hanouneh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. U.</given-names>
            <surname>Amin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. M.</given-names>
            <surname>Saad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. S.</given-names>
            <surname>Malik</surname>
          </string-name>
          ,
          <article-title>Eeg power and functional connectivity correlates with semantic long-term memory retrieval</article-title>
          ,
          <source>IEEE Access 6</source>
          (
          <year>2018</year>
          )
          <fpage>8695</fpage>
          -
          <lpage>8703</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kleinlein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Luna-Jiménez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Fernández-Martínez</surname>
          </string-name>
          ,
          <article-title>Thau-upm at mediaeval 2021: From video semantics to memorability using pretrained transformers</article-title>
          ,
          <source>in: MediaEval'21 Online</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>A. F.</given-names>
            <surname>Agarap</surname>
          </string-name>
          ,
          <article-title>Deep learning using recti ed linear units (relu</article-title>
          ), ArXiv abs/
          <year>1803</year>
          .08375 (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Kingma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ba</surname>
          </string-name>
          ,
          <article-title>Adam: A method for stochastic optimization</article-title>
          ,
          <source>CoRR abs/1412</source>
          .6980 (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kleinlein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hepburn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Santos-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Fernández-Martínez</surname>
          </string-name>
          ,
          <article-title>Sampling based on natural image statistics improve local surrogate explainers</article-title>
          ,
          <source>in: The 33rd British Machine Vision Conference</source>
          ,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          ,
          <article-title>"why should i trust you?": Explaining the predictions of any classi er</article-title>
          ,
          <source>in: Proceedings of the 22nd ACM International Conference on Knowledge Discovery and</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>