<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Steven Hicks</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pål Halvorsen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Trine B. Haugen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jorunn M. Andersen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Oliwia Witczak</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Konstantin Pogorelov</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hugo L. Hammer</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Duc-Tien Dang-Nguyen</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mathias Lux</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michael Riegler</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Alpen-Adria-Universität Klagenfurt</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Oslo Metropolitan University</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>SimulaMet</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Bergen</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <fpage>27</fpage>
      <lpage>29</lpage>
      <abstract>
        <p>The Medico: Multimedia for Medicine Task is running for the third time as part of MediaEval 2019. This year, we have changed the task from anomaly detection in images of the gastrointestinal tract to focus on the automatic prediction of human semen quality based on videos. The purpose of this task is to aid in the assessment of male reproductive health by providing a quick and consisted method of analyzing human semen. In this paper, we describe the task in detail, give a brief description of the provided dataset, and discuss the evaluation process and the metrics used to rank the submissions of the participants.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        The 2019 Medico: Multimedia for Medicine Task tackles the
challenge of predicting certain quality measurements of sperm using
a multimodal dataset consisting of microscopic video recordings
of human semen, associated sensor data, and participant-related
data. Male infertility accounts for approximately 60% of cases
involving couples having problems conceiving a child [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The first
step to evaluating male fertility is often through the assessment
of semen, where a clinician measures the number of living sperm
and looks for any abnormalities that may be present in the
spermatozoa (living sperm). This process is done through a microscope,
where the clinician has to count each visible sperm manually to
calculate specific metrics related to the movement and shape of the
spermatozoa. Having a tool that could aid in the detection of sperm
and calculating these metrics would not only reduce the time spent
per sample but also lessen the large inter- intra-observer variability
between and within clinics [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>For the 2019 edition of Medico, we present three diferent tasks.
The first two tasks aim to predict key indicators of good semen
quality, specifically the motility and morphology of sperm.
Motility and morphology are two of the most common metrics used to
evaluate spermatozoa, and to predict these measurements gives the
participants a good opportunity to utilize both the video and sensor
data available in the dataset. The third task relates to looking at
individual sperm to figure out which one moves the fastest. For
all tasks, we require participants to measure and report the data
processing performance in terms of time spent on each frame
being analyzed. As we aim for real-time applications, the processing
speed is an important factor, especially for the individual
spermatozoon selection task during the in vitro fertilization procedure
(a procedure where a sperm is injected into an egg outside of the
body).
2</p>
    </sec>
    <sec id="sec-2">
      <title>DATASET DETAILS</title>
      <p>
        The VISEM dataset [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] contains more than 35 gigabytes of videos
and related data from 85 male participants aged 18 years or older.
For each participant, we include a set of measurements collected
from an analyzed semen sample. This includes a video of the live
spermatozoa, a sperm fatty acid profile, the fatty acid composition
of serum phospholipids, some study participant-related data and
World Health Organization (WHO) analysis data [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In total, the
dataset consists of six CSV-files, five for data and one which maps
video IDs to study participants IDs. The duration of each video
varies between two to seven minutes and runs at approximately 50
frames-per-second. The resolution for each video is 640 × 480 and is
stored in a ".avi" file. The name of each video file contains the videos
ID, the date it was recorded, a small optional description, and ends
with the code of the person who assessed the video. For the purpose
of this task, we have split the videos within the video directory into
three separate "folds". This was done in order to make it easier for
participates to perform the required three-fold cross-validation.
      </p>
      <p>For the analysis data, sensor data, and participant-related data,
the following CSV-files are provided:
• semen_analysis_data: The results of standard semen
analysis.
• fatty_acids_spermatozoa: The levels of several fatty acids
in the spermatozoa of the participants.
• fatty_acids_serum: The serum levels of the fatty acids of the
phospholipids (measured from the blood of the participant).
• sex_hormones: The serum levels of sex hormones measured
in the blood of the participants.
• study_participant_related_data: General information about
the participants such as age, abstinence time and Body
Mass Index (BMI).
• videos: Overview of which video-file belongs to what
participant.</p>
      <p>VISEM is publicly available 1 without any restriction. Each study
participant agreed to donate their data for research and provided
the necessary consent to distribute it. It is also important to point
out that all data is fully anonymized and follows the state of the art
with respect to the privacy of medical information.
3</p>
    </sec>
    <sec id="sec-3">
      <title>EVALUATION AND TASK DESCRIPTIONS</title>
      <p>Medico 2019 presents three diferent tasks, each meant to target a
specific use-case within male fertility assessment. The three tasks
for this year’s Medico is the prediction of morphology task, the
prediction of motility task, and the unsupervised sperm tracking
task. Of these three tasks, the first two are mandatory in order to
participate in the challenge. The third task is optional, but highly
recommended, as it is an important problem within assisted
reproductive technology. For the performance evaluation, the same
dataset will be used for development and testing, but for testing,
we ask the participants to perform three-fold cross-validation. For
the processing speed evaluation, we will use minimum, average
and maximum frame processing times in seconds measured by the
participants as a time interval from the moment when an image
has been completely loaded into memory to the moment of the
ifnal decision has been made by the corresponding task analysis
algorithm. In the following, we will give a more detailed description
of each task and discuss how each will be evaluated and ranked.</p>
    </sec>
    <sec id="sec-4">
      <title>3.1 Prediction of Motility Task</title>
      <p>Motility is the ability of an organism to move independently. In
semen analysis, this is commonly split into three distinct categories,
namely progressive, non-progressive, and immotile spermatozoa. A
progressive spermatozoon is one that is able to move forward at a
slow or fast pace, a non-progressive spermatozoon is a sperm that
moves without forward progression, and immotile sperm do not
move at all. The purpose of this task is to automatically find the
percentage of progressive, non-progressive, and immotile
spermatozoa for a given semen sample. We urge participants to perform
multi-frame analysis over single-frame analysis. This is important
due to the fact that single-frame analysis loses most of the
temporal information needed to accurately assess the movement of the
spermatozoa.</p>
      <p>In order to participate in this task, task participants will submit a
".csv" file containing one prediction for each of the 85 study
participants. Each line should be made up of five comma-separated values;
ID of the study participant, percentage of progressive sperm,
percentage of non-progressive sperm, percentage of immotile sperm,
average frame processing time. To evaluate this task we will use
root mean squared error and mean absolute error to evaluate the
submissions. Submissions will be ranked based on the achieved
mean absolute error as it shows the improvement of the automatic
prediction compared to that of the manual assessment.</p>
    </sec>
    <sec id="sec-5">
      <title>3.2 Prediction of Morphology Task</title>
      <p>Evaluating the morphology of sperm involves looking for any
abnormalities that may be present in the three parts that make up
a spermatozoon, namely the head, midpiece, and tail. This task
should predict the percentage of sperm with head defects, midpiece
defects, and tail defects for a single semen sample.</p>
      <p>As with the prediction of motility task, participants have to submit
a ".csv" file containing one prediction of morphology assessment for
each of the 85 study participants. Each line should be made up of the
following five comma-separated values; ID of the study participant,
percentage of sperm with tail defects, percentage of sperm with
midpiece defects, percentage of sperm with head defects, average
frame processing time. Similar to the Prediction of motility task, we
will use mean squared error and mean absolute error to evaluate
the submissions. Submissions will be ranked based on the achieved
mean absolute error.</p>
      <p>Hicks et al.</p>
    </sec>
    <sec id="sec-6">
      <title>3.3 Unsupervised Sperm Tracking Task</title>
      <p>This task is about finding the spermatozoon that moves fastest
compared to all others, and requires that task participants track
individual spermatozoon in order to evaluate the speed at a given
point in time. Within this task, we defined the "fastest"
spermatozoon in two diferent ways:
(1) Fastest average speed: The one that moves the longest
distance during a video defined by the total distance divided
by the length of the video.
(2) Highest top speed: the one that has the highest intermediate
speed.</p>
      <p>One specific challenge of this third task is that the video also
changes due to the sample being moved while under the
microscope to observe the sample in its entirety. We suggest that the
participants track each sperm per viewpoint rather than per video.
To evaluate this task, we will use manual evaluation with the help
from three diferent experts within human reproduction.</p>
    </sec>
    <sec id="sec-7">
      <title>4 DISCUSSION AND OUTLOOK</title>
      <p>We believe the area of automatic semen analysis is an important,
yet overlooked, area of research which has the potential of being
very beneficial for those working to improve reproductive health.
From a computer science point of view, the task is compelling as
it features more than just simple image analysis as it requires the
participants to analyze the temporal information present from one
frame to another in order to make quality predictions. We hope
that this task will encourage the multimedia community to aid
in the development of computer-assisted reproductive health, and
discover new and clever ways of analyzing multimodal datasets.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Sandro</surname>
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Esteves</surname>
            , Ricardo Miyaoka, and
            <given-names>Ashok</given-names>
          </string-name>
          <string-name>
            <surname>Agarwal</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>An update on the clinical assessment of the infertile male</article-title>
          .
          <source>Clinics</source>
          <volume>66</volume>
          (00
          <year>2011</year>
          ),
          <fpage>691</fpage>
          -
          <lpage>700</lpage>
          . https://doi.org/10.1590/S1807-59322011000400026
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Trine</surname>
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Haugen</surname>
          </string-name>
          , Steven A.
          <string-name>
            <surname>Hicks</surname>
          </string-name>
          ,
          <string-name>
            <surname>Jorunn M. Andersen</surname>
            , Oliwia Witczak, Hugo L. Hammer, Rune Borgli, Pål Halvorsen, and
            <given-names>Michael A.</given-names>
          </string-name>
          <string-name>
            <surname>Riegler</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>VISEM: A Multimodal Video Dataset of Human Spermatozoa</article-title>
          .
          <source>In Proceedings of the 10th ACM on Multimedia Systems Conference (MMSys'19)</source>
          . ACM, New York, NY, USA. https://doi.org/10.1145/3304109.3325814
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>RW</given-names>
            <surname>Manderscheid</surname>
          </string-name>
          , CD Ryf, EJ Freeman,
          <string-name>
            <given-names>L</given-names>
            <surname>McKnight-Eily</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S</given-names>
            <surname>Dhingra</surname>
          </string-name>
          , and
          <string-name>
            <given-names>TW</given-names>
            <surname>Strine</surname>
          </string-name>
          .
          <year>2009</year>
          .
          <article-title>Evolving definitions of mental illness and wellness</article-title>
          .
          <source>Preventing chronic disease 7</source>
          , 1 (dec
          <year>2009</year>
          ),
          <fpage>A19</fpage>
          -
          <lpage>A19</lpage>
          . https://www.ncbi.nlm.nih.gov/pubmed/ 20040234https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2811514/
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>World</given-names>
            <surname>Health</surname>
          </string-name>
          Organization et al.
          <year>2010</year>
          .
          <article-title>WHO laboratory manual for the examination and processing of human semen</article-title>
          . (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>