<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Overview of MediaEval 2020 Insights for Wellbeing: Multimodal Personal Health Lifelog Data Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Peijiang Zhao</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Minh-Son Dao</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ngoc-Thanh Nguyen</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Thanh-Binh Nguyen</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Duc-Tien Dang-Nguyen</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cathal Gurrin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dublin City University</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>National Institute of Information and Communications Technology</institution>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Bergen</institution>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Information Technology (VNUHCM-UIT)</institution>
          ,
          <country country="VN">Vietnam</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Science (VNUHCM-US)</institution>
          ,
          <country country="VN">Vietnam</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <fpage>11</fpage>
      <lpage>14</lpage>
      <abstract>
        <p>This paper provides a description of the MediaEval 2020 “Multimodal personal health lifelog data analysis". The purpose of this task is to develop approaches that process the environment data to obtain insights about personal wellbeing. Establishing the association between people's wellbeing and properties of the surrounding environment which is vital for numerous research. Our task focuses on the internal associations of heterogeneous data. Participants create systems that derive insights from multimodal lifelog data that are important for health and wellbeing to tackle two challenging subtasks. The first task is to investigate whether we can use public/open data to predict personal air pollution data. The second task is to develop approaches to predict personal air quality index(AQI) using images captured by people (plus GAQD). This task targets (but is not limited to) researchers in the areas of multimedia information retrieval, machine learning, AI, data science, event-based processing and analysis, multimodal multimedia content analysis, lifelog data analysis, urban computing, environmental science, and atmospheric science.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        The association between people’s wellbeing and the properties of
the surrounding environment is an essential area of investigation[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
Although these investigations have a long and rich history[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], they
have focused on the general population. There is a surprising lack
of research investigating the impact of the environment on the scale
of individual people. On a personal scale, local information about
air pollution (e.g., PM2.5, NO2, O3), weather (e.g., temperature,
humidity), urban nature (e.g., greenness, liveliness, quietness), and
personal behavior (e.g., psychophysiological data) play an essential
role. It is not always possible to gather plentiful amounts of such
data as [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. As a result, a key research question remains open: Can
sparse or incomplete data be used to gain insight into wellbeing?
Is there a hypothesis about the associations within the data so that
wellbeing can be understood using a limited amount of data?
Developing hypotheses about the associations within the heterogeneous
data contributes towards building good multimodal models that
make it possible to understand the impact of the environment on
wellbeing at the local and individual scale. Such models as [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]
are necessary since not all cities are fully covered by standard air
pollution and weather stations, and not all people experience the
same reaction to the same environment situation. Moreover,
images captured by the first-person view could give essential cues to
understand that environmental situations in cases in which precise
data from air pollution stations are lacking[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        Let us imagine the following scenario. Yamamoto-san is using
the Image-2-AQI app to know how harmful air pollution is by
merely feeding captured images to the app. Simultaneously, at the
urban air pollution center, the air pollution map is updated with
Yamamoto-san’s contribution (e.g., images, annotation). Satoh-san,
with some clicks on his smartphone, the environmental-based risk
map application can show him the excellent route from A to B with
less congestion and harmful air pollution as trafic risk prediction
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Simultaneously, less congestion from A to B is due to fewer
people coincidentally traveling on the same route. Such simple apps
are parts of the human-environment sustainable and co-existing
system that have changed people’s pro-environmental behaviors.
      </p>
      <p>The critical research question here is, “does the personal air
quality be predicted by using other data that is easy to obtain?”
2</p>
    </sec>
    <sec id="sec-2">
      <title>TASK DESCRIPTION</title>
      <p>Task participants create systems that derive insights from
multimodal lifelog data that are important for health and wellbeing. The
ifrst dataset, namely “personal air quality data” (PAQD), includes
air pollution data (PM2.5, O3, and NO2) and lifelog data (e.g.,
physiological data, tags, and images) collected by using sensors boxes,
lifelog cameras, and smartphones along the predefined routes in a
city. The second dataset, namely “global air quality data” (GAQD),
includes weather and air pollution data collected over the city and
provided by the government and crawled from related websites.</p>
      <p>Personal Air Quality Prediction with public/open data.
Participants predict the value of personal air pollution data (PM2.5, O3,
and NO2) using only weather data (wind speed, wind direction,
temperature, humidity) and air pollution data (PM2.5, O3, and NO2)
from public/open data sources (e.g., stations, website). This
subtask’s target is to investigate whether we can use public/open data
to predict personal air pollution data. The personal air pollution
data can be concerned as the regional air pollution data since these
data a locally collected by people who carry personal equipment.
In other words, the ground truth is data collected by sensor boxes
carried by people.</p>
      <p>Personal Air Quality Prediction with lifelog data. Participants
predict the personal Air Quality Index using images captured by
people (plus GAQD). The purpose of this subtask is whether we can
use only lifelog data (i.e., pictures of the surrounding environment,
annotations, and comments), plus some data from open sources
(e.g., weather, air pollution data) to predict the personal air pollution
data.
3</p>
    </sec>
    <sec id="sec-3">
      <title>DATA DESCRIPTION</title>
      <p>
        Personal air quality dataset (PAQD). PAQD were collected from
March to April 2019 along the marathon course of the Tokyo 2020
Olympics and the running course around the Imperial Palace using
wearable sensors [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ](Fig. 1). There were five data collection
participants assigned to five routes to collect the data. Routes 1–4 were
along the marathon course for the Tokyo 2020 Olympics. Route
5 was the running course around the Imperial Palace. The length
of each route was approximately 5 km. Each participant started
data collection at 9 am every weekday, and it took approximately
one hour to walk each route. Collected data contain weather data
(e.g., temperature, humidity), atmospheric data (e.g., O3, PM2.5, and
NO2), GPS data, and lifelog data (e.g., images, annotation).
      </p>
      <p>
        Glocal air pollution dataset (GAPD). GAPD contains the
atmospheric monitoring station data collected by the Atmospheric
Environmental Regional Observation System ( AEROS ) in Japan [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
AEROS contains real-time atmospheric data at every hour for 2032
meteorological monitoring stations across Japan. The atmospheric
data includes eleven types of air pollutant data (SO2, NOx, NO,
NO2, CO, Ox, NMHC, CH4, THC, SPM, and PM2.5), and four types
of meteorological data (wind direction, wind speed, temperature,
and humidity).
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>GROUND TRUTH AND EVALUATION</title>
      <p>The ground truth for the dataset of the two subtasks is collected as
follows:
• For the Personal Air Quality Prediction with public/open
data subtask: some parts of personal (PM2.5, O3, and NO2)
data are deleted and saved as the ground truth.
• For the Personal Air Quality Prediction with lifelog data
subtask: the set of personal AQI are hidden and saved as
the ground truth.</p>
      <p>We use symmetric mean absolute percentage error (SMAPE),
root mean squared error (RMSE), and mean absolute error (MAE)
to do the evaluation. Each evaluation metrics defined as follow:
(1)
(2)
(3)
 =
 =
1 Õ</p>
      <p>| −   |
 =1 ( +   )/2</p>
      <p>( −   )2
 =1
1 Õ
 =1
 =
| −   |
Where  and   mean the real value and the prediction value at
time  , and n is the total number of predicted data.</p>
      <p>For each subtask, the evaluation method is applied as follows:
• For the Personal Air Quality Prediction with public/open
data subtask: We use the SMAPE /RMSE /MAE for
comparing each air pollution factor PM2.5, O3, and NO2 with
the ground truth.
• For the Personal Air Quality Prediction with lifelog data
subtask: We use the SMAPE/RMSE /MAE for comparing
predicted AQI to the ground truth.
5</p>
    </sec>
    <sec id="sec-5">
      <title>DISCUSSION AND OUTLOOK</title>
      <p>Details on the methods and results of each individual participant
team can be found in the working note papers of the MediaEval
2020 workshop proceedings.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <fpage>2020</fpage>
          .
          <article-title>Atmospheric Environmental Regional Observation System</article-title>
          . http: //soramame.taiki.go.jp/.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Zhao P. J Sato T. Zettsu</surname>
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Dang-Nguyen D. T. Gurrin</surname>
            <given-names>C. Nguyen N. T.</given-names>
          </string-name>
          <string-name>
            <surname>Dao</surname>
            ,
            <given-names>M. S.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Overview of MediaEval 2019: Insights for Wellbeing Task - Multimodal Personal Health Lifelog Data Analysis</article-title>
          .
          <source>In MediaEval Benchmarking Initiative for Multimedia Evaluation.</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Salvador</given-names>
            <surname>Ruiz-Correa Darshan</surname>
          </string-name>
          Santani and Daniel Gatica-Perez.
          <year>2018</year>
          .
          <article-title>Looking South: Learning Urban Perception in Developing Cities</article-title>
          .
          <source>Trans. Soc. Comput 13</source>
          ,
          <issue>1</issue>
          (
          <year>December 2018</year>
          ),
          <fpage>291</fpage>
          -
          <lpage>301</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. Dao N.</given-names>
            <surname>Nguyen</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Zettsu</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Complex Event Analysis for Trafic Risk Prediction based on 3D-CNN with Multi-sources Urban Sensing Data</article-title>
          .
          <source>In 2019 IEEE International Conference on Big Data</source>
          .
          <fpage>1669</fpage>
          -
          <lpage>1674</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M. Dao P.</given-names>
            <surname>Vo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Phan</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Zettsu</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Association Model between Visual Feature and AQI Rank Using Lifelog Data</article-title>
          .
          <source>In 2019 IEEE International Conference on Big Data</source>
          .
          <fpage>4197</fpage>
          -
          <lpage>4200</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T</given-names>
            <surname>Sato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.S</given-names>
            <surname>Dao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K</given-names>
            <surname>Kuribayashi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Zettsu</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>SEPHLA: Challenges and Opportunities within Environment - Personal Health Archives</article-title>
          .
          <source>In International Conference on Multimedia Modeling</source>
          .
          <fpage>325</fpage>
          -
          <lpage>337</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Sajedi H. Ramezani F. Vahdatpour</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2018</year>
          .
          <article-title>Air pollution forecasting from sky images with shallow and deep classifiers</article-title>
          .
          <source>Earth Sci Inform</source>
          <volume>11</volume>
          (
          <year>2018</year>
          ),
          <fpage>413</fpage>
          -
          <lpage>422</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>P. J</given-names>
            <surname>Zhao</surname>
          </string-name>
          and
          <string-name>
            <given-names>K</given-names>
            <surname>Zettsu</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Decoder Transfer Learning for Predicting Personal Exposure to Air Pollution</article-title>
          .
          <source>In 2019 IEEE International Conference on Big Data</source>
          .
          <fpage>5620</fpage>
          -
          <lpage>5629</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>