<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>TravelGANs. A design approach to style transfer</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Azzurra Pini</string-name>
          <email>azzurra.pini@fjordnet.com</email>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dublin</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ireland</string-name>
        </contrib>
      </contrib-group>
      <abstract>
        <p>1 Generative AI has been used and researched mostly for artistic purposes and with a technology driven approach. We present an application of style transfer with GANs to design a service for the travel industry. Our solution is intended to support a mindset shift in the travel booking process, which is claiming for a more visual based search. TravelGANs is designed for sourcing local photos from the web and seasonalize them on the fly based on users' preferred travel dates.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>Generative AI has been around for quite a while to support human
in specific tasks, mainly for data augmentation, to fill gaps of data
and information in various contexts. Although the military has been
a primary field of application for most AI techniques, the research
has then evolved mostly in academia for computer vision research
and applied to artistic projects.</p>
      <p>
        Our project has been focusing in particular on the application of
generative models, such as Generative Adversarial Networks, a class
of machine learning systems invented in 2014 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], that is now one of
the most popular techniques used to learn patterns from data and
reproduce its characteristics into new visual samples. Since then the
technique has been used mostly to generate realistic photographs,
text to image, video simulation, face ageing, image blending, super
resolution and similar applications. At the technical level the
capabilities of GANs, VAE and generative models in general have
improved substantially over the years and we are now witnessing an
impressive level of accuracy in reproducing realistic content.
Nevertheless, only a few examples of real-world applications can be
found, mostly for the fashion industry [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ][
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], for generating
realistic models and apply different styles to clothing items. Some
more rare ones are registered in the science domain [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] and
engineering such as for autonomous driving, to simulate the possible
different environmental context of roads in terms of lighting and
weather. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
      </p>
      <p>As opposed to the majority of the current research, instead of
taking a technology driven approach, we tackled generative AI from
a design perspective, looking for different ways where generative
models can support a user experience or a mindset change.</p>
      <p>TravelGANs tackles a behavioural change in the travel booking
process which we claim to be more and more relying on visual
features. The prototype aims at providing an AI powered visual
booking service, capable of quickly sourcing local photos and
seasonalise them on the fly in response to visitors’ preferred travel
period.</p>
      <p>
        In this paper we will introduce the design concept, some
experiments with the CycleGAN [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] style transfer and its
email:
implementation, the user interface and finally we will propose some
directions for future research.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>A-EYE SEASON TRAVEL</title>
      <p>
        The project started observing a mindset change in travel booking
towards a more visual experience, while the formats around
communicating destinations have stayed static. Some recent
initiatives by a few travel companies are have already acknowledged
this cultural and behavioural shift, and started to move in that
direction, providing for example users with the possibility to book
travels through Instagram. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ][
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]
      </p>
      <p>With this idea in mind, the concept originated in particular from
the idea of gathering travel inspiration from online pictures, and
specifically to answer the question on ‘how does (place) look like in
(season)?’. Even the most visual platforms, more or less specifically
dedicated to travel, do not in fact provide alternative photos of a
place in different conditions – a functionality that we claim could be
extremely relevant for travel planning, particularly for certain
destinations with high seasonal variability.</p>
      <p>From a technology perspective several options have been
explored in order to tackle this challenge, ranging from computer
vision techniques to more complex machine learning models. While
classic computer vision techniques would support the analysis and
editing of existing pictures, they are not accurate enough for
transforming images without paired examples provided.</p>
      <p>
        An interesting opportunity for experimentation has been
identified in Generative Adversarial Networks (GANs), in particular
we have been exploring the applications in relation to style transfer,
an increasingly popular sub-set of algorithms originally started for
artistic purposes [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and now used to translate different types
pictures into different styles, e.g. converting a photograph into the
style of a famous painter or a horse into a zebra. More specifically,
recent research has provided examples of application of GANs style
transfer for translating photos of locations from winter to summer
and vice versa. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] Within this framework more recent research has
also been made on transferring an image from night to day [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and
into different lighting or weather conditions. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]
      </p>
      <p>
        The CycleGAN style transfer [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] is an approach for unpaired
Image-to-Image translation developed at UC Berkley to learning to
translate an image from a source domain to a target domain in the
absence of paired examples. Among the different examples of
transfer presented in the paper – the most popular being the artistic
applications – we explored a later implementation into a season
transfer for transforming pictures from winter to summer and vice
versa. The current model is trained on 854 winter photos and 1273
summer photos of Yosemite downloaded from Flickr.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>TravelGANs</title>
      <p>With our prototype the challenge was to design a service to support
a visual exploration of holiday destinations and for this purpose to
exploit the potential of generative network. The service has been
conceived mainly to support the travel exploration phase and it’s
meant to complement a regular booking platform (such as
Booking.com). TravelGANs is meant to provide inspiration for
holiday booking, in situations where there are less constraints in
terms of destination and period with respect to other travel use cases.
Specifically designed for ad-hoc holiday bookers, the service is
meant to support an experience where users can find holiday
destinations based on visual preferences rather than time and budget.
The core concept is for them to understand how a certain destination
will look like at the moment of the trip. The service is also tackling
the use case of a B&amp;B or small hotel owner as well as travel agents
assembling packages. In this second case TravelGANs can help
them rapidly assemble, customize and seasonalize their listings to
increase the user engagement.</p>
      <p>
        In order to test the concept we developed a web app prototype,
using the current season transfer model of CycleGAN [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. The
design and development were performed in a rapid prototyping
environment by a small team and delivered in a 6 weeks period.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Building a travel DB 3.1</title>
      <p>From a software perspective we were aiming at the following
highlevel pipeline:
1. Selecting a set of popular travel destinations organized by type:
seaside, mountain, cities, nature;
Building a static dataset for those destinations in different
seasons, 3. Selecting winter and summer alternatives;
Run the images through the pre-trained model for season
transfer;</p>
      <sec id="sec-4-1">
        <title>Feed back the transformed images to the data base.</title>
        <p>Our initial database has been populated by a web app we developed
to access the FlickrAPIs, which allowed us to automate the
collection of images, and in particular:
• Select a location
• Select a season
• Toggle for hemisphere (to support the seasons inversion)
• Filter licences for usage rights
• Pick preferred pictures and download a .zip file
• Feed the downloaded pictures to the CycleGAN season
transfer model.</p>
        <p>The web app is initially intended to support the development of the
prototype, aiming at a future integration with a travel portal, but also
to support the use case of the host, who can explore and select
royalty free photos to add to his/her property page.</p>
        <p>As displayed in Figure 1, based on the results of our test we also
included some pre-processing steps that we found useful to improve
2 https://github.com/junyanz/CycleGAN.
3 https://www.cs.ccu.edu.tw/~wtchu/projects/Weather/index.html
the transfer. The details of the season transfer tests and
implementation will be discussed in the following section.
3.2</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>A CycleGAN application</title>
      <p>The transformation of our pictures was made using the pretrained
summer2winter_yosemite model2.</p>
      <p>Because the model has been trained mostly on pictures of
Yosemite, when used to transfer seasonal style for other types of
locations it visibly revealed a fundamental bias towards mountain
landscapes of a “continental” latitude, with particular colour and
lighting conditions. One of the challenges of our research was also
to test the performance of the model on different types of landscapes.</p>
      <p>Aware of this bias we also made some initial tests with a dataset
containing a weather classification3 to complement and improve the
performance of the model, forcing associations between
meteorological conditions and seasons. With the support of this
classification we selected pictures tagged with peculiar seasonal
weather conditions before feeding them to the model for the transfer.</p>
      <p>Some tests returned particularly positive results [Figure 2],
regardless of the similarity of the images with the training set,
suggesting a valuable direction for future improvement of the season
transfer. In order to gauge the applicability of the CycleGAN for the
design, we tested the model also for other types of landscapes.
Because of the limitations of the original training dataset we proved
that the application of the transfer generally works better for pictures
of mountains and similar natural landscapes. In particular we
confirmed the hypothesis that the winter2summer transfer works
pretty well with strong seasonal traits, e.g. snow, clouds, cold
colours, thus supporting the utility of a complementary weather
classification. At the same time, we discovered that generally the
transfer doesn’t work well when the original picture is very different
from the examples of the training set, in the cases where the image
doesn’t present strong classic seasonal traits, i.e. if it’s rather
colourful and sunny. In this case sometimes the perceived effect is a
reversed transfer, converting the image to winter (instead of
summer). (Figure 3)</p>
      <p>c.</p>
      <p>Figure 3. winter2summer failure cases
A similar phenomenon unexpectedly happens also for environments
that should have been similar to the training dataset, suggesting the
need of a wider training also for these locations. (Figure 3b)
Since the prototype was designed to include different types of
locations, further experiments have been made also with radically
different landscapes. When an unexpected natural element is found
(e.g. sand) the model fails to recognise it, and returns more
“creative” results, mistaking for example a desert for sea and
therefore replacing wrong colours (Figure 3c). As we expected the
model doesn’t also work very well with pictures of interiors, such as
hotel rooms, that could be rather common for travel photos,
especially as the pictures are gathered in an automatic way (e.g.
through APIs).</p>
      <p>Some of these experiments also revealed that the performance of
the model increases significantly when the original picture is in
black and white. In this case we hypothesize that the model is able
to translate the images in a more straightforward way. Without the
confusion created by unexpected colours, the model can simply
apply the ‘seasonal’ style. Some tests of the winter2summer model
on our dataset produced pretty good results. For this reason, we
implemented the black and white conversion as a step in our pipeline
before feeding the images to the model as displayed in Figure 1. As
shown in Figure 4, after the conversion the hypothesised
colourisation recognises and assigns pretty much all the right
colours. Sometimes the fake images can be difficult to distinguish
from the original colour version, but in terms of season transfer the
results appear definitely more accurate and realistic.</p>
      <p>real (winter)</p>
      <p>fake (summer)
before
after
Figure 4. winter2summer with black&amp;white conversion
4</p>
    </sec>
    <sec id="sec-6">
      <title>The User interface</title>
      <p>TravelGANs’ user interface is designed to support two different use
cases. The main one we designed for is for a regular user who’s
exploring options for choosing a holiday destination.</p>
      <p>The user is presented with a set of photos of travel destinations,
randomly selected from our database through the Flickr API web
app. The photos are arranged in an image board layout without any
description indicating the location or season, to be used for visual
inspiration. The user can refresh the selection and is prompted to
select one that resonates with her/him.</p>
      <p>After selecting a photo, the destination and the season of the
image is revealed to the user. We use the metadata automatically
associated to each photo - geographical coordinates, type of location
and timestamp - to provide the user for other suggestions of similar
destinations to explore.</p>
      <p>When a GAN generated image is selected, the original photo is
displayed for comparison and the user is then guided towards the
traditional booking process.</p>
    </sec>
    <sec id="sec-7">
      <title>FUTURE WORK</title>
      <p>TravelGANs is a prototype of a service that suggests a new approach
at holiday exploration, based on users’ visual preferences. The
implementation exploits the potential of deep generative networks
to provide seasonal alternatives of travel pictures.</p>
      <p>From a design perspective there is planned future work for an
extensive user research, especially on the perception of AI generated
images in relation to a travel booking service. Broader research is
also required to design functionalities more focused on visual
similarity and personalization of features.</p>
      <p>From a technology perspective we discovered a great potential
for further exploration with respect to GANs and more in general on
deep learning models for generative design.</p>
      <p>More specifically, we identified several opportunities for future
work in:
•</p>
      <p>Training of different models for different location types,
e.g. Seaside, cities and interiors.
•
•
•
•
•</p>
      <p>
        Training models on clustered locations based on similar
visual characteristics, using external classifications such as
the weather dataset [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ];
Implement computer vision techniques to analyse images
upfront, such as image segmentation and classification
integrating stuff classes [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ];
      </p>
      <sec id="sec-7-1">
        <title>Contextual latitude/longitude</title>
        <p>improve seasonal accuracy;
time
classification
Semantic analysis and content recognition: recognising
semantic elements in pictures would allow to apply
different style rules to different objects (e.g. sea VS sand)
Perform tests with similar generative techniques as
Variational Auto-encoders (VAE).</p>
        <p>The exploration of these technologies would allow also to expand
the design of a more visual travel booking experience with
complementary use cases to be explored.</p>
        <p>• The simple application of content recognition would allow
users for a semantic search of specific visual elements into
travel photos. Such a travel service would therefore be able
to respond to questions like “I want to go to a place with
trees and museums”.
•
•</p>
        <p>Dominant color extraction and similar techniques would
allow to explore travel locations by their most frequent
hues, responding to users’ questions like “I want to go to a
place that looks mostly yellow and blue”.</p>
        <p>All these techniques can be integrated and provide live
updated personalized visual cues and trends of the most
searched terms or characteristics.</p>
        <p>The project also became a starting point to deepen the reflection on
human-machine interaction, in particular in relation to service
design applications of generative AI and the need for parameters for
the exploration and representation of the latent space.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Anoosheh</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Timofte</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pollefeys</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Gool</surname>
            ,
            <given-names>L. Van.</given-names>
          </string-name>
          (
          <year>2019</year>
          ).
          <article-title>Night-to-day image translation for retrieval-based localization</article-title>
          .
          <source>Proceedings - IEEE International Conference on Robotics and Automation</source>
          , 2019-May,
          <fpage>5958</fpage>
          -
          <lpage>5964</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Caesar</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uijlings</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ferrari</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>COCO-Stuff: Thing and Stuff Classes in Context</article-title>
          .
          <source>Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition</source>
          ,
          <fpage>1209</fpage>
          -
          <lpage>1218</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Dietmar</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <article-title>"GANs and Deepfakes Could Revolutionize The Fashion Industry"</article-title>
          . Forbes.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Coffey</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <article-title>Easyjet launches app allowing passengers to book flights based on Instagram pictures</article-title>
          , (
          <year>2018</year>
          ), Indipendent.co.uk https://www.independent.co.uk/travel/news-and
          <article-title>-advice/easyjet-applook-and-book-instagram-pictures-flights-social-mediaa8588481.html</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Gatys</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ecker</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Bethge</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>A Neural Algorithm of Artistic Style</article-title>
          .
          <source>Journal of Vision</source>
          ,
          <volume>16</volume>
          (
          <issue>12</issue>
          ),
          <fpage>326</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Ian</surname>
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Goodfellow</surname>
          </string-name>
          , Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville,
          <article-title>Yoshua Bengio "Generative Adversarial Networks"</article-title>
          ,
          <source>in NIPS 2014</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>M. Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Breuel</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Kautz</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Unsupervised image-toimage translation networks</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          (pp.
          <fpage>700</fpage>
          -
          <lpage>708</lpage>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jia</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>C. K.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Two-Class Weather Classification</article-title>
          .
          <source>IEEE Transactions on Pattern Analysis and Machine Intelligence</source>
          ,
          <volume>39</volume>
          (
          <issue>12</issue>
          ),
          <fpage>2510</fpage>
          -
          <lpage>2524</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , How Instagram is changing travel https://www.nationalgeographic.com/travel/travel-interests/
          <article-title>arts-andculture/how-instagram-is-changing-travel/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Mustafa</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bard</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bhimji</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          et al.
          <article-title>CosmoGAN: creating highfidelity weak lensing convergence maps using Generative Adversarial Networks</article-title>
          .
          <source>Comput. Astrophys. 6</source>
          ,
          <issue>1</issue>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Ritschel</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <article-title>When booking holidays think of Instagram photos above all other factors, Indipendent</article-title>
          .co.uk, (
          <year>2017</year>
          ) https://www.independent.co.uk/life-style/
          <article-title>millennials-holidaydecision-instagram-photos-factors-think-first-social-mediaa8131731.html</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Chu</surname>
            ,
            <given-names>W. T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zheng</surname>
            ,
            <given-names>X. Y.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>D. S.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Camera as weather sensor: Estimating weather information from single images</article-title>
          .
          <source>Journal of Visual Communication and Image Representation</source>
          ,
          <volume>46</volume>
          ,
          <fpage>233</fpage>
          -
          <lpage>249</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Wong</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>"The Rise of AI Supermodels"</article-title>
          . (
          <year>2019</year>
          )
          <article-title>CDO Trends</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>J. Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Isola</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Efros</surname>
            ,
            <given-names>A. A.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Unpaired Imageto-Image Translation Using Cycle-Consistent Adversarial Networks</article-title>
          .
          <source>Proceedings of the IEEE International Conference on Computer Vision</source>
          , 2017-Octob,
          <fpage>2242</fpage>
          -
          <lpage>2251</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>