=Paper=
{{Paper
|id=Vol-3290/long_paper6539
|storemode=property
|title=The Computational Memorability of Iconic Images
|pdfUrl=https://ceur-ws.org/Vol-3290/long_paper6539.pdf
|volume=Vol-3290
|authors=Lisa Saleh,Nanne van Noord
|dblpUrl=https://dblp.org/rec/conf/chr/SalehN22
}}
==The Computational Memorability of Iconic Images==
The Computational Memorability of Iconic Images Lisa Saleh, Nanne van Noord∗ Multimedia Analytics Lab, University of Amsterdam Abstract The perception of historic events is frequently shaped by speci昀椀c images that have been ascribed an iconic status. These images are widely reproduced and recognised and can therefore be considered memorable. A question that arises given such images is whether the memorability of iconic images is intrinsic or whether it is shaped. In this work we analyse the memorability of iconic images by means of computational techniques that are speci昀椀cally designed to measure the intrinsic memorability of images. To judge whether iconic images are inherently more memorable we establish two baselines based on datasets of diverse imagery and of newspaper imagery. Our 昀椀ndings show that iconic images are not more memorable than modern day newspaper imagery or when compared to a diverse set of everyday images. In fact, by and large many of the iconic images analysed score on the low end of the memorability spectrum. Additionally, we explore the variation in memorability of reproductions of iconic images and 昀椀nd that certain images have been edited resulting in higher memorability scores, but that the images by and large are reproduced with memorability close to the original. Keywords Memorability, Iconicity, Computer Vision 1. Introduction The need to capture historic events in a visual frame has been around long enough to have captured events from centuries ago [18]. From ancient greek wall paintings of the battle of Marathon (490 BC) to the Black Death (14th century) in elaborate oil paintings [24, 6]. His- toric events are o昀琀en remembered through the visual imagery it is captured in [18]. Due to technological development in recent history, it has become conventional to capture events with photography. A unique share of historic photographs displayed in media are considered iconic. To illustrate, Tank Man, the image of a man in front of the tanks in a street of Tienanmen Square in Beijing, is an image which many people immediately can recall. Iconic photographs can be de昀椀ned as “photographs that are widely recalled and recognized by individuals across social groups and generations, that they connect with speci昀椀c historical events, and that have emotional signi昀椀cance for them and their national community” [7]. Iconic photographs are thus inherently connected with historical events and may in昀氀uence the views of people on this historic event. For example, the photograph of Alan Kurdi during the Syrian refugee crisis in 2015 caused such an emotional response with most individuals that would in昀氀uence the interna- tional politics surrounding the Syrian refugee crisis [1]. Thus iconic images may even in昀氀uence CHR 2022: Computational Humanities Research Conference, December 12 – 14, 2022, Antwerp, Belgium ∗ Corresponding author. £ lisa.saleh@student.uva.nl (L. Saleh); n.j.e.vannoord@uva.nl (N. van Noord) ȉ 0000-0002-5145-3603 (N. van Noord) © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). CEUR Workshop Proceedings http://ceur-ws.org ISSN 1613-0073 CEUR Workshop Proceedings (CEUR-WS.org) 55 the course of history. Due to this in昀氀uence iconic images have been widely researched with the aim to understand what makes them unique. Di昀昀erent aspects of iconic images have been researched. One o昀琀en noted criteria for iconic- ity is the importance of the captured historic event. Due to the democratisation of photography, images of recent historic events are abundant. Thus an image that merely displays a signi昀椀cant historic event is not su昀케cient to reach iconic status. For an image to become iconic it must be widely spread in the media [11]. Therefore it is useful to know what images the media tends to publish and which characteristics they carry. For example, one study on media portrayal of photographs of hurricane Katrina uncovered that photographs with some visual themes are more o昀琀en published than others [8]. Besides visual themes, there are plenty of other aspects that can in昀氀uence if an image can be iconic. To recall the de昀椀nition of an iconic photograph, the photograph must be widely recalled and recognized. An image that is widely recalled and recognized needs either to be really memorable or to be displayed very o昀琀en. A more memo- rable image is more likely to be recalled. Thus high memorability of an image may contribute to its iconic status. As there has been no research on the intrinsic memorability of iconic im- ages, this will be the focus of this paper. Speci昀椀cally, we investigate whether there are general patterns concerning memorability of iconic images that are salient even when considering the unique circumstances by which these images have become iconic. Memorability of images is a complex characteristic to predict; Using only the surface features of the image for predicting memorability is not comprehensive. Automating the prediction of memorability with Convolutional Neural Networks (CNN) has been a successful solution to this problem [16]. It is proven that features such as object size and brightness do a昀昀ect image memorability [10]. However, a CNN like MemNet outperforms these surface features in predicting memorability [16]. When considering image memorability what a method like MemNet measures is the intrinsic memorability of an image, which can be interpreted as the innate probability of an image to be remembered. It does not guarantee that an image will be remembered, and it is certainly not the only factor, but all other factors being equal an image with a higher intrinsic memorability is more likely to be remembered. Besides the advantages of automatizing the process by using a CNN, the usage of a predic- tion CNN has another advantage: As iconic photographs are images that are widely spread it is likely that a great share of people has already seen said image. Therefore it would be di昀케- cult to test the memorability of these images with a memory task. It would be a challenge to guarantee that these people have never seen these images before. Thus the challenge of test- ing the memorability of iconic photographs context-independent can be solved by using the above-mentioned MemNet. Additionally, the iconicity of some images is determined by the fame of the subject. MemNet is not trained to recognize celebrities or other symbols. In this research, the focus is on what image aspects the memorability in昀氀uenced, without analysis of the context of the images. This makes the usage of MemNet a great 昀椀t. In this paper, iconic photographs are tested on their memorability by the use of the CNN MemNet to explore to what extent the memorability of iconic photographs can be measured. To investigate this we formulate the following two research questions: RQ 1: How does the memorability of iconic images compare to a baseline? RQ 2: To what extent does the memorability of iconic images di昀昀er across variations? 56 2. Related work 2.1. Iconic Images There has been little to no computational research on iconic images, but in the following we will give a brief overview of other research directions. As iconic images are not a bound set, di昀昀erent de昀椀nitions and what criteria they should meet have been proposed [7, 17, 2, 11]. There are di昀昀erences in the criteria brought up, but the following six proposed by Perlmutter [22] are used most o昀琀en: ”(1) signi昀椀cance of the reported event; (2) capacity to represent the event as a whole; (3) celebrity of the image promoted by the media; (4) prominence of display of the image; (5) frequent repetition of the image across media outlets; (6) ability to generate a primordial theme in society such as good versus evil.” These criteria are not met to the same extent for every iconic image; it is not a blueprint for iconic images. For example, both the image of the Falling Man of the Twin Towers and the hijacked plane crashing into the Twin Towers represent the September 11 attacks. The image of the Falling Man has a less capacity to represent the event as a whole than the image of the airplane, as the attack itself is displayed more comprehensively in the latter image. But even though the Falling Man image does not meet the second criteria to the full extent, it is still an image to be considered iconic. Additionally, other studies have added criteria to be more comprehensive. For example, in some studies more stress has been placed on the symbolic meaning that people have of the image [7]. An image can become a symbol with a meaning beyond the historical image it is attached to. The Guerrillero Heroico image of Che Guevara is used as a symbol for revolution beyond in Cuba [20]. Even though this is not signi昀椀cant for every iconic image, it is still a unique aspect for some. Furthermore, the criteria from Perlmutter [22] put the focus on the image and less on the reception of the image. One aspect of the reception that is unmentioned in these six criteria is that an iconic image also should be widely recalled and part of the col- lective memory of a certain with a group’s identity [7]. This is another criterion o昀琀en added to complement the criteria from Perlmutter. That an image must be widely recalled is impor- tant to add because this can be a way of researching what image have an iconic status with qualitative research as done by for example Hoeven [12]. But as other factors of the reception, like the symbolic meaning or being a part of the col- lective memory, of an iconic image are not fully encoded in the image itself and are subjective, it is yet challenging to study computational [19]. It is yet di昀케cult to study the semantics of images computationally, as this o昀琀en is can not be conducted from the visual data or metadata. Therefore the study of the perception of iconic images is o昀琀en limited to interviews or large- scale surveys [19]. Hence the study of iconic images in computer vision is yet to be immersed, and potentially the computational study of memorability can open this door. 2.2. Memorability Due to technological advancements capturing and sharing have become more convenient. Mass media and its consumption lead people to be exposed to countless images every day. People are exposed to many images daily and some of those are better remembered than oth- ers. Which image people remember and which they forget is related to the memorability of the image. Di昀昀erent academic 昀椀elds have an interest in image memorability. For example, 57 research in cognitive sciences answered how certain activities in the brain correlate to the memorability of images. Additionally, image memorability has been a subject of research in the computer vision 昀椀eld in recent years [16, 10, 9, 14, 4]. Taking the next step from com- puting how memorable images are with MemNet, to analyzing what image attributes leads to memorable image and what connection memorability has with other qualities like the emotion it portrays. In line with researching what attributes in昀氀uence memorability, GANalyze was created [10]. With GANalyze images are tweaked to increase memorability. Some attributes emerged to o昀琀en be increased when optimizing the memorability, such as redness, brightness and object size [10]. Thus images that have higher redness, brightness or a bigger object size are more likely to be memorable. Even though these attributes might be simple and intuitive ways to predict memorability, the prediction of memorability is complex and can not solely be explained by these attributes [16]. When diving further into the correlation between memorability and some image aspects, collected across di昀昀erent studies, the following observations were made. In some studies, it appeared that a number of object categories, like people, animals and vehicles were relatively more memorable [21]. Images with humans are especially memorable when the face is visi- ble and has eye contact with the camera [13]. In scenic images, indoor scenes appeared to be relatively more memorable [13]. Additionally, it appeared that spatial layout is highly corre- lated with memorability; Images where the object is bigger and centred in the image are more likely to be memorable [21]. This is in line with research which showed that images with a high aesthetic level are also more likely to be memorable [15]; the spatial layout of an image is strongly connected to its aesthetics. Moreover, it also appeared that images in which humans portray negative emotions like disgust and fear, tend to be recalled better [16]. Finally, only the most memorable images have a correlation with popularity [16]. Even though mapping for these above attributes and many more, there is still about 25% of the variance in memorability that is unaccounted for [23]. Thus looking for the attributes in images that are correlated with memorability will give some intuitive sense of memorability, but will not be comprehensive. 2.3. MemNet & LaMem In this paper, memorability will be measured computationally through a score given by Mem- Net. MemNet is a CNN trained on an annotated dataset called LaMem. LaMem is a large-scale dataset constructed of multiple already existing datasets that were annotated in a perception study with human subjects who performed a memorability task. The datasets used are highly diverse, which leads to LaMem being a diverse dataset totalling 60.000 images [16]. It contains images of humans, animals, landscapes, and even art, which also includes abstract images. This diversity of image types is displayed in Figure 1. For the LaMem annotation task a stream of images was shown to participants and a昀琀er a varying number of distractors the participants had to indicate if they had seen the image before. Thus when an image was remembered o昀琀en the image would be scored as more memorable. A昀琀er performing this task on a group of participants a memorability score for each image could be calculated. This score is the annotation for each image in the LaMem dataset. This makes LaMem thereby a su昀케cient foundation for a CNN to compute a memorability score (0 − 1 scalar) for di昀昀erent types of images [16]. LaMem was shown to have an inter-annotator 58 Figure 1: Sample images from LaMem [16] with diversity in image subject. The images are arranged by their memorability score, decreasing from le昀琀 to right. rank correlation of 0.68, with MemNet achieving a remarkably high rank correlation of 0.64 [16, p. 6]. MemNet has since been further validated in subsequent studies and is, therefore, a reliable measure of intrinsic image memorability [23, 21, 14, 4]. 3. Memorability of Iconic Images The following step-by-step procedure, shown in Figure 2 represents our proposed method that can be used for measuring a memorability across a set of images. Figure 2: Flowchart overview of the memorability comparison procedure for two datasets. The procedure we propose involves a comparative process between two datasets (A and B). Here, dataset A represents the collection to be studied, whereas dataset B functions as a baseline. The general idea of this approach is to not only measure the memorability scores but also judge whether the scores di昀昀er from a meaningful baseline. Each dataset is pre-processed by transforming the images to a suitable input resolution for the CNN (typically a low resolution like 256 × 256) and normalizing them. Once the dataset are pre-processed the images will be passed through the pre-trained CNN to obtain a per-image memorability score. The method for comparing between the memorability scores of datasets A and B depends on the sizes of the respective datasets. In the case of small datasets, the analysis will be mostly done by qualitatively looking at the images to draw conclusions based on the speci昀椀c images. When dealing with large-scale datasets, the comparison can be based on a statistical analysis. Whilst our baseline datasets are large-scale, the dataset of iconic images used is too small scale to reliably perform statistical analysis. Further analysis beyond the comparative analysis will consist of sampling images which are 59 either random or statically interesting, according to their attribute score. Sampling random images will give an indicative perspective on how the data is constructed. This method of analysis is based on Distant Viewing [3], a method for studying large visual corpora, where the main take way is to perform the computational analysis whilst also viewing the corpora. The latter is import because the intuitive semantics that may get lost in a computational analysis will thereby also be studied. As the semantics are important for iconic images, this method suits this paper well. Conclusions drawn from these analyses can then be linked to prior work. 3.1. Comparing the Memorability of Iconic Images For our dataset of iconic images we use a selection of 26 images that were initially in a large- scale survey on iconic images [12]. Smits and Ros [25] further used this set and collected variations of the images in the form of online circulations. These 26 images are a part of a global visual memory: ’a limited set of images that people all over the world have seen and remembered’ [12]. Each image and its scores are depicted in Figure 3. These images are in order from highest to lowest memorability score. When comparing the scores with the images we make the following observations. Firstly, it is notable is that the three images with the highest memorability score are similar. The images of Che Guevara, Sharbat Gula, and the Migrant Mother are all portraits where the face of the subject 昀椀lls much of the image. Their expression is visible to the viewer as all three are looking in the (rough) direction of the camera. That these portraits have a high memorability score is in line with the prior work; Images with bigger object size, a face that has eye contact with the camera and humans as their main object are more likely to have a higher memorability score. Which makes it in the lines of expectation that images like the Holocaust survivors, Raising a 昀氀ag over the Reichstag and Tank Man have a lower memorability score; These images have a small object size and either has no human faces or a high amount of di昀昀erent faces. On the whole we see a diverse range of memorability scores across the dataset. 3.2. Iconic Images compared to a Baseline To place the memorability scores in context, we compare to two di昀昀erent baselines. The 昀椀rst baseline used is the LaMem dataset; This dataset consists of images with di昀昀erent subjects and themes, and functions as a diverse baseline representing a ‘general image collection’ [16]. This is an annotated dataset for memorability, but the annotations were not used; Images of the LaMem dataset were inserted in MemNet to create a memorability score generated with the same circumstances as the dataset it is compared with. The second baseline used was the GoodNews dataset, which consists of all kinds of images used in the New York Times from 1818 until 2019 [5]. These images vary from news photography to sports photography, to images from the cooking appendix and so on. The range of types of images is displayed in Figure 4. These images are mainly images that were made by professional photographers and selected by the editors of the paper. Thus these images o昀琀en have some journalistic value and meet the aesthetic standards of the paper. Therefore the GoodNews dataset is used as a baseline to represent images depicted in media. As iconic images are widely published in the media, the GoodNews dataset will help us determine whether iconic images are particularly memorable 60 Sharbat Gula Che Guevera Migrant Mother 0.9120 0.8894 0.8767 Survivor of Hutu The Falling Man on the Moon death Man 0.8168 camp 0.8190 0.8290 Abu Hinden- Ghraib burg Alan Kurdi prisoner Disaster 0.7897 0.8098 0.7940 Mohandas Mao Zedong Hijacked Airplane Ghandi 0.7767 0.7697 0.7725 Viet Cong Times Square Kiss Napalm Girl Execution 0.7610 0.7588 0.7533 Burning Vultre and Situation Monk the girl Room 0.7476 0.7462 0.7447 Raising the Assassination of Kent State flag on Inejiro Anasuma Shootings Iwo Jima 0.7248 0.7151 0.7173 Spanish Coup in Tank Man Soldier Chile 0.6939 0.6982 0.6981 Raising a Flag over Holocaust the survivors Reichstag 0.6449 0.6796 Figure 3: Iconic images sorted on their memorability score. Each image is captioned with the name it is known as and its memorability score. The photographer and year of origin can be found in Table 1. 61 when compared to other media images. Figure 4: Samples from the GoodNews dataset, which contains diverse images from The New York Times [5]. The distribution of the memorability scores of the LaMem and GoodNews dataset is visu- alised in Figure 5. Additionally, the scores of the iconic images are plotted with a scatter plot at the bottom of the Figure. From the distributions we can observe that the GoodNews images are generally more memorable than the LaMem images. Additionally, the LaMem dataset has a wider distribution than the GoodNews dataset. There are fewer images in the GoodNews dataset with a very low memorability score, which might re昀氀ect that all images in GoodNews have been approved by an editor of the paper. It is very likely an editor would select a pic- ture with at least some traits that correlate with memorability. For example, images that are selected by an editor o昀琀en need to have either a clear main object, thus big object size or some aesthetic value. Taking into account scatter plot of the iconic images we observe that iconic images are generally (slightly) less memorably than the images depicted in the media. To further clarify how the iconic images compare to both datasets we plot the distribution of the iconic images according to the quartiles of both LaMem and GoodNews in Figure 6. We can observe that most iconic images fall into the lower quartiles for both datasets, scoring slightly higher when compared to LaMem. As the LaMem dataset is representative of all kinds of images, it can be concluded that these iconic images are slightly less memorable images. When comparing the iconic images to the GoodNews dataset, it stands out that most of the iconic images have a memorability score that overlaps with the 昀椀rst quantile of the GoodNews dataset. The distribution of the iconic images in the quartiles of the GoodNews dataset is signi昀椀cantly shi昀琀ed to the less memorable side. Thus iconic images are less memorable than images depicted in the media. 4. Memorability across Variations Until now we have analysed the most canonical versions of the 26 iconic images. However, it is known that edits have been made in prominently published versions of iconic images. To explore whether these edits have been made (implicitly or explicitly) to boost the memorability of iconic images we further analyse the circulations collected by Smits and Ros [25]. For each of the 26 iconic images they used the Google Cloud Vision API to retrieve online circulations, thereby creating a dataset of 900k images. Per image the number of variations di昀昀ers from the Che Guevara portrait having over 100k variations retrieved to the image of Mao Zedong and 62 Figure 5: The upper graph is the distribution plot of the LaMem and GoodNews dataset, where the distribution is the density for the memorability scores. The graph below is a scatter plot of the iconic images and how they compare the datasets. Figure 6: Distribution of the iconic images across the LaMem and GoodNews datasets. The y-axis represents how many iconic images have the memorability score for that quartile. the founding of the PRC, having less than 3k variations. Figure 7 shows how many images were retrieved for each of the images. Due to how the variations were collected, in iterations based on previous retrieval results, the 63 Table 1 Reproduction of table from Smits and Ros [25] with the amount of circulations per image. known as photographer year historical event circulations Migrant mother Dorothea Lange 1936 Great Depression 41697 Falling Soldier Robert Capa 1936 Spanish Civil War 18194 The Hindenburg Disaster Sam Shere 1937 Zeppelin 36683 Times Square Kiss Alfred Eisenstaedt 1945 V-Day 65164 Raising the Flag on Iwo Jima Joe Rosenthal 1945 Pacific War 63249 Holocaust survivors Lee Miller 1945 Holocaust 18343 Raising a Flag over the Reichstag Yevgeny Khaldei 1945 World War II 90344 Gandhi and the Spinning Wheel Margaret Bourke-White 1946 Mohandas Gandhi 10893 The Founding of the PRC Hou Bo 1949 Mao Zedong 2865 Assassination of Inejiro Asanuma Yasushi Nagao 1960 post-war Japan 3921 Guerillero heroico Alberto Korda 1960 Che Guevara 108288 The Burning Monk Malcom Browne 1963 Vietnam War 18122 Saigon Execution Eddie Adams 1968 Vietnam War 18305 A Man on the Moon Neil Armstrong 1969 Space Race 186921 Kent State Shootings John Filo 1970 Kent State 7320 Accidental Napalm (Napalm girl) Nick Ut 1972 Vietnam War 38619 Allende’s Last Stand Luis Orlando 1973 South-American Coups 6997 Afghan Girl Steve McCurry 1984 Afghan War 47892 Tank Man Je昀昀 Widener 1989 Tiananmen Square Protest 63182 The vulture and the little girl Kevin Carter 1993 Sudan famine 30121 Survivor of Hutu death camp James Nachtwey 1994 Rwandan genocide 3395 The Falling Man Richard Drew 2001 9/11 11681 Hijacked airplane unknown 2001 9/11 6938 Abu Ghraib prisoner Sergeant Ivan Frederick 2003 Iraq War 3601 The Situation Room Pete Souza 2011 War on Terrorism 20102 Alan Kurdi Nilüfer Demir 2015 Refugee crisis 24432 total 947269 di昀昀erences to the original get progressively larger the deeper we get into the list of variations. The variations of the iconic image vary in di昀昀erent crops, di昀昀erent shades of colors and so on. There are also variations where other images and text are added to the iconic image, this can di昀昀er from a logo of a broadcaster to a book cover where the original image also appears. Furthermore, internet memes, collages and other types of photo-shopped images appear in the dataset. Additionally, the API is not infallible, there are some images in the dataset where the original image does not appear at all. Roughly speaking we observe that images which have been circulated less there are also fewer edits. To control for some of this variation, and to limit the scale of the comparison, we only use the 昀椀rst 10k variations for each iconic image. The memorability scores for the variations of each iconic image are visualised in Figure 7. Images to the right on the x-axis should generally have less resemblance with the original image. When viewing the distributions of scores for the variations some patterns can be found. We recognise four groups: (1) distributions with relatively little spread across the variations, (2) distributions that fan outward towards the end of the plot, (3) distributions that have a large spread from start to 昀椀nish, and (4) distributions where we can recognise clear clusters. Examples of the 昀椀rst group, with little spread, at the Migrant Mother, the Situation Room, and the assassination of Anasuma. Within this group we can recognise two subcategories, as they 64 Figure 7: Grid with a scatter plot per iconic image, where each scatter plot visualises the memorability score across the variations in sequence. Higher x values are further from the original image. are either images that have less than 10k variations or they are images where the canonical form is most dominant. Because for the latter subcategory we have only looked at the 昀椀rst 10k images retrieved, it is still very likely that there are also many variations that di昀昀er more strongly circulating on the internet. But these would only be retrieved further in the dataset than the 昀椀rst 10k images selected. However, it is still noticeable that the 昀椀rst 10k variations have more resemblance with the original iconic image than for other widely circulated iconic images. For the second group examples have most of their spread at the end of the graph, such as the Burning Monk, the Falling man and the Tank Man images. In these images, it appears that there is de昀椀nitely a big share of images that have a big resemblance with the original picture but variations that di昀昀er more already appear within the 昀椀rst 10k variations. In the third group there is a lot of spread from beginning to 昀椀nish, these are images like Gandhi, Raising the 昀氀ag over Iwo Jima and the Image of the Spanish Soldier. These images appear in a lot of di昀昀erent variations. This can be explained by it being popular photos for web pages, book covers and other types of editing where the context of the image gets altered heavily. Lastly is the group where we can recognise di昀昀erent clusters. This group includes images 65 like Napalm Girl, the Hijacked Plane and the portrait of Sharbat Gula. All these images have an alternative popular variation, which leads to a cluster forming in the 昀椀gure. In the following section, the Napalm Girl image will be highlighted to give an example of what these variations look like. Two variations of the Sharbat Gula image and the Hijacked plane are displayed in Figure 8 with their memorability score in the caption of each image. The variations that are highlighted in this 昀椀gure are average images from the main clusters. In the Sharbat Gula variations graph, the bigger cluster on the top of the graph is the original image (Figure 8a) and the cluster that lies under most of the images are depicted in Figure 8b. In the Hijacked Plane, there is a similar pattern, the graph of the variations has two distinct clusters, one at the top which consists of images like the original image (Figure 8c and the other cluster which has a lower memorability score than the 昀椀rst cluster. This cluster consists of images like in Figure 8d, which has a wider crop than the original image such that the building on the le昀琀 still remains in the image. (b) The variation of the original with a recent im- (a) The original image with a memorability score age placed alongside has a memorability score of 0.9120 of 0.8511 (c) The published image with a memorability (d) A variation of the published image with a score of 0.7697 wider crop and a memorability score of 0.7095 Figure 8: Two variations of the Sharbat Gula and the Hijacked Plane image. Where the Figure 8a is the original portrait of Sharbat Gula and Figure 8b is a variation where a more recent portrait is placed alongside the original. The Figure 8c is the original published version of the Hijacked Plane and the wider crop variation in Figure 8d 66 4.1. The Memorability of the Variations of the Napalm Girl Image Figure 9: Scatter plot of the memorability scores on the y-axis and the sequence of variations of the Napalm Girl image on the x-axis, where the red dotted line is a trend line. A number of di昀昀erent crops of the Napalm girl image circulate online. The scatter plot which displays the memorability scores and their variations is depicted in Figure 9. The outliers on the upper side of this plot, images with a score above 0.85, are either images heavily photoshopped or images that got in this dataset but in which the Napalm Girl image does not appear. One of the aspects that stands out from this graph is the big cluster of images which are under the trend line and appear from about 4.000 on the x-axis. When sampling these images it appears that these are mainly the Napalm Girl image with a tighter crop than the original image. This is displayed in Figure 10. Where the original image is displayed in Figure 10a and the tighter crop in Figure 10b. The part of the image that is only visible in the wider crop consists of a big part of the sky and a photographer on the right. The tight crop is more focused on the children, which are the main subject of the picture, and thereby take up a larger portion of the image. The tighter crop having a higher memorability score 昀椀ts the notion that images with a bigger object size are more memorable and hints that memorability might be an (implicit) criteria for edits done by photo-editors. Another frequent variation was the image in Figure 10c. This image has a signi昀椀cantly higher score than the original picture and was the only variation with a score in this range that still fully depicted the original image. The original image was edited with a red-hued 昀椀lter over the image. That this red-hued image has such a higher memorability score is expected as the redness of an image has been demonstrated to positively a昀昀ect memorability. 5. Discussion The 昀椀ndings in this paper are based on the result generated by MemNet. Even though MemNet is a validated method of predicting memorability, the actual memorability and the computed 67 (a) The original full photograph with a memora- (b) Widely published tighter crop with a memo- bility of 0.6767 rability of 0.7652 (c) A variation with a red 昀椀lter and a memorability of 0.8200 Figure 10: Three di昀昀erent variations of Napalm Girl found in online circulation. memorability score could still di昀昀er. In conducted research, it became apparent that image memorability and some qualities were correlated, like aesthetics and certain emotions. But even though they are correlated, MemNet is not trained on these qualities; Thus, for example, images that do not have the more common features that positively in昀氀uence the memorability, like a big object size, but are very memorable because they portray a strong negative emotion that also in昀氀uences memorability, possibly do not get a high memorability score from MemNet. This highlights a mismatch between intrinsic memorability and actually being remembered. A clear example of this mismatch is the Napalm Girl image, this image has a memorability score below the average of LaMem. But Napalm Girl displays a scene of terror, that evokes strong negative emotions like sadness and anger. Those strong emotions make this image more mem- orable and highlight a limitation of (computational) intrinsic memorability methods. Moreover, frequent exposure of less memorable images might also lead to increased remembrance, which might also play a role for this image. A possible limitation of this work is that the dataset was selected by Dutch researchers. Despite being selected with the aim to represent international iconic images, the images are predominantly known in the Western World. This could in昀氀uence the results, but this might 68 also interact with MemNet. As MemNet is trained on LaMem which consists mainly of images from the Western World, this bias is is matched - whilst it should not in昀氀uence the analysis itself it does limit to what extent we can generalize about the results. Additionally, the dataset of the iconic images is mostly from the 20th century when color photography was not as common. Most of the 26 images are in black and white. When experimenting with the di昀昀erences in memorability score for the same image in greyscale as in color, it a昀昀ected the memorability score; While the LaMem dataset averaged a memorability score of 0.7645 this score dropped to 0.7456 when all images were converted to greyscale. From this we can observe that colour plays a role, but not to the extent that is changes our conclusions. Even with taking these points into account, the observations we made align with existing theories on memorability. 6. Conclusion In answering the research question: How does the memorability of iconic images compare to a baseline? It appeared that of the iconic images, the portraits where the face was a big part of the image had a higher memorability score. This con昀椀rmed the research on memorability where a correlation between higher memorability and big object size, and a face as an image object was established. Additionally, the iconic images with small object sizes and small faces had a lower memorability score. The memorability scores for these iconic images align with theories on memorability. Furthermore, it appeared that images that are depicted in the media are generally more memorable than all other images. Few images are depicted in media that have a relatively low memorability score since all images depicted in media are selected by editors. Iconic images are generally slightly less memorable than other images and are on the lower side of the memorability of images in the media. To answer the second research question: To what extent does the memorability iconic images di昀昀er across variations? When comparing the spread of the memorability of the variations of the iconic images, there were certain patterns to be found. Some had clear clusters, which were other popular variations of the original image. Looking at examples of di昀昀erent variations and memorability scores they show that altering an image can in昀氀uence the memorability score. In these examples, it appeared that variations of the images with a red hue or tighter crop were more memorable. This is in line with previous work. On the whole we can conclude that computational measures of memorability do not fully capture the memorability of iconic images, as many iconic images are remembered much better than what their memorability score would imply. While the reasons for this may be manifold we expect that frequent exposure and strong emotional content play an important role. References [1] R. Adler-Nissen, K. E. Andersen, and L. Hansen. “Images, emotions, and international politics: the death of Alan Kurdi”. In: Review of International Studies 46.1 (2020), pp. 75– 95. doi: 10.1017/s0260210519000317. 69 [2] K. Andén-Papadopoulos. “The Abu Ghraib torture photographs: News frames, visual culture, and the power of images”. In: Journalism 9.1 (2008), pp. 5–30. doi: 10.1177/1464 884907084337. [3] T. Arnold and L. Tilton. “Distant viewing: analyzing large visual corpora”. In: Digital Scholarship in the Humanities 34.Supplement_1 (2019), pp. i3–i16. doi: 10.1093/llc/fqz01 3. [4] Y. Baveye, R. Cohendet, M. Perreira Da Silva, and P. Le Callet. “Deep Learning for Image Memorability Prediction: The Emotional Bias”. In: Proceedings of the 24th ACM Interna- tional Conference on Multimedia. Mm ’16. Amsterdam, The Netherlands: Association for Computing Machinery, 2016, pp. 491–495. doi: 10.1145/2964284.2967269. url: https://d oi.org/10.1145/2964284.2967269. [5] A. F. Biten, L. Gómez, M. Rusiñol, and D. Karatzas. “Good News, Everyone! Context driven entity-aware captioning for news images”. In: CoRR abs/1904.01475 (2019). arXiv: 1904.01475. [6] P. Bruegel the Elder. Triumph of Death. 1563. url: https://www.museodelprado.es/en/th e-collection/art-work/the-triumph-of-death/d3d82b0b-9bf2-4082-ab04-66ed53196ccc. [7] A. A. Cohen, S. Boudana, and P. Frosh. “You Must Remember This: Iconic News Pho- tographs and Collective Memory”. In: Journal of Communication 68.3 (2018), pp. 453– 479. doi: 10.1093/joc/jqy017. [8] N. S. Dahmen and A. Miller. “Rede昀椀ning iconicity: A 昀椀ve-year study of visual themes of Hurricane Katrina”. In: Visual Communication Quarterly 19.1 (2012), pp. 4–19. [9] R. Dubey, J. Peterson, A. Khosla, M.-H. Yang, and B. Ghanem. “What Makes an Object Memorable?” In: 2015 IEEE International Conference on Computer Vision (ICCV). 2015, pp. 1089–1097. doi: 10.1109/iccv.2015.130. [10] L. Goetschalckx, A. Andonian, A. Oliva, and P. Isola. “GANalyze: Toward Visual De昀椀ni- tions of Cognitive Image Properties”. In: Proceedings of the IEEE/CVF International Con- ference on Computer Vision (ICCV). 2019. [11] R. Hariman and J. L. Lucaites. No caption needed: Iconic photographs, public culture, and liberal democracy. University of Chicago Press, 2007. [12] R. van der Hoeven. “The Global Visual Memory: A Study of the Recognition and Inter- pretation of Iconic and Historical Photographs”. PhD thesis. Universiteit Utrecht, 2019. [13] P. Isola, D. Parikh, A. Torralba, and A. Oliva. “Understanding the Intrinsic Memorability of Images”. In: Advances in Neural Information Processing Systems. Ed. by J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, and K. Weinberger. Vol. 24. Curran Associates, Inc., 2011. [14] P. Jing, Y. Su, L. Nie, and H. Gu. “Predicting Image Memorability Through Adaptive Transfer Learning From External Sources”. In: IEEE Transactions on Multimedia 19.5 (2017), pp. 1050–1062. doi: 10.1109/tmm.2016.2644866. [15] P. Jing, Y. Su, L. Nie, and H. Gu. “Predicting Image Memorability Through Adaptive Transfer Learning From External Sources”. In: IEEE Transactions on Multimedia 19.5 (2017), pp. 1050–1062. doi: 10.1109/tmm.2016.2644866. 70 [16] A. Khosla, A. S. Raju, A. Torralba, and A. Oliva. “Understanding and Predicting Image Memorability at a Large Scale”. In: 2015 IEEE International Conference on Computer Vision (ICCV). 2015, pp. 2390–2398. doi: 10.1109/iccv.2015.275. [17] M. Mortensen. “Constructing, con昀椀rming, and contesting icons: the Alan Kurdi imagery appropriated by #humanitywashedashore, Ai Weiwei, and Charlie Hebdo”. In: Media, Culture & Society 39.8 (2017), pp. 1142–1161. doi: 10.1177/0163443717725572. [18] M. Moss. Toward the visualization of history: the past as image. Lexington Books, 2008. [19] N. van Noord. “A survey of computational methods for iconic image analysis”. In: Digital Scholarship in the Humanities (2022). doi: 10.1093/llc/fqac003. [20] M. Parker. “The Retrospective Iconicity of ‘Guerrillero Heroico’”. In: Salford Postgraduate Annual Research Conference. 2009, p. 292. [21] S. Perera, A. Tal, and L. Zelnik-Manor. “Is Image Memorability Prediction Solved?” In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Los Alamitos, CA, USA: IEEE Computer Society, 2019, pp. 800–808. doi: 10 .1109/cvprw.2019.00108. [22] D. D. Perlmutter. Photojournalism and foreign policy : icons of outrage in international crises. Praeger series in political communication. Westport, CO [etc: Praeger, 1998. [23] N. C. Rust and V. Mehrpour. “Understanding Image Memorability”. In: Trends in Cognitive Sciences 24.7 (2020), pp. 557–568. doi: https://doi.org/10.1016/j.tics.2020.04.001. [24] Sarcophagus with the Battle of Marathon. 490 Bc. url: https://www.livius.org/pictures/it aly/brescia-brixia/marathon-relief/. [25] T. Smits and R. Ros. “Quantifying Iconicity in 940K Online Circulations of 26 Iconic Photographs”. In: Computational Humanities Research. 2020. 71