Embroidered Ephemera: Crafting Qualitative Data
Physicalization Designs from Twitter Data
Anne Sullivana
a
    StoryCraft Lab, Georgia Institute of Technology, Atlanta, GA, USA


                                         Abstract
                                         Embroidered Ephemera explores the juxtaposition of tweets—which can take as little as a few seconds to
                                         create, share, and ultimately disappear—and embroidery, which can take hundreds of hours to create
                                         yet also last hundreds of years. Embroidered Ephemera is an online tool that allows users to input a
                                         Twitter user or hashtag and then generates embroidery sampler designs based on collected tweets. The
                                         generator uses an assortment of text analysis tools to choose words, motifs, and colors based on the
                                         chosen tweet from the specified user or hashtag. The text analysis influences the possibility space for
                                         the aesthetic aspects of the generated sampler design but is not used to define the output such as would
                                         be seen in typical quantitative data visualization.

                                         Keywords
                                         Computational craft, embroider, casual creator, procedural content generation


1. Introduction
Embroidery is a form of handcraft that has been around since at least 30,000 BC – which is the
date of the earliest known fossilized remains of an embroidered piece – and is a craft practice
that continues through today. Embroidery uses a handheld needle to apply thread or yarn as
a decorative feature to a piece of cloth or other material. Through the years, embroidery has
been used to embellish clothing and other items, and due at least partially to the amount of
time that is required to create the needlework patterns, it imbues a sense of importance and
value to whatever item has been embroidered.
   Embroidery encompasses a large variety of stitch types, each requiring practice to develop the
skills required to create high quality designs on valued clothing or items. To gain proficiency,
different stitches would historically be practiced on scraps of cloth. These scraps also became a
way to share stitch knowledge between people and allowed for easier information exchange.
Over time, this practice evolved into more aesthetically driven and decorative designs. These
pieces are now referred to as embroidery samplers, and while partly ornamental, they are still
used as a way to show expertise at one or more stitch types.
   As samplers have evolved, they began to become more complex, and often include a combi-
nation of motifs, alphabets, quotes, figures, borders, etc. These samplers take weeks or months

Joint Proceedings of the ICCC 2020 Workshops (ICCC-WS 2020), September 7-11 2020, Coimbra (PT) / Online
Envelope-Open unicorn@gatech.edu (A. Sullivan)
GLOBE http://asdesigned.com/ (A. Sullivan)
Orcid 0000-0003-4258-7279 (A. Sullivan)
                                       © 2020 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)
Figure 1: Left: A cross-stitch embroidery sampler made by Mary Lotspeich, age 11, ca 1842. Photo by
Sailko, CC BY 3.0, via Wikimedia Commons. Minor color correction applied. Right: A design generated
by Embroidered Ephemera using tweet data by the Twitter company account @twitter.


to complete, and in some cases last for centuries (see Figure 1, left). These designs also began to
often include the name of the person who embroidered it and the date that it was completed.
This is not typical of other embroidered designs, and it is likely this practice developed as sam-
plers began to be known as a “specimen of phenomenal achievement” [1] and displayed as their
own stand-alone art piece. Because of the length of time embroidery can survive and because
samplers regularly include names and dates; it has provided a legacy for the embroiderers; both
highlighting their skills and what things they valued enough to include within the sampler.
   In contrast to the time commitment required for embroidery, Twitter is an online format
that allows users to quickly share their thoughts to a global audience, in a matter of seconds or
minutes. In turn, a tweet can then easily be lost in a sea of other tweets within a very short time
frame. Because of this, while Twitter has become a powerful cultural force, the vast majority
of tweets, when taken on their own, are not considered particularly valuable by the author or
audience.
   Embroidered Ephemera is a casual creator that was designed to explore this juxtaposition
between cross-stitch embroidery samplers and tweets. What could it look like to have tweets
highlighted as something valuable and special like embroidery? In what ways could we capture
some qualities of the tweet author and what is valuable to them?
   To answer these questions, we created an online tool that generates sampler designs based
on data derived from tweets from a user-specified person or hashtag. The system combines
these tweets with generated motifs and crowd-sourced color palettes to constrain the possibility
space of the generator which will then generate an embroidery sampler design (see Figure 1,
right).
   In the rest of this paper we will discuss the Embroidered Ephemera system in more detail
and describe the design decisions that were made while creating it. We will also describe what
we learned by creating this system and discuss possible future work.


2. Related Work
Embroidered Ephemera draws inspiration from many fields including craft research, graphic
design, steganography, craftivism, casual creators, qualitative data visualization, feminist data
visualization, and data physicalization.
   In particular, the rich history of encoding data in textile crafts was a strong influence for this
project. Textiles have a long history of use in steganography, most famously fictionalized by
Charles Dickens in Tale of Two Cities, with the character Madame Defarge, who knit names of
nobles to be sent next to the guillotine [2]. This was not just fantasy; for instance, there are
records of women knitting information about train schedules during wartime in 1914 Belgium
[3].
   More recently, textile creators have been exploring aspects of creative and personal expression
with the use of covertly embedding data into textile creation. For example, Spyn [4] uses
infrared ink to allow the knitted artifact to be “read” through computer vision to recall what
was happening to the knitter during the time they were knitting that area. Similarly, Haring’s
“Subtle Distress” [5] uses purl and knit stitches to represent Morse code within the stitches of a
sweater.
   Most steganography looks to camouflage data while still being decipherable, however with
Embroidered Ephemera we are not aiming to hide information, but rather display the data
beyond a purely informational way. This aligns with the ideals of feminist data visualization,
particularly embracing affective experiences of the viewer [6].
   This is often seen with textile-based data physicalization, which gives people a chance to
interact with the data in a tangible and more affective way [7]. For instance, many crafters have
knit, crocheted, or sewed blankets or quilts that represent local or historical temperature data
[8]. The final artifacts may encode data, but they can also be appreciated as both a useful object
and as an aesthetic object. Most of these projects use quantitative data, but for Embroidered
Ephemera, we were interested in using qualitative data for designs that could be used for data
physicalization.
   It was also important to us that the tool was light-weight, easy to use, and playful. For this,
casual creators provided the inspiration as a “pleasurable exploration of possibility space” [9].
Smaller casual creators such as the magic shop generator and other RPG-based tools by donjon
[10] with very limited controls encourages more exploration. However, unlike these simpler
generators, we wanted Embroidered Ephemera to use data to shape the generative space of the
system.
3. System Details
Embroidered Ephemera is a web-based tool that generates embroidery sampler designs that
can be used to create a physically embroidered piece. The system is written in JavaScript using
jQuery and JSON and is available open source at https://github.com/anneandkita/twittersampler.
   The focus of this project is to generate designs that are informed by the traditions of em-
broidery samplers but use modern-day data as inputs for the generator. The goal was to create
samplers that could aesthetically blend with traditional embroidered samplers. Therefore, aes-
thetics is a large consideration for the output of the system. The output is not meant to be
clearly interpreted data visualization; instead the design is aesthetically influenced by the data
to create an affective experience.
   As embroidered samplers have evolved over centuries, there have been a number of sampler
styles. For this project, we chose to use the “band sampler” style, which is made up of rows of
horizontally repeating motifs, and of-ten include text. Band samplers became popular when the
price of fabric made it difficult to get large pieces of cloth, as the design can easily be modified
to fit different aspect ratios and sizes. This was chosen as a well-suited style to represent the
limited character size of tweets, which often causes users to similarly modify their words to fit
within the limit.
   As mentioned earlier in the paper, embroidery encompasses a large number of stitch types.
For this project, we chose to limit it to cross-stitch, which is a stitch type that is made by creating
an X-shaped stitch over the weave of the cloth. Cross-stitch is used regularly in samplers as it
can be used for many different types of designs, and each X can be viewed as a pixel, which is
well-suited for our computational needs.


4. Output Generation
To start using Embroidered Ephemera, an interactor enters a Twitter username or hashtag onto
the site. From there, the system uses the Twitter API to search the last 30 days of tweets from
that user or hashtag [11]. The tweets are ranked based on engagement – calculated based on
the number of comments, re-tweets, and likes – with higher engagement giving the tweet a
higher rank. Using the ranks as weights, the system makes a weighted random choice, with
higher engagement tweets more likely to be chosen.
   The text of the tweet is converted into a cross-stitch font using a custom-designed library
of font glyph patterns. Once converted to a cross-stitch pattern, the height of the text block
is calculated and placed using the rule of thirds as a rough guideline [12]. The text block is
always placed higher than the bottom one third of the design, to give the overall composition
a balanced feel [12]. Additionally, the name of the twitter user and the date are converted to
the cross-stitch font and placed within the design to mimic the dated signature of the sampler
creator. The block containing the user and date is placed at approximately the top or bottom
one-third line, opposite of the text block to create balance and symmetry.
   From here, the system generates the rest of the design around the text blocks. Since Embroi-
dered Ephemera generates band samplers, the system generates one band (or row) at a time. A
row can either be a repeating graphic that is tiled horizontally, or a set of words that is taken
from a list of most-used words.

4.1. Most Used Words
The sampler design has the possibility of including one or more bands made up of words. This
is inspired by traditional samplers which would sometimes include important concepts to the
creator such as home, love, or family. To find what words were important to the twitter user
or hashtag specified, the system uses the 30 days of tweets returned initially by the Twitter
API to find the top 10 most used words, modified by uniqueness. To accomplish this, we use
term frequency–inverse document frequency (TF-IDF) analysis. The analysis calculates term
frequency using the 30 days of tweets and makes use of a corpus of tens of thousands of tweets
(primarily written in English) as the document for the inverse document frequency. This means
that words such as “the”, “and”, “you”, etc. will be less likely to appear.
   The list of top 10 unique words can then be used within the design of the sampler. Once
a word has been used in a band in the sampler design, it cannot be used again within that
same design. Once all the words have been used in a design, no more bands with words can be
created. Bands with multiple words use the ‘+’ character between words to help keep the word
lists from appearing to be part of the original tweet.

4.2. Graphics
Once a tweet has been selected, the system uses Google’s sentiment analysis API [13] which
calculates a numerical “positivity” rating and a magnitude (or confidence in that rating) based
on the chosen tweet. From these two numbers, the system calculates an “activation” rating
with lower positivity scores having lower activation. The activation rating is put into one of
three categories: low, medium, or high, with lower numbers being put into the low category,
etc. The magnitude returned by the sentiment analysis API is used as a modifier such that lower
magnitudes (lower confidence) trend towards medium activation. The activation rating is then
used as a constraint for generating graphics and colors within the sampler design.
   For the graphics, we created a library of 100 horizontally tiling cross-stitch designs. These
designs are either created specifically for this project or are sourced from open source and public
domain patterns, and feature both traditional and modern motifs. All designs were converted
for our library to work with our system.
   Each design in the library is hand-tagged with an activation rating based on the design
principle of movement, which relates to the way the viewers eye travels through a piece of art
[12]. Therefore, motifs with squares, grid-like qualities, or primarily horizontal or vertical lines
are rated low activation, organic motifs or curvy, flowing designs are rated medium activation,
and angled or jagged motifs are rated high activation (see Figure 2).
   To choose a row, the system uses the activation rating of the tweet and chooses randomly from
the designs that have the same activation rating, with a small chance of choosing a motif that
has an activation rating one step away from the tweet’s activation rating (e.g. a low activation
tweet has a small chance to include a medium activation motif, but would never have a high
activation motif.)
Figure 2: The tweet on the left was designated as a “medium” activation tweet, therefore the repeated
motifs include a lot of organic and curved lines. The tweet on the right on the other hand was categorized
as ”low” activation, and therefore uses squares, grid pat-terns, and horizontal or vertical lines in the
design. While the colors are similar, the difference in motifs make the two pieces noticeably different.


4.3. Colors Used
In every design, black is used as the primary color, which is used for all text and is included as a
color in almost every graphic motif. However, motifs can have up to two other colors, so other
colors need to be generated for each sampler design.
   To choose colors, the system again uses the activation rating of the tweet. Here, we use
US-specific culture-based views of colors to assign a range of colors to each activation level.
For instance, blue is considered a calm color in the US, so it is associated with low activation.
For our system we use the following list:
    • Violet or blue – low activation

    • Green or yellow – medium activation

    • Orange or red – high activation
  One of the two colors is chosen randomly from the associated activation rating and is used in
combination with the COLOURlovers API [14]. The system requests a random palette from the
top 100 rated palettes on COLOURlovers which includes the specified color. From the palette
that is returned, the perceived brightness is calculated for each color. Any color that has a
perceived brightness too close to the default background color is removed from the list to ensure
the motifs will show up against the background.
   Two colors are chosen from the remaining colors in the palette, and black is used if there are
not enough colors remaining. As the color palettes are rarely monochromatic (for instance a
palette with green in it could also include pink and blue), and the colors are chosen randomly,
the initial seed color does not always appear in the final design. We chose to allow this to
increase the generative space for the colors, as always using a specific color led to an overly
limited aesthetics.


5. Discussion and Future Work
As detailed earlier, Embroidered Ephemera was designed to explore the contrast between the
speed and ephemeral nature of Twitter versus the meditative and lengthy practice of embroidery.
There is an immediacy to posting to Twitter that in many ways is similar to that of a generator
such as Embroidered Ephemera. This speed of creation often leads to a very “noisy” space, and
it is difficult for something to stand out.
    Towards this point, the average typing speed on a phone keyboard is 25 wpm. This means
that someone can write and send a full-length tweet in a little less than 2 minutes, and this can
be done even faster when using a keyboard or sending a shorter tweet. Embroidered Ephemera
can create a new sampler design in approximately 4-5 seconds. This means that it would take
6-8 minutes to create 100 designs.
    In contrast, embroidery is a much slower process. Even with cross-stitch which does not
have the cognitive load of switching stitch types, it is a labor-intensive process. To embroider
one of the generated samplers from Embroidered Ephemera on 18-count Aida cloth (a type
of woven cloth that has 18 holes per inch), it takes approximately 1 hour to stitch one line of
one color. It is important to note that this refers to one line of “pixels” or stitches, not one full
design band (see Figure 3). At this rate, based on the size of the sampler designs, it would take
roughly 300 hours to complete. Therefore, in the time it takes to create one sampler, someone
could generate 28,000 designs, or send 9,000 full length tweets from their phone.
    With such a large time investment, the user experience of the system becomes particularly
important.

5.1. User Experience
While Embroidered Ephemera is in early stages and therefore has not had extensive user testing,
we have gathered informal feedback as well as noted our personal experiences while using the
system.
   There are two stages of the experience with the system. The first is using the generator, while
the second is creating the embroidery from the generated design. While a number of users have
interacted with the generative system, only the author has taken the next step in working on
an embroidery of a generated design.
   This mismatch in engagement likely speaks to the unequal human labor requirements between
generating a design and hand embroidering that design. The speed at which the user can generate
new designs, and the level of input the user has with the system fits the diversionary aspect of
a casual creator. However, while embroidery and other craft-based hobbies are also sometimes
seen as diversions (due to the devaluation of reproductive labor; crafting is not economically
viable, therefore it must not be a serious undertaking [15]), the time investment and depth of
the interaction show a level of commitment to the process that is not found in casual creators.
   The difference in commitment also touches on another challenge in generative design; that of
perceived value of the output of the system by the user. Even when there is initial perceptible
uniqueness [16] in the output, as the user explores the generative space, that uniqueness
diminishes the more designs are generated. This is even more pronounced with a shallow
generative space such as is found in Embroidered Ephemera.
   Beyond this, in textile crafting (and likely most crafting), creative expression is one of the core
values of practitioners [17]. However, with the current version of Embroidered Ephemera, the
generative system is doing all of the design work: choosing a tweet, choosing borders, choosing
colors, choosing a layout. This leaves the user with very limited opportunities for expressing
their own creativity in the design generation beyond choosing a user or hash tag and hitting
the re-roll button.
   Our experience creating the embroidered artifact aligns with these observations. Because all
of the creative decisions have already been made, the act of creating the embroidery becomes
machine-like and thoughtless. And while the opposing values of tweets and embroidered
samplers leads to some playful interactions and design, this is shallow enough that it is unlikely
to carry the embroiderer through the hundreds of hours required to finish the physical artifact.
In its current incarnation, the user’s creative expression only exists where the embroiderer
chooses not to follow the design.
   This could be addressed by allowing the user to retain more creative control during the design
process and/or to encourage moments of user creativity during creation of the artifact. To work
within the system as it exists now, creative expression exists primarily at the curation level;
what Twitter users, tweets, and designs are chosen to be embroidered.
   The curation of tweets as creative expression is what we relied on in our experience of
choosing tweets to embroider. In the example seen in Figure 3, a selection of tweets by political
figures were chosen to highlight the differences in tone and content and how this might affect
the final design. The tweet shown was chosen in part because it characterizes the general tone
of the author’s tweets and highlights the contrast between this style of delivery versus that
of Michelle Obama as seen in Figure 2. It was also chosen because it brings into focus the
conflict of politics and twitter; twitter is designed for short form and quick discourse but is
being used by the current US administration for political statements. By committing a quick
and reactionary tweet to something that as long-lasting as embroidery, it highlights the power
and control those words have, even if they are not intended or used that way.

5.2. Privacy
An issue we had not considered when starting this project is that of data privacy. While it is
possible to search any username on twitter and see their tweets if their privacy settings are
set up to do so, our casual creator is still bringing focus to tweets that may not be something
that the original author may want. Because the Twitter API and our tool do not ask for user
permission, it raises ethical concerns.
  In particular, a design can be saved and shared which contains a tweet which the original
author may have deleted or may not have wanted to give that type of attention to. Given the
Figure 3: Clockwise from top left: The original tweet. The design created by Embroidered Ephemera
based on the original tweet. The be-ginning of a physical embroidered piece based on the design.


length of time that embroidery can survive, if a design is embroidered into a physical artifact,
this means that the tweet may outlive the author themselves.
   While this is beyond the scope of this paper and an area of active research, we do not have
a current solution to this beyond asking people to be considerate when using the tool, or not
allowing other people to use it at all. In the future if and when the Twitter API is extended, we
would like to change the tool such that people can only look up their own username or those of
public figures.

5.3. US-Centric
Finally, a large constraint on our system is that it is no better than a random generator when
using tweets that are not in English or when looking beyond color-based cultural references
from the US. In the future, this could be addressed by using either the location of the tweet
author or language of the tweet to modify how the data is interpreted. This would also rely on
the sentiment analysis API having support for different languages beyond English.

5.4. Future Work
In many ways this is a data visualization project, however the end goal was never to create an
artifact that affords the data to be easily interpreted such as is expected from traditional data
visualization. Instead, the data is analyzed and used to shape the aesthetics of the piece, but it
does not define them entirely. We made this choice in part due to the fact that we are using
qualitative rather than quantitative data.
   The system currently quantizes the sentiment analysis results by assigning them to the three
activation levels. This leads to a lack of nuance within and between the designs, although that
lack of nuance may help with perceptible uniqueness and differentiation between categories of
designs.
   We would like to further develop and incorporate the activation concept by using continuous
values and refining the way we categorize colors and graphical motifs. It is also possible to
incorporate this concept into more of the design by looking at heights of motifs, different types
of samplers beyond the band sampler, and incorporating colors in the text and background.
The system would also be improved with additional motifs and fonts in the library. All of these
changes would increase the generative space and allow the user to explore more areas of the
possibility space through the generator.
   Finally, the user’s exploration of that space is currently done through a very limited user
interface. While this does invite exploration, the interaction is very shallow, and limits the
user’s creative expression as mentioned above. Adding the ability to modify meaningful pieces
of the design would provide a better balance between control and ease of use. For instance,
being able to lock or re-generate a specific tweet, color palette, or row design would give the
user ways to further influence the design while still allowing room for exploration.


6. Conclusion
In this paper we have presented Embroidered Ephemera, a casual creator which designs
traditional-inspired embroidery samplers from Twitter data. Text analysis is used to shape
the aesthetics of the design, by constraining the options of motifs and colors available to the
generator. Each sampler design is therefore aesthetically influenced by the data but is not
defined by it.


References
 [1] M. B. Huish, Samplers & Tapestry Embroideries, London, New York [etc.] Longmans,
     Green and co., 1913. URL: http://archive.org/details/samplerstapestry00huisrich.
 [2] C. Dickens, A Tale of Two Cities, Gawthorn, 1899.
 [3] J. Witkowski, Knit for Defense, Purl to Control, InVisible Culture (2015). URL: https:
     //ivc.lib.rochester.edu/knit-for-defense-purl-to-control/#fn-3529-26.
 [4] D. K. Rosner, Spyn: Weaving Stories into Handcrafted Artifacts (2008) 19.
 [5] Yidan, Hiding in the Light, 2019. URL: http://textileartscenter.com/blog/
     hiding-in-the-light/.
 [6] C. D’Ignazio, L. F. Klein, Feminist Data Visualization, in: Workshop on Visualization for
     the Digital Humanities (VIS4DH), Baltimore. IEEE, 2016.
 [7] Y. Jansen, P. Dragicevic, P. Isenberg, J. Alexander, A. Karnik, J. Kildal, S. Subramanian,
     K. Hornbæk, Opportunities and Challenges for Data Physicalization, in: Proceedings of
     the 33rd Annual ACM Conference on Human Factors in Computing Systems, CHI ’15,
     Association for Computing Machinery, Seoul, Republic of Korea, 2015, pp. 3227–3236. URL:
     https://doi.org/10.1145/2702123.2702180. doi:1 0 . 1 1 4 5 / 2 7 0 2 1 2 3 . 2 7 0 2 1 8 0 .
 [8] R. Onion, The Quilters and Knitters Who Are Mapping Climate Change, 2020. URL:
     https://slate.com/technology/2020/02/quilts-knitting-cross-stitch-climate-change.html.
 [9] K. Compton, M. Mateas, Casual Creators, in: Proceedings of the Sixth Interna-
     tional Conference on Computational Creativity, 2015, p. 8. URL: http://axon.cs.byu.edu/
     ICCC2015proceedings/10.2Compton.pdf.
[10] donjon; RPG Tools, n.d.. URL: https://donjon.bin.sh/.
[11] Twitter Developer Docs, n.d.. URL: https://developer.twitter.com/en/docs.
[12] B. Peterson, Learning to See Creatively: Design, Color, and Composition in Photography,
     Amphoto Books, 2015.
[13] Analyzing Sentiment | Cloud Natural Language API, n.d.. URL: https://cloud.google.com/
     natural-language/docs/analyzing-sentiment.
[14] COLOURlovers API Documentation, n.d.. URL: https://www.colourlovers.com/api.
[15] C. Hughes, Gender, Craft Labour and the Creative Sector, International journal of cultural
     policy 18 (2012) 439–454.
[16] K. Compton, Casual Creators: Defining a Genre of Autotelic Creativity Support Systems,
     Ph.D. thesis, University of California, Santa Cruz, 2019.
[17] A. Sullivan, A. Salter, G. Smith, Games Crafters Play, in: Proceedings of the 13th Interna-
     tional Conference on the Foundations of Digital Games, 2018, pp. 1–9.