<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Analyzing literary texts in Lithuanian Sign Language with Computer Vision: a proof of concept</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vadim Kimmelman</string-name>
          <email>vadim.kimmelman@uib.no</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Anželika Teresė</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Bergen</institution>
          ,
          <addr-line>Sydnesplassen 7, Bergen 5007</addr-line>
          ,
          <country country="NO">Norway</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Vilnius University</institution>
          ,
          <addr-line>St University 3, Vilnius</addr-line>
          ,
          <country country="LT">Lithuania</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <abstract>
        <p>Sign languages are natural languages existing in the visual-gestural modality used by the deaf communities. Previous research found that literary texts in sign languages use a number of features that distinguish them from non-literary texts, including phonetic features such as the hand movement amplitude, symmetry and enhanced use of nonmanual markers (facial expressions and body movements). However, this type of research has been largely based on manual annotation and observation. We use Computer Vision (MediaPipe Holistic) to automatically extract measurements from video recordings in order to quantify such phonetic features. The data set comprises of 2D video recordings of five short literary pieces in Lithuanian Sign Language that were recorded by their authors (originals) and then retold by the same authors in a non-literary form (as “prose” retellings). We extract landmark coordinates from the videos and measure the following features: variation in hand movement amplitude; symmetry as a comparative measure of activity of the two hands; sideward body leans; eyebrow movement. We discuss the steps necessary to use the measurements, namely filtering and normalization by body size. We compare the original literary pieces with their retelling according to these features. We find that the original literary pieces clearly have more/larger hand movements, larger sideward body leans, and more use of eyebrow movement. However, we do not find clear symmetry diferences between the originals and the retellings. The study is a proof of concept for the use of Computer Vision in phonetic analysis of sign languages, also in the context of literary analysis.</p>
      </abstract>
      <kwd-group>
        <kwd>sign language</kwd>
        <kwd>literary texts</kwd>
        <kwd>phonetic analysis</kwd>
        <kwd>Computer Vision</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Sign languages are natural languages existing in the visual-gestural modality used by the deaf
communities. Being natural languages, they are used in all possible domains where human
languages can be used, including for creation of literature [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. While analysis of sign language
literature has been a domain of inquiry for many years, in this paper we explore how Computer
Vision tools can be applied to initiate a quantitative analysis of literary texts in sign languages.
      </p>
      <sec id="sec-1-1">
        <title>1.1. Sign language literature</title>
        <p>
          According to sign language literature researchers R. Sutton-Spence and M. Kaneko [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], sign
language literature is a body of creative and artistic sign language works (such as poems, stories
and jokes) composed and performed by deaf people. These are works of diferent form and
content conveyed by signs and non-manual elements [
          <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
          ], in which rhyme (in the case of
poetry), rhythm, metaphors, etc. are created by using diferent means of expression from those
used in spoken languages [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. The analysis of sign language literature (starting from American,
British, and South African Sign Language literature) was first undertaken around 1993 by C.
Valli [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. Sign language literature has been mainly analysed by R. Sutton-Spence [
          <xref ref-type="bibr" rid="ref5 ref6">5, 6</xref>
          ] and M.
Kaneko [
          <xref ref-type="bibr" rid="ref1 ref7 ref8 ref9">7, 8, 1, 9</xref>
          ] in collaboration with other colleagues.
        </p>
        <p>
          The first studies in the field did not aim to compare oral/written literature and sign language
literature, but focused on the following questions instead: how do deaf artists convey rhyme
(deaf artists use the repetition of handshape, movement and space) or metaphors and other
means of expression; and what tools exist only in sign languages and cannot be found in
oral/written artistic texts [
          <xref ref-type="bibr" rid="ref1 ref8 ref9">1, 8, 9</xref>
          ]. Therefore, the aforementioned researchers analysed rhythm,
repetition (rhyme), symmetry (vertical, horizontal, and back-and-forth symmetry), metaphors,
neologisms, ambiguity, types of movement, nonmanual elements, such as eye gaze, eye aperture,
mouth actions, etc. In addition, in the early years of research in the field, sign language linguists
started a discussion about the concept of literature, because the traditional definition of literature
is primarily associated with oral/written forms, which leaves sign language literature outside
the definition. Although the analysis of artistic texts in sign languages is gaining momentum,
the questions of how artistic sign language pieces difer from non-artistic ones and what means
of expression are intrinsic to sign language literature alone and cannot be found in literature
produced in spoken language remain to be answered.
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>1.2. Computer Vision for sign language linguistics</title>
        <p>
          In recent years, substantial advances in Computer Vision have happened, and reliable landmark
detection and tracking for human bodies in 2D videos has been achieved. A major advance of
this field was OpenPose [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], which has been more recently followed by MediaPipe [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
        </p>
        <p>
          Since sign languages exist in the visual modality, and the primary data for sign languages is
represented in video recordings (or video streams), it is natural that CV has been extensively
applied to sign language research. More specifically, a large part of work in Automatic Sign
Language Recognition and Translation uses CV solutions as part of the pipeline [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
        </p>
        <p>
          However, a few researchers have also tested the applicability of CV solutions and applied
them to linguistic analysis of signed languages, see a brief overview in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Most importantly
for the current study, it has been shown that CV can be used to measure hand activity in general,
and specifically to determine hand position and symmetry [
          <xref ref-type="bibr" rid="ref13 ref14 ref15">13, 14, 15</xref>
          ]. In a series of papers,
applicability of CV to the analysis of nonmanual markers (specifically, eyebrow movements and
head tilts) has also been demonstrated [
          <xref ref-type="bibr" rid="ref16 ref17 ref18">16, 17, 18</xref>
          ].
        </p>
        <p>In the current paper, we combine insights from linguistic research on sign language literature
with the advances in using Computer Vision for sign language analysis. We provide a
proofof-concept study showing that it is possible to measure phonetic properties in recordings of
literary pieces using a Computer Vision tool, and compare literary and non-literary renditions
of the same texts based on these measurable properties.</p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>2. The dataset</title>
      <p>The current dataset contains five literary pieces in Lithuanian Sign Language and their
nonliterary (prose) retellings produces by the same signers. Figure 1 illustrates the same event
(drinking) being described in the original literary piece and in the retelling. One can see that
diferent signs are used, but also that the body posture and facial expressions are diferent.</p>
      <p>Lithuanian Sign Language (LSL) is the language of the deaf community in Lithiania. The study
into LSL began at the Surdology Centre in 1996, following the declaration by the Lithuanian
Government of LSL as the native language of the deaf a year before. The Centre’s team collected
LSL lexicon, studied LSL grammar, and embarked on LSL dictionaries (print versions). Currently,
the bulk of the work in Lithuania is devoted to the digital LSL dictionary. LSL literature research
began in 2018.</p>
      <p>The literary pieces analyzed in the current study are the following:</p>
      <sec id="sec-2-1">
        <title>1. “Black Orchid” (Juodoji orchidėja) by Paulius Jurjonas</title>
        <p>2. “Sign Language” (Gestų kalba)1 by Nijolė Karmazienė
3. “Time is running” (Bėgantis laikas) by Nijolė Karmazienė
4. “Being Yourself” (Būti savimi) by Karolis Pilipavičius
5. ”Sign Language” (Gestų kalba) by Karolis Pilipavičius</p>
      </sec>
      <sec id="sec-2-2">
        <title>See Appendix for the brief summaries of the content of the pieces.</title>
        <p>All the video recordings were re-scaled to 853x480 resolution.
1In the graphs it is represented by the older title Kurčias žmogus ”Deaf person”.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Data extraction and processing</title>
      <p>
        In order to extract landmark coordinates, we used MediaPipe Holistic [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ],2 which combines
pose, hand, and face components. We used a Python implementation of MediaPipe Holistic.
The output is .csv files for each frame of the video recording, with files produced separately for
the pose, hands and face components. In this research, we only used the pose component to
extract coordinates of the wrists (landmarks 15 and 16) and the shoulders (landmarks 11 and 12)
and of the body center (0), and the face components to extract coordinates of one landmark for
each eyebrow (landmarks 105 and 334) and the coordinate for the top of the nose (landmark 8).
      </p>
      <p>
        The individual .csv files from the pose component and from the face component were
combined into single files (with only the relevant landmarks) using the tidyverse package [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] in R
version 4.2.2 (2022-10-31) [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] run in R Studio version 2022.12.0.353 [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. The resulting data
ifles and the R script can be found here: https://osf.io/c9p7n/.
      </p>
      <p>The pose component outputs the following features: frame number, video height and width,
landmark number, the x, y (on the scale 0 to 1) and z coordinates, and visibility (on the scale 0
to 1). The face component outputs the same features except for visibility.</p>
      <p>Exploring the data, we discovered that, despite the fact that the hand coordinates should be
between 1 and 0 for both the x and y coordinates, the model actually outputs values beyond this
interval, especially so for the y-coordinate (between -0.6 and 1.2 in our data). This correlates
with visibility: visibility is somewhat low for out of bound coordinates (below 0.5). Whatever the
explanation, it is clear that MediaPipe outputs erroneous coordinates for the hand landmarks
sometimes, and they cannot be used. We therefore filter out all the measurements that go
outside of the frame boundaries, and also all the measurements with visibility below 0.5. We
also discovered that shoulder coordinates are not always detected clearly, with extreme outlier
measurements infrequently occurring in the absence of any body movements in some videos.
Therefore, to filter out such cases, we remove any measurements that are not within +-3SD
from the mean per landmark per video.</p>
      <p>The original pose dataset contains 101175 lines (20235 frames * 5 relevant body landmarks),
and the face dataset contains 57321 lines (19107 frames * 3 relevant face landmarks).3 After
ifltering, the pose dataset contains 98474 lines, so we remove 2.7% of the data, and the face
dataset does not change as it does not contain extreme outliers. The small reduction in the size
of the pose dataset should not be considered a comment on the quality of MediaPipe output. In
fact, most of the data we filter out are simply frames where the hands are above or below the
frame boundaries, and so the measurements simply cannot be made.</p>
      <p>
        The next step that we need to take is normalization. Because the coordinates depend on the
size of the body of the person in the video recording, as well as on the distance between the
person and the camera, in order to compare across diferent videos, normalization is necessary.
We use the procedure and the code recently proposed by Börstell [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Specifically, the hand
coordinates are adjusted to be centered around the the midpoint between the shoulders (so
the coordinates now indicate the distance from this point to the wrists), and scaled by the
mean distance between the shoulders in order to normalize for body size and distance to the
2https://github.com/google/mediapipe/blob/master/docs/solutions/holistic.md
3The diferences in the number of frames is due to the fact that in a small number of frames the hands are detected,
but the face is not.
camera. We also apply the same procedure to the eyebrow coordinates, where they are adjusted
to the top of the nose coordinate and also scaled by the mean distance between the eyebrow
coordinates. This also means that the transformed coordinate values are no longer between 0
and 1, but are centered around 0.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Analysis</title>
      <p>As discussed in the previous sections, we decided to focus on some phonetic aspects of sign
language production that are potentially linked to literary texts, and can be measured using
Computer Vision tools at our disposal. Therefore, we conducted measurements of general hand
activity, of symmetry, of body leans, and of eyebrow movements. For each of the measures, we
compare the individual original pieces and their retellings.</p>
      <sec id="sec-4-1">
        <title>4.1. Hand activity</title>
        <p>
          To measure hand activity, we explore separately the x- and y-coordinates of the two hands. Note
that the z-coordinates in the MediaPipe pose output (unlike the ones in the face component) have
not been tested or calibrated by the creators, so they are not useful for phonetic analysis. This
means that we are not measuring actual hand position in the 3D space, but only its projection
to the 2D plane of the video recording. While this is a substantial drawback in general because
all information along the z-scale is lost, we think that it is acceptable for the purposes of this
paper. First, the signers in the videos all stand facing the camera in each video and so the
loss of information at least is expected to be similar across the videos.4 Second, because we
compare individual recordings to each other, the x- and y-coordinates already provide very
large numbers of data points that can be analyzed. Still, clearly, if it happens that literary pieces
and prose retellings are diferent in how far the hands move along the z-axis specifically, we
would not be able to detect this diference using the current method (see also [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]).
        </p>
        <p>In Figure 2, the x-coordinates (normalized and centered) are plotted for each piece and its
retelling as boxplots. We can see that, for each piece, the original has a wider distribution for
the coordinates. This means that the hands move further to the right and to the left in the
original than in the retelling. In other words, the horizontal amplitude of the hand movements
is larger in the original.</p>
        <p>Because the diferences are not in the mean position of the hand, but in the distribution, in
Figure 3 we plot the Standard Deviations of the x-coordinates for each piece for the two hands.
Here we can also clearly see that the SDs are consistently larger in the original than in the
retelling. The only case where there is almost no diference is left hand for the last piece. The
left hand does not in fact move much in either the original or the retelling.</p>
        <p>We have tested whether the diference in SDs between the original and the retelling is
significant using Levene’s test for equality of variances, and found that it is significant overall
and for each individual piece for both hands, except for the single case described above. Note,
however, that significance testing is not very meaningful because the datasets contain very
4Of course, the signers use body and head turns for linguistic and literary purposes, so they are not always facing
the camera directly.
many points, so even very small diferences easily reach significance. We suggest that graphical
exploration is more meaningful here.</p>
        <p>In Figure 4, the y-coordinates (normalized and centered) are plotted for each piece and its
retelling. We can again see that, for each piece, the original has a wider distribution for the
coordinates. This means that the hands move further up and down in the original than in the
retelling. In other words, the vertical amplitude of the hand movements is larger in the original.</p>
        <p>Figure 5 plots the SDs of the y-coordinates across the pieces. Again, we can clearly see that
in each case the SD is higher in the original than in the retelling, even though in some pieces
the diferences are pretty small (albeit statistically significant).</p>
        <p>To sum up, it is very clear that, in our dataset, the hands move across a larger space in the
originals, both horizontally and vertically. In the retellings, the hand coordinates show less
variation and smaller extreme values.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Symmetry</title>
        <p>
          As we discussed above, symmetry can be used in literary texts in sign languages as a literary
device. In order to quantify symmetry, we follow the insights (and the specific measure)
proposed by Börstell [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Börstell’s purpose was to classify signs according to handedness
based on relative activity of the two hands. The measure of relative activity was calculated in
the following way: we calculate the Euclidean distance between the coordinates for each wrist
between consecutive frames. The absolute diference between the distance travelled by the two
wrists is the measure that is used to analyze symmetry. The larger the diference, the more
asymmetric the movement is (larger values mean that one of the hands is moving considerably
more that the other). Börstell has shown that this measure can be used very efectively to
classify isolated signs into one-handed, two-handed symmetric and two-handed asymmetric.
        </p>
        <p>For our purposes, we calculate this measure for each frame, and compare the averages and
distributions between the original and retelling of each piece. The results (mean symmetry
measures) are represented on Figure 6.</p>
        <p>We can see that symmetry is not used consistently to contrast originals and retellings. In one
piece, the symmetry measure is much higher in the original, which means that there is much
less symmetry in the original than in the retelling. Of the other four pieces, two have higher
symmetry and two have lower symmetry in the originals, but the diferences are small.</p>
        <p>The results thus indicate that symmetry is not used as a literary device in the pieces that
we analyzed. This agrees with our qualitative of the data: we were not able to see marked
symmetry used for literary purposes in these pieces.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Body leans</title>
        <p>Not only the hands, but also the body can move more in literary texts (recall also Figure 1).
We decided to specifically measure sideward body leans by looking at the distributions of the
x-coordinates (normalized and centered) of the two shoulders. As with the hand coordinates,
the caveat is that we do not have measurements along the z-scale. In this case we also do
not consider the y-coordinates because we conceptualize body leans as primarily sideward
movement of the shoulders. While this is reasonable for our dataset where the signers face the
camera, it is clearly not applicable to videos filmed at a diferent angle, and so other measures
should be used.</p>
        <p>The resulting distributions of the x-coordinates of the shoulders can be seen in Figure 7. It is
clear that, in four of the five pieces, both the left and the right shoulder move to much more
extreme points in the original as compared to the retelling. In one piece (the last piece on the
Figure), the diference is small, and it is in the opposite directions: the shoulders move more in
the retelling. This agrees with our observations of that recording: the signer moves their body
more in the retelling than in the original, where the body is very still. Figure 8 represents the
same data by comparing SDs of the horizontal shoudler coordinates between the original and
the retelling.</p>
        <p>To sum up, in four of the five pieces we indeed find a much more pronounced use of sideward
body leans, as represented by the more extreme and varied coordinates of the two shoulders.
However, even in our small dataset, in one of the literary pieces the signer barely moves their
body, so large body movements are not a universal feature of literary texts.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Eyebrow movements</title>
        <p>We also hypothesize that more pronounced facial expressions can be used as literary devices.
Because facial expressions are complex, for this pilot study we only focus on eyebrow movement.
We operationalize eyebrow position as a measure of Euclidean distance between an eyebrow
landmark and the landmark on the top of the nose. This time we use the z-coordinates as well
because they are a stable output of the face component of MediaPipe Holistic.</p>
        <p>
          The distance between an eyebrow point and the top of the nose is a rough measure because,
as has been shown in much previous research also for signed languages [
          <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
          ], the inner and
outer parts of the eyebrows move partially independently. For the purposes of this pilot study,
we use this rough measure, but in future, more detailed measurements can be conducted.
        </p>
        <p>
          Looking at the results, we can see in Figure 9 that, for the most part, the mean eyebrow
distance is larger in the original, and also that the variation in the original is clearly larger. The
diferences are less pronounced in the left eyebrow compared to the right eyebrow, which is not
surprising as eyebrows often move asymmetrically [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ].
        </p>
        <p>As for the hands and body movements, we think that the relevant feature to focus on is
variation and not the mean positions because variation reflects how much the eyebrows move,
not where they are on average. We can see in Figure 10 that, in all the pieces, SDs of the
eyebrow distance measures are larger in the original compared to the retelling. In one piece,
the diferences are tiny, but still in the same direction.</p>
        <p>To sum up, we find evidence that, in our dataset, facial expressions as manifested by eyebrow
movements are used more actively in the original literary pieces. However, future work should
include more detailed measures of the eyebrow position, and also of other facial articulators.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Summary and discussion</title>
      <p>
        In this paper, we presented a proof of concept for the use of Computer Vision (specifically,
MediaPipe Holistic) for the analysis of literary texts in a signed language. We have demonstrated
that some phonetic features that are used in literary devices can be measured using CV outputs,
and these measurements can be meaningfully compared across genres. Specifically, using a
dataset of five literary pieces in Lithuanian Sign Language and their prose retellings by the
same authors, we found that the originals contain more hand activity (larger and more varied
hand movements), larger and more varied lateral movements of the body (as measured by the
shoulder position), and more eyebrow movement. At the same time, we did not find clear
diferences in symmetry between the originals and the retellings, even though symmetry can
sometimes be used as a literary device [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        There is a number of limitations to the use of Computer Vision for analysis of sign language
data. First, MediaPipe Holistic and other CV solutions have errors in their outputs (see also [
        <xref ref-type="bibr" rid="ref17 ref18">17,
18</xref>
        ]). It is always necessary to check for clear outliers in the data, and to remove measurements
which are clearly erroneous, as well as cases of low visibility. However, if the measures are used
to explore longer videos, and not to conduct analysis on a frame-by-frame basis, these errors
are unlikely to cause any serious problems. On the level of specific videos, such measures as
average positions or SDs will be quite reliable due to the large numbers of the measurements.
      </p>
      <p>
        Second, at least for the pose component, MediPipe Holistic does not provide reliable 3D
reconstruction. Because sign languages employ the 3D space for many linguistic purposes
[
        <xref ref-type="bibr" rid="ref22 ref23 ref24">23, 24, 22</xref>
        ], this limits the analytic possibilities based on these tools. For the purposes of the
current study, this limitation does not play a big role, but in many other scenarios the lack of
depth information will be a crucial drawback.
      </p>
      <p>
        Third, the measures that we developed to compare the literary texts and their retellings
are quire rough measures, and there further improvements are clearly possible. One direction
for future improvement is segmenting the videos into smaller pieces, such as utterances or
individual signs. Segmentation into individual signs can potentially be done automatically
[
        <xref ref-type="bibr" rid="ref13 ref25">13, 25</xref>
        ]. It can be insightful to measure symmetry, as well as hand activity at the level of
individual signs, and to measure body leans at the level of utterances. This would enable more
subtle and detailed comparison of the use of these phonetic features for literary purposes.
      </p>
      <p>Despite these limitations, our study clearly shows that CV solutions such as MediaPipe can
clearly be used to measure and quantify certain phonetic features in sign languages, and these
features can be used further to answer a wide variety of research question in linguistic, and
even, as in this case, literary analysis.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>Funded by the European Union (ERC, NONMANUAL, project number 101039378). Views and
opinions expressed are however those of the author(s) only and do not necessarily reflect those
of the European Union or the European Research Council. Neither the European Union nor the
granting authority can be held responsible for them.</p>
      <p>The authors would like to thank Carl Börstell for a helpful discussion of the analysis.
A. Content summary of the literary pieces
1. “Black Orchid” by Paulius Jurjonas (2018). The poetry is about love. The piece is about a
man having a flashback upon seeing a motorcycle. In the flashback, the protagonist gives a ride
to a long-haired heart-breaker (femme fatal) woman with thick pufy lips. The flashback ends;
in the present, the heart of the protagonist is portrayed as black and dead.</p>
      <p>2. “Sign language” by Nijolė Karmazienė (2015). The poet tells the story of a powerful tree
that has grown from a small seed. The tree is swinging in the wind. In the distance, on the
top of a rock, there is a nest with a hawk watching the tree. The hawk is strong and powerful.
The bird flies down toward the tree, circles around it and finally alights on a branch. The hawk
admires the tree and distinguishes it from among all other trees. Finally, the bird returns to the
nest. The work shows that both the tree and the bird feel to have gained strength, bliss and joy
from the encounter. The poet employs metaphors: the tree stands for the deaf communities and
sign language. The hawk is a metaphor of highly qualified sign language interpreters.</p>
      <p>3. Nijolė Karmazienė in poem “Time is running” tells a story about a watch. At first the
watch works great, then it breaks. The poet metaphorically represents the eyelid of a human
eye, which opens, and tears are visible in it, which soon pour down and wash the watch‘s gears.
After that the watch is ticking again. And since the text is metaphorical, it talks about human
health through the prism of a watch, the poet talks about deteriorating well-being, illnesses,
and death. And vice versa, when the mechanism starts to work, the hands start to go (up) – the
poet talks about improving well-being, health, and life.</p>
      <p>4. Poet Karolis Pilipavičius in “Being Yourself” portrays a man tightly holding the folds of
his jacket against his chest. People go past the man one by one, touching him as if wishing to
see what is behind the jacket. The poet creates a metaphor: the jacket is a man’s identity, his
inner world. At a certain point, a passer-by touches the protagonist and his body is revealed.
Ashamed, the man wants to hide, but he decides to come out. A multi-coloured rainbow is seen
on his chest. This is also a metaphor. Of course, one of the interpretations is that the rainbow is
multi-coloured, so it is a sign of diversity and otherness. However, another interpretation is
that the rainbow is a symbol of the LGBT+ community. Ultimately, the male protagonist in the
work sets an example for others by encouraging them to come out and be open with the world,
themselves and other people. It is a sign of community, freedom and calmness.</p>
      <p>5. “Sign Language” by Karolis Pilipavičius. The poet tells the story about a deaf tourist who,
in a happy mood to get to know a new city, faces challenges: when he asks for directions in sign
language, he gets no help, when he asks what time it is, he only sees people scrambling. The
protagonist is confused by this, he starts to wonder why he is so diferent from other tourists?
Is it because he speaks sign language? It breaks his heart. Another person approaches the
protagonist and speaks with him in sign language. Soon, many sign language speakers gather
around the deaf tourist to help him. He feels happy, understood, accepted and safe.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaneko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sutton-Spence</surname>
          </string-name>
          ,
          <article-title>Metaphor in sign language</article-title>
          ,
          <source>in: The Routledge Handbook of Metaphor and Language</source>
          , Routledge,
          <year>2016</year>
          , pp.
          <fpage>281</fpage>
          -
          <lpage>298</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C. L.</given-names>
            <surname>Valli</surname>
          </string-name>
          ,
          <article-title>Poetics of American sign language poetry</article-title>
          ,
          <source>The Union Institute</source>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K.</given-names>
            <surname>Christie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Wilkins</surname>
          </string-name>
          ,
          <article-title>A Feast for the Eyes: ASL Literacy and ASL Literature</article-title>
          ,
          <source>Journal of Deaf Studies and Deaf Education</source>
          <volume>2</volume>
          (
          <year>1997</year>
          )
          <fpage>57</fpage>
          -
          <lpage>59</lpage>
          . doi:
          <article-title>1 0 . 1 0 9 3 / o x f o r d j o u r n a l s . d e a f e d</article-title>
          .
          <source>a 0 1</source>
          <volume>4 3 1 0 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Teresė</surname>
          </string-name>
          ,
          <source>Lithuanian Sign Language Poetry: Location as Mean of Expression of Metaphors, Taikomoji kalbotyra 17</source>
          (
          <year>2022</year>
          )
          <fpage>9</fpage>
          -
          <lpage>37</lpage>
          .
          <source>doi:1 0 . 1 5 3 8 8 / T a i k a l b o t . 2 0 2 2 . 1 7</source>
          . 1 .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sutton-Spence</surname>
          </string-name>
          ,
          <article-title>Phonological ”Deviance” in British Sign Language Poetry</article-title>
          ,
          <source>Sign Language Studies</source>
          <volume>2</volume>
          (
          <year>2001</year>
          )
          <fpage>62</fpage>
          -
          <lpage>83</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sutton-Spence</surname>
          </string-name>
          , R. Müller De Quadros,
          <article-title>Sign language poetry and Deaf identity</article-title>
          ,
          <source>Sign Language &amp; Linguistics</source>
          <volume>8</volume>
          (
          <year>2005</year>
          )
          <fpage>177</fpage>
          -
          <lpage>212</lpage>
          .
          <source>doi:1 0 . 1 0</source>
          <volume>7 5</volume>
          / s l l .
          <source>8 . 1 . 1 0</source>
          s u t .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaneko</surname>
          </string-name>
          ,
          <article-title>The poetics of Sign Language Haiku</article-title>
          .,
          <string-name>
            <surname>Doctoral</surname>
            <given-names>dissertation</given-names>
          </string-name>
          , University of Bristol,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaneko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mesch</surname>
          </string-name>
          , Eye Gaze in Creative Sign Language,
          <source>Sign Language Studies</source>
          <volume>13</volume>
          (
          <year>2013</year>
          )
          <fpage>372</fpage>
          -
          <lpage>400</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Asmal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kaneko</surname>
          </string-name>
          , Visual Vernacular in South African Sign Language,
          <source>Sign Language Studies</source>
          <volume>20</volume>
          (
          <year>2020</year>
          )
          <fpage>491</fpage>
          -
          <lpage>517</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Z.</given-names>
            <surname>Cao</surname>
          </string-name>
          , G. Hidalgo,
          <string-name>
            <given-names>T.</given-names>
            <surname>Simon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.-E.</given-names>
            <surname>Wei</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. Sheikh,</surname>
          </string-name>
          <article-title>OpenPose: realtime multi-person 2D pose estimation using Part Afinity Fields</article-title>
          , in: arXiv preprint arXiv:
          <year>1812</year>
          .08008,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>C.</given-names>
            <surname>Lugaresi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Nash</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>McClanahan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Uboweja</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hays</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.-L. Chang</surname>
            ,
            <given-names>M. G.</given-names>
          </string-name>
          <string-name>
            <surname>Yong</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>W.-T.</given-names>
          </string-name>
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Hua</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Georg</surname>
          </string-name>
          , M. Grundmann,
          <article-title>MediaPipe: A Framework for Building Perception Pipelines (</article-title>
          <year>2019</year>
          ). URL: https://arxiv.org/abs/
          <year>1906</year>
          .08172.
          <source>doi:1 0 . 4 8</source>
          <volume>5 5</volume>
          <fpage>0</fpage>
          <string-name>
            <surname>/ A R X I</surname>
          </string-name>
          <article-title>V . 1 9 0 6 . 0 8 1 7 2</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>I.</given-names>
            <surname>Adeyanju</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Bello</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Adegboye</surname>
          </string-name>
          ,
          <article-title>Machine learning methods for sign language recognition: A critical review and analysis</article-title>
          ,
          <source>Intelligent Systems with Applications</source>
          <volume>12</volume>
          (
          <year>2021</year>
          )
          <article-title>200056</article-title>
          .
          <source>doi:1 0 . 1 0 1 6 / j . i s w a . 2 0</source>
          <volume>2 1 . 2 0 0 0 5 6 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>C.</given-names>
            <surname>Börstell</surname>
          </string-name>
          ,
          <article-title>Extracting Sign Language Articulation from Videos with MediaPipe</article-title>
          ,
          <source>in: Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)</source>
          ,
          <source>NEALT Proceedings Series, No. 52</source>
          , University of Tartu Library, Tórshavn, Faroe Islands,
          <year>2023</year>
          , pp.
          <fpage>169</fpage>
          -
          <lpage>178</lpage>
          . URL: https://aclanthology.org/
          <year>2023</year>
          .nodalida-
          <volume>1</volume>
          .
          <fpage>18</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>R.</given-names>
            <surname>Östling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Börstell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Courtaux</surname>
          </string-name>
          ,
          <article-title>Visual iconicity across sign languages: Large-scale automated video analysis of iconic articulators and locations, Frontiers in Psychology 9 (</article-title>
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>C.</given-names>
            <surname>Börstell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Östling</surname>
          </string-name>
          ,
          <article-title>Iconic locations in Swedish Sign Language: Mapping form to meaning with lexical databases</article-title>
          , in: J.
          <string-name>
            <surname>Tiedemann</surname>
          </string-name>
          (Ed.),
          <source>Proceedings of the 21st Nordic Conference on Computational Linguistics (NODALIDA</source>
          <year>2017</year>
          ), Linköping University Electronic Press, Gothenburg,
          <year>2017</year>
          , pp.
          <fpage>221</fpage>
          -
          <lpage>225</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>V.</given-names>
            <surname>Kimmelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Imashev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mukushev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sandygulova</surname>
          </string-name>
          ,
          <article-title>Eyebrow position in grammatical and emotional expressions in Kazakh-Russian Sign Language: A quantitative study</article-title>
          ,
          <source>PLOS ONE 15</source>
          (
          <year>2020</year>
          ).
          <article-title>doi:1 0 . 1 3 7 1 / j o u r n a l</article-title>
          .
          <source>p o n e . 0 2</source>
          <volume>3 3 7 3 1 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kuznetsova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Imashev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mukushev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sandygulova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kimmelman</surname>
          </string-name>
          ,
          <article-title>Using Computer Vision to Analyze Non-manual Marking of Questions in KRSL</article-title>
          ,
          <source>in: Proceedings of the 1st International Workshop on Automatic Translation for Signed and Spoken Languages (AT4SSL)</source>
          ,
          <article-title>Association for Machine Translation in the Americas</article-title>
          , Virtual,
          <year>2021</year>
          , pp.
          <fpage>49</fpage>
          -
          <lpage>59</lpage>
          . URL: https://aclanthology.org/
          <year>2021</year>
          .mtsummit-at4ssl.
          <fpage>6</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kuznetsova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Imashev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mukushev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sandygulova</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kimmelman</surname>
          </string-name>
          ,
          <article-title>Functional Data Analysis of Non-manual Marking of Questions in Kazakh-Russian Sign Language</article-title>
          , in: E. Efthimiou, S.-E. Fotinea,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hanke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Hochgesang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kristofersen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mesch</surname>
          </string-name>
          , M. Schulder (Eds.),
          <source>Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources, European Language Resources Association (ELRA)</source>
          , Marseille, France,
          <year>2022</year>
          , pp.
          <fpage>124</fpage>
          -
          <lpage>131</lpage>
          . URL: https: //www.sign
          <article-title>-lang.uni-hamburg</article-title>
          .de/lrec/pub/22024.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>H.</given-names>
            <surname>Wickham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Averick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bryan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. D.</given-names>
            <surname>McGowan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>François</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Grolemund</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hayes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Henry</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hester</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kuhn</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. L.</given-names>
            <surname>Pedersen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Bache</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Müller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ooms</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Robinson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Seidel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Spinu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Takahashi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Vaughan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Wilke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Woo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yutani</surname>
          </string-name>
          ,
          <article-title>Welcome to the tidyverse</article-title>
          ,
          <source>Journal of Open Source Software</source>
          <volume>4</volume>
          (
          <year>2019</year>
          )
          <article-title>1686</article-title>
          .
          <source>doi:1 0 . 2 1 1 0 5 / j o s s . 0 1</source>
          <volume>6 8 6 .</volume>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>R Core</given-names>
            <surname>Team</surname>
          </string-name>
          ,
          <string-name>
            <surname>R:</surname>
          </string-name>
          <article-title>A Language and Environment for Statistical Computing, R Foundation for Statistical Computing</article-title>
          , Vienna, Austria,
          <year>2022</year>
          . URL: https://www.R-project.
          <source>org/.</source>
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <surname>Posit</surname>
            <given-names>team</given-names>
          </string-name>
          , RStudio: Integrated Development Environment for
          <string-name>
            <given-names>R</given-names>
            ,
            <surname>Posit</surname>
          </string-name>
          <string-name>
            <surname>Software</surname>
          </string-name>
          ,
          <string-name>
            <surname>PBC</surname>
          </string-name>
          , Boston, MA,
          <year>2022</year>
          . URL: http://www.posit.co/.
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>M.</given-names>
            <surname>Flaherty</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kirby</surname>
          </string-name>
          ,
          <article-title>Documenting a Reduction in Signing Space in Nicaraguan Sign Language Using Depth and Motion Capture</article-title>
          ,
          <source>Cognitive Science 47</source>
          (
          <year>2023</year>
          )
          <article-title>e13277</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>P.</given-names>
            <surname>Perniss</surname>
          </string-name>
          ,
          <article-title>Use of sign space</article-title>
          , in: R. Pfau,
          <string-name>
            <given-names>M.</given-names>
            <surname>Steinbach</surname>
          </string-name>
          ,
          <string-name>
            <surname>B.</surname>
          </string-name>
          Woll (Eds.),
          <article-title>Sign language: An international handbook</article-title>
          , Mouton de Gruyter, Berlin,
          <year>2012</year>
          , pp.
          <fpage>412</fpage>
          -
          <lpage>431</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <surname>G. Barberà,</surname>
          </string-name>
          <article-title>The meaning of space in sign language: reference, specificity and structure in Catalan sign language discourse, number 4 in Sign languages and deaf communities</article-title>
          , De Gruyter Mouton : Ishara Press, Boston,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <surname>T. Das Jui</surname>
            ,
            <given-names>G. M.</given-names>
          </string-name>
          <string-name>
            <surname>Bejarano</surname>
            ,
            <given-names>P. Rivas,</given-names>
          </string-name>
          <article-title>A Machine Learning-based Segmentation Approach for Measuring Similarity between Sign Languages</article-title>
          , in: E. Efthimiou, S.-E. Fotinea,
          <string-name>
            <given-names>T.</given-names>
            <surname>Hanke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Hochgesang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kristofersen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mesch</surname>
          </string-name>
          , M. Schulder (Eds.),
          <source>Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources, European Language Resources Association (ELRA)</source>
          , Marseille, France,
          <year>2022</year>
          , pp.
          <fpage>94</fpage>
          -
          <lpage>101</lpage>
          . URL: https://www.sign
          <article-title>-lang.uni-hamburg</article-title>
          .de/lrec/pub/22018.pdf.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>