<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Linguistic Analysis of Startups in The Context of the Air Transport Industry Management</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Transport and Telecommunication Institute</institution>
          ,
          <addr-line>1 Lomonosova street, LV-1019, Riga</addr-line>
          ,
          <country country="LV">Latvia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Much research has studied how a company can maximize its profit. Relatively small number of them focused on Value Proposition, though the number of authors proved that companies that deliver multiple values experience better business performance. This article describes a current research on linguistic analysis of startups in the context of the air transport industry. Analyzing startups manually is a very time consuming task, so the automation of the process would be beneficial. The author takes corpus linguistic approach, created an experiment protocol and is on the stage of conducting an experiment. Under this experiment air transportation startups' landing pages were collected in the number of 800. 100 annotators first were preliminary surveyed and then trained to annotate startups. Post-annotation training will be conducted to understand the difference in expertise level. The annotation results will be further analyzed and linguistic features and patterns will be identified. As a result of the research, an author will develop a methodology for analysis of values based on a model of automatic identification of values in the text of a startup's landing page in the air transportation industry.</p>
      </abstract>
      <kwd-group>
        <kwd>Startups</kwd>
        <kwd>Value Proposition</kwd>
        <kwd>Air Transport</kwd>
        <kwd>Annotation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Value chain analysis is a primary corporate strategy tool. Traditionally relatively few
values have been pursued; price, quality, etc. Recently, customers are demanding a
wide variety of values. Most importantly, some of the values that have previously
received little attention become dominant values (concept known as value shift).
Some examples: cars - eco-friendliness, electronics – usability, food - fairness,
organic. As new values become prominent, industries and their companies will be
transformed; not following values could result in wasted resources (pursuing the wrong
values) and becoming irrelevant (not pursuing the right ones). Hence, the ability to
identify values early on is important for business performance. Startups are often the
first to discover new values. As an example, Skytran, startup from air transportation
field, offers autonomous, zero-emission vehicles arrowing above congested streets.
The company’s landing page identifies its values as “high speed, high capacity, low
cost” [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Analyzing startups manually is a very time consuming task. The author
assumes that one startup landing page takes 10 minutes to examine. Air transport
industry offers around 10000 startups per year, requiring 1 full-time analysts to keep
track of startups of just one industry on annual basis; and there are 100’s of
intersecting industries in aviation (food &amp; beverage, hospitality, etc.). Methods that allow at
least in part automate this process would be beneficial. This research describes the
first stages of developing a methodology of analyzing values within industry based on
startups value propositions.
      </p>
      <p>The main research question is tools and procedures for analysis of value
proposition of startups in the area of air transportation from their textual description in the
frame of developing a methodology for analysis of values based on a model of
automatic identification of values in the text of a startup’s landing page in the air
transportation industry.</p>
      <p>The initial contribution of this research is the development of methodology of
analyzing values within industry based on startups value propositions. The procedures
and tools integrated in the model are either articulated or developed by the author
throughout the Thesis.</p>
      <p>The approach. The author uses a Natural Language Processing (NLP) approach to
identify the methods and features well suited for this problem. A bottom-up
(datadriven) technique is taken, i.e. the author first constructs the dataset and then analyzes
it using both computational and linguistic methods to identify which features and
methods perform the best. The author also takes the corpus linguistic approach, i.e.
reliable language analysis is more feasible with corpora collected in the field in its
natural context.</p>
      <p>The object of the research is startups’ landing pages in the field of air transport
industry.</p>
      <p>The subject of the research is value proposition of a startup.</p>
      <p>The tasks to be performed are the following:
1. To study theoretical literature on the 5. To identify features (e.g. linguistic,
research topic semantic)
2. To examine existing research on the 6. To identify patterns</p>
      <p>research topic 7. To develop testable predictions
3. To collect preliminary data 8. To build a model
4. To conduct an experiment 9. To test a model
2</p>
    </sec>
    <sec id="sec-2">
      <title>The Methodology and Methods of Research</title>
      <p>
        The research methodology is presented in Figure 1. The author takes data-driven
scientific method. The general idea was adopted from the paper describing the scope
of big data in medicine [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>With the accessibility of large datasets and advanced correlation/statistical tools,
will we still need to rely on hypotheses in scientific research? Traditionalists say that
in purely data-driven methods, one may not realize where to look for those riveting
findings if no hypotheses were formed beforehand.</p>
      <p>
        Big data advocates, on the other side, propose that with no prior beliefs, one is not
driven by established ways of thinking or creating, opening the possibilities of
breakthrough insights where nobody had been before [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Research Theoretical Framework. The following theories and methods have
been used in work: the system approach [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], information extraction, exploratory
experimentation [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], the methods of statistical analysis [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], simulation modelling
approach [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], methods of quality analysis [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], surveys, pattern identification [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ],
building a model.
      </p>
      <p>Limitations of the research are linked to the research base: one industry for
startups - aviation industry, one hundred experiment participants and their level of
expertise (undergraduate students, non-native English speakers), task complexity and
ambiguity, ambiguity of natural language.</p>
      <p>
        Scientific novelty:
analysis of Value Proposition in the context of air transportation and startups;
construction of Value Proposition corpus / dataset; systematization of Value Proposition
(ontology); analysis of linguistic patterns of Value Proposition; automation of
linguistic features detection that could be used to detect Value Proposition (feature
engineering [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ])
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>State of the Art</title>
      <p>Articles reviewed here and referenced throughout the article were sourced from online
libraries using a relative research approach: papers related to Value Proposition,
startup concept, corpus linguistics, text annotation. The inclusion criteria for an article
in this review were the following: the study was of an exploratory or empirical nature
or gives an idea of the terminology source.</p>
      <p>A rich theoretical and empirical literature can be applied to the question on how to
identify Value Proposition in startups texts. As the current research focuses on
linguistic patterns of startups landing pages, the author reviews theoretical and empirical
literature on specific online text features, textual value proposition methods, value
proposition comprehension by consumers of different level of expertise.</p>
      <p>
        Michael Lanning and Edward Michaels first used the expression “value
proposition” (VP) in a 1988 work document for the consulting company “McKinsey and Co”.
In the article, which was entitled “Delivering value to customers”, the authors define
value proposition as “a clear, simple statement of the benefits, both tangible and
intangible, that the company will provide, along with the approximate price it will
charge each customer segment for those benefits”[
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>
        In 2016, Eric Almquist suggested a strategy based on a differentiated customer
value proposition. A suggested set of value was called Elements of Value [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
Elements of Value categories are based on Maslow’s Hierarchy of Needs shown in
Figure 2.
      </p>
      <p>
        Fig. 2. Heuristic model of value with examples of companies exhibiting elements of value [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
      </p>
      <p>
        There are comparatively few researches on analyzing value propositions in online
startups. In 2007, Su-C Li in his paper argues that a properly constructed value
proposition is essential to the value creation process in e-business, and value co-production
is the building blocks for value protection mechanism in network economy [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
      <p>
        Äyväri, Anne, and Annukka Jyrämä in their article “Rethinking value proposition
tools for living labs” published in 2017 provide a conceptual analysis on value
proposition tools to be used in future empirical research and in building managerial insight.
The conceptual analysis focuses on a living lab framework and recent theoretical
developments around the concept of value that are reflected in the context of three
managerial tools for creating value propositions. Among findings in the context of the
living labs approach, the Value Proposition Builder seems to conflict with the ideas
and premises of user-centric innovation processes [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. In 2019, Guo, Hai, Jun Yang,
and Jiaping examined the fit between value proposition innovation and technological
innovation (exploitative vs explorative) for the performance of startups in the digital
environment. They based their research on on-site survey data of 285 digital startups
in one of the world's largest digital economies and found that explorative innovation
strengthens the positive impact of value proposition innovation on the performance of
startups, but exploitative innovation weakens this positive effect [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ].
      </p>
      <p>
        Corpus linguistics has generated a number of research methods, which attempt to
trace a path from data to theory. Wallis and Nelson in 2001first introduced what they
called the 3A perspective - Annotation, Abstraction and Analysis: Annotation consists
of the application of a scheme to texts [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Annotations include structural markup,
part-of-speech tagging, parsing, and numerous other representations. Abstraction
consists of the translation (mapping) of terms in the scheme to terms in a theoretically
motivated model or dataset. Abstraction typically includes linguist-directed search
but may include e.g., rule-learning for parsers. Analysis consists of statistically
probing, manipulating and generalizing from the dataset. Analysis might include
statistical evaluations, optimization of rule-bases or knowledge discovery methods.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Research Design and Preliminary Results</title>
      <p>First Stage – Current
A total of 800 different startups in the field of air transportation were chosen, their
landing pages consist the research base for analysis on Value Proposition. 100
participants of low level expertise - IT, Aviation, and Management undergraduate students –
were involved in a preliminary survey, post-survey, training and post-training
annotation of startups landing pages. A separate webpage was created for conducting
surveys, two-level training and annotation process. The quality of annotation as it was
performed by low-expert students was assessed hereafter by industry experts.</p>
      <p>The aim of a preliminary survey is to understand the level of expertise for
nontrained participants. The students were told about the basic concept of value
proposition. Also, they were given some examples of Value Proposition such as typical ones:
affordability, quality, speed; less typical: eco-friendliness. After that they were asked
to list as many values as they can, that are provided by the Air Transportation startups
and companies.
not be easily identified due to its vague expression or the creator of the webpage use
non-standard ways to promote their startup, e.g. video or non-trivial words.</p>
      <p>
        The participants have options to identify a landing page as not a startup, can state
that the value is hard to identify, and can name the page as not from the air
transportation industry. Also, they are asked to click a Like button if they think this startup
clearly deliver the VP.
Preliminary results. In the preliminary results the author observed that the current
assumption, based on the suggested algorithm, shows the expected ambiguity of the
natural language as well as the task complexity and ambiguity as perception of the VP
concept depends on the individual personal characteristics and background. Also, it
proves that the necessity of inter-annotator agreement is a measure of how well two
(or more) annotators can make the same annotation decision for a certain category
[
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. This moment 56% of the planned startups landing pages are annotated.
      </p>
      <p>Future Work. The immediately future work is to identify the features (e.g.
linguistic, semantic), so the author can determine specific for startups Value Proposition
patterns to develop testable predictions. Creating transportation industry values
ontology based on inter-annotated corpus with high ambiguity level is one of the objectives
of this research. Building and testing a model is the next phase of the current studies.
As the result of this work the author will elaborate a methodology for analysis of
values based on a model of automatic identification of values in the text of a startup’s
landing page in the air transportation industry.</p>
      <p>At the end stage the author plans to develop a tool in the form of software for
assisting industry analysts in their research and advisory services.</p>
      <p>Factors that influence the performance of the Value Proposition identifying model
are discovered during the annotation process. The author is analyzing participants’
comments and commits not only to identify the clearly viewed linguistic features, but
also to find not so vivid patterns that we can use to automate the process of VP
identification. There are several factors technical and non-technical that the author thinks
can influence the performance of annotation process (like what influences the attitude
towards and the time of annotation) and that are being investigated.</p>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>In this research the author empirically has studied the process of Value Proposition
identification in the air transportation startups. The study has been conducted
according to one of the well-recognized path of corpus linguistic research methods: 3A –
Annotation, Abstraction, Analysis. The current stage of the given research is a landing
text annotation process. An experiment design has been developed: preliminary
survey, two-level training, dataset of objects, 100 annotators.</p>
      <p>The preliminary findings show the importance of inter-annotator agreement and an
ambiguity of natural language and value identification process. The next stage of this
research will include identifying the linguistic features and patterns. It will be a basis
of elaborating a methodology of automatic identification of values in the text of a
startup’s landing page in the air transportation industry. At the end, the author plans to
develop a tool in the form of software to assist industry analysts in their research and
advisory services.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <article-title>Skytran company webpage (online), www</article-title>
          .skytran.com,
          <source>last accessed April 12</source>
          ,
          <year>2020</year>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>McCue</surname>
            <given-names>ME</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCoy</surname>
            <given-names>AM</given-names>
          </string-name>
          .
          <article-title>The Scope of Big Data in One Medicine: Unprecedented Opportunities and Challenges</article-title>
          . Front Vet Sci, e-publishing (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Shih</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sen</surname>
            <given-names>C</given-names>
          </string-name>
          .
          <article-title>Data-Driven vs</article-title>
          .
          <source>Hypothesis-Driven Research: Making Sense of Big Data. Academy of Management Proceedings</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>von Bertalanffy</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          (
          <year>1968</year>
          ).
          <source>General System Theory: Foundations</source>
          , Development, Applications. New York: George Braziller (
          <year>1968</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Burian R.M. Exploratory</surname>
          </string-name>
          <article-title>Experimentation</article-title>
          . In: Dubitzky W.,
          <string-name>
            <surname>Wolkenhauer</surname>
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cho</surname>
            <given-names>KH.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yokota</surname>
            <given-names>H</given-names>
          </string-name>
          . (eds) Encyclopedia of Systems Biology. Springer, New York, NY (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Ravens</surname>
            <given-names>C</given-names>
          </string-name>
          .
          <article-title>Methods of statistical analysis</article-title>
          .
          <source>In: Internal Brand Management in an International Context. Innovatives Markenmanagement</source>
          , Springer Gabler, Wiesbaden (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Simulation</given-names>
            <surname>Modeling</surname>
          </string-name>
          . In: Alhajj R.,
          <source>Rokne J. (eds) Encyclopedia of Social Network Analysis and Mining</source>
          . Springer, New York, NY (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Müller</surname>
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pickard</surname>
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bertsche</surname>
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Analysis</surname>
          </string-name>
          and
          <article-title>Inclusion of Synergies of Common Quality Management Methods for Optimised Quality Assurance</article-title>
          . In: Spitzer C.,
          <string-name>
            <surname>Schmocker</surname>
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dang</surname>
            <given-names>V.N.</given-names>
          </string-name>
          <article-title>Probabilistic Safety Assessment and Management</article-title>
          . Springer, London (
          <year>2004</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Fu</surname>
            <given-names>K.S.</given-names>
          </string-name>
          <article-title>Syntactic (Linguistic) Pattern Recognition</article-title>
          .
          <source>In: Fu K.S. Digital Pattern Recognition. Communication and Cybernetics</source>
          , vol
          <volume>10</volume>
          . Springer, Berlin, Heidelberg (
          <year>1976</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Brownlee</surname>
          </string-name>
          , Jason. Discover Feature Engineering, How to Engineer Features and How to Get Good at It, machinelearningmastery. com, (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Harvey</surname>
            <given-names>Golub</given-names>
          </string-name>
          , Jane Henry, John L. Forbis, Nitin T. Mehta,
          <string-name>
            <given-names>Michael J.</given-names>
            <surname>Lanning</surname>
          </string-name>
          , Edward G. Michaels, and
          <string-name>
            <given-names>Kenichi</given-names>
            <surname>Ohmae</surname>
          </string-name>
          .
          <article-title>Delivering value to customers. Harvard Business Review, President and Fellows of Harvard College (</article-title>
          <year>1988</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Eric</surname>
            <given-names>Almquist</given-names>
          </string-name>
          , John Senior, Nicolas Bloch, “The Elements of Value”,
          <source>Harvard Business Review</source>
          , pp.
          <fpage>46</fpage>
          -
          <lpage>53</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>S.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <article-title>"The Role of Value Proposition and Value Co-Production in New Internet Startups: How New Venture e-Businesses Achieve Competitive Advantage,"</article-title>
          PICMET '
          <fpage>07</fpage>
          - 2007 Portland International Conference on Management of Engineering &amp; Technology, Portland, OR, pp.
          <fpage>1126</fpage>
          -
          <lpage>1132</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Äyväri</surname>
          </string-name>
          , Anne; Jyrämä, Annukka: Rethinking.
          <article-title>Value proposition tools for living labs</article-title>
          .
          <source>In: Journal of Service Theory and Practice</source>
          , Vol.
          <volume>27</volume>
          , No.
          <volume>5</volume>
          , p.
          <fpage>1024</fpage>
          -
          <lpage>1039</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Guo</surname>
            , Hai,
            <given-names>Jun</given-names>
          </string-name>
          <string-name>
            <surname>Yang</surname>
          </string-name>
          , and Jiaping Han.
          <article-title>"The Fit Between Value Proposition Innovation and Technological Innovation in the Digital Environment." IEEE Transactions on Engineering Management (</article-title>
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Wallis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Nelson</surname>
            <given-names>G.</given-names>
          </string-name>
          <article-title>Knowledge discovery in grammatically analysed corpora</article-title>
          .
          <source>Data Mining and Knowledge Discovery</source>
          ,
          <volume>5</volume>
          :
          <fpage>307</fpage>
          -
          <lpage>340</lpage>
          .(
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Artstein</surname>
            <given-names>R</given-names>
          </string-name>
          .
          <article-title>Inter-annotator Agreement</article-title>
          . In: Ide N.,
          <string-name>
            <surname>Pustejovsky</surname>
            <given-names>J</given-names>
          </string-name>
          .
          <source>Handbook of Linguistic Annotation</source>
          . Springer, Dordrecht (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>