<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Query Rewrite for Null and Low Search Results in eCommerce</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Zehong Tan</string-name>
          <email>zetan@ebay.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Canran Xu</string-name>
          <email>canxu@ebay.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mengjie Jiang</string-name>
          <email>menjiang@ebay.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hua Yang</string-name>
          <email>hyang2@ebay.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Xiaoyuan Wu</string-name>
          <email>xiaowu@ebay.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>eBay Search Science</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>In most cases when users encounter with null and low search queries, there are lots of inventory to these queries on eBay, but they can't be retrieved by regular query expansion. Therefore, retrieving more relevant items by performing more aggressive query rewrites will largely improve the user experience. This paper proposes a query rewrite system that reformulate the original query into multiple alternative queries that enlarge recall without altering the main search intent. In this paper, we present a term dropping algorithm that understands and keeps the important terms in the query, and a term replacement algorithm that learns from user behavior and language model. Experiments on randomly sampled eBay queries shows that the n-best rewrites by these two algorithms e ectively recover more relevant items. Besides, the experiments indicate that our algorithm can predict the exact user re ned query in its top-5 rewrite candidates for over 50% of the null and low queries.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>When the retrieved items of a query is lower than some
certain number (e.g. 10), the search results page results in
bad customer experiences. Therefore recovering more items
for null and low queries will dramatically increase conversion
and click through rates. Causes of this issue range from a
misspelled word to a too speci c query. Therefore, with a
few exceptions this task can be tackled by simple rule-based
solutions. In this scenario, dropping or replacing some terms
in these queries may recover more items. The current null
and low recovery algorithm randomly drops tokens in the
user query up to some certain percentage and come up with
a collection of subqueries. Subsequently a relevance model
Copyright © 2017 by the paper’s authors. Copying permitted for private and
academic purposes.</p>
      <p>In: J. Degenhardt, S. Kallumadi, M. de Rijke, L. Si, A. Trotman, Y. Xu (eds.):
Proceedings of the SIGIR 2017 eCom workshop, August 2017, Tokyo, Japan,
published at http://ceur-ws.org
will be used to boost the relevant items in the ranking phase.
However, randomly dropping or replacing terms may alter
the original search intent which could result in retrieving
irrelevant items. For example, when a null query \led brake
light nano made in taiwan" is sent for term dropping using
the current algorithm, top subqueries could preserve
unimportant segments like \made in taiwan" As a result, even
though these items contain enough number of tokens in the
user query, the relevance is low and user experience is bad.
To address this issue, we implement separate algorithms that
can drop or replace terms using techniques originated from
natural language processing. These algorithms will attempt
to recognize the important segments of the query so that
dropping or replacing terms will not distort its intent.</p>
      <p>The task of query term dropping is assisted by tagging
the part-of-speech (POS), entities, unit of measures (UOM)
and phrases. In the eCommerce domain, entities are item
properties such as brand name, material, color, shape,
gender, etc. On the other hand, for null and low queries that
are not verbose, we will reformulate the query by replacing
some of the tokens. This approach avoids accidental
turning the speci c seed query into a broad query. To achieve
this, we build a language model from large amounts of user
query logs. With a given set of randomly selected null and
low queries, we focus on the evaluation of n-best rewrites
that are produced for evaluation.</p>
      <p>To summarize, our main contributions are:</p>
      <p>We propose an e ective algorithm to drop unimportant
terms without distorting the intent of the query
We apply a language model to estimate the probability
of candidate queries for term replacement.
2.</p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORKS</title>
      <p>
        Information retrieval with verbose queries has attracted
lots of interest in recent years. Redundancies that reside in
these queries suggest that generating subqueries by dropping
unimportant terms using natural language processing (NLP)
techniques[
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ] in the original query would be helpful.
To extract patterns to reduce long queries, researchers tried
to extract features like tf-idf, POS, entities and similarity
scores to the original queries. When the retrieved documents
are available, more sophisticated work employs random walk
from each token in the original query on a graph of retrieved
documents and the score of each suggestion is ranked by the
Hadamard product of each term[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        Query rewrite systems can be traced back to 2001's when
Google started to provide dynamic spelling checking
systems. Almost simultaneously, Yahoo started to track
common misspelling of frequent queries such as movie stars.
Technical details for these systems are not available, but
they seem to be based on spelling algorithms and
statistical frequencies. Traditional query rewriting system tries to
use rule-based techniques such as nding synonyms of words
that are extracted from wordnet[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In recent years,
techniques that utilize NLP and statistical models have been
widely used for tasks of query rewrite systems. Mays et.
al.[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] showed that noisy channel model could be used for
both real-word or non-word errors spelling correction. In
these works, either a set of real-word confusion pairs or a
lexicon against the edit distance is prepared[
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. [
        <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
        ]
proposed to use a large set of ordered query pairs that are scored
by their semantic similarities. These algorithms are not
speci cally designed for null and low search queries. On the
other hand, [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] tried to add taxa constraints to sub-queries
and retrieve items from a prebuilt database of historical
purchases of null and low queries. Distribution of inferred
taxa constraints from the historical data are then added to
each subquery for secondary retrieval. Furthermore, using
query-snippet pairs from user query logs enlarges the scope
of query expansion in an improved contextual way[
        <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
        ].
Some works[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] tried to build a query rewrite system without
requiring a prede ned dictionary. There are several other
researches that are relevant to query rewrite, such as using
large human labeled data set to build a conditional random
elds discriminative model[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
3.
      </p>
    </sec>
    <sec id="sec-3">
      <title>QUERY TERM DROPPING</title>
      <p>In this section, we describe the proposed approach to
query term dropping. The aim of term dropping is to
reduce a verbose query to multiple short query candidates to
increase recall as well as keep the query intent.</p>
      <p>To understand users' intent, we tag the possible product
and aspects in the user query, including brands (such as
Nike), product names (such as Macbook and Play Station
2), colors, materials, product models (such as hd-650), UOM
etc. Entities and attributes are tagged by mapping the
dictionaries mined from eBay's structured catalog. As null and
low queries are mostly tail queries, most of the tokens cannot
be covered by structured data. The syntax functionalities
of these tokens are labeled by the POS tagging. Moreover,
phrases are also detected in this phase by mining the user
query logs to make sure tokens in a phrase will be dropped
or kept as a whole. To this end, when there are multiple
possible tagging sequences, we select the sequence that tags
most of the tokens. This query tagging procedure provides a
context for core concept and attributives extraction in term
dropping.</p>
      <p>The selection of terms to drop is set by predetermined
rules. Brands, product names or rst noun phrases are
considered as Tier 1 group. They are the likely core product in
this query. Aspects (colors, materials, etc), UOM and
adjectives etc. are considered as Tier 2 group, as they are the
attributes of the core product. The stop words and
conjunctive words etc. are considered as Tier 3 group. To this end,
a term dropping suggestion is generated by the core product
terms in combination with attribute terms. Under the
requirement of minimum number of tokens of the alternative
query, the n-best suggestions are by generated a scoring
algorithm that considers both the alternate query length and
the quality of the tokens.
3.1</p>
    </sec>
    <sec id="sec-4">
      <title>Brand Disambiguation</title>
      <p>Due to the ambiguity of brands which are derived from
real word, recognition of brands such as \white" or \1928"
requires an additional disambiguation algorithm. Therefore
we build a discriminative model P (Bjq) that predicts the
term B is a brand in the context of query q. This is achieved
by computing</p>
      <p>P (Bjq) =</p>
      <p>X P (Cijq)P (BjCi);
i
(1)
where Ci is the ith shopping category. In Eq. (1), P (Cijq)
is the model that projects the context of the query onto the
shopping categories, and P (BjCi) is extracted from the data
from eBay's inventory. The model that learns P (Cijq) is
evaluated by inferred category demand. The optimal
threshold for classi cation is determined by the user click data.
Tagging procedure for brand name will be performed once
Eq. (1) has been run through the entire token list in the
query.
3.2</p>
    </sec>
    <sec id="sec-5">
      <title>Scoring Algorithm</title>
      <p>The alternative query generation is implemented as a
multipass process. Earlier passes keep more original tokens in
the alternate query, resulting in more e cient query with
less recalls. Later passes will keep fewer tokens to enlarge
the recall size. The search engine issues the term dropping
queries pass by pass, and stops whenever it gets enough
effective alternative queries and items to show to users. An
o ine experiment suggests that for 70% of the null and low
queries, it need only one pass to complete the whole process.
Suggestions from the earlier pass will always rank higher
than the ones from later passes, as overall queries with more
original tokens are believed to be closer to the original user
intent.</p>
      <p>In order to score the suggestions in one pass, we basically
prefer suggestions with more brands or nouns over
suggestions with more adjectives. The baseline tagging score is
given by a step function over their tags. After tagging, we
found that there are remaining 54% tokens are nouns. Due
to the fact that lots of null and low queries have misspelled
words that are of low document frequencies, we are inspired
to rank these nouns by the cross-entropy of their clicking
probabilities conditioning on their document frequencies:
Pclick(r) =</p>
      <p>P (clickjr) log (1</p>
      <p>P (clickjr));
(2)
where r is the document frequency of token, and P (clickjr)
is evaluated by a large sample of null queries. The clicking
score is added to the tagging score with an undetermined
coe cient, which is later optimized by maximizing the
discounted cumulative gain (DCG).</p>
    </sec>
    <sec id="sec-6">
      <title>QUERY TERM REPLACEMENT</title>
      <p>
        In this section we describe the proposed approach to term
replacement. The inputs are a query q0 consisting of a
sequence of m words (w1; : : : ; wm). As the formalism in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ],
the algorithm generates a set of candidates C(q0), and then
the best suggestion is given by the highest language model
probability. To generate the candidates, we start by
generating a set of candidate words for each input word wi that is
mined from the user behavior logs. Thus the probability for
each candidate query q1 2 C(q0) would be proportional to
P (q0jq1)P (q1), where P (q0) is the probability of the original
query modeled by the n-gram language model. Therefore,
term replacement is formulated as
q1 = arg max P (q0jq1)P (q1);
      </p>
      <p>q12C(q0)
where P (q0jq1) and P (q1) is the noisy channel model and
language model respectively. Furthermore, in practice,
considering the fact that the number of candidates in the noisy
channel model of a particular token ranges di erently to the
cardinality of bigrams, we raise the noisy channel by a power
:
q1 = arg max P (q0jq1) P (q1):</p>
      <p>q12C(q0)
4.1</p>
    </sec>
    <sec id="sec-7">
      <title>Noisy Channel Model</title>
      <p>The noisy channel model, P (q0jq1), describes the
emission probability of an original query q0 being replaced by q1.
We collect query pairs of term replacement within the same
session from user query logs. For each original query, we
assume that only one token wi is replaced by wi0 , therefore
P (q0jq1) = P (wijwi0).</p>
      <p>Note that for each observed token wi, the probability
P (wijwi0 = wi) should be counted as well. Unlike [Mays
1991], where the probability of wi not being replaced is
assigned with a tuning parameter, it is estimated by the e
ective non-replacement count:</p>
      <p>P (wijwi0 = wi) =</p>
      <p>Count(wi ! wi) ;
Count(wi ! w0)
i
where Count(wi ! wi) = (Count(wi)
wi0))=2.
4.2</p>
    </sec>
    <sec id="sec-8">
      <title>Language model</title>
      <p>Language modeling in our approach utilizes an n-gram
language model that assigns the following probability to a
string of words:</p>
      <p>m
P (w1 : : : wm) = Y P (wijw1 ) '</p>
      <p>i 1
i=1
n
Y P (wijwi n+1) (6)</p>
      <p>i 1
i=1</p>
      <p>
        As shown in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], language models solely built from query,
instead of the mixture of query and others, would be the
Count(wi ! wi 6=
(3)
(4)
(5)
most predictive. Unlike grammatically strict natural
language for long text, the feature of user queries is even more
sparse. Therefore we only incorporate bigram features in the
language model. Remedies of the sparsity of bigram features
are achieved by Kneser-Ney smoothing technique[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ].
      </p>
      <p>In the scenario of web search for eCommerce, the terms
in the queries are not necessarily following a chronicle order.
If a user types \case iphone", the intent is not far from the
query \iphone case". For this reason, we also incorporate
the backward-bigrams in the model without enhancing the
number of parameters. Since the backward-bigram
probability is not commensurate with the normal bigrams, these
probabilities are assigned with a free parameter :
P (w1 : : : wn) '
n 1 n
Y P (wi+1jwi) Y P (wi 1jwi)
i=1 i=2
(7)</p>
      <p>Finally, instead of just choosing a candidate term
replacement with the highest probability, we only accommodate the
candidates that have higher probability estimated from the
language model than the original query. This is to remedy
over-replacing issue, that is, avoid null and low queries that
are not suitable for term replacement.
5.</p>
    </sec>
    <sec id="sec-9">
      <title>EXPERIMENT</title>
      <p>We test how well our alternative query service can perform
on the eBay search engine. The evaluation scenario aims at
measuring the ability of the algorithm on recovering null and
low search result page and retrieve relevant items.
5.1</p>
    </sec>
    <sec id="sec-10">
      <title>Data Preparation</title>
      <p>We sampled two query sets of null and low queries from
search log for term dropping and replacement separately.
The training set for term dropping consists of one week of
user's search and click data. The training data for the query
term replacement consists of query pairs taken from 11 weeks
of query logs. The sampled test set contains 3,000 null and
low queries from the consecutive week of the training data.
5.1.1</p>
      <sec id="sec-10-1">
        <title>Query Term Dropping</title>
        <p>In the experiment for term dropping, we leverage eBay's
structure data dictionary for the product and attribute
recognizing in the query tagging phase. When we perform
POS tagging for search queries, we employ the Stanford
bidirectional model. However, in the scenario of web search,
the pior probability of verbs is much less than ordinary new
articles, we perform an o ine correction on top of the
Stanford POS tagging. With mined unigram and bigrams from
9 days' search queries, the tagging for phrases are obtained
by the normalized mutual information:</p>
        <p>M I(wi; wi+1) =</p>
        <p>P (wi; wi+1)2
P (wi)P (wi+1)
(8)
5.1.2</p>
      </sec>
      <sec id="sec-10-2">
        <title>Query Term Replacement</title>
        <p>We remove all non-alphanumeric characters from the queries
or spelling corrections pairs. After ltering, there are 380
millions of query pairs. A collection of data statistics of the
training data is shown in Table 1.</p>
        <p>Our algorithm is ne-tuned in order to balance the
capability of retrieving more items and their relevances. In
this section, we present the result of our experiment on the
search metrics.</p>
        <p>Users' intent is re ected by their own query re nements
when they are not satis ed with the returned search result of
the original query. We rst evaluate the probability of exact
predictions of our algorithm from one of the n-best query
rewrite candidates. The results are summarized in Table 2.</p>
        <p>In Table 3, we show the number of retrieved items, R, of
the original query and the best rewritten query. For
comparison, we also show the number for users' rewritten query.
We are recovering the null and low search result pages in
general.</p>
        <p>
          The distribution of queries over shopping categories is key
indicator of high-level relevance for query rewrite systems
in eCommerce[
          <xref ref-type="bibr" rid="ref10 ref17">10, 17</xref>
          ]. It is a su ciently strong signal that
the alternative query distorts user's search intent if the
inferred level-2 categories are changed. The category overlap
is computed as the Jaccard similarity of up-to top 50 items
retrieved from the alternative query, CA50, to the dominant
category of the original query, CO:
        </p>
        <p>J (A; O) =</p>
        <p>CA50 \ CO
jCA50 [ COj</p>
        <p>
          In Table 4, we show the category overlap of top 3
suggestions by term dropping and term replacement. Overall
71.5% (68.6%) of the items retrieved by top-1 term dropping
(replacement) suggestion has the same category with the
(9)
original query. Short queries with only 2-3 tokens are more
troublesome in term dropping with overlap rate 60.4%, as
dropping even one term may change the users' intent largely.
For example, in the query \boden shoes 30", dropping token
\30" (size 30) will alter the category from \kid's clothing and
shoes" to \women's shoes". Similar issues can be relieved by
term replacement. Our result achieved an overall 74.3%
category overlap, compared to 74%, as reported in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].1
        </p>
        <p>One important feature of our algorithms that lead to
better search experiences is the explicit mechanism to rewrite
search queries based on the context. In Table 5, we show
several examples of top query rewrite generated from the
algorithm.</p>
        <p>As for term dropping, in the rst query \tommy hil ger"
is detected as brand and \sport bra" as an unbreakable noun
phrase. Our term dropping rewrites the query by dropping
\prep" \free" and misspelled word \shipp". In query \10w
high quality speaker 6ohm", our algorithms recognizes that
it contains two UOM constraints \10w" \6ohm" and relaxes
the query by dropping one of the UOM constraints. As
mentioned in section 3.1, the brand names were disambiguated
by considering the shopping query. Though word \watt"
often refers to a unit at most of the time, it is a high con dent
brand when it occurs in queries in pottery&amp;craft category.
In query \watt spagehetti platter", our algorithm recognizes
\watt" as a china brand and keeps it in query rewriting. For
term replacement, the query \iphone watch" is rewritten as
\apple watch", instead of \iphone clock", because the bigram
probability Pb(watchjiphone )is much greater than the
emission probability Pe(clockjwatch). On the other hand,
\amber ball insect" is replaced by \amber sphere insect" because
the Kneser-Ney smoothing correctly compensate the
probability of \amber sphere", instead of recognizing \amber" as a
color.
6.</p>
      </sec>
    </sec>
    <sec id="sec-11">
      <title>CONCLUSION AND OUTLOOK</title>
      <p>
        In this work, we have implemented algorithms that aim
to cure the null and low search results. In particular, we
showed that tagging technique developed in NLP could be
used to generate more context relevant subqueries, and that
additionally statistical natural language model provides a
promising avenue for making progress on the path to
building full natural language understanding systems. Our
nu1To our knowledge, the most relevant work to ours is Ref[
        <xref ref-type="bibr" rid="ref10">10</xref>
        ],
in which the authors reported an algorithm to generate
search sub-queries in the scenario of null search page with
additional taxa constraints. However, the result of this
approach is not feasible to reproduce because the expense of
inferred exact taxa constrains of each query is maintaining a
dynamic database as large as the active inventory. For this
reason, this baseline is compared to the reported number in
the original literature, instead of reproducing its result for
the same query set.
merical experiments indicate that user's shopping experience
could be improved with higher level of engagement.
      </p>
      <p>The improvements achieved by our algorithms are not
unique to vanilla null and low search pages, and are readily
applicable to even broader tasks with search engine, such as
extracting structured data from item titles, general query
reformulation and anomalous query synopsis. It could lead
to signi cant improvements in shopping experiences with
similar models especially with text data of less grammatical
strictness.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Kumaran</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Carvalho</surname>
            ,
            <given-names>V. R.</given-names>
          </string-name>
          <article-title>Reducing long queries using query quality predictors</article-title>
          .
          <source>In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval</source>
          (
          <year>2009</year>
          ), ACM, pp.
          <volume>564</volume>
          {
          <fpage>571</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , and Zhang,
          <string-name>
            <surname>Y.-Q.</surname>
          </string-name>
          <article-title>A query substitution-search result re nement approach for long query web searches</article-title>
          .
          <source>In Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume</source>
          <volume>01</volume>
          (Washington, DC, USA,
          <year>2009</year>
          ),
          <article-title>WI-IAT '09</article-title>
          , IEEE Computer Society, pp.
          <volume>245</volume>
          {
          <fpage>251</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Duan</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Hsu</surname>
            ,
            <given-names>B.-J. P.</given-names>
          </string-name>
          <article-title>Online spelling correction for query completion</article-title>
          .
          <source>In Proceedings of the 20th International Conference on World Wide Web</source>
          (New York, NY, USA,
          <year>2011</year>
          ),
          <source>WWW '11</source>
          , ACM, pp.
          <volume>117</volume>
          {
          <fpage>126</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Maxwell</surname>
            ,
            <given-names>K. T.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , W. B.
          <article-title>Compact query term selection using topically related text</article-title>
          .
          <source>In Proceedings of the 36th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval</source>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Zhang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deng</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <article-title>Concept based query expansion using wordnet</article-title>
          .
          <source>In AST '09 Proceedings of the 2009 International e-Conference on Advanced Science and Technology</source>
          (
          <year>2009</year>
          ), pp.
          <volume>52</volume>
          {
          <fpage>55</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Mays</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Damerau</surname>
            ,
            <given-names>F. J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Mercer</surname>
          </string-name>
          , R. L.
          <article-title>Context based spelling correction</article-title>
          .
          <source>In Information Processing and Management: An International Journal</source>
          (
          <year>1991</year>
          ), vol.
          <volume>27</volume>
          , pp.
          <volume>517</volume>
          {
          <fpage>522</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Ahmad</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Kondrak</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Learning</surname>
          </string-name>
          <article-title>a spelling error model from search query logs</article-title>
          .
          <source>In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (Stroudsburg</source>
          , PA, USA,
          <year>2005</year>
          ), HLT '05,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computational Linguistics, pp.
          <volume>955</volume>
          {
          <fpage>962</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rey</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Madani</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Greiner</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <article-title>Generating query substitutions</article-title>
          .
          <source>In Proceedings of the 15th International Conference on World Wide Web</source>
          (New York, NY, USA,
          <year>2006</year>
          ),
          <source>WWW '06</source>
          , ACM, pp.
          <volume>387</volume>
          {
          <fpage>396</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Hasan</surname>
            ,
            <given-names>M. A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parikh</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Sundaresan</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <article-title>Query suggestion for e-commerce sites</article-title>
          .
          <source>In Proceedings of the Fourth ACM International Conference on Web Search and Data Mining</source>
          (New York, NY, USA,
          <year>2011</year>
          ),
          <source>WSDM '11</source>
          , ACM, pp.
          <volume>765</volume>
          {
          <fpage>774</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Singh</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parikh</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Sundaresan</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <article-title>Rewriting null e-commerce queries to recommend products</article-title>
          .
          <source>In Proceedings of the 21st International Conference on World Wide Web</source>
          (New York, NY, USA,
          <year>2012</year>
          ),
          <source>WWW '12 Companion</source>
          , ACM, pp.
          <volume>73</volume>
          {
          <fpage>82</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Riezler</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , and Liu,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <article-title>Query rewriting using monolingual statistical machine translation</article-title>
          .
          <source>Computational Linguistics</source>
          <volume>36</volume>
          ,
          <issue>3</issue>
          (sep
          <year>2010</year>
          ),
          <volume>569</volume>
          {
          <fpage>582</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <article-title>Improving query spelling correction using web search results</article-title>
          .
          <source>In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning</source>
          (
          <year>2007</year>
          ), pp.
          <volume>181</volume>
          {
          <fpage>189</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Micol</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Quirk</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <article-title>A large scale ranker-based system for search query spelling correction</article-title>
          .
          <source>In Proceedings of the 23rd International Conference on Computational Linguistics</source>
          (
          <year>2010</year>
          ),
          <article-title>Association for Computational Linguistics</article-title>
          , pp.
          <volume>358</volume>
          {
          <fpage>366</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , and Cheng,
          <string-name>
            <surname>X.</surname>
          </string-name>
          <article-title>A uni ed and discriminative model for query re nement</article-title>
          .
          <source>In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval</source>
          (
          <year>2008</year>
          ), pp.
          <volume>379</volume>
          {
          <fpage>386</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Miao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Behr</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C. L.</given-names>
          </string-name>
          <article-title>Exploring web scale language models for search query processing</article-title>
          .
          <source>In WWW '10 Proceedings of the 19th international conference on World wide web (</source>
          <year>2010</year>
          ), pp.
          <volume>451</volume>
          {
          <fpage>460</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Kneser</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Ney</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <article-title>Improved backing-o for m-gram language modeling</article-title>
          .
          <source>In Acoustics, Speech, and Signal Processing</source>
          ,
          <year>1995</year>
          . ICASSP-
          <volume>95</volume>
          ., 1995 International Conference on (
          <year>1995</year>
          ), vol.
          <volume>1</volume>
          , IEEE, pp.
          <volume>181</volume>
          {
          <fpage>184</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <surname>Hasan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heger</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Mansour</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <article-title>Spelling correction of user search queries through statistical machine translation</article-title>
          .
          <source>In EMNLP</source>
          (
          <year>2015</year>
          ), L. MA~ rquez,
          <string-name>
            <given-names>C.</given-names>
            <surname>Callison-Burch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pighin</surname>
          </string-name>
          , and Y. Marton, Eds.,
          <source>The Association for Computational Linguistics</source>
          , pp.
          <volume>451</volume>
          {
          <fpage>460</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>