<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>raiden11@IECSIL-FIRE-2018 : Named Entity Recognition For Indian Languages</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ayush Gupta</string-name>
          <email>gupta.ayush11786@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Meghna Ayyar</string-name>
          <email>leomi7ayyar@gmail.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ashutosh Kumar Singh</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rajiv Ratn Shah</string-name>
          <email>rajivratn@iiitd.ac.in</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Delhi Technological University</institution>
          ,
          <addr-line>Delhi 110042</addr-line>
          ,
          <country country="IN">INDIA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Indraprastha Institute of Information Technology</institution>
          ,
          <addr-line>Delhi 110020</addr-line>
          ,
          <country country="IN">INDIA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents our solution for the Named Entity Recognition (NER) task for the Information Extractor for Conversational Systems in Indian Languages challenge (IECSIL) [5] of the FIRE 2018 conference. A subset of the Information Extraction (IE) task, NER is a key to extract information and semantics of the text from unstructured data. The objective of NER is the identification and classification of every word or token in a document into predefined categories such as names of person, location, organization, etc. For this challenge the dataset provided by IECSIL [4] comprised of multilingual text of various Indian languages like Hindi, Tamil, Malayalam, Telugu, and Kannada. We mainly focus on the identification and classification of named entities belonging to nine categories like Name, Location, Datenum, etc. We tried linear models like Naive Bayes and SVM, and also a simple Neural Network to solve this problem. The best results are achieved by the simple neural network with an accuracy of 90.33% for all languages combined. This indicates that different advanced neural networks could be possible solutions to further improve this accuracy.</p>
      </abstract>
      <kwd-group>
        <kwd>Named Entity Recognition Information Extraction Word Embeddings Neural Networks</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The sheer volume of unstructured data available on the Web, in the form of blogs,
articles, emails, social media posts, documents et cetera is so large that the task
of deriving information from them demands an approach that does not involve
the manual annotation of this data. The data is heterogeneous, which makes
annotation by a human nearly impossible. Therefore, we would like to develop a
computer annotation approach to structure the data and make the information
from it easily extractable. Information extraction from unstructured data and
text is an NLP technique that deals with this problem and entity recognition is
one small subtask in the direction of solving it.</p>
      <p>
        NER tasks traditionally require large amounts of knowledge in the form of
feature engineering and lexicons to achieve high performance. Moreover, named
entities are open class expressions that have a lot of varieties, where new
expressions are being added constantly. NER deals with the location and identification
of the named entities that are present in the sentences given in the textual input.
It generally has pre-defined categories like names of people, places, organizations,
terms specific to a particular topic or field, industrial product names et cetera
which together comprise of the named entities [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. NER is essential for various
NLP tasks like Question Answering Systems [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], Information Retrieval [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ],
Machine Translation [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] etc. In addition to this, NER finds many applications in
multiple industries such as news and media, search engines, content
recommendations, customer support and also in academia.
      </p>
      <p>
        Several techniques have been developed to recognize named entities for the
English language and also for other foreign languages like Chinese, Japanese,
Korean, Arabic, and Spanish. Many of these techniques use either rule-based
technique shown by Kim and Woodland[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] or some form of statistical
technique e.g. Malouf, Robert[
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] used Hidden Markov Model to perform entity
recognition. Both these approaches rely on the help of a language expert for the
creation and validation of a large dataset which can then be used for further
analysis. In contrast, for Indian languages, not much work has been reported
because of insufficient resources due to which it has been difficult to employ
statistical techniques for Indian languages. Moreover, due to the morphological
nature of the Indian languages, it needs different methods than what has been
employed for English to form a language model. For e.g. in the sentence Peter
likes Paris., NER needs to tag the entities Peter as &lt;Name of a person&gt; and
Paris as &lt;Name of place&gt;. Here one rule to identify the nouns would be that
nouns generally start with a capital letter. However, for Indian languages like
Hindi, Tamil et cetera the nouns have a non-capitalized form and also
incorporate a richer morphology as compared to English making it more challenging to
perform NER [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
      </p>
      <p>The corpora provided as a part of the challenge consisted of pre-processed
data for five Indian languages thereby, reducing the main difficulty of getting a
validated and annotated dataset for the NER task. As an effort to perform entity
recognition we have used some linear models as baselines, and have also used a
neural network model to demonstrate that these techniques can be extended for
the purpose of NER of Indian languages. The rest of the paper is organized as
follows. The related work is presented in Section 2 while Section 3 describes the
methodology followed in detail. Section 4 discusses the evaluation of the various
models tried for the different languages in the corpora and Section 5 concludes
the paper and presents some future work that can be pursued.</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        Saha et al. [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] developed A Hybrid Feature Set based Maximum Entropy
(MaxEnt) Hindi Named Entity Recognition. The four identified named entity (NE) by
them were Person names (P), Location names (L), Organization names (O) and
Date (D). MaxEnt, a supervised machine learning technique as given by
Birthwick and Grishman [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] was applied to solve linguistic problems with the help
of orthographic, collocation features and gazetteers lists. They used a 2-phase
transliteration model to construct about 34 entities and subsequently employed
a semi-automatic induction of context patterns used for classification. A 0.76
Fmeasure was achieved as a baseline result and 0.81 F-measure was achieved after
adding gazetteer lists and context patterns into MaxEnt based NER system.
      </p>
      <p>
        Saha et al. [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] implemented Hybrid Approach for Named Entity Recognition
in Indian Languages for 5 Indian languages - Hindi, Bengali, Oriya, Telugu, and
Urdu. They used two approaches i.e. the Linguistic approach where the typical
rules were written by linguists and the Machine Learning (ML) approach – in
which the system was trained using tags. They received poor accuracy for Oriya,
Telugu and Urdu languages compared to the other two languages due to lack
of Parts Of Speech (POS) information, morphological information,
languagespecific rules, and gazetteers lists.
      </p>
      <p>
        CRF was introduced by Lafferty et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] to build a probabilistic model for
segmenting and labelling the sequence data. Ekbal et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] have worked on
Named Entity Recognition in Bengali: A Conditional Random Field Approach
by using a CRF based approach to develop NER for Bengali. They achieved
F-measure of 0.91 using CRF, more than the HMM result.
      </p>
      <p>
        Gali et al. [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] discussed the ambiguities for Indian languages that deal with
the linguistic issues like agglutinative nature and absence of capitalization, same
meaning for common name and proper name, spelling variation, patterns, and
suffixes. Using CRF’s they performed statistical tagging, resolved capitalization
etc. They achieved F - measures 0.41, 0.50, 0.39, 0.40, and 0.43 for Bengali,
Hindi, Oriya, Telugu, and Urdu respectively.
      </p>
      <p>
        Amarappa and Sathyanarayana [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] worked on Named Entity Recognition and
Classification (NERC) in Kannada language, built a SEMI-Automatic Statistical
Machine Learning NLP model based on noun taggers using HMM which was
a challenging task. In addition, they extended their work in Amarappa and
Sathyanarayana [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] to use a Multinomial Naïve Bayes (MNB) Classifier discussed
by Kibriya et al. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] to develop a novel model for NERC.
      </p>
      <p>
        Malarkodi et al. [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] experimented with NER on Tamil database, which coped
with real-time challenges using CRF’s and could be applied for most of the Indian
languages. Kaur and Gupta [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] built an NER for Punjabi language using
rulebased and list look-up approaches. Punjabi is a language with high clung and
inflections, which leads to linguistic problems. The rule-based approach trained
the system to identify NE by writing rules manually for all NE features.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>The problem at hand is to use multilingual corpora to perform entity
recognition task for five Indian languages. The following sections describe in detail
the various steps that have been performed to achieve the results. Section 3.1
describes the system architecture followed by Section 3.2 which discusses feature
extraction techniques applied. Finally, Section 3.3 deals with the description of
the various models used in our approach.
3.1</p>
      <sec id="sec-3-1">
        <title>System Architecture</title>
        <p>The complete architecture of the proposed system has been summarized in Fig. 1.
The pipeline of the system starts with the given dataset, as the input. We marked
all the full-stops and punctuation marks as entities belonging to Other class and
removed them. Then, we marked the numerical entities as all of them belong to
Numbers or Datenum class. This reduced the dataset size and aided in feature
extraction as we do not have word embeddings for numerical entities. As different
languages can have different ways of writing digits, we have to modify the digit
recognizer for each new language we add. This step is followed by the process
of feature extraction which has been described in detail in Section 3.2. Once
the features were extracted as vectors, we experimented with various models like
Naive Bayes and SVM, selected the one which gave the best results in terms of
accuracy by predicting the labels on unseen data. For each entity, we selected
the class having the highest probability among all.</p>
        <p>Fig. 1. System Architecture Diagram</p>
      </sec>
      <sec id="sec-3-2">
        <title>Feature Extraction</title>
        <p>
          An effective representation for sentences is to use the Bag of Words (BoW)
model which can then be given as input to a linear classifier like Naive Bayes
or Support Vector Machine (SVM). However just using BoW poses problems
like large feature dimension and sparse vector representation to represent words.
Word embeddings can capture the semantic relations amongst the words,
unlike BoW. Hence, we used FastText, a fast word embedding/vector generator by
Facebook [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] that provides a way to represent words using numbers. We used
a pre-trained collection5 of word vectors in 294 languages. These word vectors
were trained using skip-gram model described by Bojanowski et al. [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] with
default parameters. Each word vector consists of 300 decimal numbers representing
unique features. Words which are semantically close in meaning appear together
in the 300-dimensional graph. Word vectors were downloaded for all five Indian
Languages and stored as binary files. Except for Number and Datenum, all other
classes, as shown in Table 1, have more than 90% of their entities present in these
vectors. Overall 95.83 % of the entities were present. Remaining entities were
marked as unknown and a randomly generated sequence of decimals between -1
and 1 was used as their feature sets.
A simple and efficient baseline for our model is to use a linear classifier, e.g. a
logistic regression or a SVM algorithm. However, these classifiers fail to capture
the context of the statement while evaluating entity. Common examples are the
words; book, battle, address, bear etc., which can represent a verb as well as a
noun depending on the context. Also, they cannot generalize well in case of
imbalance in the class distribution in the dataset. To overcome this, we can apply
a multi-layer neural network or a deep neural network which can capture the
composite relation between the words.
5
https://github.com/facebookresearch/fastText/blob/master/pre-trainedvectors.md
        </p>
        <p>Neural network algorithms tend to exploit patterns and structures in datasets
to discover useful representations. So we chose to build a simple artificial neural
network, with one fully connected hidden layer with 8 units using the TensorFlow
library as shown in Fig. 2. The input layer consists of 300 nodes without any
activation function while the following layers have a softmax activation function.
The following equation describes the forward propagation in the first hidden layer
of the neural network :</p>
        <p>0 300
ali+1 = g @X alj
j=1</p>
        <p>1
Wij + blA
(1)
Here al+1 is the activation value of the ith node at the (l + 1)th layer, Wij
i
is the weight from jth node of the lth layer to the ith node, bl is the bias at the
lth layer and g is the softmax activation function. In our case, we have used a
basic neural network with one input layer of 300 units, one fully connected layer
of size 8 followed by the output layer of size 9 units. So l = 0 for the first hidden
layer.</p>
        <p>Finally, the output layer takes the values from the hidden layer nodes as
input, adds a bias value and returns a one hot vector as output which represents
the predicted class of the entity. The number of training epochs was fixed to 20
as above this number we could not observe any improvement in the performance
of the model in terms of accuracy. It takes about 2 hours to execute the model
on each language.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Evaluation</title>
      <p>Section 4.1 describes the data while Section 4.2 provides insight into the
experimental setup used. Section 4.3 lists the evaluation metrics used. Section 4.4
presents the results obtained in detail while Section 4.5 presents Error Analysis.
4.1</p>
      <sec id="sec-4-1">
        <title>Dataset Description</title>
        <p>The corpora consisted of five Indian languages namely, Hindi, Kannada,
Malayalam, Tamil, and Telugu. It was provided by ARNEKT Organization6 for the
purpose of this challenge. The dataset comprised of files for each language in
which the sentences were separated by newline string. Most of the sentences
provided were about 20 to 35 words long although some extended to above 100
words as well. The details about the entity categories and their distribution in
the different languages are shown in Table 2. The entities were tagged into 9
separate classes namely number, event, Datenum, things, organization,
occupation, name, location and other.</p>
        <p>Languages Num Event DNautme/ Things Org
Occ Name</p>
        <p>Loc</p>
        <p>Other
The dataset provided was properly annotated and hence did not need any kind
of pre-processing except the removal of punctuation marks. By using the words
vectors provided by Facebook, these words were then vectorized to a form that
could be directly given as input for our models. The supervised dataset was
divided into training and testing sets in the ratio 8:1.
6 http://iecsil.arnekt.com
4.3</p>
      </sec>
      <sec id="sec-4-2">
        <title>Evaluation Metrics</title>
        <p>Accuracy has been used as the metric for evaluating classification models in our
experiments. Accuracy is defined as the ratio between a total number of correct
predictions Nc upon the total number of predictions Nt.</p>
        <p>In addition to classification accuracy we have also used :
1. Recall (R) : Calculates the fraction of entities identified by the model and the
total number of named entities actually present in the corpus. For instance
let Nt be the total number of named entities present in the corpus and Ni
be the number of entities identified by the model.</p>
        <p>Accuracy =</p>
        <p>N c</p>
        <p>N t
Recall =</p>
        <p>N i</p>
        <p>N t</p>
        <p>Precision =
F1-Score = 2*</p>
        <p>N p
N e
R * P
R + P
2. Precision (P): Calculates the fraction of the named entities that were
correctly identified by the model. For instance let Ni be the total number of
named entities actually present in the corpus that were correctly identified
and Ne be the total number of named entities identified by the model.
3. F1-Score: F1 score is the harmonic mean of precision and recall. A perfect
F1 score of 1 is reached when the model has perfect precision and recall and
worst at 0.
(2)
(3)
(4)
(5)
4.4</p>
      </sec>
      <sec id="sec-4-3">
        <title>Results and Analysis</title>
        <p>The model was initially tested on the basic implementations of Naive Bayes
Classifier and Support Vector Machine(SVM). They yielded an overall accuracy
of 81.42% and 86.50% respectively. The results for all the languages provided in
the dataset have been listed in Table 3. It is evident from the reported figures
that the artificial neural network performs better than the standard SVM and
Naive Bayes models. We observe that the accuracy values of the neural network
lies in the vicinity of 90% and are relatively close to each other. This is because
our model is language independent and thus, it should yield similar values for
any new language. We can also observe that Tamil has comparatively lower
accuracy than other languages. This could be due to large word vectors corpus for
Tamil as compared to other languages.</p>
        <p>The detailed classification results of all 5 languages for each of the 9 classes
on the FIRE 2018 dataset are presented in Table 4. Fig. 3 depicts a graph
comparing recall values for all the languages. Similarly, Fig. 4 and Fig. 5 depict
graphs comparing precision values and F1-Score respectively. The recall rate of
Number class is 100% for all the languages due to the digit recognizer. Many
Datenum entities get tagged as Number entities thereby reducing the precision
value of Number class and recall value of Datenum class. This points to the need
to adopt a different strategy to recognize Datenum class.</p>
      </sec>
      <sec id="sec-4-4">
        <title>Error Analysis</title>
        <p>We did an error analysis to identify the incorrect predictions. In all the languages,
several Name entities were misclassified as Other entities. This is because Name
entities have relatively lower presence in the Word Embeddings leading to a
random vector getting assigned to represent them during the feature
extraction stage. One way to solve this would be the addition of a language-specific
gazetteers list to separate out the Names beforehand. The model also
misclassifies Things and Organization entities at times. This is due to the diverse nature
of these entities and also because they are rarely repeated in the dataset. The
classifier also experiences some problems while differentiating between Location
and Name entities. Along with Number entities, Event and Occupation entities
are also classified with high accuracy and very low misclassifications. This is
because of the limited set of common Events and Occupations entities present
in both the real world and consequently in the dataset.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion and Future Work</title>
      <p>The increasing amount of unstructured data requires that we develop an
approach to extract useful information in an efficient way. Information Extraction
(IE) from data is an important area of research of which named entity
recognition is a subset. In this paper, we have described the approach we adopted for
developing models capable of performing the task of NER for IECSIL as a part
of the FIRE 2018 challenge. The exact methods followed have been discussed
in detail. We have provided the results obtained along with analysis about the
entities for which the models performed well and reasons for such a behaviour.
The highest accuracy for NER was 90.33% for all languages combined when we
used our simple ANN model, which is an improvement to the baseline accuracy
of 85.73%.</p>
      <p>
        As an extension to the model presented, we can work on deeper models with
a better-suited structure for handling textual data. Recurrent Neural Network
(RNN), sometimes even addressed as Long short-term Memory (LSTM) is one
such example. A further improvement to this model is RNN with added
attention mechanism as demonstrated in Zhou, Peng et.al.[
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], where they have shown
that RNNs with attention mechanism perform well on NLP related problems.
In addition to this, we can work towards adding language specific markers and
special features like adding a Datenum recognizer that can reduce the
misclassifications, enhance the model and increase its robustness.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Amarappa</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sathyanarayana</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Named entity recognition and classification in kannada language</article-title>
          .
          <source>International Journal of Electronics and Computer Science Engineering</source>
          <volume>2</volume>
          (
          <issue>1</issue>
          ),
          <fpage>281</fpage>
          -
          <lpage>289</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Amarappa</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sathyanarayana</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Kannada named entity recognition and classification (nerc) based on multinomial nan" ive bayes (mnb) classifier</article-title>
          .
          <source>arXiv preprint arXiv:1509.04385</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Babych</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartley</surname>
            ,
            <given-names>A.:</given-names>
          </string-name>
          <article-title>Improving machine translation quality with automatic named entity recognition</article-title>
          .
          <source>In: Proceedings of the 7th International EAMT workshop on MT and other Language Technology Tools</source>
          ,
          <article-title>Improving MT through other Language Technology Tools: Resources and Tools for Building MT</article-title>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          . Association for Computational Linguistics (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Barathi</given-names>
            <surname>Ganesh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.B.</given-names>
            ,
            <surname>Soman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.P.</given-names>
            ,
            <surname>Reshma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            ,
            <surname>Mandar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Prachi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Gouri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Anitha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Anand</surname>
          </string-name>
          <string-name>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Information extraction for conversational systems in indian languages - arnekt iecsil</article-title>
          .
          <source>In: Forum for Information Retrieval Evaluation</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Barathi</given-names>
            <surname>Ganesh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.B.</given-names>
            ,
            <surname>Soman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.P.</given-names>
            ,
            <surname>Reshma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            ,
            <surname>Mandar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Prachi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Gouri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Anitha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Anand</surname>
          </string-name>
          <string-name>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Overview of arnekt iecsil at fire-2018 track on information extraction for conversational systems in indian languages</article-title>
          .
          <source>In: FIRE (Working Notes)</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Borthwick</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grishman</surname>
            ,
            <given-names>R.:</given-names>
          </string-name>
          <article-title>A maximum entropy approach to named entity recognition</article-title>
          .
          <source>Ph.D. thesis</source>
          , Citeseer (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Ekbal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haque</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bandyopadhyay</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Named entity recognition in bengali: A conditional random field approach</article-title>
          .
          <source>In: Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-II</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Gali</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Surana</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vaidya</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shishtla</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sharma</surname>
            ,
            <given-names>D.M.:</given-names>
          </string-name>
          <article-title>Aggregating machine learning and rule based heuristics for named entity recognition</article-title>
          .
          <source>In: Proceedings of the IJCNLP-08 Workshop on Named Entity Recognition for South and South East Asian Languages</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Grave</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bojanowski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joulin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Learning word vectors for 157 languages</article-title>
          . arXiv preprint arXiv:
          <year>1802</year>
          .
          <volume>06893</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Grave</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Joulin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bojanowski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Bag of tricks for efficient text classification</article-title>
          .
          <source>In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics</source>
          , EACL. pp.
          <fpage>3</fpage>
          -
          <lpage>7</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kaur</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Name entity recognition for punjabi language</article-title>
          .
          <source>Machine translation 2(3)</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Kibriya</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frank</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pfahringer</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holmes</surname>
          </string-name>
          , G.:
          <article-title>Multinomial naive bayes for text categorization revisited</article-title>
          .
          <source>In: Australasian Joint Conference on Artificial Intelligence</source>
          . pp.
          <fpage>488</fpage>
          -
          <lpage>499</lpage>
          . Springer (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>J.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Woodland</surname>
          </string-name>
          , P.C.
          <article-title>: A rule-based named entity recognition system for speech input</article-title>
          .
          <source>In: Sixth International Conference on Spoken Language Processing</source>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Lafferty</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCallum</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pereira</surname>
            ,
            <given-names>F.C.</given-names>
          </string-name>
          :
          <article-title>Conditional random fields: Probabilistic models for segmenting and labeling sequence data (</article-title>
          <year>2001</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Malarkodi</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pattabhi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sobha</surname>
          </string-name>
          , L.D.:
          <article-title>Tamil ner-coping with real time challenges</article-title>
          .
          <source>In: 24th International Conference on Computational Linguistics</source>
          . p.
          <volume>23</volume>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Malouf</surname>
          </string-name>
          , R.:
          <article-title>Markov models for language-independent named entity recognition, proceedings of the 6th conference on natural language learning</article-title>
          .
          <source>August 31</source>
          ,
          <fpage>1</fpage>
          -
          <lpage>4</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Mollá</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Van Zaanen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et al.:
          <article-title>Named entity recognition for question answering (</article-title>
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Nadeau</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sekine</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>A survey of named entity recognition and classification</article-title>
          .
          <source>Lingvisticae Investigationes</source>
          <volume>30</volume>
          (
          <issue>1</issue>
          ),
          <fpage>3</fpage>
          -
          <lpage>26</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Pillai</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sobha</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Named entity recognition for indian languages: A survey</article-title>
          .
          <source>International Journal</source>
          <volume>3</volume>
          (
          <issue>11</issue>
          ) (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Saha</surname>
            ,
            <given-names>S.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chatterji</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dandapat</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sarkar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>A hybrid approach for named entity recognition in indian languages</article-title>
          .
          <source>In: Proceedings of the IJCNLP-08 Workshop on NER for South and South East Asian languages</source>
          . pp.
          <fpage>17</fpage>
          -
          <lpage>24</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Saha</surname>
            ,
            <given-names>S.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sarkar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>A hybrid feature set based maximum entropy hindi named entity recognition</article-title>
          .
          <source>In: Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Salomonsson</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Entity-based information retrieval</article-title>
          . Department of Computer Science, Faculty of Engineering, LTH, Lund University (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shi</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tian</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qi</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hao</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Attention-based bidirectional long short-term memory networks for relation classification</article-title>
          .
          <source>In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume</source>
          <volume>2</volume>
          :
          <string-name>
            <given-names>Short</given-names>
            <surname>Papers</surname>
          </string-name>
          <article-title>)</article-title>
          .
          <source>vol. 2</source>
          , pp.
          <fpage>207</fpage>
          -
          <lpage>212</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>