<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Named Entity-Aware Abstractive Text Summarization for Hindi Language</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Saumay Gupta</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sukomal Pal</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Indian Institute of Technology (BHU)</institution>
          ,
          <addr-line>Varanasi</addr-line>
          ,
          <country country="IN">India</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this study, we introduce a novel approach to text summarization, specifically tailored for the Hindi language, titled Named Entity-Aware Abstractive Text Summarization (NEA-ATS). Our methodology uniquely integrates Named Entity Recognition with advanced pretrained language models, focusing on critical entities such as individuals, locations, and organizations. We use our proposed methodology along with the pretrained models to work on the ILSUM task to provide summaries for Hindi news articles. We secured the first rank for the Hindi summarization task. Our comprehensive evaluation ofers valuable insights into enhancing the NEA-ATS methodology in the future, along with determining eficient methods and model for Hindi summarization.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Indian language summarization</kwd>
        <kwd>Named entity aware summarization</kwd>
        <kwd>Pretrained models</kwd>
        <kwd>NER</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        In the age of information abundance, abstractive text summarization has emerged as a vital
tool to distill vast amounts of textual content eficiently. Unlike extractive summarization that
pulls exact sentences from the original text, abstractive summarization entails creating fresh
sentences to encapsulate the core ideas of the source, demanding an in-depth understanding
of its semantics and context[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        The complexities of the Hindi language, including its intricate syntax, rich morphology, and
prevalent code-mixing, present unique challenges for summarization algorithms. To address
these, we propose an innovative Named Entity-Aware Abstractive Text Summarization
(NEAATS) technique for Hindi. Leveraging the prowess of pretrained language models (PLMs) like
mBART-50, mT5, and IndicBART, which excel in capturing semantic and contextual nuances
across languages, our NEA-ATS method employs named entity recognition (NER) to enhance
the substance and clarity of summaries[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. By integrating the latest Hindi NER model,
HiNERoriginal-muril-base-cased, our system can identify and categorize pivotal entities such as
individuals, locations, and organizations, ensuring that summaries focus on salient details while
preserving the original text’s coherence and essence.[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
      </p>
      <p>
        The incorporation of NER is particularly vital given the tendency of state-of-the-art
abstractive summarizers to omit or incorrectly substitute named entities, which can result in
misleading summaries and the spread of misinformation[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Our method addresses these issues
by initially training the summarization model on the NER task to enhance entity awareness,
thereby reducing hallucinations and inaccuracies in the final summary.
      </p>
      <p>
        The ILSUM 2023 shared task [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], centered on creating state-of-the-art models for
generating meaningful summaries for new articles, particularly in Indian languages, provides
participants with a collection of news articles and their summaries. This dataset encompasses English
and three major Indian languages: Hindi, Gujarati, and Bengali.
      </p>
      <p>The first section gives us an introduction of the whole paper, Section 2 describes some recent
works related to the fields of summarization and NER. Section 3 briefly explains the task and
describes the dataset we are working on. Section 4 explains our NEA-ATS methodology along
with the PLMs used for our task. While, Section 5 and 6 describe our experimentation method,
the results obtained and the insights we got from our experiment. Section 7 concludes our
work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related work</title>
      <p>In this section, we describe some recent works and advancements done in the area of Hindi
language summarization and named-entity recognition.</p>
      <p>Text summarization is a constant growing field of research and recently summarization for
Indian languages, particularly Hindi has picked up a lot of pace. As summarization can either
be extractive or abstractive, lots of works can be found that were done in this field.</p>
      <p>
        Chestha et al. in their work [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] present a novel approach where both extractive and
abstractive summarization techniques are proposed and compared for Hindi text documents. The
study introduces a ward hierarchical agglomerative clustering method. The comparison
between the extractive and abstractive methods provides valuable insights into the efectiveness
and applicability of these techniques in processing Hindi text documents.
      </p>
      <p>
        Kumar et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] introduce a Generative Adversarial Network (GAN)-based model for
abstractive text summarization. The model is composed of a generator and two discriminators,
where the Similarity Discriminator ensures that the generated summary maintains a high
degree of similarity with the original text. This innovative approach addresses the challenge of
maintaining coherence and relevance in generated summaries.
      </p>
      <p>
        The work [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] by Rishab et al. tackles the challenge of limited datasets in Indian regional
languages, particularly Hindi. They introduce two innovative deep learning models for text
summarization, following an abstractive methodology. These models utilize attention
mechanisms and are built upon a Stacked LSTM Sequence To Sequence (Seq2Seq) framework. The
research is noteworthy for its emphasis on regional linguistic diversity and for addressing the
issue of dataset shortages in this field.
      </p>
      <p>
        Richa et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] explore the Named Entity Recognition (NER) process in detail,
emphasizing its importance in Information Extraction. It describes a dual-phase procedure for NER,
involving the detection and categorization of named entities into established groups. These
groups encompass a variety of types, including individuals, locations, organizations, numeric
expressions, temporal expressions, among others.. The use of neural language models and
Conditional Random Fields (CRF) for Hindi language NER signifies a notable advancement in
applying machine learning techniques to Indian languages.
      </p>
      <p>The recent rise in Indic language datasets like XL-Sum [10], WikiLingua [11], MassiveSumm
[12] and many others has also helped create Indic language summarization become an active
area of research.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Task Description</title>
      <p>
        Our task involves training a model and creating concise summaries for articles for the Hindi
language, which can be either extractive or abstractive. The dataset employed in this study
consists of article-headline pairs, which have been collected from several leading newspapers
across the country. The dataset provided covers English and major Indian languages such
as Hindi, Gujarati and Bengali. We use the combined Hindi dataset (ILSUM 2022[13] and
2023[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]) for our work and describe its structure below. Notably, while previous studies
in other languages have utilized news article-headline pairs, the dataset presents a unique
challenge due to the presence of code-mixing (mixing multiple diferent languages) and script
mixing. Very few works have been done till now to summarization code-mixed Hindi articles.
      </p>
      <p>Some examples of such code-mixing and script mixing in both headlines and articles, are
illustrated below:
• 31 िदसंबर 2022 तक SBFC फाइनेंस 16 राज्यों और दो क ें द्र शासत प्रद</p>
      <p>में मौजूद ह।ै
• 1957 के DMC एक्ट यानी Delhi Municipal Corporation Act के अनुसार, DMC एक्ट के अनुसार,
कु त्े का रजस्ट ्रेशन एक साल के लए मान्य होता ह ।ै
ेशों के 105 स े अधक शहरों</p>
      <p>The dataset was checked on various factors mentioned in [14] and checks were done on
cases such as:
1. Empty records
2. Duplicated Entries
3. Classification of summaries into extractive, semi-extractive, and abstractive categories.
4. Identification of article-summary pair where the first sentence of the article is the same
as the summary.
5. Evaluation of summary size to ensure it does not resemble the length of the actual article.</p>
      <p>Of the 21,225 records, one was found to contain an empty article composed solely of a
newline character, which was subsequently removed from the dataset. No duplicated entries were
detected. The distribution of summary types is presented in Table 1, with summaries
categorized as extractive if they precisely match sentences from the article, abstractive if no
sentence matches, and semi-extractive if some sentences match and some do not. Remarkably,
approximately 80% of the summaries fell into the abstractive or semi-extractive categories.
Consequently, the decision was made to employ abstractive summarization techniques for the
dataset.</p>
      <p>Descriptive statistics for the dataset are provided in Table 2 and Table 3. Multiple
tokenizers, IndicNLP[15], AlbertTokenizer[16], T5Tokenizer[17] and mBart50Tokenizer[18] were
employed for dataset analysis, yielding valuable insights into the dataset and tokenization
processes. Additionally, these insights guided the selection of hyperparameters for the models
used to train the dataset.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Methodology</title>
      <p>In news article summarization, two key approaches are extractive and abstractive
summarization. In our experiment, we use the abstractive method of summarization to provide summaries
for the Hindi news article. We also experiment with providing named-entity attention to the
model to improve the summaries. Firstly, we describe the models we employed for the purpose
of text summarization and NER and then we explain the methodology we experimented with
to provide named-entity aware summaries.</p>
      <sec id="sec-4-1">
        <title>4.1. Models used</title>
        <p>In this section, we describe the various PLMs used for our task of named-entity aware Hindi
summarization. PLMs, particularly deep transformer-based ones, have been instrumental in
advancing abstractive text summarization. These models, equipped with extensive knowledge
and vast parameters, have led to significant progress in the field of Natural Language
Processing (NLP). This progress has empowered text summarization by enabling the generation of
summaries that closely resemble human-authored content.[19]
• mBART-50[18] is a multilingual Seq2Seq model that undergoes pre-training with the
‘Multilingual Denoising Pretraining’ objective. This model demonstrates the feasibility
of developing multilingual translation models via multilingual fine-tuning. Unlike
conventional fine-tuning in a single direction, this approach fine-tunes a pretrained model
across multiple directions simultaneously. mBART-50 extends the capabilities of the
original mBART model and spanning a total of 50 languages. We used the mBART-Large-50
model having 610M parameters for fine-tuning.
• mT5[17] a multilingual version of T5. This model has been pretrained on a Common
Crawl-based dataset that has 101 languages. It operates within a unified ‘text-to-text’
framework, making it exceptionally versatile for various language tasks. Renowned for
its exceptional performance in multilingual applications, mT5 stands out for its capacity
to understand and generate text in numerous languages, making it a valuable tool for a
wide range of language-related tasks. We use only the mT5-base version of the model
having 580M parameters. Due to memory constraints, we were unable to use the mT5
large version which had 1.2B parameters.
• IndicBART[16], a multilingual sequence-to-sequence pretrained model, prioritizes
Indic languages and English. It accommodates 11 Indian languages, leveraging the mBART
architecture and orthographic similarities among Indic scripts for improved transfer
learning. Notably, its smaller size(244M) compared to models like mBART and
mT5(base) makes it a computationally eficient choice for fine-tuning and decoding tasks.
Alongside this model we also used the IndiBARTSS[16] model which was specifically
pretrained for short summarization tasks.</p>
        <p>In addition to the above models which provide Hindi text summarization we use another
PLM which provides us with the state-of-the-art ability of named entity recognition.
• HiNER-original-muril-base-cased[20] is a MuRIL based model fine-tuned on the NER
dataset HiNER (Hindi Named Entity Recognition) [20]. The dataset was compiled from
diverse government information webpages, and involved manual annotation of these
sentences. It comprises sentences extracted from ILCI and various other sources. This
model was used to get the named entities in an article.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Named Entity-Aware Summarization</title>
        <p>NER is a task in NLP that focuses on detecting and categorizing essential information (entities)
within a text into predetermined classes, including people’s names, organizations, places, dates,
and more. NER systems are designed to locate named entities in a body of text and classify them
according to a fixed set of categories, providing a way to extract structured information from
unstructured text sources.</p>
        <p>In the domain of abstractive summarization for news articles, the inclusion of NER emerges
as a crucial factor in enhancing the quality and depth of generated summaries. NER allows
the model to identify and prioritize key entities such as individuals, locations, and
organizations within the text. This strategic integration empowers the summarization process not only
to distill information but also to furnish context by highlighting vital elements. The
motivation behind incorporating NER lies in the ambition to create summaries that are both concise
and enriched with essential details, ensuring coherence and capturing critical contextual
nuances from the source news articles. The NER-augmented abstractive summarization approach
aims to produce reader-friendly and comprehensive summaries, encapsulating the
fundamental essence of news stories while underscoring significant entities.</p>
        <p>We experiment with the NEA-ATS method where the named entities are appended to the
article to provide attention to named entities along with the string ‘entity: ’ to show that the
sentence is a named entity sentence. Let us consider the set of all articles as  , for an input
article  ∈ , we use an NER model  () to get a set of unique named entities  = {  ∣   ∈
 (),  ∈  } (article is constrained by the maximum tokens allowed for the summarization
model). We then create a sentence using the named entities and append it to the start of the
article creating the modified article  ′ given by  ′ = { ∶  1,  2,  3....,   | ∣   ∈ } . These
modified articles are then fed to the summarization model along with the summary  to
finetune the model. Adding the named entities in the start of the article makes the model focus
more on named entities as they are considered to be of higher importance (first input to the
model). This process can be visible in Fig. 1.</p>
        <p>An example of applying the above method and getting the modified article is (starting
sentences of the article are shown):
• Original: के रल के एनार्कु लम जले म ें 5 साल क बच्ी स े रपे के बाद गला दबाकर हत्या कर दी
गई। आरोपी ने बच्ी का शव बोरे म ें डालकर डंिपग ग्राउ ंड में फें क िदया था। पुलस ने आरोपी शख्स को
िगरफ्तार कर लया ह ।ै घटना शुक्रवार शाम क ह ।ै पुलस ने शिनवार को मीडया को इसक जानकारी
दी।
• Modified Article: entity: के रल, एनार्कु लम, 5 साल, शुक्रवार शाम, शिनवार, िवव ेक कु मार, रात
9. 30 बजे, िबहार, सुबह, कांग्रेस, िवधानसभा, वीडी सतीशन, सुधाकरण। के रल के एनार्क ु लम जले</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Experiments</title>
      <p>As we only had training dataset for our experiment, we split the dataset randomly into 5 folds.
This splitting is done so in a sense that we utilize 4 folds (80%) of the split for training and the
rest (20%) for validation. In the k-fold cross validation process, the training and validation was
done 5 times, where in each fold, a diferent fold is picked for validation and the rest for training.
As we did not have access to the validation dataset, it was necessary to split the training dataset
as k-fold cross validation helps us in choosing a good split along with generalizing the model.</p>
      <p>We use the PyTorch and HuggingFace libraries to train and test our models, where we trained
the models for a maximum of 10 epochs and selected the epoch giving the best validation
metrics along with checking for overfitting and underfitting. We used the ROUGE[ 21] metrics
(namely: ROUGE-1, ROUGE-2 and ROUGE-L) to compute the scores and evaluate the models.
As the standard ROUGE module does not support Indian languages, we use the Multilingual
ROUGE module[10] for our work.</p>
      <p>The named entities are extracted from the source articles using the
HINER-original-murilbase-cased[20] pretrained model, while we use four diferent models, mBART-50[ 18], mT5[17],
IndicBART[16] and IndicBARTSS[16] to fine-tune on the task of summarization. Each of these
four diferent models are trained on both the original dataset and the modified dataset
having named entities along with 5-fold cross validation. The model versions giving best
validation results were picked to provide the summaries for the test dataset. Hyperparameters for
the models were initially selected by observing the trends in the dataset and how the models
worked internally. These were more finely tuned when we started training the models. The
ifnal parameters used to train the models are given in Table 4. The default learning rate of 5 −5
was used to train all the models, while a beam size of 4 was taken to generate the predictions.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Results and Discussion</title>
      <p>The validation metrics for each fold for all the models are given in Table 5 to Table 8. Each
time the model was separately fine-tuned from scratch for each fold for both the named-entity
based and original articles. The validation results were compared and the top 5 performing
models were chosen to generate the summary (limited to 75 or 100 tokens) for the test dataset.
Some examples of the generated summaries are:
• Aam Aadmi Party (AAP) MP Raghav Chadha Controversy - संसद परसर म ें मगंलवार को
आम आदमी पाट (AAP) सांसद राघव चड्ढा के ऊपर कौआ ब ठै गया। राघव उस समय फोन पर बात
कर रहे थे। वे मानसून सत्र स े वापस लौट रहे थे, तभी ये घटना हुई</p>
      <p>The models chosen to generate test summary and their evaluation metrics are given in Table
9. BERT score(precision, recall and F1) was used along with the ROUGE scores for evalution.</p>
      <p>The evaluation findings indicate that the named entity aware mT5 model yields superior
outcomes, surpassing the original mBART-50 model which further outperforms other models. The
test metrics given in Table 9 also corroborate this claim, where the named entity aware mT5
model outperforms original mBART-50 on BERT scores, although it performs slightly poorly
based on the ROUGE scores. This observation suggests that models like mT5 and mBART-50,
which are fine-tuned on base models such as T5 and BART, are more efective than others like
IndicBART and IndicBARTSS. However, it’s noteworthy that named entity aware models show
diminished performance relative to their standard counterparts. This implies that while
focusing on named entities, the addition of named entity sentences in front of articles may disrupt
the text’s semantic integrity, leading to lower metric scores. This trend is further evident in
the test results presented in Table 9, where the original mBART-50 model surpasses the named
entity aware mT5 model for the ROUGE score albeit slightly.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion</title>
      <p>This paper detailed the creation of the NEA-ATS method specifically tailored for Hindi. This
method addressed the complexities of Hindi’s syntax and morphology, as well as the
challenges posed by code-mixing. By incorporating advanced pretrained language models, such as
mBART-50[18], mT5[17], and IndicBART[16], the study emphasized the importance of named
entity recognition in improving the accuracy and relevance of the summaries.</p>
      <p>
        The approach involved refining summarization models to prioritize named entities, ensuring
that generated summaries were focused on key information. Extensive testing, including
experiments with datasets in Hindi, illustrated NEA-ATS method’s capability in producing detailed
and context-aware summaries. Although, integration of named entities sometimes afected
textual flow, these instances ofered valuable insights for future enhancements. To address
this issue, future research could explore alternative methods of entity attention that do not
compromise the semantic coherence of text. For ILSUM task [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], original mBART-50 and
named entity aware mT5 models outperformed other original and named entity aware models.
      </p>
      <p>In summary, this research marks an important advancement in text summarization for Indian
languages, especially Hindi. It underscores the critical role of named entity recognition in
abstractive summarization and sets a foundation for future explorations in this area.
model and crf for hindi language, Computer Speech Language 74 (2022) 101356.
URL: https://www.sciencedirect.com/science/article/pii/S0885230822000055. doi:https:
//doi.org/10.1016/j.csl.2022.101356.
[10] T. Hasan, A. Bhattacharjee, M. S. Islam, K. Mubasshir, Y.-F. Li, Y.-B. Kang, M. S.
Rahman, R. Shahriyar, XL-sum: Large-scale multilingual abstractive summarization for
44 languages, in: Findings of the Association for Computational Linguistics:
ACLIJCNLP 2021, Association for Computational Linguistics, Online, 2021, pp. 4693–4703.</p>
      <p>URL: https://aclanthology.org/2021.findings-acl.413.
[11] C. C. Faisal Ladhak, Esin Durmus, K. McKeown, Wikilingua: A new benchmark dataset
for multilingual abstractive summarization, in: Findings of EMNLP, 2020, 2020.
[12] D. Varab, N. Schluter, MassiveSumm: a very large-scale, very multilingual, news
summarisation dataset, in: Proceedings of the 2021 Conference on Empirical Methods in
Natural Language Processing, Association for Computational Linguistics, Online and Punta
Cana, Dominican Republic, 2021, pp. 10150–10161. URL: https://aclanthology.org/2021.
emnlp-main.797.
[13] S. Satapara, B. Modha, S. Modha, P. Mehta, Fire 2022 ilsum track: Indian language
summarization, in: Proceedings of the 14th Annual Meeting of the Forum for
Information Retrieval Evaluation, FIRE ’22, Association for Computing Machinery, New York,
NY, USA, 2023, p. 8–11. URL: https://doi.org/10.1145/3574318.3574328. doi:10.1145/
3574318.3574328.
[14] A. Urlana, S. M. Bhatt, N. Surange, M. Shrivastava, Indian language summarization using
pretrained sequence-to-sequence models (2022).
[15] A. Kunchukuttan, The IndicNLP Library, https://github.com/anoopkunchukuttan/indic_
nlp_library/blob/master/docs/indicnlp.pdf, 2020.
[16] R. Dabre, H. Shrotriya, A. Kunchukuttan, R. Puduppully, M. M. Khapra, P. Kumar,
Indicbart: A pre-trained model for indic natural language generation, in: Findings of the
Association for Computational Linguistics: ACL 2022, 2022, pp. 1849–1863.
[17] L. Xue, N. Constant, A. Roberts, M. Kale, R. Al-Rfou, A. Siddhant, A. Barua, C. Rafel, mt5:
A massively multilingual pre-trained text-to-text transformer, in: Proceedings of the
2021 Conference of the North American Chapter of the Association for Computational
Linguistics: Human Language Technologies, 2021, pp. 483–498.
[18] Y. Tang, C. Tran, X. Li, P.-J. Chen, N. Goyal, V. Chaudhary, J. Gu, A. Fan,
Multilingual translation with extensible multilingual pretraining and finetuning, arXiv preprint
arXiv:2008.00401 (2020).
[19] A. A. Syed, F. L. Gaol, A. Boediman, T. Matsuo, W. Budiharto, A survey of abstractive
text summarization utilising pretrained language models, in: N. T. Nguyen, T. K. Tran,
U. Tukayev, T.-P. Hong, B. Trawiński, E. Szczerbicki (Eds.), Intelligent Information and
Database Systems, Springer International Publishing, Cham, 2022, pp. 532–544.
[20] R. Murthy, P. Bhattacharjee, R. Sharnagat, J. Khatri, D. Kanojia, P. Bhattacharyya, Hiner: A
large hindi named entity recognition dataset, in: Proceedings of the Thirteenth Language
Resources and Evaluation Conference, 2022, pp. 4467–4476.
[21] C.-Y. Lin, ROUGE: A package for automatic evaluation of summaries, in: Text
Summarization Branches Out, Association for Computational Linguistics, Barcelona, Spain, 2004,
pp. 74–81. URL: https://aclanthology.org/W04-1013.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Tank</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Thakkar</surname>
          </string-name>
          ,
          <article-title>Abstractive text summarization using adversarial learning and deep neural network, Multimedia Tools and Applications (</article-title>
          <year>2023</year>
          ). URL: https://doi.org/10.1007/ s11042-023-17478-0. doi:
          <volume>10</volume>
          .1007/s11042- 023- 17478- 0.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>D.</given-names>
            <surname>Suleiman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Awajan</surname>
          </string-name>
          ,
          <article-title>Deep learning based abstractive text summarization: Approaches, datasets, evaluation measures, and challenges</article-title>
          ,
          <source>Mathematical Problems in Engineering</source>
          <year>2020</year>
          (
          <year>2020</year>
          )
          <article-title>9365340</article-title>
          . URL: https://doi.org/10.1155/
          <year>2020</year>
          /9365340. doi:
          <volume>10</volume>
          .1155/
          <year>2020</year>
          /9365340.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Berezin</surname>
          </string-name>
          , T. Batura,
          <article-title>Named entity inclusion in abstractive text summarization</article-title>
          ,
          <source>in: Third Workshop on Scholarly Document Processing</source>
          ,
          <year>2022</year>
          , p.
          <fpage>158</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>S.</given-names>
            <surname>Satapara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mehta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ganguly</surname>
          </string-name>
          ,
          <article-title>Indian language summarization at fire 2023, in: Proceedings of the 15th Annual Meeting of the Forum for Information Retrieval Evaluation</article-title>
          ,
          <string-name>
            <surname>FIRE</surname>
          </string-name>
          <year>2023</year>
          , Goa,
          <source>India. December 15-18</source>
          ,
          <year>2023</year>
          , ACM,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S.</given-names>
            <surname>Satapara</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mehta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Modha</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ganguly</surname>
          </string-name>
          ,
          <article-title>Key takeaways from the second shared task on indian language summarization</article-title>
          (ilsum
          <year>2023</year>
          ), in: K. Ghosh,
          <string-name>
            <given-names>T.</given-names>
            <surname>Mandl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Majumder</surname>
          </string-name>
          , M. Mitra (Eds.), Working Notes of FIRE 2023 -
          <article-title>Forum for Information Retrieval Evaluation, Goa, India</article-title>
          .
          <source>December 15-18</source>
          ,
          <year>2023</year>
          , CEUR Workshop Proceedings, CEUR-WS.org,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>C.</given-names>
            <surname>Kwatra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Gupta</surname>
          </string-name>
          ,
          <article-title>Extractive and abstractive summarization for hindi text using hierarchical clustering</article-title>
          ,
          <source>in: 2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES)</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>6</lpage>
          . doi:
          <volume>10</volume>
          .1109/ ICSES52305.
          <year>2021</year>
          .
          <volume>9633789</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>S.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Solanki</surname>
          </string-name>
          ,
          <article-title>An abstractive text summarization technique using transformer model with self-attention mechanism</article-title>
          ,
          <source>Neural Computing and Applications</source>
          <volume>35</volume>
          (
          <year>2023</year>
          )
          <fpage>18603</fpage>
          -
          <lpage>18622</lpage>
          . URL: https://doi.org/10.1007/s00521-023-08687-7. doi:
          <volume>10</volume>
          .1007/ s00521- 023- 08687- 7.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>R.</given-names>
            <surname>Karmakar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Nirantar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Kurunkar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Hiremath</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Chaudhari</surname>
          </string-name>
          ,
          <article-title>Indian regional language abstractive text summarization using attention-based lstm neural network</article-title>
          ,
          <source>in: 2021 International Conference on Intelligent Technologies (CONIT)</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          . doi:
          <volume>10</volume>
          .1109/CONIT51480.
          <year>2021</year>
          .
          <volume>9498309</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>R.</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Morwal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Agarwal</surname>
          </string-name>
          ,
          <article-title>Named entity recognition using neural language</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>