<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Brasilia,
Brazil
* Corresponding author.
$ senay.kafkas@kaust.edu.sa (Ş. Kafkas); sumyyah.toonsi@kaust.edu.sa (S. Toonsi); sakhaa.alsaedi@kaust.edu.sa
(S. Alsaedi)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Natural Language Processing Tutorial for Biomedical Text Mining - Abstract</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Şenay Kafkas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sumyyah Toonsi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sakhaa Alsaedi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST)</institution>
          ,
          <addr-line>Thuwal, 23955, Kingdom of</addr-line>
          <country country="SA">Saudi Arabia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Computer, Electrical and Mathematical Sciences &amp; Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST)</institution>
          ,
          <addr-line>Thuwal, 23955, Kingdom of</addr-line>
          <country country="SA">Saudi Arabia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0001</lpage>
      <abstract>
        <p>In this tutorial, we introduce Natural Language Processing (NLP) for text mining (TM) in the biomedical domain. The tutorial is structured such that each main concept is backed by hands-on exercises. We start by introducing the diference between NLP and TM. Then continue with motivating the need for text mining in the biomedical domain. Next, we introduce basic concepts of NLP tasks such as Named Entity Recognition (NER), Named Entity Normalization (NEN), and Relationship Extraction (RE. We also cover in detail the widely used methods being used to implement these tasks. These methods include dictionary/ontology-based, rule-based, and advanced machine/deep learning-based approaches. In particular, we cover language models like Word2Vec and transformers (e.g. BERT) and their applications. Furthermore, we discuss the recent advancements in NLP by focusing on the large language models covering GPT, ChatGPT, and others. We conclude our tutorial by discussing limitations and ethics in NLP where we cover the best practices to develop state-of-the-art NLP and TM tools. The learning objectives of this tutorial are: • Diferences between NLP and TM • The need for biomedical TM • Implementation of fundamental NLP tasks: NER, NEN and RE • Current and future trends in NLP • Limitations and ethics in NLP and biomedical TM The learning outcomes of this tutorial are: • Familiarity with current NLP techniques/tools being used for biomedical TM • Basic skills to use and develop fundamental NLP tools such as NER and RE • Familiarity with the current as well as expected future trends in NLP • Familiarity with the ethics and limitations of biomedical TM PCWrEooUrckResehdoinpgs ISSNc1e6u1r-3w-0s0.o7r3g</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Natural Language Processing</kwd>
        <kwd>Text mining</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Materials of this tutorial are available
ICBO-NLP-for-Biomedical-Text-Mining-tutorial/tree/main
from</p>
    </sec>
  </body>
  <back>
    <ref-list />
  </back>
</article>