<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Exploring Chronic Disease Trends among Adults in the USA: A Statistical Analysis with Visual Insights</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Vuong M. Ngo</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Geetika Sood</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Fionnuala Donohue</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Patricia Kearney</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jennifer Pallin</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mark Roantree</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Ho Chi Minh City Open University</institution>
          ,
          <addr-line>Ho Chi Minh City</addr-line>
          ,
          <country country="VN">Vietnam</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Insight, School of Computing, Dublin City University</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>National Health Intelligence Unit, Health Service Executive</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>School of Public Health, University College Cork</institution>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Chronic diseases are some of the most widespread and expensive health conditions in the United States. This study evaluates the prevalence of chronic diseases in the U.S. from 2011 to 2021, using data from the U.S. CDC, along with visualization and descriptive statistical analysis techniques. Despite increased awareness and advancements in prevention, diagnostics, and treatment, the prevalence of certain chronic diseases, such as obesity, diabetes, and asthma, continues to rise each year. Other conditions, such as kidney disease and heavy alcohol consumption, have shown little to no improvement. Notably, the prevalence of these diseases in each state is not directly correlated with population size but is instead influenced by various factors, including living conditions, unhealthy lifestyle habits, socioeconomic status, and access to healthcare services. For instance, Mississippi has an average obesity rate of 37.6% and an average diabetes rate of 12.8%, both of which are partially influenced by an average tobacco use rate of 23%. In comparison, the U.S. averages for obesity, diabetes, and tobacco use are 30.4%, 9.5%, and 18.3%, respectively.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;chronic kidney disease</kwd>
        <kwd>diabetes</kwd>
        <kwd>obesity</kwd>
        <kwd>asthma</kwd>
        <kwd>alcohol consumption</kwd>
        <kwd>U</kwd>
        <kwd>S</kwd>
        <kwd>states</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Chronic diseases like obesity, stroke, kidney disease and diabetes. Many of them can be prevented by
addressing four key risk behaviours: heavy alcohol consumption, tobacco use, physical inactivity and
poor nutrition. These risk behaviours, and associated chronic conditions, lead to reduced quality of life,
premature death, and increased health service use. Efective management of chronic disease requires
focusing on lifestyle changes and addressing social determinants of health that impact well-being [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
      </p>
      <p>
        Chronic illnesses encompass a wide range of conditions persisting for a year or longer, necessitating
continual medical care and often restricting daily activities [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. They play a significant role in driving
US’s annual healthcare expenditure of $4.1 trillion. The global economic impact of chronic diseases is
projected to reach $47 trillion by 2030.[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In U.S. approximately 45% of population lives with at least
one chronic disease, and this number continues to rise. Furthermore, one in four adults has two or more
chronic conditions, while more than half have at least one. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ].
      </p>
      <p>
        To address the questions posed, our paper employs a comprehensive approach combining statistical
methods [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and visualization techniques [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. We analyzed trends in chronic diseases and
conditions among adults in the US by examining data over time. This includes a variety of conditions
such as alcohol consumption, asthma, diabetes, lung diseases, and kidney conditions. By applying
time-series analysis from 2011 to 2021, we identified patterns and shifts in the prevalence of these
conditions, providing insights into how their impact has evolved.
      </p>
      <p>To determine regional variations, we employed geographical mapping and state-level comparisons.
This analysis enabled us to pinpoint which states exhibit the highest and lowest prevalence of chronic
diseases and conditions. Through the use of heat maps and bar graphs, we visually represented these
disparities, facilitating a clearer understanding of geographic trends and variations. Overall, our solution
integrates rigorous statistical analysis with efective visualization techniques to ofer a detailed and
insightful view of chronic disease trends, regional disparities, and the impact of alcohol consumption
on health outcomes.</p>
      <p>The remainder of this paper is organized as follows. In Section 2, we review the related work. Section
3 provides an overview of the dataset and the methods used to extract and present information related
to chronic diseases. In Section 4, we present and discuss statistical information and visualizations on
alcohol consumption, asthma, diabetes, kidney disease, and obstructive pulmonary conditions. Finally,
in Section 5, we conclude the paper and ofer insights for future research directions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>
        Chronic diseases place a significant burden on Ireland’s healthcare system, as well as those of other
countries, requiring well-coordinated, long-term management strategies [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. The studies [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]
focus on current research into chronic disease management within Ireland. In [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], algorithms are
being developed to accurately and securely link patient records across multiple systems by utilizing
available data points such as medical history, demographic details and other relevant attributes. This
HL7-FHIR-based approach tackles the challenge of integrating data from diverse sources, while ensuring
compliance with protecting patient information and privacy regulations. In [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ], the authors present a
practical method for integrating four diferent HSE databases in Ireland into a unified system. Specifically,
they employed a Global Schema Layer dependent on a Local Schema Layer. These layers can be shared
and applied across multiple use cases (subject to governance) or used for individual case studies. The
paper demonstrates how 20 queries can extract new insights from the unified dataset—something not
achievable with the separate systems. However, neither paper provides clear information about the
relationship between chronic diseases over time, especially in a time series context.
      </p>
      <p>
        Several studies have applied statistical methods to extract information about chronic diseases, such
as [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Beasley et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] used statistical methods to address the risk factors and primary
prevention strategies for childhood asthma, including: (1) demographic, developmental, and lifestyle
factors (family history, genetics, urbanization, and stress); (2) infection-related factors (pertussis); (3)
medication use (antibiotics); (4) diet (salt intake); and (5) inhaled exposures (paternal smoking and
outdoor air pollution). O’Neill et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] conducted a cross-sectional analysis of The Irish Longitudinal
Study on Ageing Wave 1, examining patients with type 2 diabetes. They identified financial barriers
in primary care for individuals with uncomplicated diabetes and noted that those ineligible for the
Cycle of Care program were more likely to manage their condition without medication. Baral et al.
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] analyzed mortality trends related to chronic lower respiratory diseases, focusing on gender, race,
and geographic disparities in the U.S. It highlighted a decline in mortality, especially among men and
non-Hispanic White populations, from 1999 to 2020. This paper can help you understand the trends in
respiratory-related chronic diseases and their demographic variations. However, each study addresses
only a specific chronic disease: Beasley et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] on asthma, O’Neill et al. [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] on diabetes, and Baral
et al. [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] on lower respiratory diseases.
      </p>
      <p>
        In studying chronic diseases, the authors in [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] analyzed four critical conditions prevalent among
the elderly: heart disease, diabetes, chronic kidney disease, and hypertension. Unlike our approach,
their primary focus was on developing an advanced prediction system. This system utilized neural
network models to forecast disease outcomes and was designed for integration into mobile platforms,
enhancing accessibility for both patients and healthcare providers. The papers by Bauer et al. [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] and
Boersma et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] both investigate chronic disease in the U.S. However, [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] is a survey paper that
presents strategies to reduce the burden of chronic diseases, whereas [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] is a statistical report showing
that, in 2018, 51.8% of U.S. adults had at least one chronic condition, and 27.2% had multiple chronic
conditions
      </p>
      <p>
        Similar to our study, Raghupathi et al. [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] analyzed the current state of various chronic diseases
across U.S. and its individual states, utilizing data from the Centers for Disease Control and Prevention
(CDC) and employing visualization and analytics techniques. Their study provides an in-depth analysis
of multiple categories of chronic diseases at the state level. By leveraging visual and descriptive analytics,
it sheds light on the relationships between behavioral habits, preventive health measures and chronic
conditions. However, unlike our study, their focus was on mental health, gender, ethnicity, and insurance
coverage. Additionally, their analysis only covered the years 2012 to 2014.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Materials and Methods</title>
      <sec id="sec-3-1">
        <title>3.1. Dataset</title>
        <p>
          The CDC’s National Center for Chronic Disease Prevention and Health Promotion (NCCDPHP) plays a
crucial role in supporting states in the collection of data on chronic diseases and key health indicators
through a variety of surveillance systems. This data is fundamental to the CDC’s ability to fully
understand the impact of chronic diseases on individuals and communities across the United States.
Additionally, it allows for the evaluation of the efectiveness of public health interventions aimed at
addressing these significant health challenges. In 2023, the NCCDPHP released the ’U.S. Chronic Disease
Indicators (CDI)’ dataset, comprising 1,185,676 records [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ].
        </p>
        <p>Description
15 years of collection from 2007 to 2021
50 states, including Alabama, Alaska, California, Guam, Hawaii, Iowa, Oklahoma, Texas,
Virginia, Washington
17 chronic diseases and indicators, such as Alcohol Consumption, Arthritis, Asthma,
Cancer, Cardiovascular Disease, Diabetes, Kidney Disease, Mental Health, Nutrition,
Physical Activity, Weight Status, Tobacco
Each topic includes several questions, such as: number of hospitalizations,
hospitalizations among Medicare-eligible persons aged 65 or older, mortality rate (percentage), and
the prevalence of the topic among adults aged 18 and above (percentage)
Includes Overall, Male, Female
Includes 8 categories: Hispanic, White, Black, Asian or Pacific Islander, American Indian
or Alaska Native, Asian Multiracial, non-Hispanic, Other
Data collected from 18 sources, including American Community Survey, Behavioral
Risk Factor Surveillance System, National Vital Statistics System, the U.S. Renal Data
System, the State Tobacco Activities Tracking and Evaluation System, and the Youth</p>
        <p>Risk Behavior Surveillance System.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Statistics and visualisation</title>
        <p>This study delves into chronic disease characteristics in the U.S., examining how population, behaviors,
and other health factors relate to these conditions at the state level and by years. Using visual analytics,
primarily descriptive, we analyze data sourced from CDC. Visual analytics merges computer analytics
with human insight, enabling real-time analysis and uncovering hidden insights [20, 21]. It transforms
data overload into opportunity and fosters transparent processing for analytical discussions. This
interdisciplinary research encompasses data mining, statistics, visualization, and more, highlighting the
scientific discipline of integrating diverse areas into visual analytics .</p>
        <p>Historically, automatic analysis techniques such as and data mining and statistics developed separately
from interaction and visualization methods. A significant advancement in visual analytics was the shift
from confirmatory data analysis—where charts were primarily used to present results—to exploratory
data analysis, which involves actively interacting with the data [22]. This transition underscored the
importance of interactive methods for data exploration and understanding.</p>
        <p>Combining statistical methods with visualization techniques ofers numerous benefits. Statistical
methods are instrumental in extracting and combining information from complex datasets, providing a
solid foundation for analysis. They enable the identification of patterns, relationships, and trends that
might not be immediately apparent. Visualization techniques then enhance this process by translating
these statistical insights into clear, intuitive graphical representations. This synergy allows for a
more comprehensive understanding of the data, as visualizations can reveal underlying structures and
anomalies that statistics alone might miss.</p>
        <p>Moreover, visualization facilitates the exploration of data by allowing users to interact with it
dynamically. This interaction enables iterative refinement of hypotheses and models, leading to deeper
insights and more informed decision-making. By visualizing statistical results, users can better grasp
the implications of their findings and communicate them more efectively to others</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Extracted Information and Discussion</title>
      <sec id="sec-4-1">
        <title>4.1. Heavy Alcohol Consumption</title>
        <p>Heavy alcohol consumption is a risk factor for many chronic diseases and conditions, including diabetes,
stroke, and cancer [23]. It is also ranked as the seventh global leading risk factor for death and disability
[24]. Figure 1 depicts the prevalence of heavy alcohol consumption among adults (aged ≥ 18) in the
United States over an 11-year period from 2011 to 2021. Additionally, it highlight the states with the
highest and lowest percentages of heavy alcohol consumption during this time frame. The National
Institute on Alcohol Abuse and Alcoholism defines heavy drinking as consuming five or more drinks
per day (male) or four or more drinks per day (female) [25].</p>
        <p>In Figure 1, states having high alcohol consumption are Montana, Alaska, South Carolina, Maine,
Hawaii, Oregon, South Dakota, Illinois, Iowa and Massachusetts. These states share several common
factors that may influence high alcohol consumption. Many, such as Montana, Alaska, Maine, and South
Dakota, have rural, outdoor-focused lifestyles, where social gatherings often center around activities
like hunting, fishing, and community events where alcohol is prevalent. Harsh winters in places like
Alaska, Maine, and South Dakota lead to more indoor socializing, often with alcohol. Tourism-heavy
states like Hawaii and Oregon see increased alcohol consumption due to visitor indulgence. Urban
centers in Illinois and Massachusetts have vibrant nightlife and hospitality industries that contribute to
higher alcohol use in these regions.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Asthma</title>
        <p>
          Asthma is a chronic lung condition that afects individuals of all ages. It is characterized by inflammation
and constriction of the muscles around the airways, which makes breathing more dificult for those
afected [ 26]. It is associated with reduced quality of life, increased hospitalisation, and premature
death[
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. Asthma onset is multi-factorial, with risk factors including smoking, air pollution, and other
airborne pollutants. In adults, obesity has been identified as a risk factor for late onset asthma [27].
        </p>
        <p>As shown in Figure 2, asthma prevalence in the United States increased between 2011 and 2021,
mirroring global trends. High asthma diagnosis rates in states such as New Hampshire, Maine, Rhode
Island, West Virginia, Vermont, Massachusetts, and Kentucky are driven by multiple factors. Poor
air quality due to industrial pollution and coal mining, particularly in West Virginia and Kentucky,
exacerbates respiratory conditions. In New England, cold climates increase indoor allergen exposure,
such as dust and mold, which can trigger asthma. Additionally, Kentucky and West Virginia’s high
smoking rates contribute to increased secondhand smoke exposure, while socioeconomic challenges,
including limited healthcare access, further elevate asthma prevalence, especially in rural and
lowincome communities.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Diabetes</title>
        <p>Diabetes is a chronic metabolic disease characterised by elevated blood glucose levels [28]. There are
three main types: type 1, type 2 and gestational. Type 1 typically presents in childhood or adolescence,
and it occurs when the body is unable to produce insulin. Type 2 typically presents in adults, accounting
for 90% of all diabetes cases globally [29]. Elevated glucose level in people with diabetes leads to
microvascular and macrovascular complications, damaging the heart, blood vessels, eyes, kidneys, and
nerves [28]. As a result, diabetes is a leading cause of cardiovascular disease, blindness, kidney failure
and lower-limb amputation in almost all high-income countries [30].</p>
        <p>As shown in Figure 3, prevalence rates of diabetes have risen from 2011 to 2021, with Mississippi
consistently exhibiting the highest prevalence. This upward trend aligns with global patterns [31].
The high diabetes rates in states such as Mississippi, Alabama, Louisiana, and Kentucky can be
attributed to several interrelated factors. These states often have high obesity rates—a major risk factor
for diabetes—due to poor dietary habits and limited access to healthy food options. Socioeconomic
challenges, including lower income levels and reduced access to healthcare, further exacerbate the
issue by restricting opportunities for preventive care and early diagnosis. Additionally, socioeconomic
factors and lifestyle patterns in these states contribute to high levels of physical inactivity. Public
health disparities and lower levels of health education also play a role, as individuals may lack access to
resources necessary for efective disease management and prevention.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Chronic Kidney Disease</title>
        <p>Chronic Kidney disease (CKD) occurs due to disease pathways that irreversibly alter the function and
structure of the kidney. It is typically diagnosed when signs of damage are seen over an at least 3-month
duration [32]. Risk factors for onset include diabetes and hypertension. Socio-economic factors also play
a role, with those within the lowest socioeconomic quarterly having a 60% higher risk of progressive
CKD than do those in the highest quarterly.</p>
        <p>Figure 4 illustrates the prevalence of CKD among adults in the US and its states over an 11-year
period, from 2011 to 2021. The high CKD rates in states such as Hawaii, North Carolina, Kentucky,
Georgia, Mississippi, Oklahoma, Louisiana, West Virginia, Texas, Delaware, Arizona, Nevada, Alabama,
Florida, and Washington. Many of these states experience high rates of diabetes and hypertension, two
leading risk factors for CKD. In particular, Mississippi, Alabama and West Virginia face high levels
of diabetes and obesity, exacerbating CKD prevalence. Additionally, states with high temperatures
and frequent droughts, such as Arizona and Nevada, can lead to dehydration, further stressing kidney
function. Socioeconomic factors, including limited access to healthcare in rural areas like Kentucky and
West Virginia, contribute to delayed diagnosis and treatment of CKD. High rates of obesity and poor
dietary habits in states like Louisiana and Georgia also increase the risk of developing CKD.</p>
      </sec>
      <sec id="sec-4-5">
        <title>4.5. Chronic Obstructive Pulmonary</title>
        <p>Chronic obstructive pulmonary disease (COPD) includes chronic bronchitis and emphysema and is a
long-term lung disease preventing airflow to the lungs, resulting in breathing problems [ 33]. Globally, it
is the fourth leading cause of death. The primary risk factor for onset is smoking, however, second-hand
smoke during pregnancy or early childhood and air pollution have been identified as risk factors. In
addition, those with COPD are also more likely to have other chronic diseases including asthma, heart
disease and diabetes.</p>
        <p>Figure 5 illustrates that Kentucky consistently has the highest prevalence of COPD, while Hawaii
has the lowest. High COPD rates in states like Kentucky, Alabama, Arkansas, and Louisiana are
linked to several key factors. These states often have elevated smoking rates, the primary risk factor
for COPD, with regions like Kentucky having strong tobacco industries that promote widespread
smoking. Additionally, high levels of industrial pollution and poor air quality in some areas contribute
to worsening respiratory conditions. Socioeconomic factors, such as lower income and limited access
to healthcare in rural areas, can lead to delays in diagnosing and treating COPD. Moreover, regions
with high poverty rates may have increased exposure to indoor pollutants, such as secondhand smoke
or poor housing conditions, which further deteriorates respiratory health.</p>
      </sec>
      <sec id="sec-4-6">
        <title>4.6. Correlation between Diseases</title>
        <p>Figure 6 shows the average percentages of populations in all states afected by chronic diseases or
health indicators, such as heavy alcohol consumption, diabetes, kidney disease, and obesity. It also
displays the population sizes of each state. Mississippi (MS) has the highest percentage of diabetes
at 12.8% and the second-highest obesity rate at 37.6%, with a population of approximately 37 million
people. South Carolina (SC) has high percentages of alcohol consumption and diabetes, both at 6.9%
and 11.4%, respectively, with a population of around 41.6 million. Arkansas (AR) and Alabama (AL)
show elevated rates of kidney disease and obesity, both at 3.1% and 36%, respectively.</p>
        <p>Figure 7 highlights trends in various chronic diseases and health behaviors between 2011 and 2021 in
U.S. Alcohol consumption fluctuated slightly, peaking at 7.2% in 2020 before decreasing to 6.5% in 2021.
Asthma rates saw a gradual increase, from 9.1% in 2011 to 10.1% in 2021. COPD prevalence remained
relatively stable, ranging from 6.2% to 6.4% between 2011 and 2020, with a slight decrease to 6.0% in
2021.</p>
        <p>Diabetes rates steadily increased from 9.0% in 2011 to 9.8% in 2021, with a peak of 10.0% in 2018.
Kidney disease prevalence remained relatively stable, rising slightly from 2.3% in 2011 to 2.7% in
2021. Obesity, however, showed a dramatic rise, starting from 27.6% in 2011 and reaching 34% in 2021,
signifying a critical public health concern. In contrast, tobacco use consistently declined, falling from
21.7% in 2011 to 15.0% in 2021.</p>
        <p>Figure 8 presents trends in several health indicators and diseases in California, with an average
population of 57.5 million, over the period from 2011 to 2021. Alcohol consumption showed fluctuations,
starting at 6.2% in 2011 and peaking at 6.4% in 2013, with variations throughout the years. By 2021, the
rate stood at 6.1%, indicating stable but moderate changes over time. Asthma rates remained relatively
steady with minor fluctuations, rising from 8.4% in 2011 to a peak of 9.3% in 2020 before slightly
decreasing to 8.8% in 2021. COPD prevalence stayed fairly stable, ranging between 4.0% and 5.1%, with
the highest rate recorded in 2020 at 5.1%, followed by a slight drop to 4.4% in 2021.</p>
        <p>Diabetes rates showed a steady and concerning upward trend, increasing from 8.5% in 2011 to 10.8%
in 2021. Kidney disease rates fluctuated between 2.3% and 3.2%, with the highest rate observed in 2017
(3.2%), followed by a decline to 2.3% in 2021. Tobacco use saw a significant decline, decreasing from
13.6% in 2011 to 8.8% in 2021, reflecting a positive public health trend in reducing smoking rates. Obesity
showed a notable rise from 23.7% in 2011 to 30.2% in 2020, with a slight decrease to 27.8% in 2021,
highlighting concerns over increasing inactivity throughout the decade.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusion and Future Work</title>
      <p>We utilized the U.S. CDC dataset on chronic diseases covering the years 2011 to 2021 for information
extraction and visualization. This dataset enabled us to analyze and present detailed trends for seven
major chronic diseases and health indicators: alcohol consumption, asthma, COPD, diabetes, CKD,
obesity, and tobacco use. Each indicator was visualized both independently and collectively, providing
clear insights into their prevalence across the U.S. and within individual states.</p>
      <p>Notably, the prevalence of these diseases in each state is not directly correlated with population size.
Instead, the rates are shaped by various factors, including living conditions, unhealthy lifestyle habits,
socio-economic status, and access to healthcare services. These factors contribute to the disparities in
chronic disease rates observed across diferent regions of the U.S. Our analysis underscores the need to
address not only individual health behaviors but also the broader socio-economic and environmental
factors to efectively mitigate the rise of these chronic conditions.</p>
      <p>In the future, we plan to use AI algorithms [34] to classify states based on features like economic
factors, disease rates, racial demographics, healthcare infrastructure, and lifestyle habits. This will help
identify patterns and group states with similar characteristics, ofering insights into the factors driving
chronic disease rates. We will also apply predictive algorithms to forecast the prevalence of specific
chronic diseases in individual states, using historical data and trends to anticipate future public health
challenges.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgement</title>
      <p>The research is part of the RECONNECT project: chRonic disEase: disCOvery, aNalysis aNd prEdiCTive
modelling. This publication has emanated from research conducted with the financial support of
Taighde Éireann – Research Ireland under Grant numbers 22/NCF/DR/11244 and 12/RC/2289_P2. For
the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author
Accepted Manuscript version arising from this submission
about_data, 2023. [Online; accessed 08-May-2024].
[20] J. Heer, M. Bostock, Declarative language design for interactive visualization, in: Proceedings
of the 2010 ACM SIGCHI Conference on Human Factors in Computing Systems, ACM, Atlanta,
Georgia, USA, 2010, pp. 1263–1272. doi:10.1145/1753326.1753506.
[21] V. M. Ngo, M.-T. Kechadi, Electronic farming records – a framework for normalising agronomic
knowledge discovery, Computers and Electronics in Agriculture 184 (2021) 106074. doi:https:
//doi.org/10.1016/j.compag.2021.106074.
[22] D. A. Keim, F. Mansmann, J. Thomas, Visual analytics: How will we get there?, in: Proceedings of
the 2020 IEEE Conference on Visual Analytics Science and Technology (VAST), IEEE, 2020, pp.
1–7.
[23] World Health Organization, Global Status Report on Alcohol and Health 2018, World Health
Organization, 2018. URL: https://www.who.int/publications/i/item/9789241565639, accessed:
202405-20.
[24] M. Barbería-Latasa, A. Gea, M. A. Martínez-González, Alcohol, drinking pattern, and chronic
disease, Nutrients (2022). doi:10.3390/nu14091954.
[25] National Institute on Alcohol Abuse and Alcoholism (NIAAA), Drinking levels
deifned, 2023. URL: https://www.niaaa.nih.gov/alcohol-health/overview-alcohol-consumption/
moderate-binge-drinking, accessed: 2024-05-20.
[26] World Health Organisation, Asthma, 2024. URL: https://www.who.int/news-room/fact-sheets/
detail/asthma, accessed: 2024-05-20.
[27] M. E. Kuruvilla, K. Vanijcharoenkarn, J. A. Shih, F. E. H. Lee, Epidemiology and risk factors for
asthma, Respiratory Medicine 149 (2019) 16–22. doi:10.1016/J.RMED.2019.01.014.
[28] World Health Organisation, Diabetes, 2022. URL: https://www.who.int/health-topics/diabetes#
tab=tab_1, accessed: 2022-02-08.
[29] International Diabetes Federation, What is diabetes, 2020. URL: https://www.idf.org/aboutdiabetes/
what-is-diabetes.html, accessed: 2022-02-08.
[30] International Diabetes Federation, Diabetes complications, 2020. URL: https://idf.org/
aboutdiabetes/complications.html, accessed: 2022-02-08.
[31] J. E. Shaw, R. A. Sicree, P. Z. Zimmet, Global estimates of the prevalence of diabetes for 2010 and
2030, Diabetes Research and Clinical Practice 87 (2010) 4–14. doi:10.1016/J.DIABRES.2009.
10.007.
[32] A. C. Webster, E. V. Nagler, R. L. Morton, P. Masson, Chronic kidney disease, The Lancet 389
(2017) 1238–1252. doi:10.1016/S0140-6736(16)32064-5.
[33] Global Initiative for Chronic Obstructive Lung Disease, Global Strategy for the Diagnosis,
Management, and Prevention of Chronic Obstructive Pulmonary Disease, Global Initiative for Chronic
Obstructive Lung Disease, 2018.
[34] V. M. Ngo, T.-V. T. Duong, T.-B.-T. Nguyen, P. T. Nguyen, O. Conlan, An eficient classification
algorithm for traditional textile patterns from diferent cultures based on structures, J. Comput.
Cult. Herit. 14 (2021). doi:10.1145/3465381.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>K.</given-names>
            <surname>Hacker</surname>
          </string-name>
          ,
          <article-title>The burden of chronic disease</article-title>
          ,
          <source>Mayo Clinic Proceedings. Innovations, Quality &amp; Outcomes</source>
          <volume>8</volume>
          (
          <year>2024</year>
          )
          <fpage>112</fpage>
          -
          <lpage>119</lpage>
          . URL: https://doi.org/10.1016/j.mayocpiqo.
          <year>2023</year>
          .
          <volume>08</volume>
          .005. doi:
          <volume>10</volume>
          .1016/ j.mayocpiqo.
          <year>2023</year>
          .
          <volume>08</volume>
          .005.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>U</given-names>
            <surname>.-C. Team</surname>
          </string-name>
          ,
          <article-title>National Center for Chronic Disease Prevention and Health Promotion (NCCDPHP)</article-title>
          , https://www.cdc.gov/chronicdisease/about/index.htm,
          <year>2022</year>
          . [Online; accessed 08-May-2024].
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>H. K.</surname>
          </string-name>
          ,
          <article-title>The burden of chronic disease</article-title>
          ,
          <source>Mayo Clinic proceedings. Innovations, quality &amp; outcomes 8</source>
          (
          <year>2024</year>
          )
          <fpage>112</fpage>
          -
          <lpage>119</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.mayocpiqo.
          <year>2023</year>
          .
          <volume>08</volume>
          .005.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>B. W.</given-names>
            <surname>Ward</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Schiller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Goodman</surname>
          </string-name>
          ,
          <article-title>Multiple chronic conditions among us adults: A 2012 update</article-title>
          ,
          <source>Preventing Chronic Disease</source>
          <volume>11</volume>
          (
          <year>2014</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>P.</given-names>
            <surname>Boersma</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. I.</given-names>
            <surname>Black</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. W.</given-names>
            <surname>Ward</surname>
          </string-name>
          ,
          <article-title>Prevalence of multiple chronic conditions among us adults</article-title>
          ,
          <year>2018</year>
          ,
          <source>Preventing Chronic Disease</source>
          <volume>17</volume>
          (
          <year>2020</year>
          )
          <article-title>E106</article-title>
          . doi:
          <volume>10</volume>
          .5888/pcd17.
          <fpage>200130</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A. E.</given-names>
            <surname>Bardach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. O.</given-names>
            <surname>Alcaraz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ciapponi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. U.</given-names>
            <surname>Garay</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Riviere</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Palacios</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Cremonte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Augustovski</surname>
          </string-name>
          ,
          <article-title>Alcohol consumption's attributable disease burden and cost-efectiveness of targeted public health interventions: a systematic review of mathematical models</article-title>
          ,
          <source>BMC Public Health</source>
          <volume>19</volume>
          (
          <year>2019</year>
          )
          <article-title>1378</article-title>
          . doi:
          <volume>10</volume>
          .1186/s12889-019-7771-4.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>V. M.</given-names>
            <surname>Ngo</surname>
          </string-name>
          , M.-T. Kechadi,
          <article-title>Crop knowledge discovery based on agricultural big data integration</article-title>
          ,
          <source>in: Proceedings of the 4th International Conference on Machine Learning and Soft Computing, ICMLSC '20</source>
          ,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          ,
          <year>2020</year>
          , p.
          <fpage>46</fpage>
          -
          <lpage>50</lpage>
          . doi:
          <volume>10</volume>
          .1145/3380688.3380705.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <article-title>[8] Institute for Health Metrics and Evaluation, Us health map</article-title>
          , https://vizhub.healthdata.org/ subnational/usa,
          <year>2024</year>
          . Accessed:
          <fpage>2024</fpage>
          -09-20.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>V. M.</given-names>
            <surname>Ngo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Gajula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Thorpe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Mckeever</surname>
          </string-name>
          ,
          <article-title>Discovering child sexual abuse material creators' behaviors and preferences on the dark web</article-title>
          ,
          <source>Child Abuse &amp; Neglect</source>
          <volume>147</volume>
          (
          <year>2024</year>
          )
          <article-title>106558</article-title>
          . doi:https: //doi.org/10.1016/j.chiabu.
          <year>2023</year>
          .
          <volume>106558</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>Health</given-names>
            <surname>Service</surname>
          </string-name>
          <string-name>
            <surname>Executive</surname>
          </string-name>
          ,
          <article-title>Chronic disease management programme</article-title>
          , https://www.hse.ie/eng/ services/list/2/primarycare/chronic-disease
          <string-name>
            <surname>-</surname>
          </string-name>
          management-programme/,
          <year>2023</year>
          . Accessed:
          <issue>01</issue>
          <year>September 2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>V. M.</given-names>
            <surname>Ngo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Sood</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Donohue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Kearney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Buckley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Roantree</surname>
          </string-name>
          ,
          <article-title>Using hl7-fhir as an integration platform for chronic disease services management and planning in the irish healthcare sector</article-title>
          , in: Joint International Conference of ISEH,
          <source>ICEPH &amp; ISEG on Environment and Health</source>
          <year>2024</year>
          ,
          <year>2024</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>V. M.</given-names>
            <surname>Ngo</surname>
          </string-name>
          , G. Sood,
          <string-name>
            <given-names>P.</given-names>
            <surname>Kearney</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Donohue</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Nie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Roantree</surname>
          </string-name>
          ,
          <article-title>Query based construction of chronic disease datasets</article-title>
          ,
          <year>2024</year>
          . doi:
          <volume>10</volume>
          .48550/arXiv.2410.13880, arXiv-cs.
          <source>DB.</source>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>R.</given-names>
            <surname>Beasley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Semprini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. A.</given-names>
            <surname>Mitchell</surname>
          </string-name>
          ,
          <article-title>Risk factors for asthma: is prevention possible?</article-title>
          ,
          <source>Lancet</source>
          <volume>386</volume>
          (
          <year>2015</year>
          )
          <fpage>1075</fpage>
          -
          <lpage>1085</lpage>
          . doi:
          <volume>10</volume>
          .1016/S0140-
          <volume>6736</volume>
          (
          <issue>15</issue>
          )
          <fpage>00156</fpage>
          -
          <lpage>7</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>K. O'Neill</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>McHugh</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Kearney</surname>
          </string-name>
          ,
          <article-title>Cycle of care for people with diabetes: an equitable initiative?</article-title>
          ,
          <source>HRB Open Research</source>
          <volume>2</volume>
          (
          <year>2019</year>
          )
          <article-title>3</article-title>
          . doi:
          <volume>10</volume>
          .12688/hrbopenres.12890.1.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>N.</given-names>
            <surname>Baral</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. B. A.</given-names>
            <surname>Jabbar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Noor</surname>
          </string-name>
          , et al.,
          <article-title>Demographic and geographical trends in chronic lower respiratory diseases mortality in the united states</article-title>
          ,
          <year>1999</year>
          to 2020, Respiratory Research 25 (
          <year>2024</year>
          )
          <article-title>258</article-title>
          . doi:
          <volume>10</volume>
          .1186/s12931-024-02880-5.
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>J. Wu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
          </string-name>
          ,
          <article-title>Efective data decision-making and transmission system based on mobile health for chronic disease management in the elderly</article-title>
          ,
          <source>IEEE Systems Journal</source>
          <volume>15</volume>
          (
          <year>2021</year>
          )
          <fpage>5537</fpage>
          -
          <lpage>5548</lpage>
          . doi:
          <volume>10</volume>
          .1109/JSYST.
          <year>2020</year>
          .
          <volume>3024816</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>U. E.</given-names>
            <surname>Bauer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. A.</given-names>
            <surname>Briss</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Goodman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Bowman</surname>
          </string-name>
          ,
          <article-title>Prevention of chronic disease in the 21st century: elimination of the leading preventable causes of premature death and disability in the usa</article-title>
          ,
          <source>The Lancet</source>
          <volume>384</volume>
          (
          <year>2014</year>
          )
          <fpage>45</fpage>
          -
          <lpage>52</lpage>
          . doi:
          <volume>10</volume>
          .1016/S0140-
          <volume>6736</volume>
          (
          <issue>14</issue>
          )
          <fpage>60648</fpage>
          -
          <lpage>6</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>W.</given-names>
            <surname>Raghupathi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Raghupathi</surname>
          </string-name>
          ,
          <article-title>An empirical study of chronic diseases in the united states: A visual analytics approach</article-title>
          ,
          <source>International Journal of Environmental Research and Public Health</source>
          <volume>15</volume>
          (
          <year>2018</year>
          )
          <article-title>431</article-title>
          . doi:
          <volume>10</volume>
          .3390/ijerph15030431.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>CDC</given-names>
            <surname>Authors</surname>
          </string-name>
          , U.S.
          <source>Chronic Disease Indicators (CDI)</source>
          ,
          <year>2023</year>
          Release, https://data.cdc.gov/ Chronic-Disease-Indicators/
          <article-title>U-S-Chronic-Disease-</article-title>
          <string-name>
            <surname>Indicators-CDI-</surname>
          </string-name>
          2023
          <string-name>
            <surname>-Release/</surname>
          </string-name>
          g4ie-h725/
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>