<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">CatBoost model with self-explanatory capabilities for predicting SLE in OMAN population</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Hamza</forename><surname>Zidoum</surname></persName>
							<email>zidoum@squ.edu.com</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer</orgName>
								<orgName type="institution" key="instit1">Science Sultan Qaboos</orgName>
								<orgName type="institution" key="instit2">University</orgName>
								<address>
									<settlement>Muscat</settlement>
									<country key="OM">Oman</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ali</forename><surname>Alshareedah</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer</orgName>
								<orgName type="institution" key="instit1">Science Sultan Qaboos</orgName>
								<orgName type="institution" key="instit2">University</orgName>
								<address>
									<settlement>Muscat</settlement>
									<country key="OM">Oman</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Aliya</forename><surname>Al-Ansari</surname></persName>
							<email>alansari@squ.edu.om</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Biology</orgName>
								<orgName type="institution" key="instit1">Sultan Qaboos</orgName>
								<orgName type="institution" key="instit2">University</orgName>
								<address>
									<settlement>Muscat</settlement>
									<country key="OM">Oman</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Batool</forename><surname>Al Lawati</surname></persName>
							<affiliation key="aff2">
								<orgName type="department">Department of Medicine Sultan Qaboos</orgName>
								<orgName type="institution">University</orgName>
								<address>
									<settlement>Muscat (Oman)</settlement>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">S</forename><surname>Al-Sawafi</surname></persName>
							<email>sksawafi@squ.edu.om</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Computer</orgName>
								<orgName type="institution" key="instit1">Science Sultan Qaboos</orgName>
								<orgName type="institution" key="instit2">University</orgName>
								<address>
									<settlement>Muscat</settlement>
									<country key="OM">Oman</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">CatBoost model with self-explanatory capabilities for predicting SLE in OMAN population</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">F39119F0D8D216EA16E30CFCF70A1C98</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:36+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Systemic lupus erythematosus</term>
					<term>CatBoost</term>
					<term>Feature selection</term>
					<term>Model interpretation</term>
					<term>SHAP1</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Systemic lupus erythematosus (SLE) presents as an autoimmune condition influenced by both genetic and environmental factors, showcasing a diverse range of clinical symptoms and often, unpredictable disease flares. Despite advancements in classification methods, the timely diagnosis of SLE remains a challenge for many patients. This research introduces an interpretable disease classification model that combines the robust predictive capabilities of CatBoost with the transparent interpretation tools offered by SHapley Additive exPlanations (SHAP). Trained on a local cohort comprising 219 Omani patients diagnosed with SLE and individuals with other control diseases, the CatBoost model demonstrates high performance. Moreover, utilizing the SHAP library enables the generation of individualized explanations for the model's decisions, highlighting key clinical features such as alopecia, renal disorders, cutaneous lupus, and hemolytic anemia, alongside patient age, which significantly contribute to the prediction process. The model achieved notable metrics, including an AUC score of 0.945 and an F1-score of 0.92, underscoring its efficacy in SLE prediction</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Systemic lupus erythematosus (SLE) stands as a chronic autoimmune disease affecting multiple systems of the body. Its clinical presentation varies across different races, genders, and age groups, rendering diagnosis challenging <ref type="bibr" target="#b0">[1]</ref>. Despite significant strides in SLE treatment strategies, diagnostic and therapeutic hurdles persist <ref type="bibr" target="#b1">[2]</ref>. Early diagnosis remains particularly problematic, given the gradual onset of SLE symptoms over years, alongside the potential for other conditions to mimic its manifestations, including infectious and hematologic diseases <ref type="bibr" target="#b2">[3]</ref>. Data analysis underscores the importance of diagnosing SLE within a narrow window, as delayed diagnosis correlates with increased flare rates, hospitalizations, and the risk of progressive organ damage, ultimately elevating mortality rates <ref type="bibr" target="#b3">[4]</ref>. In Oman, where the mortality rate stands at 5% and the mean prevalence at 38 per 100,000 individuals, limited research exists on the specific clinical and serologic characteristics of the Omani population. <ref type="bibr" target="#b4">[5]</ref>, <ref type="bibr" target="#b5">[6]</ref>. This study contributes significantly on three fronts: firstly, by identifying unique clinical manifestation patterns specific to the Omani population, filling a notable gap in the literature. Secondly, by introducing the CatBoost algorithm, renowned for its rapid computation, strong generalization, and high predictive accuracy, alongside leveraging advanced machine learning techniques such as the SHAP algorithm, RFECV-based feature selection, and GridSearchCV-based hyperparameter optimization. Thirdly, by integrating the model's predictions with interpretability algorithms, this research promotes self-explanatory models that empower physicians to cross-reference model outputs with their expertise, enhancing diagnostic accuracy and fostering the adoption of machine learning in healthcare.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Method</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>DATASET</head><p>The dataset utilized in this study originates from the Rheumatology clinic at Sultan Qaboos University Hospital. Approval for the study was granted by the Ethics Committee of the College of Medicine and Health Science at Sultan Qaboos University (MERC # 1418 and 1650). Data extraction involved both structured and unstructured sources, including the hospital's Electronic Medical Record (EMR) system named TrakCare, which stores patients' demographic information, medical states, and histories. While demographic data were directly obtained from TrakCare, clinical data were unstructured and retrieved from patients' medical histories in the form of clinical notes from each hospital visit. The dataset comprised records of Omani patients from 2006 to 2019 who met the entry criteria outlined by EULAR/ACR, which necessitate a positive Antinuclear Antibodies test (ANA test) followed by the application of additional classification criteria. Non-Omani patients and those with insufficient data were excluded. The dataset encompassed 214 Omani patient records, with 138 diagnosed with SLE, confirmed by rheumatologist assessment on a case-by-case basis, while the remaining 81 patients had other control diseases. Analysis revealed a female predominance of 92% and a mean age of 38. The Al Batinah Governorate accounted for the highest proportion of patients (37.9%), followed by Muscat (23.7%).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>FEATURE SELECTION</head><p>Initial data comprised 20 clinical and demographic variables (referred to as "features" in machine learning), with no missing values detected. Categorical features were encoded using Ordinal encoding, and Min-Max normalization was applied due to variations in feature ranges. To enhance signal-to-noise ratio and select the most informative features, recursive feature elimination (RFE) based on Random Forest (RF) with ten-fold cross-validation (CV) was employed. RFE iteratively builds models, identifies the best feature, selects it, and repeats the process until all features are traversed. We have developed four ML models to predict the presence or absence of SLE (Figure <ref type="figure" target="#fig_0">1</ref>). In addition to three common ML models that are multi-layer perceptron (MLP), support vector machine (SVM), and Random Forest, we introduced CatBoost [11] CatBoost is an ensemble learning algorithm, similar to gradient boosting, but with some unique features. Its implementation of ordered boosting helps it handle categorical variables more effectively, which is particularly useful in real-world datasets where categorical features are common. The Oblivious Tree structure and random permutations are additional techniques that contribute to its robustness and efficiency especially when dealing with categorical data. Ordered boosting calculates leaf values during the selection of the tree structure to reduce overfitting. Oblivious Tree structure is used to construct CatBoosts' model ensembles which means that all the leaves are in the same level and the same splitting criterion is applied to all intermediate nodes within the same level of tree. The use of Oblivious Tree structure greatly improves the performance speed and efficiency. Random permutations of the training examples are also applied to fight the prediction shift caused by a special kind of target leakage present in all existing implementations of gradient boosting algorithms <ref type="bibr" target="#b11">[12]</ref>. To train and validate the performance of our model, the dataset was divided into two parts in a 70:30 ratio (i.e. 70% of the dataset is used for training and 30% for testing). Additionally, a subset of the training data set was used for cross-validation to protect the models from overfitting and optimize the model's parameters. Due to the imbalanced nature of the data set, several parameters are used to evaluate the classification performance such as Recall, Specificity, F1-score, and AUC (Area Under ROC Curve). The problem with using imbalanced data set for classification is that the user is biased to the performance on cases that are poorly represented in the data samples <ref type="bibr" target="#b12">[13]</ref>. Standard evaluation criteria tend to focus the evaluation of the models on the most frequent cases, thus if applied, could lead to sub-optimal classification models. Each of the models undergoes a hyperparameter optimization through grid search with a five-fold cross-validation. Finally, to avoid reporting biased results and limit overfitting, we calculated the average of 10 repetitions for each model.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>MODEL INTERPRETATION</head><p>In clinical applications, the ability to justify the prediction is equally as important as the prediction score itself. This is because of the high sensitivity of the medical environment where misclassification could lead to devastating consequences. It is therefore challenging to trust complex ML models for a number of reasons. First, the models are often designed and rigorously trained on specific diseases in a narrow environment. Second, it depends on the user's technical knowledge of statistics and ML. Third, how the data is labeled affects the results produced by the model <ref type="bibr" target="#b13">[14]</ref>. For these reasons and more, Interpretable ML has thus emerged as an area of research that aims to design transparent and explainable models through developing means to transform a black-box ML model into a white-box ML model. By providing transparent prediction, domain experts can accurately interpret the results meaningfully. In 2017, Lundberg and Lee proposed a unified framework to interpret ML predictions. SHapley Additive exPlanations (SHAP) is derived from 'Shapley values', a concept that is commonly used within the field of cooperative game theory to determine the payout for each player within a cooperative coalition <ref type="bibr" target="#b13">[14]</ref>. Casting this concept onto prediction models, the payout is mapped as the final prediction while players are mapped as the model's features. The contribution of a feature to the final prediction can be determined by looking at the magnitude and sign of the Shapley value. Specifically, the importance of a feature relative to the payout (prediction) is represented by the magnitude of the related Shapley value. More importantly, this framework provides local and global interpretability simultaneously</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results</head><p>Applying the RFE feature selection algorithm resulted in 13 optimal features (Figure <ref type="figure" target="#fig_1">2</ref>). In Table <ref type="table" target="#tab_1">3</ref>, the features that were selected are indicated with a True value. Overall, three demographic features, as well as 10 clinical features, were selected   Compared with the help of SHAP algorithm, we can break down each prediction individually. As a demonstration, we took two individuals from the testing set: a 40-year-old patient that was predicted to have the disease and a 56-year-old patient that was not. In table 5, the non-normalized features of the two patients are shown. The force plot attributes the positive prediction of patient 1 to renal disorders, and the patient's age (Figure <ref type="figure" target="#fig_3">4</ref>.a). Since the values in Figure <ref type="figure" target="#fig_3">4</ref> are normalized we cross-reference them with table 4, we find that the patient is 40 which falls within the age group SLE is most active in. Additionally, the patient has been diagnosed with Lupus Nephritis a disease that is commonly caused by an auto-immune disorder. In contrast, patient 2 (Figure <ref type="figure" target="#fig_3">4</ref>.b) displays a lack of any autoimmune manifestation, long disease duration, and the age of 56 makes him outside the age group that SLE is most active in. Looking at the waterfall plot in Figure <ref type="bibr">(5.a)</ref>, we find the feature with highest SHAP value for patient 1 is renal disorders by a large margin. Due to its high SHAP value, the presence of renal disorder in patient 1 had the greatest contribution to the positive prediction of SLE. This was followed by the age and province features. Overall, there were four blue features pushing the prediction probability lower toward class 0. The non-existence of alopecia, hemolytic anemia, and ACL in patient 1 profile in table 4 resulted in negative SHAP values. The remaining features had minimal impact on the prediction probability evident by their low SHAP values. The waterfall plot for patient 2 (Figure <ref type="figure" target="#fig_4">5</ref>.b) indicates that age is the largest contribution toward class 0, followed by the absence of any renal disorders. Ranking Features. In Figure <ref type="figure" target="#fig_5">6</ref>, the older the patient is the less likely it is to have SLE, which is evident by the red dots on the negative scale of SHAP values. The same can be said for disease duration, we find that long disease durations without autoimmune manifestation correlated with the absence of SLE. Experts point out, however, that SLE intensity increases and decreases at intervals differently from patient to patient, thus in rare occasions clinical symptoms might not manifest until late phases of the disease <ref type="bibr" target="#b14">[15]</ref>. Our result indicates that the higher the patient's age and disease duration the less likely that SLE is the cause. Renal disorders are ranked the highest in contribution followed by alopecia, Acute Cutaneous Lupus (ACL), and hemolytic anemia. The lowest contributing features are serositis, proteinuria, and leukopenia.</p><p>1. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusions</head><p>In this study, the first SLE prediction model has been developed with our proposed selfexplainable framework that aims at establishing trust in ML prediction. SHAP interpretation tool was implemented to explain and justify individual predictions and thereby eliminate any risk of misclassification. Additionally, a minimum set of 13 early predictors achieved the highest scores of 0.95 AUC and 0.92 F1-score metrics. The dataset features comprise demographic and clinical symptoms available to physicians at early stages. By interpreting Catboost predictions, we found that four clinical features had the highest influence on the prediction in addition to the patient's age. The features were alopecia, renal disorders, cutaneous lupus, and hemolytic anemia. All are considered indicators of lupus activity at varying rates, combined with the patient's age and age-onset the model was able to establish a profile of the disease relative to the Omani population. With such scores, our model can predict with reasonable certainty the presence or absence of SLE. This can alert physicians to investigate further with the help of immunological tests such as antinuclear antibodies test and Anti-dsDNA test. Overall, our framework and its application can aid in providing a more practical introduction of machine learning and interpretation tools to medical diagnosis, thereby increasing the efficiency of medical testing and subsequently maximizing chances of disease mitigation and management. This is expected to reduce the cost, of medical care as well as decrease the cases of unmitigated severe cases of SLE.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Flowchart of the development and evaluation process</figDesc><graphic coords="3,186.45,235.99,222.02,123.45" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Visualizing RFE's optimal number of features with 10-fold CV</figDesc><graphic coords="4,207.20,422.92,180.58,122.55" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: ROC plots for the 4 classification models considered in this work using the features produced by the RFE algorithm</figDesc><graphic coords="5,203.50,253.17,187.29,143.05" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 4 :</head><label>4</label><figDesc>Figure 4: Force plot of CatBoost model prediction for patient 1 (values are normalized).</figDesc><graphic coords="6,107.45,240.07,380.06,147.50" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 5 :</head><label>5</label><figDesc>Figure 5: Waterfall plot of CatBoost model for patient 1. The waterfall plot displays SHAP values representing feature contribution toward a positive prediction.</figDesc><graphic coords="6,129.10,542.82,336.72,156.75" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 6 :</head><label>6</label><figDesc>Summery plot of CatBoost model. The summary plot combines feature importance with feature effects.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 Characteristics of patients.</head><label>1</label><figDesc></figDesc><table><row><cell>Feature</cell><cell>Category</cell><cell>Occurrence</cell></row><row><cell></cell><cell></cell><cell>(No. %,</cell></row><row><cell></cell><cell></cell><cell>N=219)</cell></row><row><cell>Fever</cell><cell>Yes/No</cell><cell>48 (21.9%)</cell></row><row><cell>Acute cutaneous lupus (ACL)</cell><cell>Yes/No</cell><cell>70 (31.9%)</cell></row><row><cell>Chronic cutaneous lupus</cell><cell>Yes/No</cell><cell>5 (2.28%)</cell></row><row><cell>Oral ulcers</cell><cell>Yes/No</cell><cell>29 (13.2%)</cell></row><row><cell>Alopecia</cell><cell>Yes/No</cell><cell>61 (27.8%)</cell></row><row><cell>Joint Involvement</cell><cell>Yes/No</cell><cell>202 (92.2%)</cell></row><row><cell>Serositis</cell><cell>Yes/No</cell><cell>9 (4%)</cell></row><row><cell>Renal Manifestation</cell><cell>Yes/No</cell><cell>62 (28.3%)</cell></row><row><cell>Lupus Nephritis class</cell><cell>None</cell><cell>112 (51 %)</cell></row><row><cell></cell><cell>Class II</cell><cell>1 (0.4%)</cell></row><row><cell></cell><cell>Class III</cell><cell>4 (1.8%)</cell></row><row><cell></cell><cell>Class IV</cell><cell>16 (7.3%)</cell></row><row><cell></cell><cell>Class V</cell><cell>5 (2 %)</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 3 Comparing between the different classifiers</head><label>3</label><figDesc></figDesc><table><row><cell>Model</cell><cell>Precision</cell><cell>Recall</cell><cell>F1-score</cell><cell>AUC</cell></row><row><cell>SVM</cell><cell>0.85</cell><cell>0.83</cell><cell>0.85</cell><cell>0.91</cell></row><row><cell>Random Forest</cell><cell>0.85</cell><cell>0.96</cell><cell>0.85</cell><cell>0.93</cell></row><row><cell>CatBoost</cell><cell>0.90</cell><cell>0.95</cell><cell>0.90</cell><cell>0.956</cell></row><row><cell>MLP</cell><cell>0.86</cell><cell>0.86</cell><cell>0.86</cell><cell>0.89</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 4 Non-normalized values for test patients 1 and 2</head><label>4</label><figDesc></figDesc><table><row><cell>Feature Name</cell><cell>Patient 1 (positive for SLE)</cell><cell>Patient 2 (Negative for SLE)</cell></row><row><cell>AGE</cell><cell>40</cell><cell>56</cell></row><row><cell>Disease_Duration</cell><cell>21</cell><cell>13</cell></row><row><cell>PROV</cell><cell>Dakhiliyah</cell><cell>Muscat</cell></row><row><cell>Fever</cell><cell>N</cell><cell>N</cell></row><row><cell>ACL</cell><cell>N</cell><cell>N</cell></row><row><cell>Oral_Ul</cell><cell>N</cell><cell>N</cell></row><row><cell>Alopecia</cell><cell>N</cell><cell>N</cell></row><row><cell>Serositis</cell><cell>N</cell><cell>N</cell></row><row><cell>Renal</cell><cell>Nephritis</cell><cell>N</cell></row></table></figure>
		</body>
		<back>

			<div type="funding">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Feature Name</head><p>Selected by RFE AGE True</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Comparing between the performance of the different classifiers (Table <ref type="table">4</ref>), CatBoost had the highest AUC score of 0.956 providing a slight edge in performance (Figure <ref type="figure">3</ref>). This superiority in performance was also indicated in benchmarks against other recent classifiers (e.g. XGBoost and LightGBM) on a set of popular publicly available datasets <ref type="bibr" target="#b11">[12]</ref>. In training phase, each set of decision trees is built consecutively with successive trees focusing on minimizing the loss compared to previous trees.</p></div>			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Diagnosis of systemic lupus erythematosus. Importance of antinuclear antibody titers and peripheral staining patterns</title>
		<author>
			<persName><forename type="first">R</forename><surname>Nisengard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Archives of Dermatology</title>
		<imprint>
			<biblScope unit="volume">111</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page" from="1298" to="1300" />
			<date type="published" when="1975">1975</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">The history of lupus throughout the ages</title>
		<author>
			<persName><forename type="first">R</forename><surname>Felten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lipsker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sibilia</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Chasset</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Arnaud</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the American Academy of Dermatology</title>
		<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">The Main Challenges in Systemic Lupus Erythematosus: Where Do We Stand?</title>
		<author>
			<persName><forename type="first">M</forename><surname>Piga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Arnaud</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Clinical Medicine</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page">243</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Association between organ damage and mortality in systemic lupus erythematosus: a systematic review and meta-analysis</title>
		<author>
			<persName><forename type="first">I</forename><surname>Murimi-Worstell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Nab</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Kan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Onasanya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Tierce</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">BMJ Open</title>
		<imprint>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page">e031850</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A multicenter longitudinal study of the prevalence and mortality rate of systemic lupus erythematosus patients in Oman: Oman Lupus Study</title>
		<author>
			<persName><forename type="first">N</forename><surname>Al-Adhoubi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Al-Balushi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Al</forename><surname>Salmi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Ali</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Al</forename><surname>Lawati</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Al</forename><surname>Lawati</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Rheumatic Diseases</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="847" to="854" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Spectrum of systemic lupus erythematosus in Oman: from childhood to adulthood</title>
		<author>
			<persName><forename type="first">Al</forename><surname>Rasbi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Abdalla</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Sultan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Abdullah</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Al</forename><surname>Kaabi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename></persName>
		</author>
		<author>
			<persName><forename type="first">Al-Zakwani I</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Rheumatology International</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="1691" to="1698" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">CatBoost for big data: an interdisciplinary review</title>
		<author>
			<persName><forename type="first">J</forename><surname>Hancock</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Khoshgoftaar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Big Data</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">Benchmarking and Optimization of Gradient Boosting Decision Tree Algorithms</title>
		<author>
			<persName><forename type="first">A</forename><surname>Anghel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Papandreou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Parnell</surname></persName>
		</author>
		<idno>arXiv:180904559</idno>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">European League Against Rheumatism (EULAR)/American College of Rheumatology (ACR) SLE classification criteria item performance</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Brinks</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dörner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Daikh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Mosca</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Annals of the Rheumatic Diseases</title>
		<imprint>
			<biblScope unit="volume">80</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="775" to="781" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title level="m" type="main">Categorical variables in developmental research</title>
		<author>
			<persName><forename type="first">A</forename><surname>Eye</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Clogg</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1996">1996</date>
			<publisher>Academic Press</publisher>
			<pubPlace>San Diego</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<monogr>
		<author>
			<persName><forename type="first">L</forename><surname>Prokhorenkova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Gusev</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Vorobev</surname></persName>
		</author>
		<idno>arXiv:170609516</idno>
		<title level="m">CatBoost: unbiased boosting with categorical features</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Dorogush</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Ershov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Gulin</surname></persName>
		</author>
		<idno type="arXiv">arXiv:1810.11363</idno>
		<title level="m">CatBoost: gradient boosting with categorical features support</title>
				<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">A survey of predictive modelling under imbalanced distributions</title>
		<author>
			<persName><forename type="first">P</forename><surname>Branco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Torgo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Ribeiro</surname></persName>
		</author>
		<idno>ArXiv:1505.01658</idno>
		<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">A Unified Approach to Interpreting Model Predictions</title>
		<author>
			<persName><forename type="first">S</forename><surname>Lundberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S-I</forename><surname>Lee</surname></persName>
		</author>
		<idno>arXiv:170507874</idno>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Clinical Features and Prognosis of Late-onset Systemic Lupus Erythematosus: Results from the 1000 Faces of Lupus Study</title>
		<author>
			<persName><forename type="first">S</forename><surname>Lalani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pope</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Leon</forename><forename type="middle">F</forename><surname>De</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Peschken</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Journal of Rheumatology</title>
		<imprint>
			<biblScope unit="volume">37</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="38" to="44" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">When to order an antinuclear antibody test</title>
		<author>
			<persName><forename type="first">A</forename><surname>Binder</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ellis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">BMJ</title>
		<imprint>
			<biblScope unit="volume">347</biblScope>
			<biblScope unit="issue">aug21</biblScope>
			<biblScope unit="page" from="5060" to="5060" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Sensitivity and specificity of ANA and anti-dsDNA in the diagnosis of systemic lupus erythematosus: A comparison using control sera obtained from healthy individuals and patients with multiple medical problems</title>
		<author>
			<persName><forename type="first">R</forename><surname>Wichainun</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Asian Pacific Journal of Allergy and Immunology</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="issue">4</biblScope>
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Late-Onset Systemic Lupus Erythematosus. Drugs &amp;amp</title>
		<author>
			<persName><forename type="first">L</forename><surname>Arnaud</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Mathian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Boddaert</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Amoura</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Aging</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="181" to="189" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Rituximab in Systemic Lupus Erythematosus and Lupus Nephritis</title>
		<author>
			<persName><forename type="first">H</forename><surname>Beckwith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Lightstone</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nephron Clinical Practice</title>
		<imprint>
			<biblScope unit="volume">128</biblScope>
			<biblScope unit="issue">3-4</biblScope>
			<biblScope unit="page" from="250" to="254" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Systemic lupus erythematosus, lupus nephritis and end-stage renal disease: a pragmatic review mapping disease severity and progression</title>
		<author>
			<persName><forename type="first">A</forename><surname>Mahajan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Amelio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Gairy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Kaur</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Levy</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Roth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Lupus</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="1011" to="1020" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Clinical and Immunological Biomarkers for Systemic Lupus Erythematosus</title>
		<author>
			<persName><forename type="first">H</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Nagafuchi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Fujio</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Biomolecules</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page">928</biblScope>
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Anaemia in systemic lupus erythematosus: from pathophysiology to clinical assessment</title>
		<author>
			<persName><forename type="first">S</forename><surname>Giannouli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Annals of the Rheumatic Diseases</title>
		<imprint>
			<biblScope unit="volume">65</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="144" to="148" />
			<date type="published" when="2006">2006</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Diagnosis of systemic lupus erythematosus. Importance of antinuclear antibody titers and peripheral staining patterns</title>
		<author>
			<persName><forename type="first">R</forename><surname>Nisengard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Archives of Dermatology</title>
		<imprint>
			<biblScope unit="volume">111</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page" from="1298" to="1300" />
			<date type="published" when="1975">1975</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
