1Department of Informatics and Applied Mathematics, Russia n-Armenian University, Yerevan 0051, Armenia.
2Department of Computer Science, College of Staten Island (CSI), CUNY, City University of New York, New York, NY 10314, USA.
3Department of Arrhythmia, Yerevan State Medical University, Yerevan 0025, Armenia.
4Department of Information Systems, Ivannikov Institute for System Programming of the Russian Academy of Sciences, Moscow 109004, Russia.
5Department of Mathematics and Mathematical Modelling, Russian-Armenian University, Yerevan 0051, Armenia.
6Department of Cardiology, Yerevan State Medical University, Yerevan 0025, Armenia.
*Corresponding Author: Sos Agaian
Department of Computer Science, College of Staten Island (CSI), CUNY, City University of New York, New York, NY 10314, USA.
Email: sos.agaian@csi.cuny.edu
Received : May 22, 2023
Accepted : Jul 06, 2023
Published : Jul 13, 2023
Archived : www.jclinmedimages.org
Copyright : © Agaian S (2023).
Atrial Fibrillation (AFib) is the most common sustained arrhythmia, which causes significant morbidity and mortality among the population. AFib is strictly associated with other heart problems and can drastically increase the risk of transient ischemic attack, strokes, and heart failure. Various common risk factors, such as diabetes, hypertension, smoking, age, obesity, hyperlipidemia, and others, are common in patients with AFib. If someone has any of these risk factors, they are at risk for AFib heart failure, diabetes, or coronary artery disease. Thus, diagnosing early and treating it is essential. The implications of Ischemic Heart Disease (IHD), arterial hypertension (AH), and AFib have not been satisfactorily studied. This article aims to develop an effective Doctor-in-The-Loop machine learning system to enhance the performance of heart disease diagnosis by (a) extracting the most contributed to AFib risk predictors; (b) weighting the extracted features based on their effect on AFib progression; (c) generating a new dataset containing patients with IHD and AH; and (d) classifying AFib using the newly constructed dataset. Computer simulation results on the RIC AFib Dataset show that (1) the Age, Diastolic Blood Pressure, IL-6, TNF-alpha, and TNF-beta1 features have the highest effect on AFib progression, (2) the weighted risk predictors overperformed the AFib classification results reaching up to 98.36% accuracy. Also, the evaluation of the presented method on the UCI Heart Dataset shows that it achieved 91.67% heart disease classification accuracy, which is 6% higher than the Spencer et al. method.
Keywords: Doctor-in-the-loop; Atrial fibrillation; Weighted risk predictors; Classification.
Atrial fibrillation (AFib) is the most frequent cardiac arrhythmia in clinical practice [1-8]. It is a significant risk factor for ischemic stroke and provokes a significant economic burden, substantial morbidity, and mortality. Atrial fibrillation is a heart rhythm disorder caused by degeneration of the electrical impulses in the upper cardiac chambers (atria), resulting in a change from an organized heart rhythm to a rapid, chaotic rhythm. People with AFib are 5 to 7 times more likely to have a stroke than the general population average. The Ischemic Heart Disease (IHD, which means that the heart is not getting enough blood and oxygen) causes more deaths, morbidity, and financial burden than other diseases in western societies. It has been estimated that 33.5 million people worldwide are diagnosed with AFib [1-11]. AFib is associated with a range of cardiovascular conditions such as Arterial Hypertension (AH), Chronic Heart Failure (CHF), IHD, which are proven risk factors for the development, persistence and progression of AFib. Due to the increase in life expectancy improvement in such patients in survival rates, an increase in the prevalence of AFib is currently observed.
The association of AFib with IHD and AH is frequent. Therefore, many factors are possibly involved in the occurrence and development of AFib and its progression [11-20,22-23]. Practitioners are faced with correlating structural and functional features with symptoms such as inflammation and fibrosis.
In general, traditional patient information, management, and therapy systems suffer from an absence of intelligence. They successfully offer basic patient management capabilities to their end-users, but they do not provide substantial decision support functionalities or automation to lend a helping hand to clinicians. Also, the most commonly used AI systems are “humans-out-of-the-loop” ones that cannot understand the context and hence cannot reason about interventions and retrospections [24].
Additionally, medical machine learning approaches suffer from weakly structured and non-standardized training data, quality of features (clinical characteristics), rare events, or dealing with uncertainty. As mentioned in [19], biomedical data sets are full of uncertainty, incompleteness, etc., which entails severe machine learning performance limitations. For example, the dataset may miss essential data or be incomplete, noisy, dirty, imbalanced, or uninformative. Therefore, fully automated approaches are difficult or even impossible to implement, or at least the quality of results from automated procedures might be questionable [19,20]. To solve these problems, one needs a doctor’s guidance to influence with their experience the work of the machine learning system, including extracting features, decision support, forecasting, and finally, to move from a single-domain AFib classification to comprehensive description of AFib patients (see review on 2020 European Society of Cardiology guidelines) [24].
Recently, a new paradigm know as “Doctor-in-The-Loop” (DTL) has gained attention in the data-information-driven medicine community. It aims to use knowledge discovery to improve medical treatments with the “human-in-the-loop” concept and ensure that machine learning systems routinize the correct information [17,18].
The next question we will investigate in this article is which features we should use to create a predictive model. Quality of Features (QF) has been a critical issue in machine learning. Effectively distinguishing important and inadequately relevant features in terms of their imapct on particular heart diseases can improve the predictive models’ accuracy, particularly in learning performance. Various approaches have been proposed to address this issue for decades. QF is one of the main pieces of feature engineering and can be roughly divided into feature selection and feature weighting algorithms. Feature selection is the process of obtaining a subset from an original feature set according to a specific feature selection criterion. It isolates the most consistent, non-redundant, and relevant features in model construction. Also, feature selection is increasingly important as the size and complexity of the average dataset continue to grow exponentially [25]. Spencer et al. proposed three methods to select a set of features to improve the accuracy of heart disease classification on the UCI Heart Disease Dataset [1]. The best performing model was the BayesNet algorithm on Heart ChiSq features, which achieved an accuracy of 85%.
The goal of feature weighting is to estimate the relative importance of each feature and assign it a corresponding weight [26,27]. Alternatively, feature weighting approximates the optimal degree of an individual feature’s influence with a training set [28]. This study investigates the weight of feature’s impact on AFib progression and generates specialized databases.
The goal of this article is to develop a doctor-in-the-loop machine learning system to identify and weigh the risk predictors that contribute to the onset of AFib associated with IHD and AH, as well as perform AFib classification based on weighted predictors.
The main contributions of the proposed method are:
a) A novel “doctor-in-the-loop” AFib risk predictors identification system;
b) A novel dataset drove feature weights adjustment procedure for AFib progression with IHD and AH patients;
c) Evaluation results using the newly constructed dataset and UCI Heart Dataset.
The remainder of the article is organized as follows. A literature review on interest modeling in multiple and single application environments and weighting methodologies is discussed in Chapter 1. The proposed approach and system design are discussed in Chapter 2. Chapter 3 discusses the results. Chapter 4 presents the evaluation of the proposed method on benchmarking the UCI Heart Dataset. Finally, discussions and conclusions for future work are presented in Chapter 5 and Chapter 6.
Proposed methods
This section introduces the main workflow of the proposed method. Additionally, it presents the methodologies used in the computer simulation.
In this study, we develop a system to identify AFib in patients with IHD and AH using an ML-based computational approach with four primary techniques. These are predated by data collection, which fits into study analysis. The workflow of the proposed study is shown in Figure 1. It can be listed as follows:
Step 1: Expert-driven features (clinical characteristics) selection based on a statistical analysis of correlation coefficients.
Step 2: Risk predictor identification and weighting based on Generalized Linear Model (GLM) results.
Step 3: A new dataset constructed on weighted values of risk predictors.
Step 4: A Fib classification based on a new dataset.
Pearson correlation-based feature selection
Most feature-weighted algorithms do not consider the correlation between features, so the redundancy with interference harms the final classification results. Patients' clinical characteristics usually contain many parameters that may support disease diagnosis [16]. However, sometimes it leads to non-precise results. Hence, we analyzed the factors based on Pearson Correlation Coefficient (PCC; threshold equals 0.4) [29,30]. Sometimes, conclusions drawn from statistical analysis do not include the necessary domain expertise (medical knowledge) to provide the full picture. Thus, the feature selection criterion is based on PCC results and enriched with additional information and doctor know-how. Moreover, the experiments show that the doctor’s knowledge on feature selection improves the classification results (Table 3).
Regression for risk predictors identification
The second step of analyzing the clinical characteristics is done using selected features. A logistic regression model was used from the GLM family to identify the features that contribute the most to the disease’s progression [31-34]. It models how the "odds" of success for a binary response variable 𝑌 depend on a set of predictors. In the case of the logistic regression model, the GLM would have this form:
where 𝜋𝑖=𝑃𝑦𝑖=1,𝛽𝑗 is a regression coefficient, 𝑦𝑖 is an observation, 𝑥𝑖 is a set of corresponding predictors, 𝑃𝑦𝑖=1 is a probability of 𝑖𝑡ℎ patients having AFib.
Elastic Net (ENET) that is a combination of 𝑙1 (Lasso) and 𝑙2(Ridge) penalty functions [46] was used as a regularization method. ENET seeks to find the regression coefficients which can minimize the following representation:
Where 𝜆 ≥ 0 is the tuning (penalty, regularization, or complexity) parameter that regulates the strength of the penalty (linear shrinkage) by determining the relative importance of the data-dependent empirical error and the penalty term. 𝛼 sets the degree of mixing between Ridge and Lasso. The ENET is equivalent to the 𝑙1 when 𝛼=1 and as 𝛼 decreases towards 0, the ENET approaches the 𝑙2 regression. The automated variable selection is being made by 𝑙1, while 𝑙2 improves the prediction [35,36].
To get the best set of ENET optimal parameters, we plan to use the grid-search with the cross-validation method [36,37]. Grid-search is a widely used method that picks the best model parameters from a list of parameter options for a given optimization problem by automating the 'trial-and-error' method [46]. It is an exhaustive parallel search and will find the best way to tune the hyperparameters based on the training set. Cross-validation is the procedure of training learners using one set of data and testing it using a different group [36]. The inner optimization finds model parameters 𝛽, which minimize the training loss 𝐿𝑇𝑟𝑎𝑖𝑛 given hyperparameters 𝜆 and 𝛼 . The optimization procedure chooses 𝜆 and 𝛼 to reduce the validation loss 𝐿𝑉𝑎𝑙 [38].
Statistical significance was set at p< 0.05 to get those risk predictors contributing to the onset of AFib [31,39].
Risk predictors weighting method
As mentioned above, feature weighting aims to estimate the relative importance of each feature and assign it a corresponding weight. Alternatively, feature weighting approximates the optimal degree of individual features’ influence with a training set [28,40]. One may find the survey of the feature weighting method’s advantages and limitations in [40,41]. The authors mentioned that there are three different ways to use the weight feature: (1) setti ng weights for all features, (2) setting weight functions for each feature, named as Per-Feature weighting, and (3) setting weight functions for each feature within each class, named as Per-Class-Per-Feature weighting. We will focus on setting weights for the identified risk predictors, which mathematically can be represented as
where risk predictors’ weight is 𝑊 = {𝑤1,𝑤2,…,𝑤𝑛} and risk predictors are 𝐹={𝑓1,𝑓2,…,𝑓𝑛 }.𝑤𝑖 is being produced by a trained regression of GLM according to each feature impact on AFib progression.
The next technique of the study is the construction a new dataset. The dataset contains the weighted risk predictors that impact AFib progression the most. This way, we calculate the weight of each risk predictor based on its effect on disease progression and use the weight of each risk predicor instead of its clinical/laboratory value. This facilitates building a system that will provide AFib classification more accurately. For the classification purposes, five different supervised ML algorithms have been used: Logistic Regression, Gaussian NB, Decision Tree, Random Forest, LDA, and KNN [42,43]. Additionally, the novel system enables precise identification of AFib patients from low to high risks supporting intermediate risk levels. The performance of the constructed dataset is evaluated using some extensions of Receiver Operating Characteristic (ROC) [44-46,47,48].
Novel RIC AFib dataset
We observed 257 patients with IHD and AH hospitalized at the Department of Cardiac Arrhythmias of the Research Institute of Cardiology (RIC) in Armenia and some on an outpatient basis. This implies that RIC Dataset consists of data that is specific to the Armenian local population and its characteristics (atmospheric conditions, geographical location, social lifestyle). Of the total group of patients examined, the study to determine the stratification of risk factors for progression of AFib included 213 patients with paroxysmal, persistent, and permanent forms of AFib (classification ESC 2016, 2020). Forty-four patients with AH and IHD but without AFib similar in gender and age were examined as a control group. The considered classes are described in Table 1:
Class Type | Description |
---|---|
Paroxysmal | It occurs when a rapid, erratic heart rate begins suddenly and then stops on its own within 7 days. It is also known as intermittent AFib, often lasts less than 24 hours, and does not require treatment. |
Persistent | It begins spontaneously. It lasts at least 7 days and may or may not end on its way. |
Permanent | In people who have had AFib for a long time, the Heart may not be able to return to a normal rhythm. |
Control Group | The patient in this group does not have any kind of AFib symptoms. |
Patients inclusion criteria were a) systolic blood pressure 140 mmHg. and higher, diastolic - 90 mmHg. and higher (ESC 2018), b) unstable angina (Recommendations for the treatment of stable coronary artery disease ESC 2013), c) the presence of recurrent (paroxysmal, persistent) or chronic (permanent) forms of AFib (ESC 2016, 2020 classification).
The patient examination includes general clinical and additional research methods: hemogram, lipidogram, electrocardiogram, EchoCG, 24-hour Holter ECG monitoring, biochemical blood tests (determination of coagulogram, lipid spectrum, fibrinogen), quantitative determination of hs-CRP, cytokines-IL-6 and TNF levels, as well as a marker of fibrosis - TGF-beta1. Detailed information on the patients with IHD and AH containing the study dataset is described in Table 2.
In this study to identify and weight the rsik predictors, 213 patients with 3 forms of AFib were considered Positive AFib, and the rest (control group) as Negative AFib. After identification and weighting, the classification has been done on described 4 classes.
This section presents the computer simulation parts, introducing characteristic clinical analysis with the doctor’s authority, selected features weighting procedure, new dataset construction, and AFib classification.
Expert driven feature (clinical characteristic) selection
Demographic, clinical, and laboratory data of studied ischemic heart disease and arterial fibrillation patients are shown in Table 2. For this study, 257 patients were examined; and the number of clinical characteristics applied to each of them was 27. The suggested methodology for supporting the risk predictors' identification of AFib progression is based on the clinical characteristics collected from patients. Thus, the method of selecting non correlated characteristics as attributes of risk predictors' construction is considered [30,49]. The Pearson Correlation Coefficient (PCC) of all 27 clinical characteristics is shown in Figure 2.
According to the expert analysis and the PCC (a correlation coefficient of over 0.4), the following features have been eliminated: QRS, HF, LAD, LV EVD, LV ESV, PAP, Gender, FIBR, TIA, Systolic blood pressure, Hypertensive crisis, Pulse, and IVST. Figure 2 shows that there are clinical characteristics (LAV, TGF-beta1) which have higher correlation than the specified threshold (0.48), but we still keep them for further analysis. This part is confirmed by doctors with medical statements, as LAV and TGF-beta1 characteristics are crucial for AFib progression. The selection performance was evaluated by some classifiers. Table 3 shows the outcome. As we can see in Table 3, the classification results outperform while adding LAV and TGF-beta1 correlated characteristics.
Abbreviations | Clinicopathologic factors | Units | Groups |
---|---|---|---|
Gender | % | ||
Female Male |
121 136 |
||
HC | Hypertensive crisis | % | |
Yes No |
127 130 |
||
TIA | Transient ischemic attacks | % | |
Yes No |
55 202 |
||
IHD | Ischemic heart disease | % | |
Yes No |
225 32 |
||
MI | Myocardial infarction | % | |
Yes No |
61 196 |
||
Age | years | 59.09 ± 6.11 | |
SBP | Systolic blood pressure | mm/Hg | 161.98 ± 8.05 |
DBP | Diastolic blood pressure | mm/Hg | 96.71 ± 6.46 |
Pulse | Pulse | bpm | 89 ± 11.16 |
HF | Heart failure | % | 0.88 ± 0.81 |
QRS | QRS complex | 102.53 ± 10.58 | |
BMI | Body mass index | kg/m² | 30.54 ± 1.96 |
LAD | Left atrial diameter | mm | 40.21 ± 3.13 |
LAV | Left atrial volume | mL | 65.37 ± 10.81 |
LV EDD | Left ventricular end-diastolic diameter | mm | 54.19 ± 2.25 |
LV EDV | Left ventricular end-diastolic volume | mL | 112.76 ± 9.11 |
LV ESV | Left ventricular end-systolic volume | mL | 48.53 ± 9.58 |
IVST | Interventricular septum thickness | mm | 12.7 ± 1.3 |
LV PVRT | Isovolumetric relaxation time | ms | 12.02 ± 0.97 |
LVWT | Left ventricle posterior wall thickness | mm | 0.46 ± 0.02 |
EF | Ejection fraction | % | 48.41 ± 2.8 |
PAP | Pulmonary arterial pressure | mm/Hg | 25.91 ± 7.2 |
FIBR | Fibrinogen | mcm/l | 13.19 ± 2.23 |
CRP | C-reactive protein | mg/l | 4.36 ± 1.85 |
IL-6 | nterleukine-6 | pg/ml | 26.57 ± 11.58 |
TNF-alpha | Tumor necrosis factor -alpha | pg/ml | 0.43 ± 2.82 |
TGF-beta1 | Transforming growth factor- beta1 | pg/ml | 708.25 ± 220.0 |
Classifier Name | Without Expert & Pearson Correlation | With Expert & Pearson Correlation | ||||||
---|---|---|---|---|---|---|---|---|
Sensitivity (%) | Precision (%) | F1-score (%) | Accuracy (%) | Sensitivity (%) | Precision (%) | F1-score (%) | Accuracy (%) | |
Logistic Regression | 83.93 | 82.62 | 83.17 | 84.62 | 86.20 | 84.25 | 85.00 | 86.15 |
Decision Tree Classifier | 82.76 | 81.00 | 81.67 | 83.08 | 86.25 | 86.25 | 86.25 | 87.69 |
Random Forest Classifier | 81.30 | 81.00 | 81.65 | 83.08 | 88.37 | 84.37 | 84.16 | 84.62 |
Linear Discriminant Analysis | 86.25 | 86.25 | 86.25 | 87.69 | 93.07 | 90.75 | 91.66 | 92.31 |
The performance evaluation criteria are defined in Table 4: where True Positives is the number of correctly classified patients with AFib, True Negatives is the number of correctly classified patients without AFib, False Positives is the number of incorrectly classified patients with AFib, False Negatives is the number of incorrectly classified patients without AFib [42,43].
Risk predictors identification and weighting
A Generalized Linear Model was built to find the predictors contributing to the onset of AFib. The selection of tuning parameters 𝜆 and 𝛼 has been done via an optimum value found through the GridSearchCV minimization approach [36]. The ranges for grid search were 𝛼 values between 0 and 1 with a 0.01 separation and 𝜆 values from 1e-5 to 100. The experimental results show that ENET regularization behaves properly when 𝜆 and 𝛼 hyperparameters are set as follows: 𝛼=0.5 to provide an equal contribution of each penalty to the loss function, and 𝜆=0.004 weight to the regularization. Some results are shown in Table 5.
p< 0.05 statistically significant level was considered for risk predictors’ selection [31,32,39]. The results according to the best hyperparameters are shown in Table 6. This provides us with five features: Age, Diastolic blood pressure (DBP), IL-6, TNF-alpha, and TGF-beta1.
𝜆 | 𝛼 | Generated risk predictors |
---|---|---|
0.5 | 0.0 (Ridge) | No predictor (all p > 0.05) |
0.05 | 0.25 | IL-6, TGF-beta1 |
0.004 | 0.5 | Age, DBP, IL-6, TNF-alpha, TGF-beta1 |
10.0 | 0.75 | No predictor (all p = 1.0) |
0.02 | 1.0 (Lasso) | IL-6, TGF-beta1 |
P value | |
---|---|
LVWT | 0.565 |
LV PWD | 0.426 |
BMI | 0.529 |
MI | 0.074 |
IHD | 0.253 |
AGE | 0.017 |
LV EDD | 0.430 |
CRP | 0.594 |
TNF-alpha | 0.005 |
TGF-beta1 | 0.000 |
IL-6 | 0.000 |
Diastolic Blood Pressure | 0.025 |
LAV | 0.126 |
EF | 0.183 |
These predictors serve as independent risk factors for the onset of AFib. Figure 3 shows the relationship between risk predictors and atrial fibrillation progression. The main intent of the proposed method is to weight each risk predictor based on its effect on disease progression. The weight function applies to each risk predictor using the results of the trained model. AFib progression accelerates at different rates according to each risk predictor’s increasing value.
Age | Weight | DBP | Weight | IL-6 | Weight | TNF-alpha | Weight | TNF-beta1 | Weight | Total Weight | Severity Risk | Class Type |
---|---|---|---|---|---|---|---|---|---|---|---|---|
40 | 216 | 85 | 1124 | 15 | 36 | 6.6 | 7 | 420 | 348 | 2627 | 0.144 | Paroxysmal |
50 | 337 | 90 | 1261 | 20.3 | 64 | 8.3 | 11 | 510 | 494 | 2847 | 0.335 | Persistent |
60 | 485 | 95 | 1405 | 31.6 | 153 | 9.6 | 15 | 650 | 770 | 2948 | 0.455 | Persistent |
70 | 660 | 100 | 1557 | 39.9 | 244 | 11.3 | 20 | 720 | 930 | 3073 | 0.609 | Persistent |
80 | 862 | 105 | 1716 | 45.6 | 318 | 14.5 | 32 | 850 | 1270 | 3269 | 0.806 | Permanent |
90 | 1091 | 110 | 1883 | 55.0 | 461 | 15.9 | 38 | 900 | 1414 | 3634 | 0.962 | Permanent |
New dataset construction and AFib classification
The effect of risk predictors on AFib progression is discussed above. Based on that a new dataset is constructed with weighted values of predictors which are Age, Diastolic blood pressure, IL-6, TNF-alpha, and TGF-beta1. The constructed novel dataset will assist clinical use and further analysis of patients. Table 7 shows an illustration of AFib progression based on the proposed methodology. First, it is presented the individual weight of each risk predictor of some patients. The total weight indicates each patient's estimated individual risk of AFib progression. The less the total weight, the less the probability of disease progression, also called the severity risk. Any severity risk lower than 0.25 presents a good outcome. The new dataset performance is evaluated using some extension of receiver operating characteristics (ROC). The results are presented in Figure 4. The ROC curve for weighted predictors confirmed its good clinical performance (micro-average AUC = 0.85, macro-average AUC = 0.88). Performance micro-average metric is preferred if the possibility of class imbalance is high [47,48].
Table 8 illustrates the multiclass AFib progression classification results based on the constructed dataset according to the proposed methodology. First, it presents the classification results on the initial RIC Dataset considering all 27 clinical characteristics; then identifies risk predictors. The last row shows the classification using weighted risk predictors (DTL - SW).
RIC Afib Dataset | Logistic Regression | Decision Tree | Random Forest | Gaussian NB | LDA | KNN |
---|---|---|---|---|---|---|
Initial 27 features | 78.04 | 92.14 | 76.92 | 91.22 | 88.46 | 70.73 |
Identified risk predictors | 92.68 | 97.56 | 95.12 | 97.22 | 94.44 | 86.11 |
DTL - SW | 97.93 | 98.36 | 97.77 | 97.56 | 97.22 | 94.44 |
UCI Heart Disease Dataset | Logistic Regression | Decision Tree | Random Forest | Gaussian NB | LDA | KNN |
---|---|---|---|---|---|---|
Heart ChiSq | 84.50 | 80.0 | 83.0 | 83.67 | 85.0 | 66.67 |
Heart Ref | 83.0 | 80.0 | 83.5 | 83.67 | 85.0 | 61.67 |
Heart SyUn | 83.33 | 78.33 | 83.0 | 83.0 | 84.67 | 66.67 |
DTL - SW | 91.67 | 85.0 | 85.0 | 90.0 | 88.33 | 88.33 |
Evaluation of suggested approach on the public dataset – UCI Heart Disease
The suggested method has been experimented on a public dataset called UCI Heart Disease [50-57]. The dataset contains 303 samples and 14 clinical characteristics (features). Many of the medical dataset features are irrelevant and uninformative. This can decrease model accuracy. In real-world applications, these features need to be put in context with other patient-related data, which might be unavailable at the time of classification. We aim to identify the predictors (features) that contribute the most to the disease progression through the suggested method. Then the identified risky predictors will be used for a more accurate model building.
The riskiest predictors on UCI Heart Disease Dataset were found to be age, cp, exang, ca, oldpeak, thal. Then, these predictors have been weighted based on their contribution to heart disease progression. We use the weighted values for classification purposes.
In Table 9, we evaluate the results of the suggested approach with existing state-of-the-art feature selection methods [1]. The results are presented in Table 9. The simulation results show that the proposed method obtains an accuracy of 91.67% for heart disease classification and has been improved by up to 6% compared with existing approaches.
The best performing model in the [1], study was the BayesNet algorithm on Heart ChiSq features with an accuracy of 85%. We achieved an accuracy of 89% using BayesNet based on the proposed doctor-in-the-loop feature selection and weighting approach (DTL-SW).
Being able to effectively detect Atrial fibrillation (AFib) and other related cardiovascular problems are crucial for the world. Even though the disease has existed for ages, we still miss the comprehensive framework that can deal with the solution where clinicians and machine learning can have an impact. Multiple studies show various ways of identifying predictors associated with AFib disease progression and classification. Still, very little research has been done to weight the independent risk factors of having a doctor-in-the-loop. This study a) proposed and validated a novel Doctor-In-The-Loop (DTL) machine learning system to identify and classify AFib in ischemic heart disease and arterial hypertension patients, including b) Expert-driven Pearson correlation-based feature (clinical characteristics) selection; c) Regression-based risk predictors’ identification; d) risk predictors weighting based on its impact on AFib progression; and b) analysis of 257 such cases hospitalized in the department of cardiac arrhythmias of the Research Institute of Cardiology (RIC) in Armenia. Extensive computer simulations showed that the Age, Diastolic blood pressure, IL-6, TNF-alpha, and TGF-beta1. features have the highest effect on AFib progression. Quantitive comparisons were based on adopted accuracy, sensitivity, precision, F1-score, and ROC metrics. Patients’ AFib classification showed that the newly constructed dataset based on the risk predictor weighting approach outperforms the significance of the initial RIC Dataset reaching accuracy up to 98.36%. Thus, the novel system assists the proper identification and classification of AFib patients.
The proposed method obtains an accuracy of 91.67% for heart disease classification on the well-known UCI Heart Disease Dataset. It improves by up to 6% compared to Spencer et al. method [1].
However, this study has some limitations. First, we have only used AFib data from a single center in our train, test and validation sets. Furthermore, the patients included in this study were patients who have other cardiovascular problems, i.e., ischemic heart disease and arterial hypertension. This may impact the generalizability of the proposed methodology. Therefore, the suggested methodology should be validated on other Afib datasets.
The identification and incorporation of weighted risk predictors based on their impact on heart disease progression through a doctor-in-the-loop approach significantly improve the identification and classification of heart disease. Thus, the presented framework has a good potential to facilitate the decision-making process in heart disease, particularly Atrial Fibrillation identification and classification. The presented method could be used as an initial screening for heart disease by helping clinicians diagnose three types of atrial fibrillation in real-time, facilitating faster decision-making and reducing costs.
Author contributions: We declare that all authors have contributed to this research paper.
Funding: This work was supported by the Ministry of Science and Higher Education of the Russian Federation, agreement No. 075-15-2022-294 dated 15 April 2022.
Ethical approval of study participants: The database is collected from Atrial fibrillation patients in the Department of Arrhythmia at the Research Institute of Cardiology named after Levon Hovhannisyan (Yerevan, Armenia), approved by the Local Ethical Committee (Protocol no.3 of the 28.11.2019), and with informed consent from the patients.
Informed consent statement: Informed consent was obtained from all subjects involved in the study.
Data availability statement: The RIC Dataset is not available. The UCI Heart Disease Dataset can be found here: http://archive.ics.uci.edu/ml/datasets/Heart+Disease
Guarantor: Not applicable.
Acknowledgments: This work is partially supported by Armenian Engineers and Scientists of America.