Skip to main content

Artificial intelligence in nonalcoholic fatty liver disease



Nonalcoholic fatty liver disease (NAFLD) has led to serious health-related complications worldwide. NAFLD has wide pathological spectra, ranging from simple steatosis to hepatitis to cirrhosis and hepatocellular carcinoma. Artificial intelligence (AI), including machine learning and deep learning algorithms, has provided great advancement and accuracy in identifying, diagnosing, and managing patients with NAFLD and detecting squeal such as advanced fibrosis and risk factors for hepatocellular cancer. This review summarizes different AI algorithms and methods in the field of hepatology, focusing on NAFLD.


A search of PubMed, WILEY, and MEDLINE databases were taken as relevant publications for this review on the application of AI techniques in detecting NAFLD in suspected population


Out of 495 articles searched in relevant databases, 49 articles were finally included and analyzed. NASH-Scope model accurately distinguished between NAFLD and non-NAFLD and between NAFLD without fibrosis and NASH with fibrosis. The logistic regression (LR) model had the highest accuracy, whereas the support vector machine (SVM) had the highest specificity and precision in diagnosing NAFLD. An extreme gradient boosting model had the highest performance in predicting non-alcoholic steatohepatitis (NASH). Electronic health record (EHR) database studies helped the diagnose NAFLD/NASH. Automated image analysis techniques predicted NAFLD severity. Deep learning radiomic elastography (DLRE) had perfect accuracy in diagnosing the cases of advanced fibrosis.


AI in NAFLD has streamlined specific patient identification and has eased assessment and management methods of patients with NAFLD.


Non-alcoholic fatty liver disease (NAFLD) is the most common chronic liver disorder affecting globally. The presence of steatosis in more than 5% of hepatocytes without considerable alcohol intake is used to diagnose it. The term NAFLD refers to conditions ranging from benign nonalcoholic fatty liver (NAFL) to the more severe non-alcoholic steatohepatitis (NASH). Despite the fact that NAFLD may affect patients of any weight, more than 80% of NAFLD patients are obese, with a body mass index (BMI) of greater than 30 kg/m2. The prevalence of NAFLD and many metabolic disorders such as obesity have lately increased globally. In less than a decade, NAFLD is likely to overtake cirrhosis as the leading cause of liver transplantation. It affects anywhere from 6% to 35% of the world’s population, with NAFLD affecting around 30% of Americans [26].

With a global incidence of 25.2%, non-alcoholic fatty liver disease (NAFLD) is one of the most frequent chronic liver disorders. Nonalcoholic steatohepatitis (NASH), cirrhosis, and hepatocellular carcinoma are all possible outcomes of NAFLD [13]. NAFLD is also closely linked to insulin resistance, metabolic syndrome, diabetes, and cardiovascular disease, suggesting that it is a multisystem illness with extra hepatic consequences. NAFLD raises the risk of end-stage liver disease, hepatocellular carcinoma (HCC), and liver-related and all-cause mortality compared to the general population of the same age and gender. However, the bulk of fatalities among people with NAFLD, are due to cardiovascular disease and cancer, according to most experts [17]. The majority of research studies so far have focused on associations between NAFLD and CVD, type 2 diabetes (T2DM), chronic kidney disease (CKD), and colorectal cancer (presented in Table 1).

Table 1 Important studies defining AI in NAFLD

Currently available serological and imaging techniques cannot distinguish between steatosis and NASH, so diagnosing NAFLD is difficult. A liver biopsy allows for an accurate diagnosis. Unfortunately, because of the increased risk of severe bleeding and life-threatening consequences, it is rarely advised in clinical practice. The recently developed imagistic methods of magnetic resonance imaging (MRI) with proton density fat fraction (PDFF) and proton magnetic resonance spectroscopy, which have high diagnostic accuracy, offer an alternative to liver biopsy (1 H-MRS) [9, 18]. According to Chen et al., the presence of NAFLD was linked to an elevated risk of colorectal adenoma and colorectal cancer in later stages [7].

Malignancy is the second-most common cause of mortality in individuals with NAFLD after cardiovascular disease. According to recent research, NAFLD with intermediate or advanced fibrosis remained associated with cardiovascular disease (OR 1.36), extrahepatic cancer (OR 1.24), and chronic kidney disease (OR 1.18) [24].

Cross-sectional study found that individuals with NAFLD, particularly those with NASH, are more likely than healthy controls to acquire advanced colorectal neoplasms. Although earlier research has found a link between NAFLD and the development of colorectal malignancies, no such link has been established over a long period of time. Furthermore, the link between NAFLD and other extra hepatic malignancies has received minimal investigation [34].

Diagnostic imaging, laboratory data, electro-diagnosis, electronic health records, and recordings from wearable devices are the primary fields in which artificial intelligence (AI) technologies are being employed in healthcare. As a result, the goal of our review is to offer a thorough overview of key AI research that may aid clinicians in the management of NAFLD and assess its diagnostic accuracy, diseases screening, specificity, and sensitivity to machine diagnosis methods.

Materials and methods

A search of PubMed, WILEY, and MEDLINE databases was taken as relevant publications for this review on the application of artificial intelligence techniques in detecting NAFLD in association with cancer. The search terms included: NAFLD, Artificial intelligence OR Machine learning OR Deep learning OR electronic health records OR Automated Diagnosis OR Automated Computer Tomography AND NAFLD OR Nonalcoholic Fatty Liver Disease in association with Cancer OR Carcinoma. Exclusion Criteria are Case Reports, Letters to Editors, Abstracts, Power point Presentations presented in conferences, Studies written in languages other than English, Paediatric Studies.

Data extraction and quality assessment

Data extraction and quality assessment was independently performed by two authors. In case of disagreement, discussion was conducted with the third author. Data extracted included the author, publication year, number of NAFLD patients, type of study, objective of study, and outcome or conclusion by author. Different AI techniques were evaluated, and comparison was stated. AI techniques predicting risk factors, diagnosing NAFLD, staging NASH and its severity, AI used in imaging techniques, AI in electronic health records, automated diagnosis, and AI used to in NAFLD and its association with cancer were analyzed.

Results and discussion

The bifurcation of artificial intelligence is mainly into its two divisions, namely machine learning and deep learning (Fig. 1). A total of 475 articles were extracted from the database. The inclusion criteria included original articles, case studies, review articles, and systematic reviews. Pre-clinical/non-clinical articles, case reports, letters to editors, and pediatric studies were excluded. The number of articles included for analysis was 95 (Fig. 2).

Fig. 1
figure 1

Bifurcation of artificial intelligence into two main types—machine learning and its subset, deep learning [19]

Fig. 2
figure 2

Flow diagram of search methodology and literature selection process

AI in predicting risk factors of NAFLD

In a study by Garcia-Carretero et al., the prevalence of NASH in 2239 hypertensive patients and assessed the relevant features related to hypertension and metabolic syndrome (MS) using supervised machine learning algorithms such as least absolute shrinkage and selection operator (LASSO) and random forest classifier was assessed [16]. LASSO is a regression analysis algorithm that uses an L1 regularization technique, that is, it adds a penalty term to the regression function. A random forest algorithm was used to assess feature importance in regression model produced by LASSO. In univariate analyses, it was associated with metabolic syndrome, type 2 diabetes, insulin resistance, and dyslipidemia. Serum ferritin and insulin were selected with high sensitivity and specificity using the LASSO approach with a sensitivity of 70%, specificity of 79%, and area under the curve of 0.79 [16]. Another study by Garcia-Carretero et al. used random forest (RF) models for predicting patients at risk of developing NASH in 1525 patients [15]. The electronic health records were used to assess the presence of NASH. The random forest model correctly classified patients with NASH with an accuracy of 0.87 in the best model and to 0.79 in the worst one. Four features that were the most relevant included insulin resistance, ferritin, serum levels of insulin, and triglycerides. Random forest-based modeling demonstrated that machine learning could be used to improve interpretability, produce an understanding of the modeled behavior, and demonstrate how far certain features can contribute to predictions [15].

Diagnosis of NAFLD using AI

In literature, various diagnostic models for NAFLD were studied. Some of the algorithms were logistic regression (LR), k-nearest neighbor (kNN), support vector machine (SVM), naive Bayes, Bayesian network (BN), and decision tree and K2 algorithm including adaptive boosting (AdaBoost), bootstrap aggregating (bagging), and random forest and extensions to the algorithm like hidden naive Bayes (HNB) and aggregating one-dependence estimators (AODE) [23]. Ma H et al. investigated these 11 machine learning algorithms in 10508 patients to predict the best diagnostic model of NAFLD [23]. They reported that 83.41% accuracy was detected with the logistic regression (LR) model, whereas the highest specificity and precision was achieved by the SVM model with values of 0.946 and 0.725, respectively. AODE model was the most sensitive, with a value of 0.680. In this study, F-measure was used to analyze the classification for building these prediction models, with the highest F-measure being 0.655 for BN model and the lowest was for fatty liver index (FLI) with a value of 0.318. The authors determined that the best performance was shown by the BN model with a 9.17% improvement in the F-measure score [23].

Yip et al. included 922 patients to compare logistic regression, AdaBoost, and ridge regression. Finally, the logistic regression model achieved an accuracy of 87–88% and six relevant features, such as insulin resistance, triglycerides, or alanine aminotransferase [33]. Sorino et al. compared eight different machine learning algorithms, namely Boosting Tree Classifier (using Adaboost Classifier), Decision Tree Classifier, Naive Bayes Classifier, K-Nearest Neighbors Classifier, Neural Network Classifier, Random Forest Classifier, Regularized Multinomial Classifier (use Logistic regression), and Support Vector Machine Classifier. Using the Meta learner approach, three models consisting (1) FLI plus GLUCOSE plus SEX plus AGE, (2) abdominal volume index (AVI) plus GLUCOSE plus gamma-glutamyl transpeptidase (GGT) plus SEX plus AGE, and (3) body roundness index (BRI) plus GLUCOSE plus GGT plus SEX plus AGE were created. The authors reported SVM algorithm (Support Vector Machine in Python) was the most appropriate and had better performance in the analyzed models [28]. Model 3 had the highest accuracy of 77% compared to models 2 and 1, with an accuracy of 68% each. As model 2 had lesser prediction errors, it was considered the best model [28]. Cheng et al. developed several models using KNN, RF, and support vector machines (SVM) to detect NAFLD. They observed that SVM had 86.9% accuracy in men, and RF had 80% in women. Both models selected some relevant features, including cholesterol-related and insulin resistance-related factors [8].

Docherty et al. developed a machine learning (ML) model to predict NASH, using confirmed NASH and non-NASH based on liver histology results in the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) dataset to train the model [11]. An extreme gradient boosting model (XGBoost) consisting of 14 features exhibited high performance as measured by area under the curve (0.82), sensitivity (81%), and precision (81%) in predicting NASH [11]. Slightly reduced performance was observed with an abbreviated feature set of 5 variables (0.79, 80%, and 80%, respectively) [11]. The full model demonstrated good performance (AUC 0.76) to predict NASH in Optum data [11]. The proposed model, named NASH map, is the first ML model developed with confirmed NASH and non-NASH cases as determined through liver biopsy and validated on a large, real-world patient dataset [11].

AI in predicting the severity and staging of NASH

For the assessment of severity of nonalcoholic fatty liver disease (NAFLD) and identification of patients with nonalcoholic steatohepatitis (NASH), a novel machine learning approach, ensemble feature selection (EFS), was devised by Canbay et al. [5]. Non-invasive parameters were selected by an ensemble feature selection (EFS) from a retrospectively collected training cohort of 164 obese individuals (age: 43.5 ± 10.3 years; BMI: 54.1 ± 10.1kg/m2) to develop a model able to predict the histological assessed NAFLD activity score (NAS) [5]. Advantages of this score are a continuous distribution allowing disease assessment apart from a dichotomous classification as NAFL or NASH and thus could possibly be used to monitor disease progression or resolution over time. Additional parameters, i.e., transient elastography or controlled attenuation parameter, could be added, given sufficiently large reference datasets [5]. Okanoue et al. developed novel non-invasive test with the help of an AI/neural network system called NASH-Scope, and the model could accurately distinguish between NAFLD and non-NAFLD and between NAFLD without fibrosis and NASH with fibrosis in 398 histologically diagnosed NAFLD patients [25]. Moreover, a systematic review by Li et al. evaluating AI-assisted diagnosis of liver fibrosis and NAFLD demonstrated promising potential and validation of these models in larger cohorts is required before implementing it into clinical practice [21]. AI (artificial intelligence) application in predicting NAFLD is extensively reviewed elsewhere [32]. The NAFLD ridge score is a machine-learning algorithm and is one of the most effective tools to detect NAFLD [33]. It is based on multiple laboratory parameters that include serum levels of ALT, serum triglycerides, HDL, HbA1c, hypertension, and leukocyte count and has an AUROC value of 0.87 [33] . It uses H-MRS (proton magnetic resonance spectroscopy) as a reference and has a negative predictive value (NPV) of 96% [33]. Despite being an effective scoring system to detect NAFLD, its use is limited to the research setting and fails to risk stratify steatosis progression [3].

AI in imaging modalities

Pasdar et al., in a multicenter prospective cohort study of 3029 European-ancestry adults recently diagnosed with T2D (n = 795) or at high risk of developing NAFLD (n = 2234). The analyses applied machine learning methods to data from the deep-phenotyped IMI DIRECT cohorts (n = 1514) to identify sets of highly informative variables to predict NAFLD. The criterion measure was liver fat quantified from MRI. LASSO (least absolute shrinkage and selection operator) was applied to select features from the different layers of omics data and random forest analysis to develop the models. A total of 18 prediction models were developed. The authors reported that the model including all omics and clinical variables yielded a cross-validated receiver operating characteristic area under the curve (ROCAUC) of 0.84 (95% CI 0.82, 0.86; p < 0.001), which compared with a ROCAUC of 0.82 (95% CI 0.81, 0.83; p < 0.001) for a model including 9 clinically accessible variables [4].

In a study by Cao et al., two-dimensional hepatic imaging was analyzed by the envelope signal, grey scale signal, and deep-learning index obtained by 3 image-processing techniques in 240 participants with mild, moderate, and severe NAFLD [6]. The authors reported that the 3 methods showed good ability (AUC > 0.7) to identify NAFLD. Meanwhile, the deep-learning index showed superior diagnostic ability in distinguishing moderate and severe NAFLD (AUC = 0.958) [6].

Rapid MRI techniques could be used to predict nonalcoholic steatohepatitis (NASH) noninvasively by measuring liver stiffness with magnetic resonance elastography (MRE) and liver fat with chemical shift-encoded (CSE) MRI [12]. So, Dzyubak et al. validated an automated image analysis technique to maximize the utility of these methods in eighty-three patients with suspected NAFLD [12]. A logistic regression model to predict pathology-diagnosed NASH was trained based on stiffness and proton density fat fraction. The area under the receiver operating characteristic curve (AUROC) was calculated using 10-fold cross-validation for models based on both automated and manual measurements [12]. Liver stiffness and PDFF were also calculated using an automated method. A separate model was trained to predict the NASH severity score (NAS). The model for predicting biopsy-diagnosed NASH had an AUROC of 0.87, and the NAS-prediction model had a C-statistic of 0.85. The stiffness and PDFF measurements based on automated ROIs had a higher agreement with the expert reader (R2 = 0.87 for stiffness and R2 = 0.99 for PDFF) than the expert and experienced readers had with each other (R2 = 0.85 for stiffness and R2 = 0.98 for PDFF) [12].

In a study by Addeman et al., a novel software package named AdipoQuant for the automated quantification of total adipose tissue (TAT), SAT, and IAAT in the abdomen was used, and similar results were obtained to manual segmentation methods [1].

Electronic health records and NAFLD

Logistic regression, decision trees, RF, extreme gradient boosting (XGBoost), or k-nearest neighbors (KNN) have been used with electronic health records (EHR), while neural networks and deep learning have been used for histology and images [30]. Sowa et al. included EHR of 126 patients to develop a final model with an accuracy of 0.79. However, this model relied on features that are not easily collected or measured, such as apoptosis markers [29]. Genome-wide association studies (GWAS) have identified several risk loci for nonalcoholic fatty liver disease (NAFLD). GWAS of 4761 cases of NAFLD and 373,227 healthy controls without evidence of NAFLD was performed using electronic health records by Fairfield et al. [14]. Loomis et al. conducted large scale electronic health record database studies with The Health Improvement Network (THIN) database (n = 133,525) and Humedica EHR database (n = 148,934) and established the consistent and strong relationships between body mass index (BMI) and prospectively recorded diagnoses of NAFLD/NASH and emphasize the importance of weight reduction strategies for prevention and management of NAFLD [22].

Danford et al. developed and validated an electronic health record (EHR) algorithm to accurately identify cases of NASH cirrhosis in the HER (n = 300) [10]. Recommendations of the Electronic Medical Records and Genomics (eMERGE) network, a network funded by the National Human Genome Research Institute, was followed to construct the algorithm [10]. The algorithm with the highest PPV of 100% on internal validation and 92% on external validation consisted of ≥ 3 counts of cirrhosis, no mention of alcohol (571.5, K74.6), and ≥ 3 counts of nonalcoholic fatty liver (571.8–571.9, K75.81, K76.0) codes in the absence of any diagnosis codes for other common causes of chronic liver disease [10].

Nonalcoholic fatty liver disease in association with advanced fibrosis and cancer

The current machine learning approaches have identified type 2 diabetes mellitus (T2DM) as a strongly correlated feature with some degree of liver fibrosis and adverse hepatic outcomes (cirrhosis, malignancy) [30]. In a study by Aggarwal and Alkhouri, the authors reported that machine learning algorithms such as deep learning radiomic elastography (DLRE) have excellent accuracy in diagnosing cases of advanced fibrosis [2]. This finding was based on another study by Wang et al. where 344 patients with nonalcoholic fatty liver disease (NAFLD) underwent 428 liver biopsies (240 had paired transient elastography examination) [31]. The fibrosis stage was scored using the NASH Clinical Research Network system, and automated quantification of fibrosis-related parameters (q-FPs) was measured by dual photon microscopy using unstained slides. At the best cut-offs, the two q-FPs had 88.3–96.2% sensitivity and 78.1–91.1% specificity for different fibrosis stages in the validation cohort [31]. It was noted that automated quantification of fibrosis-related parameters by dual-photon microscopy has high accuracy in diagnosing fibrosis and cirrhosis in NAFLD patients [31].

Lewinska et al. developed a noninvasive surveillance method for NAFLD-hepatocellular carcinoma (HCC) [20]. Using comprehensive ultra-high-performance liquid chromatography mass-spectrometry, they investigated 1295 metabolites in serum from 249 patients. The area under the receiver operating characteristic curve was calculated for all detected metabolites and used to establish their diagnostic potential, and logistic regression analysis was used to establish the diagnostic score [20]. The diagnostic model was constructed using ROC curves generated by Monte-Carlo cross-validation (MCCV) using balanced sub-sampling, and the linear support vector machine (SVM) method was used for sample classification [20]. The authors reported that the combination of 5 metabolites accurately distinguishes NAFLD-HCC patients from healthy individuals (AUC = 0.989), morbidly obese bariatric surgery NAFLD (OB-NAFLD) patients (AUC = 0.997), and patients with alcohol- and viral-associated HCC (AV-HCC) (AUC = 0.999), and this model performed well against a validation set of NAFLD patients (AUC = 0.905) [20]. With the help of machine learning model, the authors speculated that NAFLD-HCC tumors act as sinks for unsaturated fatty acids from the blood and link between increased transport of fatty acids by CD36 and NAFLD-HCC [20, 27].


Artificial intelligence is a growing field that supports the identification, diagnosing, and management of diseases such as NAFLD. Many machine learning algorithms predict the risk factors of NAFLD, and with a lifestyle change, the disease can be prevented. In conclusion, it would appear that AI in NAFLD has streamlined specific patient identification and eased assessment and management methods of patients with NAFLD. Moreover, it has proven logistic regression (LR), k-nearest neighbor (kNN), support vector machine (SVM), naive Bayes, Bayesian network (BN), and decision tree and K2 algorithm effective for diagnosing NAFLD. New imaging processing techniques and interpretation using machine learning such as LASSO, automated imaging analysis, and AdipoQuant have caused rapid development in the field of diagnostic radiology. Methods to stage liver fibrosis based on measurements of liver stiffness with the help of vibration controlled transient elastography (VCTE) and shear wave elastography (SWE) are gaining increasing importance. In future, AI will sharpen accuracy in diagnosing, staging and managing NAFLD without sequelae and will promote early detection of HCC and promptly help in its management.

Availability of data and materials

Not applicable.


  1. Addeman BT, Kutty S, Perkins TG, Soliman AS, Wiens CN, McCurdy CM, Beaton MD, Hegele RA, McKenzie CA (2015) Validation of volumetric and single-slice MRI adipose analysis using a novel fully automated segmentation method. J Magnet Res Imag 41(1):233–241.

    Article  Google Scholar 

  2. Aggarwal P, Alkhouri N (2021) Artificial intelligence in nonalcoholic fatty liver disease: a new frontier in diagnosis and treatment. Clin Liver Dis 17(6):392–397.

    Article  Google Scholar 

  3. Alqahtani SA, Schattenberg JM (2021) Nonalcoholic fatty liver disease: use of diagnostic biomarkers and modalities in clinical practice. Expert Rev Mol Diagn 21(10):1065–1078.

    Article  CAS  Google Scholar 

  4. Atabaki-Pasdar N, Ohlsson M, Viñuela A, Frau F, Pomares-Millan H, Haid M, Jones AG, Thomas EL, Koivula RW, Kurbasic A, Mutie PM, Fitipaldi H, Fernandez J, Dawed AY, Giordano GN, Forgie IM, McDonald TJ, Rutters F, Cederberg H, Franks PW (2020) Predicting and elucidating the etiology of fatty liver disease: a machine learning modeling and validation study in the IMI DIRECT cohorts. PLoS Med 17(6):e1003149.

    Article  CAS  Google Scholar 

  5. Canbay A, Kälsch J, Neumann U, Rau M, Hohenester S, Baba HA, Rust C, Geier A, Heider D, Sowa J-P (2019) Non-invasive assessment of NAFLD as systemic disease—a machine learning perspective. PLoS One 14(3):e0214436.

    Article  CAS  Google Scholar 

  6. Cao W, An X, Cong L, Lyu C, Zhou Q, Guo R (2020) Application of deep learning in quantitative analysis of 2-dimensional ultrasound imaging of nonalcoholic fatty liver disease. J Ultrasound Med 39(1):51–59.

    Article  Google Scholar 

  7. Chen J, Bian D, Zang S, Yang Z, Tian G, Luo Y, Yang J, Xu B, Shi J (2019) The association between nonalcoholic fatty liver disease and risk of colorectal adenoma and cancer incident and recurrence: a meta-analysis of observational studies. Expert Rev Gastroenterol Hepatol 13(4):385–395.

    Article  CAS  Google Scholar 

  8. Cheng, Y., Chou, C.-Y., & Hsiung, Y. (2017). Application of machine learning methods to predict non-alcohol fatty liver disease in Taiwanese high-tech industry workers.

  9. Cobbina E, Akhlaghi F (2017) Non-alcoholic fatty liver disease (NAFLD)—pathogenesis, classification, and effect on drug metabolizing enzymes and transporters. Drug Metab Rev 49(2):197–211.

    Article  CAS  Google Scholar 

  10. Danford CJ, Lee JY, Strohbehn IA, Corey KE, Lai M (2021) Development of an algorithm to identify cases of nonalcoholic steatohepatitis cirrhosis in the electronic health record. Dig Dis Sci 66(5):1452–1460.

    Article  Google Scholar 

  11. Docherty M, Regnier SA, Capkun G, Balp M-M, Ye Q, Janssens N, Tietz A, Löffler J, Cai J, Pedrosa MC, Schattenberg JM (2021) Development of a novel machine learning model to predict presence of nonalcoholic steatohepatitis. J Am Med Inform Assoc: JAMIA 28(6):1235–1241.

    Article  Google Scholar 

  12. Dzyubak B, Li J, Chen J, Mara KC, Therneau TM, Venkatesh SK, Ehman RL, Allen AM, Yin M (2021) Automated analysis of multiparametric magnetic resonance imaging/magnetic resonance elastography exams for prediction of nonalcoholic steatohepatitis. J Magnet Res Imag 54(1):122–131.

    Article  Google Scholar 

  13. European Association for the Study of the Liver (EASL), European Association for the Study of Diabetes (EASD), & European Association for the Study of Obesity (EASO) (2016) EASL-EASD-EASO clinical practice guidelines for the management of non-alcoholic fatty liver disease. J Hepatol 64(6):1388–1402.

    Article  Google Scholar 

  14. Fairfield CJ, Drake TM, Pius R, Bretherick AD, Campbell A, Clark DW, Fallowfield JA, Hayward C, Henderson NC, Joshi PK, Mills NL, Porteous DJ, Ramachandran P, Semple RK, Shaw CA, Sudlow CLM, Timmers PRHJ, Wilson JF, Wigmore SJ et al (2022) Genome-wide association study of NAFLD using electronic health records. Hepatol Commun 6(2):297–308.

    Article  CAS  Google Scholar 

  15. García-Carretero R, Holgado-Cuadrado R, Barquero-Pérez Ó (2021) Assessment of classification models and relevant features on nonalcoholic steatohepatitis using random forest. Entropy 23(6):763.

    Article  CAS  Google Scholar 

  16. Garcia-Carretero R, Vigil-Medina L, Barquero-Perez O, Ramos-Lopez J (2019) Relevant features in nonalcoholic steatohepatitis determined using machine learning for feature selection. Metab Syndr Relat Disord 17(9):444–451.

    Article  CAS  Google Scholar 

  17. Golabi P, Paik JM, Eberly K, de Avila L, Alqahtani SA, Younossi ZM (2022) Causes of death in patients with non-alcoholic fatty liver disease (NAFLD), alcoholic liver disease and chronic viral hepatitis B and C. Ann Hepatol 27(1):100556.

    Article  CAS  Google Scholar 

  18. Kim G-A, Lee HC, Choe J, Kim M-J, Lee MJ, Chang H-S, Bae IY, Kim H-K, An J, Shim JH, Kim KM, Lim Y-S (2017) Association between non-alcoholic fatty liver disease and cancer incidence rate. J Hepatol S0168-8278(17):32294–32298.

    Article  Google Scholar 

  19. Le Berre C, Sandborn WJ, Aridhi S, Devignes M-D, Fournier L, Smaïl-Tabbone M, Danese S, Peyrin-Biroulet L (2020) Application of artificial intelligence to gastroenterology and hepatology. Gastroenterology 158(1):76–94.e2.

    Article  Google Scholar 

  20. Lewinska M, Santos-Laso A, Arretxe E, Alonso C, Zhuravleva E, Jimenez-Agüero R, Eizaguirre E, Pareja MJ, Romero-Gómez M, Arrese M, Suppli MP, Knop FK, Oversoe SK, Villadsen GE, Decaens T, Carrilho FJ, de Oliveira CP, Sangro B, Macias RIR et al (2021) The altered serum lipidome and its diagnostic potential for non-alcoholic fatty liver (NAFL)-associated hepatocellular carcinoma. EBioMedicine 73:103661.

    Article  CAS  Google Scholar 

  21. Li Y, Wang X, Zhang J, Zhang S, Jiao J (2021) Applications of artificial intelligence (AI) in researches on non-alcoholic fatty liver disease (NAFLD): a systematic review. Rev Endocr Metab Disord.

  22. Loomis AK, Kabadi S, Preiss D, Hyde C, Bonato V, St Louis M, Desai J, Gill JMR, Welsh P, Waterworth D, Sattar N (2016) Body mass index and risk of nonalcoholic fatty liver disease: two electronic health record prospective studies. J Clin Endocrinol Metab 101(3):945–952.

    Article  CAS  Google Scholar 

  23. Ma H, Xu C, Shen Z, Yu C, Li Y (2018) Application of machine learning techniques for clinical predictive modeling: a cross-sectional study on nonalcoholic fatty liver disease in China. Biomed Res Int 2018:e4304376.

    Article  CAS  Google Scholar 

  24. Nabi O, Lacombe K, Boursier J, Mathurin P, Zins M, Serfaty L (2020) Prevalence and Risk Factors of Nonalcoholic Fatty Liver Disease and Advanced Fibrosis in General Population: the French Nationwide NASH-CO Study. Gastroenterology 159(2):791-793.e2.

  25. Okanoue T, Shima T, Mitsumoto Y, Umemura A, Yamaguchi K, Itoh Y, Yoneda M, Nakajima A, Mizukoshi E, Kaneko S, Harada K (2021) Artificial intelligence/neural network system for the screening of nonalcoholic fatty liver disease and nonalcoholic steatohepatitis. Hepatol Res 51(5):554–569.

    Article  CAS  Google Scholar 

  26. Popa SL, Ismaiel A, Cristina P, Cristina M, Chiarioni G, David L, Dumitrascu DL (2021) Non-alcoholic fatty liver disease: implementing complete automated diagnosis and staging. A systematic review. Diagnostics 11(6):1078.

    Article  Google Scholar 

  27. Rada P, González-Rodríguez Á, García-Monzón C, Valverde ÁM (2020) Understanding lipotoxicity in NAFLD pathogenesis: is CD36 a key driver? Cell Death Dis 11(9):1–15.

    Article  CAS  Google Scholar 

  28. Sorino P, Caruso MG, Misciagna G, Bonfiglio C, Campanella A, Mirizzi A, Franco I, Bianco A, Buongiorno C, Liuzzi R, Cisternino AM, Notarnicola M, Chiloiro M, Pascoschi G, Osella AR, Group, M (2020) Selecting the best machine learning algorithm to support the diagnosis of non-alcoholic fatty liver disease: a meta learner study. PLoS One 15(10):e0240867.

    Article  CAS  Google Scholar 

  29. Sowa J-P, Heider D, Bechmann LP, Gerken G, Hoffmann D, Canbay A (2013) Novel algorithm for non-invasive assessment of fibrosis in NAFLD. PLoS One 8(4):e62439.

    Article  CAS  Google Scholar 

  30. Taylor RS, Taylor RJ, Bayliss S, Hagström H, Nasr P, Schattenberg JM, Ishigami M, Toyoda H, Wai-Sun Wong V, Peleg N, Shlomai A, Sebastiani G, Seko Y, Bhala N, Younossi ZM, Anstee QM, McPherson S, Newsome PN (2020) Association between fibrosis stage and outcomes of patients with nonalcoholic fatty liver disease: a systematic review and meta-analysis. Gastroenterology 158(6):1611–1625.e12.

    Article  CAS  Google Scholar 

  31. Wang Y, Wong GL-H, He F-P, Sun J, Chan AW-H, Yang J, Shu SS-T, Liang X, Tse YK, Fan X-T, Hou J, Chan HL-Y, Wong VW-S (2020) Quantifying and monitoring fibrosis in non-alcoholic fatty liver disease using dual-photon microscopy. Gut 69(6):1116–1126.

    Article  Google Scholar 

  32. Wong GL-H, Yuen P-C, Ma AJ, Chan AW-H, Leung HH-W, Wong VW-S (2021) Artificial intelligence in prediction of non-alcoholic fatty liver disease and fibrosis. J Gastroenterol Hepatol 36(3):543–550.

    Article  Google Scholar 

  33. Yip TC-F, Ma AJ, Wong VW-S, Tse Y-K, Chan HL-Y, Yuen P-C, Wong GL-H (2017) Laboratory parameter-based machine learning model for excluding non-alcoholic fatty liver disease (NAFLD) in the general population. Aliment Pharmacol Ther 46(4):447–456.

    Article  CAS  Google Scholar 

  34. Younossi ZM, Otgonsuren M, Henry L, Venkatesan C, Mishra A, Erario M, Hunt S (2015) Association of nonalcoholic fatty liver disease (NAFLD) with hepatocellular carcinoma (HCC) in the United States from 2004 to 2009. Hepatology (Baltimore, Md) 62(6):1723–1730.

    Article  CAS  Google Scholar 

Download references


This work was not supported by any grant. AM is supported by the Al Baha University (Ministry of Higher Education, Saudi Arabia).


Not applicable.

Author information

Authors and Affiliations



The author confirms sole responsibility for the following: study conception and design, data collection, analysis and interpretation of results, and manuscript preparation. The authors read and approved the final manuscript.

Authors’ information

Ali Moosa Mahzari is an assistant professor of Clinical Chemistry and has subspeciality in chronic diseases such as obesity, diabetes, and metabolic syndrome.

Corresponding author

Correspondence to Ali Mahzari.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The author declares no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Mahzari, A. Artificial intelligence in nonalcoholic fatty liver disease. Egypt Liver Journal 12, 69 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Nonalcoholic fatty liver disease
  • Nonalcoholic steatohepatitis
  • Artificial intelligence
  • Machine learning
  • Deep learning
  • Electronic health records
  • Automated image analysis
  • Elastography
  • Hepatocellular cancer