Using Machine Learning to Predict Response to Image-guided Therapies for Hepatocellular Carcinoma
Abstract
Interventional oncology is a rapidly growing field with advances in minimally invasive image-guided local-regional treatments for hepatocellular carcinoma (HCC), including transarterial chemoembolization, transarterial radioembolization, and thermal ablation. However, current standardized clinical staging systems for HCC are limited in their ability to optimize patient selection for treatment as they rely primarily on serum markers and radiologist-defined imaging features. Given the variation in treatment responses, an updated scoring system that includes multidimensional aspects of the disease, including quantitative imaging features, serum markers, and functional biomarkers, is needed to optimally triage patients. With the vast amounts of numerical medical record data and imaging features, researchers have turned to image-based methods, such as radiomics and artificial intelligence (AI), to automatically extract and process multidimensional data from images. The synthesis of these data can provide clinically relevant results to guide personalized treatment plans and optimize resource utilization. Machine learning (ML) is a branch of AI in which a model learns from training data and makes effective predictions by teaching itself. This review article outlines the basics of ML and provides a comprehensive overview of its potential value in the prediction of treatment response in patients with HCC after minimally invasive image-guided therapy.
© RSNA, 2023
Summary
Artificial intelligence models can accurately predict treatment response to image-guided therapies for hepatocellular carcinoma by automatically extracting quantitative imaging features and analyzing the complex relationships between these features and clinical outcomes.
Essentials
■ Using artificial intelligence (AI) models to automatically extract quantitative imaging features from medical images can improve tumor characterization and offer better treatment prognostication than qualitative imaging data (ie, radiologist-defined semantic features).
■ Most of the research in this area has been focused on transarterial chemoembolization, while data on applications in transarterial radioembolization and ablation remain limited.
■ Image-based AI models require further standardization, testing, and model transparency to facilitate effective integration into clinical practice.
Introduction
Interventional oncology, a subspecialty of interventional radiology, offers various image-guided therapies for the treatment of hepatocellular carcinoma (HCC), including percutaneous ablation, transarterial chemoembolization (TACE), and transarterial radioembolization (TARE). While these treatments have proven efficacious in improving HCC survival rates, treatment responses vary due to differences in baseline tumor burden, tumor biology, and procedural factors (Fig 1). There is an unmet need to triage patients to the most effective therapies and optimize treatment sequencing.
Over the past decade, multiple treatment algorithms and staging systems, such as the Barcelona Clinic Liver Cancer (BCLC) staging system, the Hong Kong Liver Cancer (HKLC) staging system, and the Pre-TACE-Predict model, have been established for patient selection (1–5). However, these models, which rely largely on limited clinical, laboratory, and qualitative imaging features, demonstrate only modest performance in external test sets (6–8). With recent advances in image processing, researchers have turned to image-based methods, such as radiomics and artificial intelligence (AI), for the prediction of HCC treatment response. This review will discuss the state of AI applications in HCC prognostication.
Introduction to AI and Machine Learning: Modeling Steps
AI is a general term that describes the use of technology to mimic human intelligence (Fig 2) (9,10). Machine learning (ML) is a branch of AI in which a model learns from training data and makes effective predictions by teaching itself. For simplicity, we will focus on two main categories of ML: supervised and unsupervised learning. In supervised learning, the programmer directs the learning process by showing the model correct examples. For example, in HCC treatment response analysis, the radiologist first identifies the malignant lesion on pretreatment images, segments the lesion section by section, and reviews posttreatment imaging to determine treatment response. In this example, the labels for the training data are “response” and “nonresponse,” and the inputs are segmented pretreatment lesion images.
In contrast, unsupervised ML works independently to discover patterns within unlabeled data sets. Revisiting the example above, instead of defining treatment labels, investigators can input pre- and posttreatment images and let the algorithm form clusters, elucidating trends that the investigator had not set out to find. This can simultaneously be a strength and a weakness as the algorithm may group the posttreatment lesions as intended into different treatment groups based on pre- and posttreatment lesion size and viability, or it could group by background noise. Due to the unsupervised nature of the process, this type of ML is more prone to biases when trained on nonrepresentative samples. Compared with supervised ML, which is resource intensive due to the need to create labeled data sets, unsupervised learning algorithms do not require manual data labeling.
ML is often coupled with radiomics: the high-throughput mining of quantitative features from image voxel data. Radiologist-defined semantic features are qualitative (ie, visually assessed) imaging features such as lesion size and arterial phase hyperenhancement, whereas radiomic features provide a comprehensive quantification of a lesion's features and include things like shape, intensity, and texture. Coupling ML with quantitative imaging features yields further insight into the complex relationships between radiomic variables and clinical outcomes (11). The processing of imaging features using ML can be broadly categorized into two pathways: conventional ML radiomics analysis and deep learning (DL).
Conventional ML Radiomics Analysis
The conventional ML radiomics workflow is a stepwise process that uses image data to produce models that aid in clinical decision-making (Fig 3). The pipeline begins with image segmentation: manual, semiautomatic, or automatic. In manual segmentation, radiologists contour pretreatment lesions on every section. In semiautomatic segmentation, radiologists identify lesions of interest and set thresholding parameters that best capture them. The program automatically captures the volume of interest, which radiologists then fine-tune. Finally, in automatic segmentation, a DL model previously trained to segment tumors processes raw images and automatically delineates the volume of interest. Automatic segmentation is not only time and resource efficient but also has better accuracy in predicting treatment response (12,13).
During image processing, image quality is improved by applying filters that assist in noise reduction while maintaining crisp lesion edges, after which pixel and voxel features are quantified during feature extraction. Features can be manually selected based on the researcher's prior experience (ie, handcrafted features), automatically extracted through radiomics packages (eg, PyRadiomics) (14), or derived from the second-to-last layer of a neural network (ie, learned features). While learned features can adapt to training data and thus generalize well to unseen data, handcrafted features are set from the start and limited by expert knowledge (15). Next, feature selection and dimension reduction are performed by selecting the most generalizable features to prevent model overfitting (ie, models that work well only for the images on which they were trained). Finally, during model building, highly predictive variables are selected using filter, wrapper, or embedded methods to build robust models (Table 1) (16). The last steps of feature selection and model building constitute an iterative process where different features and algorithms are trialed to achieve the most predictive model.
DL Analysis
DL is a subset of ML that uses neural networks, which are based on cognitive models of the human brain (9,17). Convolutional neural networks can be incorporated into an analysis of large imaging data sets in two ways: in specific tasks within the radiomics workflow, as described earlier, or in end-to-end processing, where images are transformed directly into output signals (Fig 3).
In end-to-end processing, DL can bypass the individual steps within conventional ML radiomics analysis. Among patients with HCC, a DL model takes unprocessed images of HCC lesions (with or without corresponding clinical data) as input and applies many layers of image filtering transformations to calculate an output, such as a prediction of response or nonresponse to treatment. Through an iterative process, convolutional neural networks “learn” to increase their prediction accuracy (18). Unlike conventional ML models based on static images, DL models can incorporate time as a variable by evaluating lesion enhancement patterns on cine images (19,20). While DL models can be less labor intensive, they require more computational power to handle large amounts of data and can be more costly due to hardware requirements. DL models are also more dependent on training set size and diversity, and thus more prone to overfitting and poor generalizability. Due to the multilayer, nonlinear structure of neural networks, results are difficult to interpret and contextualize clinically. Ultimately, the goal of these algorithms is to help predict patient response to therapy using pretreatment images.
Prediction of HCC Treatment Response Using AI
A central challenge of HCC treatment is accurately determining which patients will respond best to which treatment or treatment combinations. A comprehensive literature review was performed using electronic databases including PubMed, MEDLINE, and Embase. Keywords included combinations of “HCC”; “ML” and/or “DL” and/or “AI”; “TACE” and/or “TARE” and/or “ablation”; and their variations. Only human studies published in English before December 2022 were included. Two authors (C.H. and A.L.) independently examined the titles and abstracts, and full texts when necessary, to determine eligible studies (Fig 4). The reference lists of eligible studies were screened to identify further eligible studies. Studies were excluded if they (a) included only qualitative imaging features and not quantitative imaging features, (b) did not use ML techniques in model construction, or (c) used postprocedural images for prediction of recurrence. Details of the included studies involving TACE and ablation are summarized in Tables 2 and 3, respectively. A total of 21 studies were included in the final analysis: 17 involved TACE, and four involved ablation. None of the included studies involved TARE. Study methodological quality was assessed using components of the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool for AI (QUADAS-AI) (21), as well as QUADAS-2 (22) (Table S1).
TACE Studies
Included studies involving TACE (n = 17) were published between 2018 and 2022. Most studies (13 of 17, 76%) were conducted in China. Median cohort size for training and testing sets combined was 194 patients (range, 36–789; IQR, 101–371). The majority of studies (11 of 17, 65%) used CT while four (24%) used MRI, one (6%) used US, and one (6%) used digital subtraction angiography (DSA). Seven of 17 (41%) studies used ML techniques, five (29%) used DL techniques, and five (29%) used a combination of both. Modified Response Evaluation Criteria in Solid Tumors (mRECIST) was the most common standard used to assess treatment response (12 of 17 studies, 71%). Six of 17 (35%) studies had training and validation sets, whereas 11 (65%) had separate training and testing sets. Of the 11 studies with separate training and testing sets, three (27%) performed internal testing and eight (73%) performed both internal and external testing of the models.
Seven domains were assessed regarding bias and applicability for each study using components of the QUADAS-AI and QUADAS-2 scores shown in Table S1. Of 119 domains in total for TACE articles, 29 domains were at high risk: 15 for risk of bias and 14 for applicability concerns. Risk of bias stemmed from inappropriate patient selection for six studies (these studies did not use Barcelona Clinic Liver Cancer staging for assessing TACE eligibility) and from index test issues for nine studies (these studies did not perform external testing). Applicability concerns stemmed primarily from patient selection, as most of these models were trained and tested on Chinese populations, where HCC etiology differs from that in the United States.
For most studies, the primary goal was predicting whether patients would respond to TACE based on pre-TACE images. Some studies incorporated survival analysis to determine whether nonresponders had poorer prognosis than TACE responders and might instead benefit from other treatments, such as ablation or systemic therapy combined with TACE (19,23).
Unlike most studies that used pretreatment CT or MRI scans, Zhang et al (19) used DL to develop an automatic tumor segmentation model that integrated clinical variables and preprocedural DSA videos of superior mesenteric and hepatic arteries to predict treatment response or nonresponse to TACE. The authors demonstrated that there was a significant difference in 3-year progression-free survival between responders and nonresponders as classified with the DL model. The authors introduced an innovative approach to determining TACE eligibility using DSA videos. DSA offers a way to directly visualize the arterial supply of target tumors that can be challenging to visualize with pretreatment CT or MRI. However, DSA image quality is limited by motion artifacts, and small, hypovascular lesions are challenging to visualize.
Peng et al (24) developed a conventional ML model to predict initial response to TACE treatment based on pre-TACE CT images. The authors demonstrated that patients predicted to be treatment responders had longer progression-free survival and overall survival than patients predicted to be treatment nonresponders. Of note, the authors then applied the model to 46 patients with HCC from The Cancer Genome Atlas (TCGA) database to analyze differential gene expression across the two cohorts and to explore potential model mechanisms. This approach of incorporating genetic data was made possible with ML, highlighting its versatility when processing large volumes of data.
Image-guided Thermal Ablation Studies
We identified four ML studies dedicated to HCC prognostication after thermal ablation. Three of the four such studies (75%) used US or contrast-enhanced US (CEUS) to evaluate postablation prognosis. Two studies included microwave ablation, one included radiofrequency ablation, and one included both microwave and radiofrequency ablation. All studies had separate training and internal testing cohorts, but none included external testing.
Ma et al (20) compared the performance of a DL model trained on CEUS images with that of a conventional ML model trained on static US images in predicting late HCC recurrence. As expected, the CEUS model outperformed the static US image model, likely because CEUS not only shows morphologic features but also provides real-time dynamic blood perfusion information that is highly correlated with ablation success. Furthermore, to unmask the “black box” nature of the models (ie, lack of transparency), the authors attempted to visualize the model's selected features. They found that the model highlighted certain frames of the CEUS cine that reflected lesion wash-in and washout periods. Study limitations included the lack of standardization of US and CEUS examinations, which were largely operator dependent and involved different machines, parameters, and contrast agent doses and timing (25).
Liu et al (26) used clinical data and imaging features extracted from CEUS images not only to predict 2-year progression-free survival in patients with early-stage HCC treated with radiofrequency ablation and surgical resection, but also to investigate the optimization of treatment selection. The authors concluded that 17.3% and 27.3% of patients who underwent radiofrequency ablation and surgical resection, respectively, would have had a better outcome if they had received the other treatment instead. Even though the study sample was limited to patients with early-stage HCC and a multicenter prospective study is necessary to test the clinical significance of these findings, the authors’ approach to optimizing treatment selection furthers clinical applicability.
TARE Studies
Data regarding ML-based prognostication of tumor response following TARE remain limited. There were no TARE studies that met the inclusion and exclusion criteria of this review. Several studies, however, have used non-ML radiomics models to predict prognosis (27–30).
DL-driven radiation dosimetry models are currently being studied in other cancers for predicting the biodistribution of theragnostic pairs, with the goal of customizing therapy dose based on unique patient characteristics (31). It would be ideal to be able to predict response to TARE using preprocedural imaging alone: While our review did not identify any studies with this approach, some studies have used both baseline images and initial postprocedural images to predict future response (28,29). Wei et al (29) built a multivariate logistic regression model using radiomic features and absorbed dose metrics extracted from post-TARE yttrium 90 PET scans to predict TARE response at first follow-up. Importantly, the mean absorbed dose alone achieved similar predictive ability, suggesting that the actual distribution and dose of the microspheres is a bigger determinant of TARE results than intratumoral hemodynamics extrapolated from preprocedural baseline imaging. Though these results do not assist in patient selection for initial therapy, being able to predict response at an early stage could be helpful in determining when earlier retreatment would be indicated. Limitations included a small sample size (30 patients, 105 lesions) as PET or CT posttherapy imaging is not routinely performed at every institution.
Barriers to Clinical Implementation of Prognostication Models
Though image-based AI models for prognostication of HCC treatment response are promising, several advances are required before widespread clinical use. Just as with medical devices, AI software should be subjected to rigorous approval and regulatory processes to ensure safe and effective patient care. Recently, the U.S. Food and Drug Administration proposed a pathway for clearing ML software as a medical device (32). In this section, we discuss elements of this pathway in the context of AI-based HCC prognostication models.
Good ML Practice
Each step of the modeling process requires standardization by expert consensus and evidence-based practice to ensure reproducibility of results. Standardization of some steps (eg, using a standardized feature extraction algorithm) is already underway as a large majority of studies use PyRadiomics to extract radiomic features (24,33–37). Establishing procedural standards, such as using a specific image acquisition protocol and imaging modality, may be challenging as specific requirements for HCC workup vary between hospitals (38). While most models already incorporate some clinical data and static images, they can be further refined through the incorporation of dynamic images (eg, cine imaging), as well as data (eg, genomic, histopathologic) from preexisting databases. These considerations emphasize the need for prospective research demonstrating the diagnostic utility of said protocols.
It is also worth noting that the assessment of clinical response to HCC treatment lacks consistency, making comparison across studies difficult. Different studies have used different imaging response standards—such as mRECIST, qEASL (quantitative European Association for the Study of the Liver guidelines), or LI-RADS (Liver Imaging Reporting and Data System)—to categorize treatment response, though mRECIST has been used the most. Among the studies using mRECIST, authors have grouped the responses differently: (a) response (partial or complete) versus nonresponse (stable or progressive disease), (b) complete response versus not (partial response or stable or progressive disease), or (c) disease control (complete or partial response or stable disease) versus disease progression. Finally, only some studies specified time points for evaluating treatment response (eg, response assessed after 2 years). Even fewer compared the performance of their model assessed at different time points after treatment to ensure generalizability (39). The inconsistencies in assessing and categorizing treatment responses obfuscate model performance comparison. For this reason, international committees have convened to define technical, clinical, and regulatory standards to improve the consistency of AI analysis and increase operability across clinical centers and geographic regions (40).
Algorithm Bias and Robustness
In terms of standardizing testing processes, studies should be validated on an external data set to confirm generalizability, and ideally in a prospective manner to reduce selection bias. A productive way to facilitate this process is to create open databases where existing imaging data can serve as additional training data to augment existing models or as independent external testing sets to demonstrate the reproducibility of results. Furthermore, investigators must report key details of the model development process to enable model acceptance. Standardized guidelines, such as TRIPOD-AI (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis–AI) and the Radiomics Quality Score, should be used for reporting imaging AI studies (41–43).
Regarding reproducibility and external testing, it is important to acknowledge that most of these studies were conducted in China, where HCC is predominantly secondary to chronic hepatitis B virus infection, whereas HCC in Western cohorts is often secondary to hepatitis C virus infection, alcohol misuse, or obesity (44). Since radiomic features have been shown to be highly correlated with histopathologic and genetic features, treatment algorithms derived from populations with one predominant HCC etiology may not be applicable to other populations with different etiology of HCC. It is also important to be mindful that AI algorithms trained on data that are imbalanced with respect to ethnicity, geography, and socioeconomics may promote bias and hence exacerbate existing health disparities.
Transparency to Users to Enhance Trust in AI Models
One of the barriers to the integration of these AI models into clinical workflow is the lack of transparency of the models’ logic. Whereas tumor size and the number of lesions are features that are easily understood to be highly correlated with treatment response, quantitative imaging features and their relation to treatment success are harder to interpret and internalize. To successfully facilitate acceptance of these AI models, there must be a high level of transparency and trust, which can be achieved through an explanation of the mechanism behind the AI decision-making process and through training of radiologists so that they can familiarize themselves with a model and the information it produces (45,46). Feature attribution processes that highlight the regions of the medical images that influenced the model decision can assist physicians in decoding nonintuitive imaging features and can provide clinical interpretations that allow for better understanding and acceptance of complex AI models (47,48).
Additional Issues in Real-World Integration
To facilitate clinical adoption, models must be user-friendly and integrate seamlessly into radiologic workflow. In the ideal scenario, when radiologists review imaging with lesions suspicious for HCC or when interventionalists or hepatologists see patients during the initial clinic visit, they would open the AI tool at their workspace and automatically receive a recommendation on the next steps of management based on the prediction of treatment response. The program and its results would be reviewed in the broader clinical context of the patient's serum markers and functional status to determine treatment appropriateness.
Models must undergo clinical trials to evaluate not only robustness and accuracy but also changes in patient outcomes due to changes in management driven by the tool. Before incorporating commercially available AI algorithms into clinical practice, institutions should test models on local data sets to ascertain generalizability to their patient population. Furthermore, it is essential to build a data registry to monitor model performance in clinical workflow, providing developers with a way to identify opportunities for improvement and respond proactively to safety concerns (49).
Overall, the regulatory framework for U.S. Food and Drug Administration approval of AI-based software is still in its inception. Future efforts should strive to develop standardized modeling, testing, and postmarket monitoring processes and offer model transparency to facilitate the integration of effective and reproducible results into clinical practice.
Conclusion
Image-based artificial intelligence (AI) models predict the performance of interventional treatment for hepatocellular carcinoma (HCC) by automatically extracting quantitative imaging features and analyzing the complex relationships between these features and clinical outcomes. Despite the active landscape, there are no applications of AI models approved by the U.S. Food and Drug Administration for image-guided treatments of HCC or interventional radiology overall. This lack of AI models approved by the U.S. Food and Drug Administration highlights the need for clinical validation beyond the development of models (49). Though many obstacles stand before widespread implementation for clinical use, current applications have shown the potential of predictive AI as a tool for personalizing treatment in patients with HCC.
Acknowledgment
The authors thank Thomas Yi, MD (Department of Diagnostic Imaging, Brown University), for comments on the manuscript.
References
- 1. . A simple prognostic scoring system for patients receiving transarterial embolisation for hepatocellular cancer. Ann Oncol 2013;24(10):2565–2570.
- 2. . How to STATE suitability and START transarterial chemoembolization in patients with intermediate stage hepatocellular carcinoma. J Hepatol 2014;61(6):1287–1296.
- 3. . The ART of decision making: retreatment with transarterial chemoembolization in patients with hepatocellular carcinoma. Hepatology 2013;57(6):2261–2273.
- 4. . Prediction of survival among patients receiving transarterial chemoembolization for hepatocellular carcinoma: a response-based approach. Hepatology 2020;72(1):198–212.
- 5. . Validation of the hepatoma arterial embolization prognostic score in European and Asian populations and proposed modification. Clin Gastroenterol Hepatol 2015;13(6):1204–8.e2.
- 6. . Validation of Pre-/Post-TACE-Predict models among patients with hepatocellular carcinoma receiving transarterial chemoembolization. Cancers (Basel) 2021;14(1):67.
- 7. . Predictive performance of the mHAP-II score in a real-life western cohort with hepatocellular carcinoma following trans-arterial chemoembolisation with drug-eluting beads (DEB-TACE). Eur Radiol 2020;30(7):3782–3792.
- 8. . Validation of the risk prediction models STATE-Score and START-Strategy to guide TACE treatment in patients with hepatocellular carcinoma. Cardiovasc Intervent Radiol 2017;40(7):1017–1025.
- 9. . Artificial intelligence in medicine. Metabolism 2017;69S:S36–S40.
- 10. . Interventional radiology ex-machina: impact of artificial intelligence on practice. Radiol Med (Torino) 2021;126(7):998–1006.
- 11. . Statistics versus machine learning. Nat Methods 2018;15(4):233–234.
- 12. . A machine learning model to predict hepatocellular carcinoma response to transcatheter arterial chemoembolization. Radiol Artif Intell 2019;1(5):e180021.
- 13. . Multi-task deep learning approach for simultaneous objective response prediction and tumor segmentation in HCC patients with transarterial chemoembolization. J Pers Med 2022;12(2):248.
- 14. . Computational radiomics system to decode the radiographic phenotype. Cancer Res 2017;77(21):e104–e107.
- 15. . Learned vs. hand-crafted features for pedestrian gender recognition. In:
MM ’15: proceedings of the 23rd ACM international conference on multimedia . Association for Computing Machinery; 2015:1263–1266. - 16. . A review of feature reduction techniques in neuroimaging. Neuroinformatics 2014;12(2):229–244.
- 17. . The role of artificial intelligence in interventional oncology: a primer. J Vasc Interv Radiol 2019;30(1):38–41.e1.
- 18. . Texture analysis as a radiomic marker for differentiating renal tumors. Abdom Radiol (NY) 2017;42(10):2470–2478.
- 19. . Real-time automatic prediction of treatment response to transcatheter arterial chemoembolization in patients with hepatocellular carcinoma using deep learning based on digital subtraction angiography videos. Cancer Imaging 2022;22(1):23.
- 20. . Dynamic contrast-enhanced ultrasound radiomics for hepatocellular carcinoma recurrence prediction after thermal ablation. Mol Imaging Biol 2021;23(4):572–585.
- 21. . A quality assessment tool for artificial intelligence-centered diagnostic test accuracy studies: QUADAS-AI. Nat Med 2021;27(10):1663–1665.
- 22. . QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 2011;155(8):529–536.
- 23. . Development and validation of a deep learning model for survival prognosis of transcatheter arterial chemoembolization in patients with intermediate-stage hepatocellular carcinoma. Eur J Radiol 2022;156:110527.
- 24. . Development and validation of a pyradiomics signature to predict initial treatment response and prognosis during transarterial chemoembolization in hepatocellular carcinoma. Front Oncol 2022;12:853254.
- 25. . Contrast-enhanced ultrasound of the liver: technical and lexicon recommendations from the ACR CEUS LI-RADS working group. Abdom Radiol (NY) 2018;43(4):861–879.
- 26. . Deep learning radiomics based on contrast-enhanced ultrasound might optimize curative treatments for very-early or early-stage hepatocellular carcinoma patients. Liver Cancer 2020;9(4):397–413.
- 27. . Assessment of hepatocellular carcinoma response to 90y radioembolization using dynamic contrast material-enhanced MRI and intravoxel incoherent motion diffusion-weighted imaging. Radiol Imaging Cancer 2020;2(4):e190094.
- 28. . Prediction of hepatocellular carcinoma response to 90 yttrium radioembolization using volumetric ADC histogram quantification: preliminary results. Cancer Imaging 2019;19(1):29.
- 29. . Tumor response prediction in 90 Y radioembolization with PET-based radiomics features and absorbed dose metrics. EJNMMI Phys 2020;7(1):74.
- 30. . Comparison of MRI-based response criteria and radiomics for the prediction of early response to transarterial radioembolization in patients with hepatocellular carcinoma. Diagn Interv Imaging 2022;103(7-8):360–366.
- 31. . Radiation dosimetry, artificial intelligence and digital twins: old dog, new tricks. Semin Nucl Med 2023;53(3):457–466.
- 32. Artificial intelligence/machine learning (AI/ML)-based software as a medical device (SaMD) action plan. U.S. Food and Drug Administration, 2019.
- 33. . Prediction of prognostic risk factors in hepatocellular carcinoma with transarterial chemoembolization using multi-modal multi-task deep learning. EClinicalMedicine 2020;23:100379.
- 34. . Machine-learning analysis of contrast-enhanced computed tomography radiomics predicts patients with hepatocellular carcinoma who are unsuitable for initial transarterial chemoembolization monotherapy: a multicenter study. Transl Oncol 2021;14(4):101034.
- 35. . Predicting the initial treatment response to transarterial chemoembolization in intermediate-stage hepatocellular carcinoma by the integration of radiomics and deep learning. Front Oncol 2021;11:730282.
- 36. . A Pre-tace radiomics model to predict HCC progression and recurrence in liver transplantation: a pilot study on a novel biomarker. Transplantation 2021;105(11):2435–2444.
- 37. . Radiomics analysis of pretreatment MRI in predicting tumor response and outcome in hepatocellular carcinoma with transarterial chemoembolization: a two-center collaborative study. Abdom Radiol (NY) 2022;47(2):651–663.
- 38. . Prediction of treatment response to transarterial radioembolization of liver metastases: radiomics analysis of pre-treatment cone-beam CT: a proof of concept study. Eur J Radiol Open 2021;8:100375.
- 39. . Machine-learning models for prediction of posttreatment recurrence in early-stage hepatocellular carcinoma using pretreatment clinical and MRI features: a proof-of-concept study. AJR Am J Roentgenol 2023;220(2):245–255.
- 40. . Consensus guidelines for the definition of time-to-event end points in image-guided tumor ablation: results of the SIO and DATECAN Initiative. Radiology 2021;301(3):533–540.
- 41. . Protocol for development of a reporting guideline (TRIPOD-AI) and risk of bias tool (PROBAST-AI) for diagnostic and prognostic prediction model studies based on artificial intelligence. BMJ Open 2021;11(7):e048008.
- 42. . Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 2017;14(12):749–762.
- 43. . FUTURE-AI: guiding principles and consensus recommendations for trustworthy artificial intelligence in medical imaging. arXiv 2109.09658 [preprint] https://arxiv.org/abs/2109.09658. Published September 20, 2021. Updated September 29, 2021. Accessed October 14, 2023.
- 44. . Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2018;68(6):394–424. [Published correction appears in CA Cancer J Clin 2020;70(4):313.]
- 45. . Artificial intelligence decision-making transparency and employees’ trust: the parallel multiple mediating effect of effectiveness and discomfort. Behav Sci (Basel) 2022;12(5):127.
- 46. . Deep learning in medicine-promise, progress, and challenges. JAMA Intern Med 2019;179(3):293–294.
- 47. . Grad-CAM++: generalized gradient-based visual explanations for deep convolutional networks. In:
2018 IEEE Winter Conference on Applications of Computer Vision (WACV) . Lake Tahoe, NV: IEEE; 2018:839–847. - 48. . Artificial intelligence for the prevention and clinical management of hepatocellular carcinoma. J Hepatol 2022;76(6):1348–1361.
- 49. . Proceedings from the Society of Interventional Radiology Foundation Research Consensus Panel on Artificial Intelligence in Interventional Radiology: from code to bedside. J Vasc Interv Radiol 2022;33(9):1113–1120.
- 50. . Predicting treatment response to intra-arterial therapies for hepatocellular carcinoma with the use of supervised machine learning—an artificial intelligence concept. J Vasc Interv Radiol 2018;29(6):850–857.e1.
- 51. . Residual convolutional neural network for predicting response of transarterial chemoembolization in hepatocellular carcinoma from CT imaging. Eur Radiol 2020;30(1):413–424.
- 52. . Accurate prediction of responses to transarterial chemoembolization for patients with hepatocellular carcinoma by using artificial intelligence in contrast-enhanced ultrasound. Eur Radiol 2020;30(4):2365–2376.
- 53. . Prediction of post transarterial chemoembolization MR images of hepatocellular carcinoma using spatio-temporal graph convolutional networks. PLoS One 2021;16(12):e0259692.
- 54. . Prediction of TACE treatment response in a preoperative MRI via analysis of integrating deep learning and radiomics features. J Med Biol Eng 2022;42(2):169–178.
- 55. . Evaluation of therapeutic effects of computed tomography imaging classification algorithm-based transcatheter arterial chemoembolization on primary hepatocellular carcinoma. Comput Intell Neurosci 2022;2022:5639820.
- 56. . Preoperative CECT-based radiomic signature for predicting the response of transarterial chemoembolization (TACE) therapy in hepatocellular carcinoma. Cardiovasc Intervent Radiol 2022;45(10):1524–1533.
- 57. . Radiomics analysis of ultrasound to predict recurrence of hepatocellular carcinoma after microwave ablation. Int J Hyperthermia 2022;39(1):595–604.
- 58. . A radiomics-based model can predict recurrence-free survival of hepatocellular carcinoma after curative ablation. Asian J Surg 2022. https://doi:10.1016/j.asjsur.2022.09.130. Published online November 7, 2022.
Article History
Received: Nov 9 2022Revision requested: Dec 28 2022
Revision received: June 2 2023
Accepted: June 13 2023
Published online: Nov 07 2023