Fundamentals of artificial intelligence
Artificial Intelligence (AI) requires human intelligence and is performed by a computer.
An algorithmic model is developed by humans (programmers)—a code instructing the computer
step by step for necessary acting, reasoning, and learning.
Machine learning (ML) is an algorithm that can learn from data and act according to
this knowledge without extensive prior programming. Here, a specific task can be performed
based on particular patterns and inferences. It does not need any clear instructions
or any programs. Based on “training data,” a mathematical model is created and the
computer can learn to make predictions. The ML model performs better with exposure
to more data over time. The three main types of ML algorithms are:
-
(i) Unsupervised ML: It depends on finding patterns. Unlabeled data are given as an
input, following which the system looks for patterns and classifies the data according
to the identified patterns.
-
(ii) Supervised ML: It predicts algorithms based on past learning. Labeled data are
already given as input; hence, the system categorizes various new given inputs based
on what has been learned with old labeled data. One of the most commonly used forms
of supervised learning is deep learning.
-
(iii) Reinforcement ML: It uses a system of reward and punishment for algorithmic
training.
Deep learning (DL) is a subdivision of machine learning methods created on artificial
neural networks. This model is built based on how a huge data/information is processed
by the human brain. A well-designed and trained DL model can predict and perform classification
tasks with more accuracy, which exceeds human expertise at times. In the field of
medicine, AI can be divided into two branches: virtual and physical.[1]
Virtual branch of AI: It is primarily based on ML, where mathematical algorithms help
improve experience-based learning. The applications of virtual AI in medicine include
electronic health record (EHR) systems and neural network-based guidance in treatment
decisions.
Physical branch of AI: It is the use of machines (robots) in physical form (macroscopic
or microscopic) for assistance in performing daily tasks, surgeries, or delivery of
chemotherapeutic drugs to a specific site.
-
(1) Carebots: These are advanced and sophisticated robots. They are used to assist surgeons
during operation, as intelligent prostheses for handicapped people, and care of the
elderly.
-
(2) Nanobots: They refer to miniature devices (size = 0.1–10 µm) in the form of switches,
motors, shuttles, or cars carrying a particular drug or molecule. One such example
is the targeted drug delivery using nanoliposomes by overcoming the permeation and
diffusion barriers of the conventional therapeutic agents.
Applications of AI in oncology
AI has been and is being explored in various fields of oncology related to histopathology,
imaging, radiation oncology, medical oncology, translational oncology, and clinical
decision-making as depicted in [Fig. 1].
Fig. 1 Applications of artificial intelligence (AI) in oncology.
Cancer Screening
AI has found its vital place in the screening of cancers. Several convolutional neural
network (CNN) models from the 2017 Kaggle Data Science Bowl demonstrated 80 to 95%
accuracy for lung cancer screening by evaluation of suspicious nodules in thoracic
computed tomography (CT) films.[2] In the same year, the digital mammography (DM) DREAM challenge by IBM, tested whether
AI algorithms could be equal to or supplement the radiologists in interpreting mammograms.
AI-based algorithms have reviewed 640,000 digital mammograms with 81% and 80% specificity
and sensitivity, respectively.[3] Scott et al. assessed the performance of AI in predicting breast cancer from a large
dataset of mammograms from both the USA and UK.[4] The false-positive and false-negative rates reduced by 5.7%, 1.2% (USA, UK) and
9.4%, 2.7%, respectively. An independent study was done comparing AI algorithms and
the radiologist's interpretation of screening mammograms for detecting breast cancer.
The AI surpassed all human readers (six radiologists): the area under the receiver
operating characteristic curve (AUROC) for the AI system was greater by an absolute
margin of 11.5% than that for an average radiologist.
Cancer Diagnosis
Within the field of diagnostics, AI is gaining a pivotal role in accurate and speedy
results.
Clinical Images
The CNN-trained algorithm using 130,000 skin images classified malignant lesions with
higher sensitivity and specificity as compared with 21 dermatologists.[5] A 94% sensitivity in polyp detection using colonoscopy images from 1,290 patients
was observed with CNN.[6]
Radiographic Imaging
A study using a DL-based AI algorithm for the prediction of disease based on electronic
health records (EHRs) of patient's clinical history, laboratory investigations, and
imaging findings has been successful in this arena. In this, a novel framework called
“deep patient” was developed based on a large dataset of patients' EHRs. Different
patient sets with common patterns in the given data were identified. The new data
were entered later and tested for its accuracy in predicting novel diseases in the
next 1 year. It predicted the development of a variety of diseases including cancers
of the prostate, rectum, and liver with 93% accuracy.[7] Another CNN-based model showed >85% accuracy in identifying extranodal extension
(ENE) in head and neck cancers on diagnostic, contrast-enhanced CT scans.[8]
The CNNs have been able to predict both IDH mutation and MGMT methylation status with 85% to 95% and 83% accuracy, respectively, based on raw imaging
data alone.[9]
[10] A CNN model successfully predicted the complete response to neoadjuvant chemoradiation
with 80% accuracy in locally advanced rectal cancer.[11] Additionally, a radiomics signature using extracted features from CT data and an
ML algorithm were able to predict underlying CD8 cell tumor infiltration and, remarkably,
response to immunotherapy for a variety of advanced cancer patients in phase 1 trials
treated with anti-PD-1 or anti-PD-L1 monotherapy.[12]
Digital Pathology
The diagnostic rates of lymph node metastasis in resected specimens of carcinoma breast
using DL CNN algorithms were found to be the same as those observed in a group of
pathologists.[13] Similarly, in adenocarcinoma of the prostate, a 75% rate of agreement of Gleason
grading was seen between the DL algorithm and pathologists.[14] A trained CNN was developed to predict six genetic mutations (STK11, EGFR, FAT1, SETBP1, KRAS, and TP53) from lung cancer biopsies. This paved the way for predicting a genotype or mutation
based on histopathologic architectural patterns.[15] These methods will be cost-effective as compared with tests for direct mutational
analysis.
Radiation Therapy
AI has been proposed for image acquisition, tumor segmentation, image registration,
radiation planning, and radiation delivery—the steps during treatment workflow in
radiotherapy.
Image Acquisition
CT scan is the basic imaging for getting electronic density values. It helps in the
planning of radiation therapy using algorithms for the calculation of doses. However,
MRI is advantageous for better soft tissue delineation and acquisition of multi-planar
images. There has been continuous research in the development of methods to generate
a CT scan using MRI data—a synthetic CT scan (sCT). Of these methods, deep embedding
CNN has shown promising results in terms of efficiency, time consumption, high-quality
image resolution, and fewer artifacts.[16] Algorithms for creating sCT from primary MRI images and for generation of radiation
plan have been used in MRI only for prostate radiotherapy.[17]
Segmentation of Tumors and Organs at Risk
Manual contouring of organs at risk (OAR) and the volume of the target is a time-consuming
task with huge interobserver variability.[18] To overcome this hurdle, automatic contouring software using knowledge-based algorithms
(atlas, machine learning, models based on statistical shape and appearance); region-based
(adaptive thresholding, graph cuts, and contouring by watershed); or a combination
of the knowledge and region-based have been used.[19] In a recent study by Lustberg,[20] the deep learning contouring outperformed the atlas-based contouring for the lungs
and spinal cord. With deep learning methods, 79% of the median time was saved in comparison
to manual methods. Men et al.[21] hypothesized a new deep dilated CNN-based method for target auto-segmentation and
volume delineation of OAR. It took on an average 45 seconds for one patient for segmentation
of all clinical target volume (CTV) and OAR which is far less time than that taken
for drawing the structures conventionally.
Image Registration
For image registration (to align an image to the reference image), mathematically
applied transformations are used. The two traditional registration methods used in
radiotherapy are the intensity-based method and the rigid method. In their review
on newer techniques of image registration, Viergever et al[22] found the deep learning process for registration of the images to be easier and
user-friendly. This has been substantiated by studies done later by Yang et al.[23] and Miao et al.[24] using DL- and CNN-based methods, respectively.
Radiation Planning
The planning process for radiotherapy using AI algorithms was pioneered by McIntosh
et al.[25] They used a voxel-based method for predicting the dose and dose-mimicking method
in planning radiation for head and neck cancers. The advantage of this adaptive radiotherapy
planning process is not limited to saving time but also includes dose adjustments
related to age, sex, race, and genetic makeup.[26]
PORTOS is the first clinical radiogenomics assay that uses predictive biomarkers to
determine the tumor sensitivity to radiation.[27]
Radiation Delivery
The recent advances in radiotherapy such as IGRT and IMRT focus more on image-guided
delivery of radiation and less on the patient's position. Positional deviation leads
to dose variation and hence compromises the efficacy of the treatment. Ogunmolu et
al.[28]
[29] developed a soft-robot actuator for head and neck radiotherapy without using a mask
to monitor intra- and interfraction movements during radiation. It showed promising
results in controlling the movement of the head to within the nearest 2 mm (millimeter)
as compared with the trajectory of reference. Park et al.[30] have used the data of breathing patterns for intra- and interfraction fuzzy deep
learning (FDL) to decrease tumor-tracking time and adjust radiation doses in lung
cancer patients according to breathing movements.
Treatment Outcomes
In 2003, an ANN analysis was done on 125 nonsmall cell lung cancer patients to predict
their 5-year survival rates based on the immunohistochemical and clinicopathological
variables.[31] The ANN-based prediction model was superior to logistic regression analysis in estimating
the 5-year NSCLC survival rates (87% vs. 78%). In-silico analysis using clinical and
dosimetric data was also done to predict the genitourinary and lower gastrointestinal
toxicity resulting from radiotherapy to prostate,[32]
[33]
[34] hepatobiliary toxicity after liver irradiation,[35] and rectal toxicity in cervical cancer patients receiving radiation therapy.[36]
Translational Oncology
The utility of DL neural networks has extended even to the field of translational
oncology. With advancements in biotechnology and biochemistry, there is huge information
on protein structure, and protein–protein and drug-protein interactions. The structure
of a protein can be predicted.[37] To speed up the drug development process, CNN and ML methods were used to predict
the side effects of various drug combinations.[38] It saves time and financial resources by halting the drug development process due
to serious toxicities. The cells can be characterized according to different mitotic
stages [39] based on microscopy images. The progenitor cell lineage in the future can be delineated
based on their microscopic images.[40] Deep learning artificial neural networks (DL ANNs) have been trained to predict
the chances of failure of around 200 drugs based on transcriptomic response signatures.[41] By combining both genomic and chemical characteristics, AI can predict the sensitivity
of tumor cells to therapeutics.[42] The role of AI has been extended even to immunologicals by predicting the peptide
major histocompatibility complex (MHC) binding of the immunological drugs using CNNs.[43]
Role of AI in Clinical Decision-Making
With the enormous amount of ongoing research in oncology that includes clinical trials,
new drug development, and the discovery of predictive biomarkers, there is a splendid
opportunity for AI to assist in data synthesis and guide us in making and taking decisions
for cancer patients. More commercial applications are being developed that utilize
DL and natural language processing to utilize this huge data from bench-side to bed-side.[44] International business machines (IBM) in collaboration with Watson for Oncology
(WFO) used AI for linking the patient data to national treatment guidelines. The WFO
demonstrated high concordance rates of the multi-disciplinary team recommendations
with that of AI in breast cancer patients.[45]
[46] However, the same IBM Watson cognitive computing system failed in taking the right
decisions related to cancer treatment. After a huge expenditure of $62 million, the
joint venture of IBM and University of Texas MD Anderson Cancer Center for the development
of an advanced Oncology Expert Advisor had to be called off! Retrospection showed
training with a small dataset as a cause for this failure.
Role of AI in Hemato-Oncology
Several CNN-based models have distinguished between different white blood cells on
peripheral smear with more than 95% AUROC scores.[47] Similar work is underway for the interpretation of bone marrow specimens. In patients
with lymphoma, automated analysis for bone marrow involvement has been successful.[48] The risk stratification for advanced Hodgkin lymphoma based on PET-CT images was
also tried with encouraging results.[49]
In malignant hematology, there are myriad clinical and laboratory-based prognostic
scores for each cancer. However, there is a significant variation on a case-to-case
basis. AI can deal with this complex data and help in personalized therapy. Upfront
AML/MDS risk stratification can be improvised with the help of AI.[50] One of the most important poor prognostic factors after treatment completion in
hematological malignancies is the detectable minimal residual disease (MRD). AI matched
the human-level performance in MRD detection using flow cytometry in patients with
acute myeloid leukemia.[51] A retrospective data analysis by Sasaki et al in CML-treated patients showed a longer
survival rate for ML-informed treatments. Algorithm-based models in MDS predicted
the response to hypomethylating agents.[52]
Role of AI in Resource-Limited Settings
In developing countries such as India, there is a huge disparity between the numbers
of patients and the numbers of health workers. A SWOT (Strengths, Weakness, Opportunities,
Threat) analysis for this cutting-edge technology in developing countries is as follows:
-
Strengths: Availability of large amounts of data, emerging young population with a
lot of talent and interest in this new technology, eco-friendly.
-
Weakness: High cost, lack of trained personnel, lack of standardization of data collection,
storage and processing, data privacy, data security, digital divide (lack of access
to high-speed broadband networks in rural areas).
-
Opportunities: Public–private partnerships, national and international collaborations,
Government funding (NITI–National Institution of Transforming India, DBT–Department
of Biotechnology) for the development of AI in health care settings.
-
Threats: Loss of technological and health care workers' employment, legal and ethical
issues.
Limitations of AI
This groundbreaking technology has its limitations that are hindering AI to be used
in routine clinical settings.
First, the greatest hindrance is the external validation of DL applications and the
generalizability of DL applications to all patients and tumors. The heterogeneous
medical data across institutions and different patient populations will require multiple
validation sets to prove the performance of AI. Second, access to data is a problem.
It would be difficult to provide data for diseases with less prevalence. Contributing
to this data scarcity, there are ethical and legal issues involved in sharing of protected
information about patient's health across different institutions and also data heterogeneity
along with incomplete data collection and competition between institutions. The use
of findable, accessible, interoperable, reusable (FAIR) data,[53] and many other opportunities are being provided for various research groups to address
this issue.[54] Third, tracing the exact logic behind the predictions of DL is a hilarious task.
Hence, it is rightly called the “black box” problem, which means the explanation for
how and why it gave the output based on the input given, cannot be provided by AI.[55] The era of personalized and evidence-based medicine demands a rationale for every
clinical decision being taken. The specific characteristic(s) of the input data that
contributed to the ultimate output cannot be pinpointed in DL algorithms. From the
perspective of both practitioner and regulatory bodies, this interpretability is a
challenge for adopting AI-based algorithms into the healthcare system.[56]
[57] Fourth, AI in radiotherapy and pathology is mostly dependent on the images for auto-segmentation,
hence quality and the number of images used for training dataset matters a lot in
giving the final output. Poor-quality images and fewer images lead to faulty assessments
by AI. Fifth, medical professionals lack the education and expertise required for
dealing with AI-related algorithms. A collaboration between departments of clinical
oncology, bioinformatics, and data science is needed to tide over this problem. Lastly,
being in an initial start-up stage, the cost will be higher and institutions need
financial planning accordingly.
Conclusion
The breakthrough technology of this millennium “artificial intelligence” has a promising
future in all the fields of oncology. It is a necessary tool for handling enormous
clinical data, accurate delivery of treatment, improving personalized treatment selection,
and predicting patient and disease outcomes.
Like any new invention, AI also has had and continues to have some hiccups, but more
research into these areas can help address them. It is not surprising to say that
in near future, AI will become a new “sixth sense” for every oncologist.