Keywords
Nursing informatics - environmental health - standardized nursing terminology - systematized
nomenclature of medicine - artificial intelligence
1 Introduction
Our health is interconnected with the air, water and land in which we live and work.
However, environmental risks are not evenly distributed, as vulnerable populations
may have a higher risk of exposure to air pollution, noise pollution and hazardous
chemicals due to pre-existing health conditions, poor nutritional status and other
environmental risk factors. In 2015, United Nations (UN) Member States formally adopted
17 Sustainable Development Goals (SDGs) to achieve by the year 2030. These goals provided
a shared plan for the health, well-being, and prosperity of people and the planet.
For example, SDG 3.9.1 considers the mortality rate attributed to ambient and household
air pollution. Further SDG indicators consider the intersection of sustainability
and human existence related to health, poverty, pollution, water, education, gender
equality, energy, and economic growth [[1]]. To further these goals, various disciplines have come together to collaborate
on the concept of One Health.
One Health is a collaborative, multisectoral approach which highlights the absolute
interconnectedness between human health and the complex ecosystems in which we live
and share [[2], [3]]. This transdisciplinary approach is an important consideration as human populations
are expanding into new geographic areas which has led to potential opportunities for
diseases to pass between animals and people. Each year, millions of people are impacted
by antimicrobial-resistant germs, vector-borne diseases, and ill health related to
the contamination of water. Weather changes impacting temperature and humidity are
also associated with adverse outcomes related to respiratory, cardiovascular, and
neurological conditions and infectious diseases [[4]
[5]
[6]
[7]].
Environmental sustainability further informed this project. Morelli [[8]] describes environmental sustainability “…as a condition of balance, resilience,
and interconnectedness that allows human society to satisfy its needs while neither
exceeding the capacity of its supporting ecosystems to continue to regenerate the
services necessary to meet those needs nor by our actions diminishing biological diversity.”
The question remains how well healthcare is advancing towards meeting environmental
sustainability, One Health and the SDGs. One such area where healthcare clinician
contributions can be leveraged in this evaluation is through secondary data reuse
of clinical documentation. Clinicians assess, diagnose, provide interventions and
evaluate the care of people and communities affected by environmental impacts. In
many settings, this documentation is captured in electronic format with embedded standardized
clinical terminologies. Standardized clinical terminologies are databases of clinical
terms that have computer-readable codes, are arranged in a polyarchy with defined
relationships, have conceptual uniqueness and permanence, and can expand through multiple
granularities [[9]]. As a mechanism to support data reuse, standardized clinical terminologies can
be used to facilitate the aggregation of clinical data across disparate settings and
applied in larger-scale evaluations. Two healthcare terminology standards often used
in this capacity, are the Systematized Nomenclature of Medicine Clinical Terms (SNOMED
CT), a multilingual clinical health terminology [[10]] and the International Classification for Nursing Practice (ICNP), a classification
of nursing diagnosis, interventions, and outcomes specifically describing nursing
practice [[11]]. These standardized terminologies are systematically organized and are computer-readable
collections of medical terms that are defined, codified, and have synonyms associated
with easier cross-mapping and comparison across systems and practice settings [[12]]. In 2021, the International Council of Nurses (ICN) and SNOMED International announced
an agreement whereby ICNP would be managed and distributed by SNOMED International
while content ownership would remain with ICN [[13]]. To support this work, reference sets and tables have been developed to describe
the mapping equivalence between these two terminologies.
While much work has been advanced to represent healthcare concepts in SNOMED CT and
ICNP, it is unknown if the suite of associated health concepts, represented in these
SDGs, exists in these systems. Therefore, this study aimed to evaluate the representation
of environmental concepts associated with health impacts in SNOMED CT and ICNP.
2 Objectives
The objective of this study was to evaluate the representation of environmental concepts
associated with health impacts, in standardized clinical terminologies.
3 Methods
This study used a descriptive methodology informed by Block et al. [[14]] procedural framework for standardized clinical terminology mapping ([Figure 1]). As per the Canadian Tri-Council Policy Statement for Ethical Conduct for Research
Involving Humans-Chapter 2 [[15]], this research study was excluded from requiring research ethics board approval,
and as such, no formal ethics application was conducted.
Fig. 1 Procedural Framework for Standardized Clinical Terminology Mapping (inspired from
[[14]]).
3.1 Source Concepts
The Global Indicator Framework for the Sustainable Development Goals and Targets of
the 2030 Agenda for Sustainable Development document was chosen as the source reference
material to perform concept extraction [[1]]. Developed by the Inter-Agency and Expert Group on SDG Indicators, it contains
a descriptive list of targets and indicators. To define source concept inclusion,
this research group focused on the Good Health and Well-Being SDG and those SDG indicators
whose custodial agency was, or, in discussion to be, the World Health Organization
[[16]]. The remaining indicators were excluded from this study.
3.2. Source Concept Extraction Protocol
A total of 41 SDG indicators were identified as health-related from the source document
of the Global Indicator Framework document. These indicators were then divided among
five nursing researchers (established & emerging) for concept extraction. For this
paper, a concept targeted for extraction was defined as the identification of unique,
explicit concepts which correspond to a singular meaning and term/phrase [[9]]. The researchers a) reviewed the assigned indicator; b) identified clinical concept(s);
c) categorized the concepts into a domain type (as per SNOMED CT) [[17]]; and d) added possible synonyms or extensions as interpreted by their own clinical
knowledge and expertise. These were then reviewed by the research team for verification
of completeness and accuracy. Disagreement was discussed until consensus was achieved.
3.3 Conceptual Modeling
The researchers assigned domain type(s) to facilitate an understanding of the orientation
and descriptive meaning of the extracted concepts (e.g., the extracted concept was “tuberculosis” and the domain type was “diagnosis”). The
purpose of this step was to embed a dimension of meaning, awareness, and boundary
to the conceptual phenomena the symbolized ‘word' was interpreted to portray [[18], [19]]. The domain types were informed by the conceptual domains used in SNOMED CT and
ICNP to arrange health concepts into the terminologies of polyhierarchical ontology
structures [[20]].
3.4 Target Terminology
The target terminologies were: 1) SNOMED CT International Edition (manual: 2022-09-30
version; automated 2022-07-31 version), and 2) ICNP 2019 Edition. All the extracted
source concepts were prepared for SNOMED CT mapping. Given the scope of ICNP, only
source concepts represented as diagnosis, interventions, and outcomes were included.
3.5 Mapping
The following mapping procedures were sequentially applied in this study:
-
Mapping Coordination: Pre-coordination;
-
Mapping Style: Manual and Automated (SNOMED CT); Manual (ICNP);
-
Hierarchical Awareness: All extracted concepts had potential alignment to existing
hierarchies in SNOMED CT and were therefore candidates for SNOMED CT mapping. All
extracted concepts aligned to diagnosis, intervention and outcomes had potential alignment
to existing hierarchies in ICNP and were therefore candidates for ICNP mapping;
-
Systematic Search Strategy (see below);
-
Cardinality (see below).
3.5.1 Systematic Search Strategy
Both manual and automated methods were used. These methods were chosen as a means
to leverage the strengths of human and computer mapping selection, as well as, control
for possible selection bias [[19], [21], [22]]. During the manual mapping phase, extracted concepts were uploaded into an excel
spreadsheet and divided among four researchers where each concept was independently,
manually mapped twice. [Table 1] outlines the systematic search strategy used in each terminology online browser.
Table 1 Systematic Search Strategy.
During the automated mapping phase, extracted concepts were uploaded into an excel
spreadsheet. All punctuation was removed to facilitate CSV upload to the automated
mapping tool Snap2SNOMED CT [[23]]. All concepts were tasked as an ‘automap', whereby the Snap2SNOMED CT software
automatically assigned candidate matches.
3.5.2 Cardinality
The types of manual mapping matches were defined as:
-
Direct match means that the concept matches exactly to the concept found in SNOMED
CT or ICNP;
-
Broader than match means that the concept found in SNOMED/ICNP was conceptually broader
(more general) than the concept found in the source concept list;
-
Narrower than match means that SNOMED/ICNP was conceptually narrower (more specific)
than the concept found in the source concept list;
-
No match means there were no matches in the target terminology (i.e., SNOMED CT, ICNP).
The cardinality of matches produced through the automated mapping tool did not include
comments of match type, and was, therefore, not applicable.
3.6 Evaluation
The candidate lists, from manual mapping and automated mapping, were presented to
two researchers who were not part of the manual or automated mapping phases. The researchers
were tasked to review the extracted concept, and the candidate mapping matches, and
then choose the ‘best' fit mapping and cardinality match(es). The two team members
took note of any concepts or mapping matches which were ambiguous or required further
review. The results of this work were then reviewed by several members of the researcher
group over three meetings, where fit results, alignment, and ambiguity were discussed.
This discussion iterated, refined and verified the final mapping list.
4 Results
A total of 169 concepts were initially extracted from the 41 source SDG indicators
and used as candidates for SNOMED CT mapping. Of these 169 concepts, 31 were given
a domain related to a diagnosis, intervention, or outcome and were therefore also
mapped to ICNP. Through iterative review and group consensus, a total of 119 concepts
with 133 mapping matches were added to the final SNOMED CT mapping list ([Table 2]). Similarly, a total of 26 concepts with 27 mapping matches were added to the final
ICNP mapping list. Of these 133 SNOMED CT mapping matches, 53 (39.8%) were direct
matches, 37 (27.8%) were narrower than matches, 35 (26.3%) were broader than matches,
and 8 (6%) had no matches. Of these 27 ICNP mapping matches, 8 (29.6%) were direct
matches, 4 (14.8%) were narrower than, 7 (25.9%) were broader than, and 8 (29.6%)
were no matches. Of note, some of these concepts were repeated through the extraction
phase of the 41 SDG indicators (e.g., mortality rate) and for the purposes of reporting
SNOMED CT mapping results, these repeats were counted. When no matches and repeated
concepts were removed, a total of 107 unique concepts were represented in SNOMED CT.
Table 2 Final Mapping Results between UN Sustainability Concepts, SNOMED CT, and ICNP.
5 Discussion
Informed by the principle of One Health, whereby health is linked to the ecosystems
with which we live, this study sought to explore the representation of environmental
concepts associated with health impacts, in standardized clinical terminologies. We
chose to extract concepts described in the UN Global Indicator Framework for the Sustainable
Development Goals and Targets and mapped them using a structured process, to SNOMED
CT and ICNP. The multiple methods used added strength to our approach (e.g., manual and automated mapping). As well, our international research team added a
depth of meaningful interpretation through the lenses of varied clinical nursing,
research, and lived experiences, aiding in the progression of concept extraction,
synonym development, mapping selection, and iterative evaluation.
5.1 Mapping Results
Overall, our mapping processes found that the extracted concepts were moderately well
represented in SNOMED CT and ICNP. Of the 133 SNOMED CT mapping matches found, 53
(39.8%) were direct matches, 37 (27.8%) were narrower than matches, 35 (26.3%) were
broader than matches, and 8 (6%) had no matches. Of those concepts which were direct
matches, 14 (10.5%) were of the disorders (diagnosis) ontological axis. In contrast,
only 1 (0.7%) broader than and 4 (3%) narrower than mapping matches were of the disorders
axis. This strong coverage may be related to the generally well-developed diagnosis
content of SNOMED CT [[21], [35], [36]]. Of the 27 ICNP mapping matches, 8 (29.6%) were direct matches, 4 (14.8%) were
narrower than, 7 (25.9%) were broader than, and 8 (29.6%) were no matches. Of these,
4 (15%) of the no matches were related to specific medical diagnoses (e.g., tuberculosis, malaria, hepatitis B, and HIV) and would unlikely be candidates for
specific ICNP inclusion (i.e., this was an anticipated ICNP mapping finding). One of the ICNP matches was equivalent
to that found in SNOMED CT (i.e., substance abuse). This was expected given the work to cross-reference ICNP in SNOMED
CT. However, there were several concepts chosen during the manual mapping process,
which were different. For example, for the source concept “chronic respiratory disease”,
the mapping match in SNOMED CT was “chronic disease of respiratory system (disorder)”,
while the ICNP match was “Impaired Respiratory System Function”. This difference may
relate to the specificity of using the nursing ICNP browser versus the larger cohort
of concepts delivered through the SNOMED CT browser. This difference may also relate
to specific nursing knowledge and interpretation of the concepts extracted from the
source document (i.e., nursing diagnosis versus medical diagnosis). It also demonstrates the variance in
manual mapping when two browsers, with related content, are used. Overall, we found
value to explore both the browsers as a mechanism to crosswalk the extracted concepts
between a large interdisciplinary terminology and a nursing-specific terminology.
It also highlights an opportunity to further explore the UN Global Indicator Framework
for the Sustainable Development Goals and Targets, for the inclusion, comprehensiveness
and interpretation of nursing care delivery.
For concepts relating to cost expenditures, affordability, and community engagement,
gaps were noted and SNOMED CT did not have extensive concept representation. For example,
SDG 3.9.1 evaluates the proportion of the population with large household expenditures
on health as a share of total household expenditure or income. The concepts of large
household expenditure and health expenditure both had no matches in SNOMED CT. These
concepts require further development in SNOMED CT as they relate to broader implications
of sustainability as well as topics associated with social determinants of health.
For example, researchers have reported on the relationship between large out-of-pocket
health expenditures, poverty [[24]] and poor infant and maternal outcomes [[25]
[26]
[27]]. Further, researchers have demonstrated clinicians do document concepts related
to expenditure and health costs, suggesting opportunities exist to use standardized
terminology thoughtfully, as a way to report on and improve patient health and equity
[[28]].
Finally, through the mapping process, we found many concepts had narrower than and
broader than matches. Several of these concepts were related to water, air and sanitation.
For example, the concepts of domestic wastewater and industrial wastewater were mapped
to the broader than SNOMED CT concept: Waste water (substance); household air pollution
and ambient air pollution concepts were mapped to the broader than SNOMED CT concept:
Air pollution (event); sanitation services was mapped to the narrower than SNOMED
CT concept: Access to toileting facilities (observable entity); and using safely managed
sanitation services was mapped to the broader than SNOMED CT concept: Adequate sanitation
(finding). The nuance and specificity of these source concepts are lost as these direct
matches do not exist in SNOMED CT. Point-of-care clinicians, such as homecare and
public healthcare nurses, interact with patients and observe community settings impacted
by these environmental factors (e.g., household wood-burning stoves used for cooking and heating leading to the exasperation
of upper respiratory conditions or having managed wastewater disposal in remote communities
as a means to protect drinking water) [[29]
[30]
[31]]. Therefore, the evaluation, expansion, and inclusion of these concepts in SNOMED
CT are necessary to advance the clinical reporting of these environmental and sustainability
indicators.
5.2 Concept Representation for Artificial Intelligence and Data Reuse
Standardized clinical terminologies have long been used to provide data for population-based
analytics, including trend analysis and public health surveillance, as well as clinical
research [[32]]. In the context of capturing climate change impacts on health, capturing standardized
environmentally-specific clinical data permits the contextualization of vast amounts
of data to predict service demands and population needs during natural disasters and
climate-related events, health promotion and disease prevention and benchmarking of
any subsequent interventions. Researchers have also demonstrated the use of these
terminologies in developing or training models based on methods broadly defined as
artificial intelligence (AI) [[33], [34]]. SNOMED CT, for example, has been used in natural language processing tasks, utilizing
machine learning methodologies to search and analyze free text clinical documentation
entries [[35]]. Standardized clinical terminologies add to these approaches as a way of providing
sharable and comparable data across settings; thus, helping to produce AI models that
are transferable across settings. As environmental concepts encoded in standardized
terminologies expand and ontological relationships associating these to health impacts
develop, using AI methods could provide additional insights into trends in data and
health conditions, as well as for predictive analytics to support better practice,
research, education, and policy-level decision-making. While large-scale use of standardized
clinical terminologies in advanced clinical AI applications has yet to be realized,
its ability to formally represent healthcare language and its ability to be used in
automatic encoding (e.g., natural language processing) suggests its use will be increasingly valuable as these
technologies advance [[36]].
6 Limitations
This study has noted limitations. While the scope of this study phase modeled a component
of a concept by defining a domain type, the explication of relationships amongst the
set of concepts was not completed (e.g., using Unified Modeling Language to develop a conceptual model) [[12]]. Future researchers may wish to further develop these linkages. This study approached
the extraction and mapping of environmental and sustainability health concepts from
a descriptive, small-group, perspective. The intention was to begin exploring the
representation of these types of concepts in two (SNOMED CT and ICNP) international
standardized clinical terminologies. Ongoing work to expand, develop, and test these
representations is recommended to strengthen, add rigour, and expand the conceptual
nuances of this topic in these codified knowledge systems. As well, we limited our
concept extraction to specific concepts associated to health and whose SDG indicator
was linked to the World Health Organization. Work to expand and explore other SDG
indicators for possible inclusion in clinical standardized terminologies is needed.
7 Conclusions
To track the impacts of climate change on human health and to develop climate-resilient
systems able to respond to public health needs during climate-related events such
as heatwaves, droughts, wildfires and floods, it is essential to have standardized
data. By evaluating the representation of environmental concepts associated with health
impacts, in standardized clinical terminologies, we provided a view into current content
coverages as well as opportunities to expand. This study highlighted the availability,
overlap, and gaps found for SDG terms in SNOMED CT and ICNP. However, future work
is needed to more clearly define and model all possible environmental impacts from
a One Health perspective. The availability of codified terminologies is important
as it can enable clinicians, researchers and public health officials to manage vast
amounts of clinical data in a way that can be easily retrieved, analyzed, related
to the context in which the data was obtained, and embedded in AI applications. This
permits policymakers and leaders to model demands on health services as a function
of time and climate change pressures on the environment and prepares communities to
adapt to these pressures whilst supporting the One Health approach.