Terminologies and ontologies: concrete application examples

Page 2: First example: diagnosis appropriate?

Contents

With the help of ontologies and ECL, a formal definition can now be found and applied. A concrete example: We want to check whether the gender of a patient matches their diagnosis. To do this, we define the diseases that can only occur in male patients(<<*:363698007=<<10052007) and include over 1300 terms with this simple ECL expression. It should be clear that a list will hardly be complete - nor efficient. Try to elicit the complete list or at least a list with more than 100 entries of "male diagnoses" from a current LLM!

An LLM only provides an extract of the diagnoses searched for. Depending on the prompt and the LLM used, a different list is generated.

(Image: Perplexity.ai)

In any ageing population, patient care is characterized by multimorbidity and polypharmacy. Every second elderly patient suffers from more than four chronic diagnoses and takes more than five medications on a permanent basis. Often there are even significantly more medications, as many active ingredients have side effects that compensate for other active ingredients. Patients with arthritis are often prescribed painkillers such as diclofenac, but these can cause stomach problems, particularly heartburn. On the other hand, active ingredients such as omeprazole and pantoprazole are used, which in turn can promote osteoporosis. A chain that can be continued indefinitely.

Explanatory box elements AMTS
Excerpt from the semantic definition of the diagnosis &quot;hepatitis C&quot;. It includes active substances that can be used to treat the diagnosis, but also those that should only be used with caution or may not be used at all.

Excerpt from the semantic definition of the diagnosis "hepatitis C". It includes active substances that can be used to treat the diagnosis, but also those that should only be used with caution or may not be used at all.

(Image: Wingert-Nomenklatur, André Sander)

The specific pharmaceutical knowledge represented in an ontology typically includes indications, contraindications (contraindications), interactions (interactions), cross-allergies and adverse effects (side effects). A very simple example is shown in the figure for the excerpt from the semantic definition of the diagnosis, whereby it should be noted that each of the "outer" boxes has its own semantic description. The end result is a huge network of content-related connections, which is ultimately filtered using the data of a specific patient. This makes it possible to analyze even very complex situations, such as patients with many diagnoses and many medications.

Such automatic analyses are also particularly important for medications that are used in different specialist areas. For example, certain active ingredients used to treat depression interact with those used to treat athlete's foot. In the typical day-to-day treatment of a GP, something like this can easily be overlooked or the doctor does not have all the information. If, in future, all data is stored centrally in an electronic patient file, then there is at least a theoretical possibility of implementing efficient AMTS testing throughout the entire healthcare system.

However, many active ingredients not only have drastic side effects, but also interact with each other. The simultaneous administration of simvastatin (an active ingredient that lowers cholesterol levels) and clarithromycin (used to treat respiratory infections) can lead to the dissolution of the striated muscles (rhabdomyolysis) and consequently to kidney failure. This is a life-threatening situation. Patients may also have intolerances, so the dose must be adapted to the patient's condition and age. The same applies to the duration of use and even the time at which a medication is taken can be decisive. So how can a doctor prescribe a medication safely? What support can he expect?

With ontologies that contain pharmaceutical knowledge and the "CDSS tools", an efficient tool can be implemented that addresses this problem – which is referred to as "drug and therapy safety (AMTS)" in everyday clinical practice.

Complexity is increasing extremely quickly and the knowledge depicted can hardly be grasped by a human being.

(Image: Wingert-Nomenklatur, André Sander)

Terminologies and ontologies can not only generate knowledge, but also transport it. By mapping a textual description to a terminological term, we leave the level of human language and ultimately enter abstract mathematics or algorithms. A term can have many different narrative descriptions and is described in terms of content (semantically) via ontologies. A certain "understanding" of a term can thus be generated at the level of algorithms. The second part of the article series described how terms are derived from human language and the effort involved, but there is also an effort to communicate terms from the outset instead of free text.

Instead of the word "myocardial infarction", for example, the term "I21.9" (ICD-10) or "22298006" (SNOMED CT) is conveyed. With suitable tools, such as terminology servers, these terms can be processed directly: They can be resolved backwards. This means that a display text is generated for a specific language. These can be related to each other and clarify, for example, whether a term belongs to a certain category or not. This is known as semantic interoperability. In a nutshell: This is a form of communication in which the sender and receiver not only understand each other syntactically, but in which the receiver is in principle able to understand the content of the communication.

It is important to distinguish between two levels at which terminologies are used to achieve semantic interoperability. First, there is the meta-level, which describes the data descriptively. And then the data itself must be described. In particular, the content of a table with the term 419076005 (SNOMED CT for allergy) could be described at the meta level.

A system can use this to conclude that, in principle, these are diagnoses and that an allergen must exist. A line in this table can now contain, for example, 91936005 (SNOMED CT for penicillin allergy), which on the one hand defines the allergen (penicillin), but can also be validated by asking a terminology whether a penicillin allergy is an allergy. As I said, on a purely conceptual, mathematical level! However, this also works prospectively: if the term "91936005" is received, terminology servers can be used to check in which table this term must be stored.

Gematik's e-prescription uses terminology very sparingly and only codes the prescribed medication (right) or the active ingredient (left) with the proprietary code systems PZN (pharmaceutical central number) and ASK (pharmaceutical substance catalog). These code systems are partially license-proven and are only used in Germany.

(Image: Simplifier.net)

In Germany, semantic interoperability is seen as a central focus of digitalization in the healthcare sector. This keyword can therefore also be found in numerous laws. Specifications should be defined as "semantically interoperable", which is why terminology such as SNOMED CT can also be found there. The specifications themselves are usually implemented in the HL7 FHIR format.

The vaccination record of the National Association of Statutory Health Insurance Physicians uses standard terminology. Only the pharmaceutical central number is proprietary and unfortunately not consistent in the designation (compare e-prescription from Gematik). Both the vaccine (left) and the diagnosis for which the vaccine is being administered (right) are shown. The SNOMED CT codes are each restricted to the list of eligible codes with a so-called value set and can therefore be checked algorithmically. With this image, it is now possible to evaluate further information about the patient. For example, if the patient has a diagnosis that has been vaccinated against, this is important information for the practitioner. On the other hand, the patient could be allergic to the vaccine.

(Image: Simplifier.net)

Despite all the successes of modern machine learning (ML) approaches, we should not forget tried and tested tools. All technologies have their weaknesses, and the challenge is to bring the respective strengths to the fore by skillfully combining them. Terminologies and ontologies are complex to create and use, but they give us a promising opportunity to further improve ML approaches.

Newer approaches are investigating the possibility of "enriching" data for machine learning in advance with the help of ontologies and even making it comprehensible (keyword "prior knowledge"). It should also be possible to create new ontologies more efficiently with the help of ML models. Rule-based systems are an important backbone for semantic interoperability and ensure the transparent quality that is so important for medical devices - ultimately, we want to be treated by doctors who can rely on their software.

(mma)