Architecture of the ONTOPV system
Up to now, many ontology-based clinical decision support systems for different types of diseases have been developed and achieved convincing results in the clinical practice [13-16]. This valuable work provides us with many inspirations. In this paper we present a prototype towards an expert-system, named ONTOPV, which is based on an ontology of TCM-diagnostic knowledge of psoriasis vulgaris, and which uses a case-database for CBR (Case Based Reasoning). This system is intended to help doctors or other users to diagnose psoriasis related diseases, and to provide clinical decision support if necessary. The prototype of the ONTOPV-system is elaborated to realize at least the following three procedures:
- Diagnosis of PV: The doctor gathers the required information by querying and observing the patient, including age, gender, lifestyle, family history, signs and symptoms. Then, the patient’s main complaint and the acquired information is put into a template, the content of which is transformed as input to the rule basis and diagnosis ontology. The inference engine derives possible diagnoses which is provided to the doctor who receives advice for diagnosis and treatment.
- Development ofintervention plans: according to the recommended diagnostic results, an intervention plan is (semi-) automatically generated for the patient diagnosed. This intervention plan includes some TCM treatments and/or common therapies of Western medicine, like topic- and photo therapy, etc. Further, a personalized care plan on demand will be developed and a solution is recommended for the patients.
- Information Retrieval: with a semantic querying mechanism, users may access the system to ask questions about the Information or required Data that are stored in the Ontology and in the CBR based KB of PV. Then, the answers together with relevant graphic displays will be returned to the user.
The following figure gives an overview of the system. It shows that the diagnostic information collected by the doctor from the patient, such as symptoms, signs, skin lesion area, and other diagnostic information, is input into the knowledge based system as input parameters, compared with the syndrome type, stored in the ontology, and matched against the customized rules in the SWRL rule base. If the given prerequisites are met, one or more inference rules are triggered to obtain the corresponding diagnosis result and the related treatment plan. The input is also compared with the past cases in the case database. If there are matching case records, the cases with higher similarity are extracted as reference case and provided to the user for comparative analysis.
Construction of domain ontology for syndrome differentiation of psoriasis vulgaris
In our work we developed a domain ontology for syndrome differentiation of psoriasis vulgaris. For this purpose, we used the ontology editor Protégé and applied a top-down approach which adopts the framework of general formal ontology (GFO) and its middle-level core ontology GFO-TCM described in [17-20].
1. Basic conceptual hierarchy
For the construction of the basic conceptual hierarchy we used various documents, such as Evidence-Based Clinical Practice Guideline of TCM for psoriasis vulgaris (2013)[21] (hereafter referred to Guideline 2013), the WHO Global Report on Psoriasis(2016)[8], the Guideline for the diagnosis and treatment of psoriasis in China (2018 simplified edition)[22], etc.
Furthermore, we used TCM-related documents about psoriasis vulgaris which provide information about the classification of psoriasis, symptoms (primary symptoms, secondary symptoms and combined symptoms), syndrome types, diagnosis, treatment methods, prescriptions, drug ingredients, drug compatibility, etc. Some basic semantic types have been obtained from the guidelines and other related literature, such as disease name; basic syndrome types (证型): blood heat syndrome, blood dryness syndrome, blood stasis syndrome; simultaneous syndromes; basic symptoms including main symptoms, minor symptoms; methods of treatment; recommended formulas and drugs; pharmaceutical ingredients and so on. The basic semantic types of psoriasis vulgaris are listed below:
-Health Problems: The psoriasis-related health problems are mainly divided into 3 categories, namely, disease, symptom, (incl. main symptoms, minor symptoms, accompanied symptoms), and syndrome or pattern;
-Formula: A formula or prescription usually contains some medicinal components, used for treating an illness, such as psoriasis vulgaris.
-Drug/Medicinal: Usually referring to those medicinal substances recorded in Chinese materia medica but also to some medicaments of western medicine.
-TCM Diagnose for syndrome differentiation: According to the Guideline (2013), each type of syndrome of psoriasis vulgaris has a set of corresponding symptoms, which can provide a guideline for the diagnosis of TCM syndrome differentiation.
-Method of treatment: The method of treatment is composed of the conventional therapies in TCM and western medicine, such as: TCM prescriptions and drugs, medicated bath, biological therapy, topical therapy, systemic therapy, phototherapy, etc.
-Participant/Person: Such as doctors, patients, etc.
-Compatibility and Caution: Prescribe the medicinal composition and dosage of prescriptions, special instructions and contraindications between certain medicinal or chemical ingredients.
2. Instantiation
According to the instructions in the Guideline (2013), the corresponding instantiated entities are established for the basic semantic types from the classification in the ontology, such as instances for major symptoms, minor symptoms, combined symptoms, prescriptions, drugs, pharmaceutical ingredients, treatment methods, etc.
3. Object- and datatype properties
Finally, the relationship between different entities in the TCM diagnosis ontology for psoriasis vulgaris needs to be clarified. About 20 common binary object properties that relate individuals to individuals have been specified. These include: annotate (a disease can be annotated by some syndromes): according to the guideline 2013, PV can be annotated by several zheng/syndromes, including some basic zhengs and combined zhengs, due to its different onset process or observed symptoms. Further object properties are: hasComponent (a formula has certain drug components); hasAdverseReaction (some drug components may cause adverse reaction with each other), diagnosed (a patients could be diagnosed with psoriasis vulgaris); hasPattern (for example, the patient's syndrome type is blood-heat type); hasPrescription (some prescriptions can be proposed according to some kind of certain syndrome); recommended (some treatment methods could be recommended for a given diagnosis); hasSymptom (a syndrome type may have some symptoms). Through these built-in object properties, individuals which belong to different concepts in the ontology are correlated to each other, and stored in the OWL ontology in the form of RDF triple as object property assertions. Then, some basic datatype properties that relate individuals to literal data are built as well, such as hasBSA that means Body Surface Area, i.e. the size of body surface occupied by psoriasis; hasTherapyPeriode; hasDosage; etc.
Construction of the rules base
The rules of diagnosis-based syndrome differentiation and treatment must take into account specific regulations for diagnosis and medication in the evidence-based guidelines of TCM for psoriasis vulgaris[21]. For example, a corresponding treatment mechanism is established for the patients’ observed symptoms as defined by the guidelines to ensure compliance with the standard operation required by the guidelines. In the ONTOPV related TCM diagnosis ontology, the reasoning rules are constructed for diagnosing certain syndrome types in terms of corresponding sets of symptoms. And there are further rules which are used to check the compatibility of drug components and allows to query whether some given components may cause side effects due to drug interactions. Using the rule base, a reasoning process can be performed the results of which provide diagnoses and /or treatment suggestions according to the patient’s symptoms. The followings are examples and explanations of some rules in the SWRL rules case.
1. Rules for the assessment of PV
Patient(?a) ∧ diagnosed(?a, psoriasis_vulgaris) ∧ hasBSA(?a, ?c) ∧ swrlb:lessThanOrEqual(?c, 0.03) -> hasAssessmentOfSeverity(?a, "mild")
A patient (a) with PV who has a skin lesions less or equal than 3% of the Body Surface Area (BSA) should be evaluated as “mild” in terms of degree of severity.
Patient(?a) ∧ diagnosed(?a, pv) ∧ hasBSA(?a, ?c) ∧ swrlb:greaterThan(?c, 0.03) ∧ swrlb:lessThan(?a, 0.1) -> hasAssessmentOfSeverity(?a, "moderate")
A patient (a) with PV who has a skin lesions greater than 3% of BSA and less than or equal to 10% of BSA should be evaluated as “moderate” in terms of degree of severity.
Patient(?a) ∧ diagnosed(?a, pv) ∧ hasBSA(?a, ?c) ∧ swrlb:greaterThanOrEqual(?c, 0.1) -> hasAssessmentOfSeverity(?a, "severe")
A patient (a) with PV who has a skin lesions greater than 10% of BSA should be evaluated as “severe” in terms of degree of severity.
2. Rules for syndrome differentiation of PV
Patient(?a) ∧ diagnosed(?a, pv) ∧ hasSymptom(?a, bright_red_lesions) ∧ hasSymptom(?a, continusously_increased_rashs) ∧ hasSymptom(?a, yellow_urine) -> denote(pv, syndrome_of_heat_in_blood)
A patient (a) with psoriasis vulgaris who has main symptoms of red lesions and continuously increased rash and has a minor symptom of yellow urine, the pattern would be assigned to a "syndrome of blood heat" (血热证)
Patient(?a) ∧ diagnosed(?a, pv) ∧ hasSymptom(?a, pink_lesions) ∧ hasSymptom(?a, dry_scales) ∧ hasSymptom(?a, dry_mouth_and_throat) -> denote(pv, syndrome_of_blood_dryness)
A patient (a) with psoriasis vulgaris who has main symptoms of red lesions and continuously increased rash and has a minor symptom of dry mouth and throat, then the pattern would be assigned to a "syndrome of blood dryness(血燥证)".
3. Rules for prescription:
Patient(?a) ∧ diagnosed(?a, pv) ∧ Pattern(?b) ∧ denote(pv,syndrome_of_heat_in_blood)-> hasPrescription(?b, decoction_of_cooling_blood_and_toxin_resolving)
A patient (a) with psoriasis vulgaris who has been ascribed to has a syndrome of heat in blood would be prescribed a recipe named " blood-cooling and toxin-resolving decoction (凉血解毒汤)".
Patient(?a) ∧ diagnosed(?a, pv) ∧ Pattern(?b) ∧ denote(?b,syndrome_of_blood_dryness) -> hasPrescription(?b, decoction_of_nourishing_blood_and_toxin_resolving)
A patient (a) with psoriasis vulgaris who has been ascribed to a syndrome of blood stasis would be prescribed a recipe named " blood-cooling and toxin-resolving decoction (养血解毒汤)".
4. Rules for treatment of psoriasis:
Patient(?a) ∧ diagnosed(?a, ?b) ∧ Topical_Therapy(?c) ∧ hasAssessmentOfSeverity(?a, "Mild") -> recommend(?b, ?c)
A patient with psoriasis who has been evaluated as “Mild” in terms of severity of skin lesions should be recommended to receive some topical therapies.
Patient(?a) ∧ diagnosed(?a, ?b) ∧ Biologics(?c) ∧ hasAssessmentOfSeverity(?a, "Severe") ∧ hasDLQI(?a, ?d) ∧ swrlb:greaterThan(?d, 20) -> recommend(?b, ?c)
A patient with psoriasis who has been evaluated as “Severe” in terms of the severity of skin lesions, and has a Dermatology Life Quality Index (DLQI), which is greater than 20, should be recommended to receive some biologics.
5. Rules for drugs adverse reaction
drugs(?a) ∧ drugs(?b) ∧ hasComponent(?a, danshen_root) ∧hasComponent(?b,aluminum_hydroxide)-> hasAdverseReaction(?a, ?b)
Two drugs containing danshen_root(丹参) and aluminum hydroxide(氢氧化铝) may cause adverse reactions.
drugs(?a) ∧ drugs(?b) ∧ hasComponent(?a, Sichuan_fritillary_bulb) ∧ hasComponent(?b, prepared_common_monkshood_branched_root)-> hasAdverseReaction(?a, ?b)
Two drugs containing Sichuan_fritillary_bulb (川贝) and prepared_common_monkshood_branched_root(附子) may cause adverse reactions.
Case-based reasoning using fuzzy logic
As mentioned above, a diagnosis of TCM is usually based on the comprehension of manifestation of pulse, tongue coating, complexion, physical constitution and other signs and symptoms, some of which are certain while more of them are fuzzy, ambiguous, nonlinear and uncertain. Hence, in the process of diagnosis, TCM doctors often more rely on their own knowledge and experience rather than logical reasoning to find solutions. Therefore, the difficulty of establishing a TCM expert system lies in the uncertainty of knowledge representation, data acquisition and logic reasoning. The complexity and fuzziness of knowledge faces new challenges and requirements to knowledge representation and reasoning mechanism.
In the case of psoriasis vulgaris, for any possible diagnosis and treatment in view of syndrome differentiation, TCM doctors must firstly identify the patient's symptoms, in order to obtain the corresponding diagnosis results. This could be considered as a kind of pattern recognition. According to M. Friedman and A. Kandel[23], a pattern is an abstract object which is inspected for the recognition process. And we usually regard a pattern as a schematic description of an object which we want to recognize. In most cases, a pattern recognition problem is a problem of discrimination between different populations, namely, a kind of cognitive classification of the object being studied. This classification is performed on the premise of known patterns, that is, the population can be divided into several types, as a set of standard patterns, to identify a specific object to which pattern it belongs, say, which type.
We took an example from the evidence-based clinical guidelines for psoriasis vulgaris in TCM (2013), according to the diagnostic criterion of blood-heat syndrome, which may have two main symptoms of bright red lesions, continuously increased rashes, and four minor symptoms including irritability and upset, yellow urine, red tongue, rapid pulse. As long as the patient has all major symptoms and at least one minor symptom, the syndrome can be determined. The same principle could be applied to the identification of blood dryness syndrome, except that the latter symptoms can be expressed as major symptoms of pink lesions, dry scales and minor symptoms of dry mouth, pale tongue, thready pulse. If the patient is observed with some symptoms of blood heat syndrome, combination with some symptoms of blood dryness syndrome, the question is how to identify this patient. This case can be observed quite often in practice, because the onset of this disease is a gradual process according to the theory in TCM. In the early stage of this disease, it is mostly manifested as blood heat syndrome, and mostly blood dryness syndrome in the middle stage, but mostly blood stasis syndrome in the late stage. It can also be manifested as other types of syndrome simultaneously. In TCM sometimes decisions cannot be easily made on the basis of two-valued logic.[24] Hence, the problem about pattern recognition in a process of syndrome differentiation can be considered as a more-or-less type rather than yes-or-no type which pertains to the topic of fuzzy pattern recognition.
In this paper, inspired by Zadeh's fuzzy set theory [25-27], a mechanism of information retrieval is proposed, which integrates ontology and CBR using fuzzy pattern recognition This integration provides automatic reasoning and semantic retrieval in line with the norms of TCM evidence based clinical guidance for psoriasis vulgaris, and additionally also simulates the thinking mode of experts based on fuzzy logic. Meanwhile, a solution is carried out to find a similar case from the case base using the proposed method of lattice degree of nearness[28][23].. In the ONTOPV prototype, the user can input data, collected from patients as input parameters into the system. For example: main symptoms, minor symptoms, personal information (age, gender, past history, family history, etc.), chief complaint (which means subjective description of the patient), etc.
Then the system processes these data with searching rules defined in the SWRL rules base. If the given preconditions are met, one or more rules are triggered, and subsequently the corresponding standard diagnosis results and treatment scheme will be obtained. At the same time, it can also be compared with the past cases stored in the case database. If there is a matching case record, the case with higher similarity will be extracted as the reference case and provided to the user for comparative analysis. The workflow of case retrieval in ONTOPV is shown in Figure 2.
Firstly, the cases of patients can be considered as a pattern space A = (a1, a2 ... am), where ai ,1≤i≤m are vectors in Rn ,i.e. each pattern is characterized by n features. For example, each case can be characterized by gender, age, patient's chief complaint, symptoms, Western medicine diagnosis, TCM diagnosis, syndrome type, treatment method, recommended medication, and quantitative score of each syndrome type based on the comprehensive score according to the membership function defined.
Suppose the universe of discourse U = {several patient cases}, the diagnostic characteristics of the patient-cases include several symptoms, which need to be diagnosed and classified according to the color of the lesion, the nature of the lesion, tongue- and pulse manifestation, urination, complexion, and chief complaint. Thus, we can get the pattern space of syndrome S= {blood heat syndrome, blood stasis syndrome, blood dryness syndrome, combined syndrome}, and the combined syndromes can be further divided into wind, dampness, and heat poisoning. Among them, the pattern of blood heat-, blood stasis-, blood dryness-, and combined syndrome can be considered as fuzzy set. To determine which type syndrome the patient presents with, a TCM doctor must firstly determine which features, say symptoms go to this classification. This problem can be measured by using the degree of membership and of similarity between two fuzzy sets A and B, which is discussed below.
Using the principle of maximum membership in fuzzy set, a simple membership function can be firstly established to determine which basic syndrome type a patient has. According to the guideline, a basic syndrome type can be defined. If a patient exhibits two major symptoms and at least one minor symptoms, then the diagnosis is true, which means the patient’s symptoms have full membership degree with this syndrome type. Accordingly, a simple membership function can be established.
We have assigned a weight value to each symptom, including the main- and the minor symptom, For example, there are altogether 2 main symptoms and 4 minor symptoms in the blood-heat syndrome. Then a weight value of 0.3 is assigned to each main symptom and 0.1 to each minor symptom. Similarly, we assign a weight value to each symptom in every syndrome type, so that we can establish a simple membership function to judge a given case of a patient, membership degree of this patient’s syndrome type. Then we only need to add the weights of the patient's symptoms to estimate the degree of membership in the class of a syndrome type he may have.
Suppose we have a fuzzy set of symptoms and syndrome of a patient in the case database P= {blood fever, bright red lesions, dry scales, slow pulses, pale tongue, yellow urine, heavy head} corresponding to each basic syndrome of blood heat, blood dryness, blood stasis due to the defined membership function above, a fuzzy vector of the patient p = (1, 0.5, 0.3) which may be interpreted that the patient's symptom set has respective grade of membership: 1(true) for blood heat syndrome of and 0.5 (possible) for blood dry syndrome type of and 0.3 (less possible)for combined syndrome wet.
Let us assume there are two reference cases A and B in the case base, and both correspond to the above-mentioned three syndrome types in terms of a grade of membership:
Given as fuzzy vector a = (0.6, 0.4, 0.2), with the symptom set {Qi deficiency, bright red lesions, upset, red tongue, string pulse, dry scales, dry mouth, dizziness}
And fuzzy vector b = (0.7, 0.1, 0.1) with the symptom set {blood heat accumulation, white scales, red skin lesions, thin and white tongue fur, dry mouth, slippery pulse}
Using the method of lattice degree of nearness proposed by P.Z.Wang[25], the similarity between two fuzzy sets can be defined by the following expression: