Department of Mathematics, National Institute of Technology, Raipur, Chhattisgarh, India
Email: [email protected]
Received: 01 Feb 2015 Revised and Accepted: 01 Mar 2016
ABSTRACT
Objective: Diagnosis process of Thalassemia requires several types of medical test, and results of this test together identify the stage of Thalassemia. The objective of this study is to design a Fuzzy Inference System to diagnose the severity of the Thalassemia disease of a patient by using Fuzzy Logic.
Methods: In this paper, a new approach based on fuzzy inference system was presented for prediction of Thalassemia disease in patients. The proposed Fuzzy model combined the expert’s knowledge and the fuzzy logic approach which is then combined in fuzzy rule base to diagnose the presence of the disease. The performances of the system graphically represented by fuzzy inference system tools in MATLAB8.4.
Results: It was found that our program matched the doctor’s diagnosis in 12 cases perfectly. The other 3 were marginally off. This results with an accuracy of about 80 %.
Conclusion: The result suggests that the model provides the most effective way to identify Thalassemia type in patients. The results in this work can be obtained by a simple and inexpensive method. This would generate, in economic terms, significant savings.
Keywords: CBC Test, Fuzzy Logic, Mamdani Fuzzy Inference System, Thalassemia Disease
© 2016 The Authors. Published by Innovare Academic Sciences Pvt Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)
INTRODUCTION
Anemia is a condition where the number of healthy RBC in the blood is lower than normal. It is due to low RBC’s, destruction of RBC’s or loss of too many RBC’s. If your blood does not have enough RBC’s, your body doesn’t get enough oxygen it needs. As a result, you may feel tired and other symptoms. But sometimes it is very difficult to detect Thalassemia on the basis of symptoms only. In the domain of Thalassemia, there is no such boundary between what is healthy and what is diseased. Having so many factors to detect Thalassemia makes doctor’s work difficult. So, experts require an accurate tool that considering these risk factors and give some certain result in uncertain terms. Some biochemical tests (HGB, HbF, HbA2, RBC, MCV, and MCH) are useful for identifying carriers of the Thalassemia trait [1-3]. In the presence of Thalassemia parameters in the CBC, an accurate and precise quantification of hemoglobin HbA2 is essential for the diagnosis of the Thalassemia trait. When biochemical tests are not exhaustive, it is necessary to study the molecular globin genes [4]. HPLC and electrophoresis are a gold standard for the diagnosis of β-Thalassemia trait, but it is not available at all places.
Thus, several attempts have been made to diagnose the condition by using red cell indices. In the present study Hemoglobin (HGB), Mean Corpuscular Volume (MCV) and Mean corpuscular hemoglobin (MCH) values were able to detect cases of type of Thalassemia. In this study, we focus on a development of the first knowledge representation corresponding to the CBC test results to create a mathematical model for Thalassemia disease diagnosis using FIS. The developed FIS model for Thalassemia disease will be useful and beneficial for many informatics related Thalassemia tasks in the future. The objective of this study is to create a fuzzy inference system to predict the severity involve in Thalassemia disease. Three steps are used to monitor general health and Thalassemia. But we are focusing only on the Tests and Procedures. Three steps are as follows:
The rest of the paper is organized as follows. An application of Fuzzy expert system in different areas represented in the Introduction section. Materials and Methods section provides the description of fuzzy model and structure of a medical expert system on Thalassemia disease. Finally, the result and discussion are concluded in the Results and Discussion section.
Applications of fuzzy logic in different areas
In 1965, Prof. Lotfi Zadeh developed fuzzy set theory that emerges the concept of fuzzy logics [5, 6]. Fuzzy logic consists of probabilistic logic or many-valued logics. Rather than fixed and exact reasoning, it provides approximate reasoning. In the development of medical systems, since the 1980s, fuzzy logic is being extensively used. In the last few decades, the significant development of control system theory can be witnessed by which development of computers and electronic has outcome into many different applications of control system theory [7].
When the studies in the literature related to this classification application are examined, it can be seen that a great variety of methods were used. Among these, [8] Fuzzy System have been used to diagnose the different types of anemia on the basis of symptoms such as Irritability, tachycardia, Memory weakness, Bleeding and Chronic fatigue. Another, [9] diagnose Liver disease using fuzzy logic on the basis of CBC Test, which uses 4 parameters such as WBC, HGB, HCT and PLT. [10] Adeli and Neshat proposed a system to diagnose the heart disease using fuzzy logic. [11] Lavanya et. al. also develop a fuzzy expert system to diagnose the Lung Cancer.
In this paper, a Fuzzy Inference System is designed to diagnose the severity of the Thalassemia disease of a patient by using Fuzzy Logic.
MATERIALS AND METHODS
We describe the designing of the Fuzzy Inference System (FIS) for Thalassemia Disease Diagnosis.
Design a fuzzy logic system for thalassemia disease diagnosis
Problem Specification and Define linguistic Variables: There are 3 input variables and 1 output variable.
Fig. 1: Mamdani fuzzy inference system for thalassemia diagnosis
Ranges for Input/output fields of the system
Linguistic Variables:
Table 1: Linguistic variable for input variables
S. No. |
Input variables |
Linguistic variables |
1. |
Hemoglobin (g/dl) |
HGB |
2. |
Mean Corpuscular Volume (fl) |
MCV |
3. |
Mean corpuscular hemoglobin (pg) |
MCH |
Table 2: Linguistic variables for output variables
S. No. |
Output variable |
Linguistic variables |
1. |
Thalassemia |
Type_of_Thalassemia |
Define Fuzzy Sets:
Membership function for all input variables
For calculating the membership function (MF) we scale the range of 0-100 and based on the severity we calculate the MF and the range are given as follows:
Output variables and value ranges
The output will be a value within the range [0, 10]. The value, 0, means that no Thalassemia problems exist as of yet. We have divided this range into smaller fuzzy sets to make a cluster of the type of Thalassemia disease. ‘Thalassemia_Minor’ is given to those patients whose output value is in between 0 and 3.4 ‘Thalassemia_Intermedia’ is given to the patients who gets a value between 3.5 and 7 also ‘Thalassemia_Major’ is given to the patients who gets a value between 7.1 and 9.9 and so on as shown in the table 5, The basic relationship is that the higher the severity of Thalassemia disease, the lower the output value.
Fig.2: Fuzzy membership for all input and output variables
Table 3: Values for all input linguistic variables
S. No. |
Linguistic variables |
Ranges |
Values |
1. |
HGB |
<7 grams/deciliter |
Very_Low |
7–10 grams/deciliter |
Low |
||
9–12 grams/deciliter |
Medium |
||
2. |
MCV |
<40 fl |
Very_Low |
40.1–60 fl |
Low |
||
60.1–80 fl |
Medium |
||
3. |
MCH |
0–10 pg |
Very_Low |
10–20 pg |
Low |
||
20–30 pg |
Medium |
[HGB = Hemoglobin, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin]
Table 4: Membership function for all input variables
S. No. |
Linguistic variables |
Membership function |
Values |
1. |
HGB |
|
Very_Low |
Low |
|||
Medium |
|||
2. |
MCV |
Very_Low |
|
Low |
|||
Medium |
|||
3. |
MCH |
Very_Low |
|
Low |
|||
Medium |
Table 5: Values for all output linguistic variables
S. No. |
Linguistic variables |
Ranges |
Values |
1. |
Type_of_Thalassemia |
HGB is 9–12 g/dl |
Thalassmia_Minor [12, 13] |
MCV is<80 fl |
|||
MCH is<27 pg |
|||
2. |
HGB is 7–10 g/dl |
Thalassemia_Intermedia [14] |
|
MCV is 50–80 fl |
|||
MCH is 16–24 pg |
|||
3. |
HGB is<7 g/dl |
Thalassemia_Major [14] |
|
MCV is>50<70 fl |
|||
MCH is>12<20 pg |
[HGB = Hemoglobin, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin]
Table 6: Classification of output
S. No. |
Linguistic variable |
Ranges |
Fuzzy set |
1. |
Type_of_Thalassemia |
<3.5 |
Thalassmia_Minor |
3.5-7 |
Thalassemia_Intermedia |
||
10> |
Thalassemia_Major |
Define fuzzy rules
In this section fuzzy inference rules generated; relevant inference rules can be determined by experience human operators well, we use IF ELSE conditions for fuzzy inference rules, as we have three input variables. Also, there are 15 rules conveniently are represented in IF-ELSE Form.
First 3 rules are for Symptoms based testing:
1. If (Symptoms is Absent) and (Family_History is Present) and (Disorder is Thalassemia_Minor) then HGB is Medium.
2. If (Symptoms is Present) and (Family_History is Present) and (Disorder is Thalassemia_Intermedia) then HGB is Low.
3. If (Symptoms is Present) and (Family_History is Present) and (Disorder is Thalassemia_Major) then HGB is Very_Low.
Further, 3 rules are in the classification of Thalassemia on the basis of MCV only:
1. If (HGB is Medium) and (MCV is Very_Low) then Thalassemia_Minor.
2. If (HGB is Low) and (MCV is Low) then Thalassemia_Intermidia.
3. If (HGB is Very_Low) and (MCV is Medium) then Thalassemia_Major.
At last 15 rules are for the further classification of Thalassemia on the basis of all three parameters such as HGB, MCV, and MCH:
The AND operator used in the rule infers that the minimum criterion is used in the resultant and aggregated to indicate the condition of Thalassemia: Thalassemia_Minor, Thalassemia_Intermedia, and Thalassemia_Major. Also, the combination of each rule can be written in the following (table 7) form:
Table 7: Illustration of applied rules with respect to membership function
Rule No. |
Antecedent |
Consequence |
||
Linguistic variable1 (HGB) |
Linguistic variable2 (MCV) |
Linguistic variable3 (MCH) |
Result |
|
1 |
Medium |
Very_Low |
Very_Low |
Thalassemia_Minor |
2 |
Medium |
Very_Low |
Low |
Thalassemia_Minor |
3 |
Medium |
Very_Low |
Medium |
Thalassemia_Minor |
4 |
Medium |
Low |
Very_Low |
Thalassemia_Minor |
5 |
Medium |
Low |
Low |
Thalassemia_Minor |
6 |
Medium |
Low |
Medium |
Thalassemia_Minor |
… |
… |
… |
… |
… |
… |
… |
… |
… |
… |
15 |
Very_Low |
Medium |
Low |
Thalassemia_Major |
Fig. 3: Rule viewer for generated rules
Experimental results
For our system, we use the mamdani type FIS. For each rule, a degree is calculated. For aggregation, the maximum is taken from all the degrees. In the defuzzification process, the centroid method is used.
Now from Matlab2014b using this input values we derive the rule viewer for giving input as demonstrated in fig. 3 and surface plots using the surface viewer as demonstrated in fig. 4.
Fuzzy system is used to obtain the severity level which is the only output variable of the system.
The risk determines the level of severity of risk given the inputs.
Table 8: Input values for patient
S. No. |
Input variable |
Value ranges |
Ranges selected |
1. |
HGB |
7.9 g/dl |
7<7.9<10 |
2. |
MCV |
54.1 fl |
40<54.1<60 |
3. |
MCH |
21.9 pg |
20<21.9<30 |
[HGB = Hemoglobin, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin].
Fig. 4: A 3D graph showing how the variation in MCH and HGB affects the severity of Thalassemia disease
Table 9: Testing of the system
HGB |
MCV |
MCH |
Type_of_thalassemia |
7.9 (Low) |
54.1 (Low) |
21.9 (Medium) |
5.25 (Thalassemia_Intermedia) |
[HGB = Hemoglobin, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin]
RESULTS AND DISCUSSION
The patient's health risk was found from the given input of a linguistic variable of HGB, MCV, and MCH. Using Mamdani FIS method to construct the membership function for assigned the linguistic variation in the gasification process. Using If. Then rule and inference strategies are chosen for processing the rule base to determine the type of Thalassemia in a patient by logical decision-making analysis. Through the defuzzification, fuzzy system provides an objective process of the risk factor, also to view the surface view of the risk determination using simulation. We tested our fuzzy expert system against the following input variable values.
To test the accuracy of our system we have taken 15 random patients’ data from our dataset of 40 patients and used it to give their diagnoses. The randomness of the patients was important as that would account for a proper sample to test with, statistically. The idea is to send 15 patients’ Data to the doctor who will give us his diagnosis. We implemented a system which ranges from Thalassemia_Minor (1) to Thalassemia_Major (3) and compared our program results with that of a doctor from registered patients from the Thalassemia Welfare Society, Bhilai, Durg (Chhattisgarh, India). It was found that our program matched the doctor’s diagnosis in 12 cases perfectly. The other 3 were marginally off. This results with an accuracy of about 80 %.
CONCLUSION
Because of uncertainty involve in the diagnosis of Thalassemia disease a new method for Thalassemia disease diagnostic problem solving based on fuzzy inference system is constructed in this paper. According to the fuzzy inference system step by step actions is performed. The constructed system in this paper is an efficient attempt to solve the Thalassemia disease problem. The proposed model detects Thalassemia on the basis of Thalassemia both Symptoms and CBC Test. The results of this work can facilitate laboratory work by reducing the time and cost.
ACKNOWLEDGMENT
I wish to express my deep sense of gratitude to Mr. Pramod Puri, General Secretary of Thalassemia Welfare Society, Bhilai (Chhattisgarh, India), for his excellent guidance, the valuable suggestion that greatly helped me to complete the work successfully.
ABBREVIATION
CBC = Complete blood count, FIS = Fuzzy inference system, HGB = Hemoglobin, HbF = Fetal hemoglobin, HbA2 = Hemoglobin A2, HPLC = High-performance liquid chromatography, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin, RBC = Red blood cell.
CONFLICT OF INTERESTS
Declared none
REFERENCES