DESIGN OF A FUZZY MODEL FOR THALASSEMIA DISEASE DIAGNOSIS: USING MAMDANI TYPE FUZZY INFERENCE SYSTEM (FIS)

SAPNA THAKUR*, SHARADA NANDAN RAW, RAVINDRA SHARMA

Department of Mathematics, National Institute of Technology, Raipur, Chhattisgarh, India
Email: sapnarajput85@gmail.com

Received: 01 Feb 2015 Revised and Accepted: 01 Mar 2016


ABSTRACT

Objective: Diagnosis process of Thalassemia requires several types of medical test, and results of this test together identify the stage of Thalassemia. The objective of this study is to design a Fuzzy Inference System to diagnose the severity of the Thalassemia disease of a patient by using Fuzzy Logic.

Methods: In this paper, a new approach based on fuzzy inference system was presented for prediction of Thalassemia disease in patients. The proposed Fuzzy model combined the expert’s knowledge and the fuzzy logic approach which is then combined in fuzzy rule base to diagnose the presence of the disease. The performances of the system graphically represented by fuzzy inference system tools in MATLAB8.4.

Results: It was found that our program matched the doctor’s diagnosis in 12 cases perfectly. The other 3 were marginally off. This results with an accuracy of about 80 %.

Conclusion: The result suggests that the model provides the most effective way to identify Thalassemia type in patients. The results in this work can be obtained by a simple and inexpensive method. This would generate, in economic terms, significant savings.

Keywords: CBC Test, Fuzzy Logic, Mamdani Fuzzy Inference System, Thalassemia Disease


INTRODUCTION

Anemia is a condition where the number of healthy RBC in the blood is lower than normal. It is due to low RBC’s, destruction of RBC’s or loss of too many RBC’s. If your blood does not have enough RBC’s, your body doesn’t get enough oxygen it needs. As a result, you may feel tired and other symptoms. But sometimes it is very difficult to detect Thalassemia on the basis of symptoms only. In the domain of Thalassemia, there is no such boundary between what is healthy and what is diseased. Having so many factors to detect Thalassemia makes doctor’s work difficult. So, experts require an accurate tool that considering these risk factors and give some certain result in uncertain terms. Some biochemical tests (HGB, HbF, HbA2, RBC, MCV, and MCH) are useful for identifying carriers of the Thalassemia trait [1-3]. In the presence of Thalassemia parameters in the CBC, an accurate and precise quantification of hemoglobin HbA2 is essential for the diagnosis of the Thalassemia trait. When biochemical tests are not exhaustive, it is necessary to study the molecular globin genes [4]. HPLC and electrophoresis are a gold standard for the diagnosis of β-Thalassemia trait, but it is not available at all places.

Thus, several attempts have been made to diagnose the condition by using red cell indices. In the present study Hemoglobin (HGB), Mean Corpuscular Volume (MCV) and Mean corpuscular hemoglobin (MCH) values were able to detect cases of type of Thalassemia. In this study, we focus on a development of the first knowledge representation corresponding to the CBC test results to create a mathematical model for Thalassemia disease diagnosis using FIS. The developed FIS model for Thalassemia disease will be useful and beneficial for many informatics related Thalassemia tasks in the future. The objective of this study is to create a fuzzy inference system to predict the severity involve in Thalassemia disease. Three steps are used to monitor general health and Thalassemia. But we are focusing only on the Tests and Procedures. Three steps are as follows:

The rest of the paper is organized as follows. An application of Fuzzy expert system in different areas represented in the Introduction section. Materials and Methods section provides the description of fuzzy model and structure of a medical expert system on Thalassemia disease. Finally, the result and discussion are concluded in the Results and Discussion section.

Applications of fuzzy logic in different areas

In 1965, Prof. Lotfi Zadeh developed fuzzy set theory that emerges the concept of fuzzy logics [5, 6]. Fuzzy logic consists of probabilistic logic or many-valued logics. Rather than fixed and exact reasoning, it provides approximate reasoning. In the development of medical systems, since the 1980s, fuzzy logic is being extensively used. In the last few decades, the significant development of control system theory can be witnessed by which development of computers and electronic has outcome into many different applications of control system theory [7].

When the studies in the literature related to this classification application are examined, it can be seen that a great variety of methods were used. Among these, [8] Fuzzy System have been used to diagnose the different types of anemia on the basis of symptoms such as Irritability, tachycardia, Memory weakness, Bleeding and Chronic fatigue. Another, [9] diagnose Liver disease using fuzzy logic on the basis of CBC Test, which uses 4 parameters such as WBC, HGB, HCT and PLT. [10] Adeli and Neshat proposed a system to diagnose the heart disease using fuzzy logic. [11] Lavanya et. al. also develop a fuzzy expert system to diagnose the Lung Cancer.

In this paper, a Fuzzy Inference System is designed to diagnose the severity of the Thalassemia disease of a patient by using Fuzzy Logic.

MATERIALS AND METHODS

We describe the designing of the Fuzzy Inference System (FIS) for Thalassemia Disease Diagnosis.

Design a fuzzy logic system for thalassemia disease diagnosis

Problem Specification and Define linguistic Variables: There are 3 input variables and 1 output variable.



Fig. 1: Mamdani fuzzy inference system for thalassemia diagnosis

Ranges for Input/output fields of the system

Linguistic Variables:


Table 1: Linguistic variable for input variables

S. No.

Input variables

Linguistic variables

1.

Hemoglobin (g/dl)

HGB

2.

Mean Corpuscular Volume (fl)

MCV

3.

Mean corpuscular hemoglobin (pg)

MCH


Table 2: Linguistic variables for output variables

S. No.

Output variable

Linguistic variables

1.

Thalassemia

Type_of_Thalassemia


Define Fuzzy Sets:

Membership function for all input variables

For calculating the membership function (MF) we scale the range of 0-100 and based on the severity we calculate the MF and the range are given as follows:

Output variables and value ranges

The output will be a value within the range [0, 10]. The value, 0, means that no Thalassemia problems exist as of yet. We have divided this range into smaller fuzzy sets to make a cluster of the type of Thalassemia disease. ‘Thalassemia_Minor’ is given to those patients whose output value is in between 0 and 3.4 ‘Thalassemia_Intermedia’ is given to the patients who gets a value between 3.5 and 7 also ‘Thalassemia_Major’ is given to the patients who gets a value between 7.1 and 9.9 and so on as shown in the table 5, The basic relationship is that the higher the severity of Thalassemia disease, the lower the output value.





Fig.2: Fuzzy membership for all input and output variables


Table 3: Values for all input linguistic variables

S. No.

Linguistic variables

Ranges

Values

1.

HGB

<7

grams/deciliter

Very_Low

7–10

grams/deciliter

Low

9–12

grams/deciliter

Medium

2.

MCV

<40 fl

Very_Low

40.1–60 fl

Low

60.1–80 fl

Medium

3.

MCH

0–10 pg

Very_Low

10–20 pg

Low

20–30 pg

Medium

[HGB = Hemoglobin, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin]


Table 4: Membership function for all input variables

S. No.

Linguistic variables

Membership function

Values

1.

HGB



Very_Low



Low



Medium

2.

MCV



Very_Low



Low



Medium

3.

MCH



Very_Low



Low



Medium


Table 5: Values for all output linguistic variables

S. No.

Linguistic variables

Ranges

Values

1.

Type_of_Thalassemia

HGB is 9–12 g/dl

Thalassmia_Minor [12, 13]

MCV is<80 fl

MCH is<27 pg

2.

HGB is 7–10 g/dl

Thalassemia_Intermedia [14]

MCV is 50–80 fl

MCH is 16–24 pg

3.

HGB is<7 g/dl

Thalassemia_Major [14]

MCV is>50<70 fl

MCH is>12<20 pg

[HGB = Hemoglobin, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin]


Table 6: Classification of output

S. No.

Linguistic variable

Ranges

Fuzzy set

1.

Type_of_Thalassemia

<3.5

Thalassmia_Minor

3.5-7

Thalassemia_Intermedia

10>

Thalassemia_Major


Define fuzzy rules

In this section fuzzy inference rules generated; relevant inference rules can be determined by experience human operators well, we use IF ELSE conditions for fuzzy inference rules, as we have three input variables. Also, there are 15 rules conveniently are represented in IF-ELSE Form.

First 3 rules are for Symptoms based testing:

1. If (Symptoms is Absent) and (Family_History is Present) and (Disorder is Thalassemia_Minor) then HGB is Medium.

2. If (Symptoms is Present) and (Family_History is Present) and (Disorder is Thalassemia_Intermedia) then HGB is Low.

3. If (Symptoms is Present) and (Family_History is Present) and (Disorder is Thalassemia_Major) then HGB is Very_Low.

Further, 3 rules are in the classification of Thalassemia on the basis of MCV only:

1. If (HGB is Medium) and (MCV is Very_Low) then Thalassemia_Minor.

2. If (HGB is Low) and (MCV is Low) then Thalassemia_Intermidia.

3. If (HGB is Very_Low) and (MCV is Medium) then Thalassemia_Major.

At last 15 rules are for the further classification of Thalassemia on the basis of all three parameters such as HGB, MCV, and MCH:

  1. If (HGB is Medium) and (MCV is Very_Low) and (MCH is Very_Low) then Type_of_Thalassemia is Thalassemia_Minor.
  2. If (HGB is Medium) and (MCV is Very_Low) and (MCH is Low) then Type_of_Thalassemia is Thalassemia_Minor.
  3. If (HGB is Medium) and (MCV is Very_Low) and (MCH is Medium) then Type_of_Thalassemia is Thalassemia_Minor.
  4. If (HGB is Medium) and (MCV is Low) and (MCH is Very_Low) then Type_of_Thalassemia is Thalassemia_Minor.
  5. If (HGB is Medium) and (MCV is Low) and (MCH is Low) then Type_of_Thalassemia is Thalassemia_Minor.
  6. If (HGB is Medium) and (MCV is Low) and (MCH is Medium) then Type_of_Thalassemia is Thalassemia_Minor.
  7. If (HGB is Medium) and (MCV is Medium) and (MCH is Very_Low) then Type_of_Thalassemia is Thalassemia_Minor.
  8. If (HGB is Medium) and (MCV is Medium) and (MCH is Low) then Type_of_Thalassemia is Thalassemia_Minor.
  9. If (HGB is Medium) and (MCV is Medium) and (MCH is Medium) then Type_of_Thalassemia is Thalassemia_Minor.
  10. If (HGB is Low) and (MCV is Low) and (MCH is Low) then Type_of_Thalassemia is Thalassemia_Intermedia.
  11. If (HGB is Low) and (MCV is Low) and (MCH is Medium) then Type_of_Thalassemia is Thalassemia_Intermedia.
  12. If (HGB is Low) and (MCV is Medium) and (MCH is Low) then Type_of_Thalassemia is Thalassemia_Intermedia.
  13. If (HGB is Low) and (MCV is Medium) and (MCH is Medium) then Type_of_Thalassemia is Thalassemia_Intermedia.
  14. If (HGB is Very_Low) and (MCV is Low) and (MCH is Low) then Type_of_Thalassemia is Thalassemia_Major.
  15. If (HGB is Very_Low) and (MCV is Medium) and (MCH is Low) then Type_of_Thalassemia is Thalassemia_Major.

The AND operator used in the rule infers that the minimum criterion is used in the resultant and aggregated to indicate the condition of Thalassemia: Thalassemia_Minor, Thalassemia_Intermedia, and Thalassemia_Major. Also, the combination of each rule can be written in the following (table 7) form:


Table 7: Illustration of applied rules with respect to membership function

Rule No.

Antecedent

Consequence

Linguistic

variable1 (HGB)

Linguistic

variable2 (MCV)

Linguistic

variable3 (MCH)

Result

1

Medium

Very_Low

Very_Low

Thalassemia_Minor

2

Medium

Very_Low

Low

Thalassemia_Minor

3

Medium

Very_Low

Medium

Thalassemia_Minor

4

Medium

Low

Very_Low

Thalassemia_Minor

5

Medium

Low

Low

Thalassemia_Minor

6

Medium

Low

Medium

Thalassemia_Minor

15

Very_Low

Medium

Low

Thalassemia_Major



Fig. 3: Rule viewer for generated rules

Experimental results

For our system, we use the mamdani type FIS. For each rule, a degree is calculated. For aggregation, the maximum is taken from all the degrees. In the defuzzification process, the centroid method is used.

Now from Matlab2014b using this input values we derive the rule viewer for giving input as demonstrated in fig. 3 and surface plots using the surface viewer as demonstrated in fig. 4.

Fuzzy system is used to obtain the severity level which is the only output variable of the system.

The risk determines the level of severity of risk given the inputs.


Table 8: Input values for patient

S. No.

Input variable

Value ranges

Ranges selected

1.

HGB

7.9 g/dl

7<7.9<10

2.

MCV

54.1 fl

40<54.1<60

3.

MCH

21.9 pg

20<21.9<30

[HGB = Hemoglobin, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin].



Fig. 4: A 3D graph showing how the variation in MCH and HGB affects the severity of Thalassemia disease


Table 9: Testing of the system

HGB

MCV

MCH

Type_of_thalassemia

7.9 (Low)

54.1 (Low)

21.9 (Medium)

5.25 (Thalassemia_Intermedia)

[HGB = Hemoglobin, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin]


RESULTS AND DISCUSSION

The patient's health risk was found from the given input of a linguistic variable of HGB, MCV, and MCH. Using Mamdani FIS method to construct the membership function for assigned the linguistic variation in the gasification process. Using If. Then rule and inference strategies are chosen for processing the rule base to determine the type of Thalassemia in a patient by logical decision-making analysis. Through the defuzzification, fuzzy system provides an objective process of the risk factor, also to view the surface view of the risk determination using simulation. We tested our fuzzy expert system against the following input variable values.

To test the accuracy of our system we have taken 15 random patients’ data from our dataset of 40 patients and used it to give their diagnoses. The randomness of the patients was important as that would account for a proper sample to test with, statistically. The idea is to send 15 patients’ Data to the doctor who will give us his diagnosis. We implemented a system which ranges from Thalassemia_Minor (1) to Thalassemia_Major (3) and compared our program results with that of a doctor from registered patients from the Thalassemia Welfare Society, Bhilai, Durg (Chhattisgarh, India). It was found that our program matched the doctor’s diagnosis in 12 cases perfectly. The other 3 were marginally off. This results with an accuracy of about 80 %.

CONCLUSION

Because of uncertainty involve in the diagnosis of Thalassemia disease a new method for Thalassemia disease diagnostic problem solving based on fuzzy inference system is constructed in this paper. According to the fuzzy inference system step by step actions is performed. The constructed system in this paper is an efficient attempt to solve the Thalassemia disease problem. The proposed model detects Thalassemia on the basis of Thalassemia both Symptoms and CBC Test. The results of this work can facilitate laboratory work by reducing the time and cost.

ACKNOWLEDGMENT

I wish to express my deep sense of gratitude to Mr. Pramod Puri, General Secretary of Thalassemia Welfare Society, Bhilai (Chhattisgarh, India), for his excellent guidance, the valuable suggestion that greatly helped me to complete the work successfully.

ABBREVIATION

CBC = Complete blood count, FIS = Fuzzy inference system, HGB = Hemoglobin, HbF = Fetal hemoglobin, HbA2 = Hemoglobin A2, HPLC = High-performance liquid chromatography, MCV = Mean corpuscular volume, MCH = Mean corpuscular hemoglobin, RBC = Red blood cell.

CONFLICT OF INTERESTS

Declared none

REFERENCES

  1. Mosca A, Paleari R, Ivaldi G, Galanello R, Giordano PC. The role of hemoglobin A2 testing in the diagnosis of thalassemias and related hemoglobinopathies. J Clin Pathol 2009;62:13-7.
  2. Yousafzai YM, Khan S, Raziq F. Beta thalassaemia trait: hematological parameters.J Ayub Med Coll Abbottabad 2010;22:84-6.
  3. Dell'Edera D, Pacella E, Epifania AA, Benedetto M, Tinelli A, et al. Importance of molecular biology in the characterization of β-thalassemia carriers. Eur Rev Med Pharmacol Sci 2011;15:79-86.
  4. Edera DD, Benedetto M, Leo M, Santacesaria C, Arianna Allegretti A, Lupo MG, et al. Identification of patients with defects in the globin genes by analyzing blood parameters and genetic study: report of five cases. J Hematol Malignancies 2013;3:29-36.
  5. Zadeh LA, Fuzzy Sets. This week’s citation classic. Information Control 1965;8:338-53.
  6. Zadeh LA. Outline of a new approach to the analysis of complex systems and decision processes. IEEE Trans Syst Man Cybern 1973;3:28-44.
  7. Mahfouf M, Abbod MF, Linkens DA. A survey of fuzzy logic monitoring and control utilization in medicine. Artificial Intelligence Med 2001;21:27-42.
  8. Aramideh J, Jelodar H. Application of fuzzy logic for the presentation of an fuzzy expert system to diagnose Anemia. Indian J Sci Technol 2014;7:933-8.
  9. Hashmia A, Khan MS. Diagnosis blood test for liver disease using fuzzy logic. Int J Sci: Basic Appl Res 2015;20:151-83.
  10. Adeli A, Neshat M. A fuzzy expert system for heart disease diagnosis. proceedings of the international multi-conference of engineers and computer scientists 2010;1:134-9.
  11. Lavanya K, Saleem DMA, Sriman NA. Fuzzy rule-based inference system for detection and diagnosis of lung cancer. Int J Latest Trends Computing 2011;165:2045-5364.
  12. Grow K, Vashist M, Abrol P, Sharma S, Yadav R. Beta thalassemia in India: current status and the challenges ahead. Int J Pharm Pharm Sci 2014;6:28-33.
  13. Anunchai A. Chalorthamh N, Ruangrajitpakornt T, Limwongseh C, Supnithit T, Tongsimai S. A development of knowledge representation for thalassemia prevention and control program.
    Natural Language Processing and Knowledge Engineering (NLP-KE), 2011. 7th International Conference IEEE; 2011. p. 190-3.
  14. Galanello R, Melis MA, Ruggeri R, Addis M, Scalas MT, Maccioni L, et al. Beta °thalassemia trait in Sardinia. Hemoglobin 1979;3:33–46.