Disease Prediction Using Machine Learning In Python Using GUI

Post Views: 5,066

Hi, guys Today We will do a project which will predict the disease by taking symptoms from the user.

Let us start the project, we will learn about the three different algorithms in machine learning.

The first algorithm is a Decision Tree, second is a Random Forest and the last one is Naive Bayes.

We are going to import Pandas for manipulating the CSV file, Numpy, Sklearn for the algorithms and Tkinter for our GUI stuff.

Because If we use a single algorithm for our project then how we come to know that the prediction is correct.

So that’s why we use three algorithms.

Now our first step is to make a list or dataset of the symptoms and diseases.

The dataset is given below:

Prototype.csv

Prototype1.csv

Disease Prediction GUI Project In Python Using ML

from tkinter import *
import numpy as np
import pandas as pd

#List of the symptoms is listed here in list l1.

l1=['back_pain','constipation','abdominal_pain','diarrhoea','mild_fever','yellow_urine',
'yellowing_of_eyes','acute_liver_failure','fluid_overload','swelling_of_stomach',
'swelled_lymph_nodes','malaise','blurred_and_distorted_vision','phlegm','throat_irritation',
'redness_of_eyes','sinus_pressure','runny_nose','congestion','chest_pain','weakness_in_limbs',
'fast_heart_rate','pain_during_bowel_movements','pain_in_anal_region','bloody_stool',
'irritation_in_anus','neck_pain','dizziness','cramps','bruising','obesity','swollen_legs',
'swollen_blood_vessels','puffy_face_and_eyes','enlarged_thyroid','brittle_nails',
'swollen_extremeties','excessive_hunger','extra_marital_contacts','drying_and_tingling_lips',
'slurred_speech','knee_pain','hip_joint_pain','muscle_weakness','stiff_neck','swelling_joints',
'movement_stiffness','spinning_movements','loss_of_balance','unsteadiness',
'weakness_of_one_body_side','loss_of_smell','bladder_discomfort','foul_smell_of urine',
'continuous_feel_of_urine','passage_of_gases','internal_itching','toxic_look_(typhos)',
'depression','irritability','muscle_pain','altered_sensorium','red_spots_over_body','belly_pain',
'abnormal_menstruation','dischromic _patches','watering_from_eyes','increased_appetite','polyuria','family_history','mucoid_sputum',
'rusty_sputum','lack_of_concentration','visual_disturbances','receiving_blood_transfusion',
'receiving_unsterile_injections','coma','stomach_bleeding','distention_of_abdomen',
'history_of_alcohol_consumption','fluid_overload','blood_in_sputum','prominent_veins_on_calf',
'palpitations','painful_walking','pus_filled_pimples','blackheads','scurring','skin_peeling',
'silver_like_dusting','small_dents_in_nails','inflammatory_nails','blister','red_sore_around_nose',
'yellow_crust_ooze']

#List of Diseases is listed in list disease.

disease=['Fungal infection','Allergy','GERD','Chronic cholestasis','Drug Reaction',
'Peptic ulcer diseae','AIDS','Diabetes','Gastroenteritis','Bronchial Asthma','Hypertension',
' Migraine','Cervical spondylosis',
'Paralysis (brain hemorrhage)','Jaundice','Malaria','Chicken pox','Dengue','Typhoid','hepatitis A',
'Hepatitis B','Hepatitis C','Hepatitis D','Hepatitis E','Alcoholic hepatitis','Tuberculosis',
'Common Cold','Pneumonia','Dimorphic hemmorhoids(piles)',
'Heartattack','Varicoseveins','Hypothyroidism','Hyperthyroidism','Hypoglycemia','Osteoarthristis',
'Arthritis','(vertigo) Paroymsal  Positional Vertigo','Acne','Urinary tract infection','Psoriasis',
'Impetigo']

l2=[]

for i in range(0,len(l1)):
    l2.append(0)

Now the main part of machine learning comes here i.e the training and testing of the code or model.

So the training file is named as prototype.csv in our program and the testing file is named as prototype 1.csv.

In the last of the article, there is a link to the files.

Import those files by using pandas and replace the items in the files as shown in the code.

Traverse the file as shown in the code and store them into an x_test and y_test. Then Ravel the y_text using the Numpy module.

df=pd.read_csv("Prototype.csv")

#Replace the values in the imported file by pandas by the inbuilt function replace in pandas.

df.replace({'prognosis':{'Fungal infection':0,'Allergy':1,'GERD':2,'Chronic cholestasis':3,'Drug Reaction':4,
'Peptic ulcer diseae':5,'AIDS':6,'Diabetes ':7,'Gastroenteritis':8,'Bronchial Asthma':9,'Hypertension ':10,
'Migraine':11,'Cervical spondylosis':12,
'Paralysis (brain hemorrhage)':13,'Jaundice':14,'Malaria':15,'Chicken pox':16,'Dengue':17,'Typhoid':18,'hepatitis A':19,
'Hepatitis B':20,'Hepatitis C':21,'Hepatitis D':22,'Hepatitis E':23,'Alcoholic hepatitis':24,'Tuberculosis':25,
'Common Cold':26,'Pneumonia':27,'Dimorphic hemmorhoids(piles)':28,'Heart attack':29,'Varicose veins':30,'Hypothyroidism':31,
'Hyperthyroidism':32,'Hypoglycemia':33,'Osteoarthristis':34,'Arthritis':35,
'(vertigo) Paroymsal  Positional Vertigo':36,'Acne':37,'Urinary tract infection':38,'Psoriasis':39,
'Impetigo':40}},inplace=True)

#check the df 
#print(df.head())

X= df[l1]

#print(X)

y = df[["prognosis"]]
np.ravel(y)

#print(y)

#Read a csv named Testing.csv

tr=pd.read_csv("Prototype 1.csv")

#Use replace method in pandas.

tr.replace({'prognosis':{'Fungal infection':0,'Allergy':1,'GERD':2,'Chronic cholestasis':3,'Drug Reaction':4,
'Peptic ulcer diseae':5,'AIDS':6,'Diabetes ':7,'Gastroenteritis':8,'Bronchial Asthma':9,'Hypertension ':10,
'Migraine':11,'Cervical spondylosis':12,
'Paralysis (brain hemorrhage)':13,'Jaundice':14,'Malaria':15,'Chicken pox':16,'Dengue':17,'Typhoid':18,'hepatitis A':19,
'Hepatitis B':20,'Hepatitis C':21,'Hepatitis D':22,'Hepatitis E':23,'Alcoholic hepatitis':24,'Tuberculosis':25,
'Common Cold':26,'Pneumonia':27,'Dimorphic hemmorhoids(piles)':28,'Heart attack':29,'Varicose veins':30,'Hypothyroidism':31,
'Hyperthyroidism':32,'Hypoglycemia':33,'Osteoarthristis':34,'Arthritis':35,
'(vertigo) Paroymsal  Positional Vertigo':36,'Acne':37,'Urinary tract infection':38,'Psoriasis':39,
'Impetigo':40}},inplace=True)

X_test= tr[l1]
y_test = tr[["prognosis"]]

#print(y_test)

np.ravel(y_test)

Now here comes the algorithm part of our program.

Before going through the algorithms, I advise you to go through the algorithms we are going to use in this project.

After understanding the basics of algorithms now apply them in our project.

If the prediction by any of the two algorithms is the same the user might have that disease.

But all the algorithms predict different diseases then the user is trying to make the machine fool.

def DecisionTree():

    from sklearn import tree

    clf3 = tree.DecisionTreeClassifier() 
    clf3 = clf3.fit(X,y)

    from sklearn.metrics import accuracy_score
    y_pred=clf3.predict(X_test)
    print(accuracy_score(y_test, y_pred))
    print(accuracy_score(y_test, y_pred,normalize=False))

    psymptoms = [Symptom1.get(),Symptom2.get(),Symptom3.get(),Symptom4.get(),Symptom5.get()]

    for k in range(0,len(l1)):
        for z in psymptoms:
            if(z==l1[k]):
                l2[k]=1

    inputtest = [l2]
    predict = clf3.predict(inputtest)
    predicted=predict[0]

    h='no'
    for a in range(0,len(disease)):
        if(predicted == a):
            h='yes'
            break


    if (h=='yes'):
        t1.delete("1.0", END)
        t1.insert(END, disease[a])
    else:
        t1.delete("1.0", END)
        t1.insert(END, "Not Found")


def randomforest():
    from sklearn.ensemble import RandomForestClassifier
    clf4 = RandomForestClassifier()
    clf4 = clf4.fit(X,np.ravel(y))

    # calculating accuracy 
    from sklearn.metrics import accuracy_score
    y_pred=clf4.predict(X_test)
    print(accuracy_score(y_test, y_pred))
    print(accuracy_score(y_test, y_pred,normalize=False))
    
    psymptoms = [Symptom1.get(),Symptom2.get(),Symptom3.get(),Symptom4.get(),Symptom5.get()]

    for k in range(0,len(l1)):
        for z in psymptoms:
            if(z==l1[k]):
                l2[k]=1

    inputtest = [l2]
    predict = clf4.predict(inputtest)
    predicted=predict[0]

    h='no'
    for a in range(0,len(disease)):
        if(predicted == a):
            h='yes'
            break

    if (h=='yes'):
        t2.delete("1.0", END)
        t2.insert(END, disease[a])
    else:
        t2.delete("1.0", END)
        t2.insert(END, "Not Found")


def NaiveBayes():
    from sklearn.naive_bayes import GaussianNB
    gnb = GaussianNB()
    gnb=gnb.fit(X,np.ravel(y))

    from sklearn.metrics import accuracy_score
    y_pred=gnb.predict(X_test)
    print(accuracy_score(y_test, y_pred))
    print(accuracy_score(y_test, y_pred,normalize=False))

    psymptoms = [Symptom1.get(),Symptom2.get(),Symptom3.get(),Symptom4.get(),Symptom5.get()]
    for k in range(0,len(l1)):
        for z in psymptoms:
            if(z==l1[k]):
                l2[k]=1

    inputtest = [l2]
    predict = gnb.predict(inputtest)
    predicted=predict[0]

    h='no'
    for a in range(0,len(disease)):
        if(predicted == a):
            h='yes'
            break

    if (h=='yes'):
        t3.delete("1.0", END)
        t3.insert(END, disease[a])
    else:
        t3.delete("1.0", END)
        t3.insert(END, "Not Found")

Now we have done with all the computation part.

We are going to use Tkinter for our GUI because of different IDE. In Anaconda we can use a simple drag and drop.

But in other IDE we have to write the big codes.

Now design the GUI for our project.

# GUI stuff..............................................................................
        
root = Tk()
root.configure(background='black')

Symptom1 = StringVar()
Symptom1.set("Select Here")

Symptom2 = StringVar()
Symptom2.set("Select Here")

Symptom3 = StringVar()
Symptom3.set("Select Here")

Symptom4 = StringVar()
Symptom4.set("Select Here")

Symptom5 = StringVar()
Symptom5.set("Select Here")

Name = StringVar()

w2 = Label(root, justify=LEFT, text="Disease Predictor using Machine Learning", fg="Red", bg="White")
w2.config(font=("Times",30,"bold italic"))
w2.grid(row=1, column=0, columnspan=2, padx=100)
w2 = Label(root, justify=LEFT, text="A Project by Shrimad Mishra", fg="Pink", bg="Blue")
w2.config(font=("Times",30,"bold italic"))
w2.grid(row=2, column=0, columnspan=2, padx=100)

NameLb = Label(root, text="Name of the Patient", fg="Red", bg="Sky Blue")
NameLb.config(font=("Times",15,"bold italic"))
NameLb.grid(row=6, column=0, pady=15, sticky=W)

S1Lb = Label(root, text="Symptom 1", fg="Blue", bg="Pink")
S1Lb.config(font=("Times",15,"bold italic"))
S1Lb.grid(row=7, column=0, pady=10, sticky=W)

S2Lb = Label(root, text="Symptom 2", fg="White", bg="Purple")
S2Lb.config(font=("Times",15,"bold italic"))
S2Lb.grid(row=8, column=0, pady=10, sticky=W)

S3Lb = Label(root, text="Symptom 3", fg="Green",bg="white")
S3Lb.config(font=("Times",15,"bold italic"))
S3Lb.grid(row=9, column=0, pady=10, sticky=W)

S4Lb = Label(root, text="Symptom 4", fg="blue", bg="Yellow")
S4Lb.config(font=("Times",15,"bold italic"))
S4Lb.grid(row=10, column=0, pady=10, sticky=W)

S5Lb = Label(root, text="Symptom 5", fg="purple", bg="light green")
S5Lb.config(font=("Times",15,"bold italic"))
S5Lb.grid(row=11, column=0, pady=10, sticky=W)


lrLb = Label(root, text="DecisionTree", fg="white", bg="red")
lrLb.config(font=("Times",15,"bold italic"))
lrLb.grid(row=15, column=0, pady=10,sticky=W)

destreeLb = Label(root, text="RandomForest", fg="Red", bg="Orange")
destreeLb.config(font=("Times",15,"bold italic"))
destreeLb.grid(row=17, column=0, pady=10, sticky=W)

ranfLb = Label(root, text="NaiveBayes", fg="White", bg="green")
ranfLb.config(font=("Times",15,"bold italic"))
ranfLb.grid(row=19, column=0, pady=10, sticky=W)

OPTIONS = sorted(l1)

NameEn = Entry(root, textvariable=Name)
NameEn.grid(row=6, column=1)

S1 = OptionMenu(root, Symptom1,*OPTIONS)
S1.grid(row=7, column=1)

S2 = OptionMenu(root, Symptom2,*OPTIONS)
S2.grid(row=8, column=1)

S3 = OptionMenu(root, Symptom3,*OPTIONS)
S3.grid(row=9, column=1)

S4 = OptionMenu(root, Symptom4,*OPTIONS)
S4.grid(row=10, column=1)

S5 = OptionMenu(root, Symptom5,*OPTIONS)
S5.grid(row=11, column=1)


dst = Button(root, text="Prediction 1", command=DecisionTree,bg="Red",fg="yellow")
dst.config(font=("Times",15,"bold italic"))
dst.grid(row=8, column=3,padx=10)

rnf = Button(root, text="Prediction 2", command=randomforest,bg="White",fg="green")
rnf.config(font=("Times",15,"bold italic"))
rnf.grid(row=9, column=3,padx=10)

lr = Button(root, text="Prediction 3", command=NaiveBayes,bg="Blue",fg="white")
lr.config(font=("Times",15,"bold italic"))
lr.grid(row=10, column=3,padx=10)


t1 = Text(root, height=1, width=40,bg="Light green",fg="red")
t1.config(font=("Times",15,"bold italic"))
t1.grid(row=15, column=1, padx=10)

t2 = Text(root, height=1, width=40,bg="White",fg="Blue")
t2.config(font=("Times",15,"bold italic"))
t2.grid(row=17, column=1 , padx=10)

t3 = Text(root, height=1, width=40,bg="red",fg="white")
t3.config(font=("Times",15,"bold italic"))
t3.grid(row=19, column=1 , padx=10)

root.mainloop()

Here is the output

Disease Prediction Using Machine Learning In Python Using GUI machine learning

Disease Prediction Using Machine Learning In Python Using GUI

disease prediction in python using tkinter

41 responses to “Disease Prediction Using Machine Learning In Python Using GUI”

Honey says:

January 8, 2020 at 7:25 am

Really an amazing and wonderful project…. you deserve the appreciation. Keep the same spirit brother. Hope it would help countless people in future.

Reply
- Shrimad Mishra says:
  
  January 8, 2020 at 3:52 pm
  
  Thanks
  
  Reply
- Akhilesh says:
  
  May 18, 2021 at 11:32 am
  
  I am unable to run the code pls help me out sir
  
  Reply
Pushp says:

January 8, 2020 at 7:38 am

Thanks Sir. It is easy to understand and very helpful for beginners.

Reply
Sarth says:

January 8, 2020 at 8:37 am

Amazing stuff. Very well coded .

Reply
Vinay says:

January 8, 2020 at 9:53 am

Very good initiative

Reply
Gopi Raman Mishra says:

January 8, 2020 at 9:03 pm

Really marvellous work. GUI is easy to focus and very complex. It is really helpful for New programmers. Well done. Thumbs up.

Reply
jeffrey says:

March 6, 2020 at 9:30 am

where is the prototype file!! unable to find it… ….

Reply
- Saruque Ahamed Mollick says:
  
  April 6, 2020 at 8:24 pm
  
  The post has been updated and the prototype file link is given. You can download it now.
  
  Reply
jeffrey says:

March 6, 2020 at 9:48 am

link for the prototype files please!……

Reply
- Rohan agrawal says:
  
  April 6, 2020 at 12:11 pm
  
  did you find the dataset?
  please provide me link
  
  Reply
- Saruque Ahamed Mollick says:
  
  April 6, 2020 at 8:24 pm
  
  The post has been updated and the prototype file link is given. You can download it now.
  
  Reply
Rohan agrawal says:

April 6, 2020 at 12:33 pm

Can you provide the link for prototype.csv and prototype 1.csv datasets?

Reply
- Saruque Ahamed Mollick says:
  
  April 6, 2020 at 8:24 pm
  
  The post has been updated and the prototype file link is given. You can download it now.
  
  Reply
  - NOWSHEEN RASHID says:
    
    April 10, 2020 at 9:07 pm
    
    No link to dataset is present in this post.
    
    Reply
    - Saruque Ahamed Mollick says:
      
      April 10, 2020 at 9:33 pm
      
      Kindly check carefully. The dataset link is given above the code
      
      Reply
Eslem says:

April 9, 2020 at 1:40 am

Can you provide the link for prototype 1.csv datasets, please?

Reply
Itishree says:

April 20, 2020 at 5:08 pm

Only prototype.csv is there.where is prototype 1.csv ?

Reply
- Saruque Ahamed Mollick says:
  
  May 30, 2020 at 9:37 pm
  
  We have added that file. Kindly check the updated post.
  
  Reply
- Rushikesh says:
  
  December 3, 2021 at 12:41 pm
  
  Can you please send me the documentation file. I m currently doing this project?
  
  Reply
rabiya fathima says:

May 4, 2020 at 6:08 pm

is this a classification or regression project?

Reply
shivang saxena says:

May 30, 2020 at 4:01 pm

where is the prototype1.csv file please provide the link of that file so we can proceed further.

Reply
- Saruque Ahamed Mollick says:
  
  May 30, 2020 at 9:36 pm
  
  We have added that protype1.csv
  
  Reply
Himalay says:

June 2, 2020 at 8:38 pm

can you please give us the explanation of the code for the algo you have used(mainly the decision tree part)

Reply
Drishthi says:

June 21, 2020 at 10:30 pm

i got a tcl error …can someone help me with it…i tried all ways i got from net

Reply
Yash Vibhute says:

October 7, 2020 at 5:10 pm

Exception in Tkinter callback
Traceback (most recent call last):
File “C:\Users\Yash\AppData\Local\Programs\Python\Python38\lib\tkinter\__init__.py”, line 1883, in __call__
return self.func(*args)
File “C:/Users/Yash/PycharmProjects/pythonProject1/codeyash.py”, line 42, in randomforest
clf4 = clf4.fit(X,np.ravel(y))
File “C:\Users\Yash\AppData\Local\Programs\Python\Python38\lib\site-packages\sklearn\ensemble\_forest.py”, line 330, in fit
y, expanded_class_weight = self._validate_y_class_weight(y)
File “C:\Users\Yash\AppData\Local\Programs\Python\Python38\lib\site-packages\sklearn\ensemble\_forest.py”, line 558, in _validate_y_class_weight
check_classification_targets(y)
File “C:\Users\Yash\AppData\Local\Programs\Python\Python38\lib\site-packages\sklearn\utils\multiclass.py”, line 172, in check_classification_targets
raise ValueError(“Unknown label type: %r” % y_type)
ValueError: Unknown label type: ‘unknown’

This is the error i am getting please help

Reply
- Rose says:
  
  March 27, 2023 at 8:57 pm
  
  Im getting the same error and not able to solve
  
  Reply
sahana k says:

November 12, 2020 at 10:41 am

please help me out ..im getting the error in GUI Stuff that “StringVar() is not defined” what shall I do ?

Reply
Sahith Reddy says:

February 10, 2021 at 6:54 am

Can you please send me the documentation file. I m currently doing this project?

Reply
Lalitha says:

April 21, 2021 at 9:18 pm

Could you please tell me how to execute I mean in which ide .. plz help mee

Reply
- Durga prasad says:
  
  January 6, 2023 at 8:00 pm
  
  Try with google colab or jupiter notebook
  
  Reply
zereabrham mulugeta says:

April 24, 2021 at 12:55 pm

good project.my project also healthcare support bot.can you help me by sending source code and documentation.

Reply
Suhasihi says:

June 27, 2021 at 4:37 pm

hey nice project but why is the accuracy for all the three algorithms same?
it shows “0.9512195121951219” accuracy score always. shouldn’t the accuracy score vary for each algorithm each time?

Reply
- Bhavya says:
  
  March 28, 2023 at 1:13 am
  
  In l1 list only some of the symptoms are included. After including all the symptoms I got 1.0 accuracy for all the three algoritms.
  
  Reply
Pallavi says:

August 2, 2021 at 5:33 pm

I am encountering a value error which is
ValueError: could not convert string to float: ‘x’
could you please help me with this.

Reply
Azanu says:

January 23, 2022 at 4:27 pm

I tried to develop this GUI but the prediction button is not working, please help me?

Reply
nicoslin says:

April 6, 2022 at 8:27 am

I’m yet to start a project on this topic… any advice please

Reply
G Durga prasad says:

January 23, 2023 at 8:11 pm

Can you please share the diseases and their symptoms list sir ..I want the list to proceed into the project so please send asap..Thank you

Reply
Sreeja says:

January 30, 2023 at 12:36 pm

I am getting the same error….could anyone pls help us with this

Reply
Shrimad Mishra says:

April 29, 2023 at 8:18 pm

Please check again there might be some issue with the accuracy calculation

Reply
Shraddha says:

October 12, 2023 at 8:00 pm

There is an error when we take our dataset.that error is connor convert string to float.what we do

Reply

Disease Prediction Using Machine Learning In Python Using GUI

Disease Prediction GUI Project In Python Using ML

41 responses to “Disease Prediction Using Machine Learning In Python Using GUI”

Leave a Reply Cancel reply

Related Posts