Card

About Dataset

Pneumonia is an infection that causes inflammation in one or both of the lungs and may be caused by a virus, bacteria, fungi or other germs.

Your doctor may conduct a physical exam and use chest x-ray, chest CT, chest ultrasound, or needle biopsy of the lung to help diagnose your condition. Your doctor may further evaluate your condition and lung function using thoracentesis, chest tube placement or image-guided abscess drainage.

In this dataset you can use CNN to help classify whether patients have Pneumonia or not. It contains "train" folder which will be used to train the model on. "test", which will help us with the validation and finally we can pick some images from the "pred" folder to predict based on the model that we have trained.

For the sake of simplicity, I have balanced the images in the dataset. For a more challenging experience, you can add other data sources and create imbalance. And further improve your CNN model using Data Augmentation.

A sample notebook has been created by myself for your reference.

I do not own this dataset. I have just collected the data from various different sources. Please check the Data Collection Methodology for more information. Credits to the original authors.