|
a |
|
b/README.md |
|
|
1 |
Computer-Vision-Lung-Cancer-Detection |
|
|
2 |
===================================== |
|
|
3 |
|
|
|
4 |
This code is part of the 2013 REU with Depaul University and University of Chicago. |
|
|
5 |
The image processing code was lead by Patrick Stein. |
|
|
6 |
The machine learning code was lead by Ethan Smith. |
|
|
7 |
The results of this research were published at the 2013 International Conference on Machine Learning Applications. |
|
|
8 |
Paper can be found at http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6786163&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D6786163 |
|
|
9 |
|
|
|
10 |
This code makes an alteration in the process of diagnosing lung cancer nodules. The typical process is as follows: |
|
|
11 |
-The human expert (radiologist) reviews the CT scan and identifies the presence/absense of nodules. |
|
|
12 |
-The human draws a border around each nodule |
|
|
13 |
-The computer uses image processing to turn the contents of that border into data |
|
|
14 |
-The computer uses that data to make predictions about the nature of the nodule (malignant, etc.) |
|
|
15 |
-The human performs his own evaluation of the nodule |
|
|
16 |
-The human uses the computer's predictions as a second opinion, and re examines the CT scan. |
|
|
17 |
-The human makes his final decision and writes a report |
|
|
18 |
-The report is sent to an oncologist who uses it to make the final diagnosis |
|
|
19 |
|
|
|
20 |
The step where the radiologist draws the pixel by pixel border around the nodule is expensive due to the time required by a specialist doctor. |
|
|
21 |
The process used in this research speeds up this process, by replacing this 'hard' border with a simple dot identifying the center of the nodule, and then uses several imaging processing methods to form several 'weak segmentations'. |
|
|
22 |
Then the machine learning process learns which combination of these weak segmentations tends to be the best at identifying nodules. |
|
|
23 |
This weak segmentation is used, processed into data, and then used by machine learning classifiers to make predictions. |
|
|
24 |
|
|
|
25 |
In our experiments this method performed quite well and actually performed better than using the hard borders. |
|
|
26 |
This can greatly aid research in automated cancer detection by reducing the cost needed to obtain data to train on. |
|
|
27 |
This can also reduce the final cost of healthcare. |