|
a |
|
b/README.md |
|
|
1 |
## About Dataset |
|
|
2 |
### Relevant information: |
|
|
3 |
|
|
|
4 |
Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. Separating plane described above was obtained using Multisurface Method-Tree (MSM-T) [K. P. Bennett, "Decision Tree Construction Via Linear Programming." Proceedings of the 4th Midwest Artificial Intelligence and Cognitive Science Society, pp. 97-101, 1992], a classification method which uses linear programming to construct a decision tree. Relevant features were selected using an exhaustive search in the space of 1-4 features and 1-3 separating planes. The actual linear program used to obtain the separating plane in the 3-dimensional space is that described in: [K. P. Bennett and O. L. Mangasarian: "Robust Linear Programming Discrimination of Two Linearly Inseparable Sets", Optimization Methods and Software 1, 1992, 23-34]. |
|
|
5 |
|
|
|
6 |
Number of instances: 569 |
|
|
7 |
|
|
|
8 |
Number of attributes: 32 (ID, diagnosis, 30 real-valued input features) |
|
|
9 |
|
|
|
10 |
Diagnosis (M = malignant, B = benign) |
|
|
11 |
|
|
|
12 |
Ten real-valued features are computed for each cell nucleus: |
|
|
13 |
|
|
|
14 |
a) radius (mean of distances from center to points on the perimeter) |
|
|
15 |
b) texture (standard deviation of gray-scale values) |
|
|
16 |
c) perimeter |
|
|
17 |
d) area |
|
|
18 |
e) smoothness (local variation in radius lengths) |
|
|
19 |
f) compactness (perimeter^2 / area - 1.0) |
|
|
20 |
g) concavity (severity of concave portions of the contour) |
|
|
21 |
h) concave points (number of concave portions of the contour) |
|
|
22 |
i) symmetry |
|
|
23 |
j) fractal dimension ("coastline approximation" - 1) |
|
|
24 |
|
|
|
25 |
Missing attribute values: none |
|
|
26 |
|
|
|
27 |
Class distribution: 357 benign, 212 malignant |
|
|
28 |
|
|
|
29 |
### Creators: |
|
|
30 |
|
|
|
31 |
Dr. William H. Wolberg, General Surgery Dept., University of Wisconsin. |
|
|
32 |
|
|
|
33 |
W. Nick Street, Computer Sciences Dept., University of Wisconsin. |
|
|
34 |
|
|
|
35 |
Olvi L. Mangasarian, Computer Sciences Dept., University of Wisconsin. |