Diff of /papers/mee.tex [000000] .. [171cba]

Switch to unified view

a b/papers/mee.tex
1
% Template for PLoS
2
% Version 1.0 January 2009
3
%
4
% To compile to pdf, run:
5
% latex plos.template
6
% bibtex plos.template
7
% latex plos.template
8
% latex plos.template
9
% dvipdf plos.template
10
11
\documentclass[10pt]{article}
12
13
% amsmath package, useful for mathematical formulas
14
\usepackage{amsmath}
15
% amssymb package, useful for mathematical symbols
16
\usepackage{amssymb}
17
18
% graphicx package, useful for including eps and pdf graphics
19
% include graphics with the command \includegraphics
20
\usepackage{graphicx}
21
\usepackage[utf8]{inputenc}
22
\usepackage[T1]{fontenc}
23
\usepackage[icelandic,english]{babel}
24
\usepackage{float}
25
26
% cite package, to clean up citations in the main text. Do not remove.
27
\usepackage{cite}
28
29
\usepackage{color} 
30
31
% Use doublespacing - comment out for single spacing
32
%\usepackage{setspace} 
33
%\doublespacing
34
35
36
% Text layout
37
\topmargin 0.0cm
38
\oddsidemargin 0.5cm
39
\evensidemargin 0.5cm
40
\textwidth 16cm 
41
\textheight 21cm
42
43
% Bold the 'Figure #' in the caption and separate it with a period
44
% Captions will be left justified
45
\usepackage[labelfont=bf,labelsep=period,justification=raggedright]{caption}
46
47
% Use the PLoS provided bibtex style
48
\bibliographystyle{plos2009}
49
50
% Remove brackets from numbering in List of References
51
\makeatletter
52
\renewcommand{\@biblabel}[1]{\quad#1.}
53
\makeatother
54
55
56
% Leave date blank
57
\date{}
58
59
\pagestyle{myheadings}
60
%% ** EDIT HERE **
61
62
63
%% ** EDIT HERE **
64
%% PLEASE INCLUDE ALL MACROS BELOW
65
66
%% END MACROS SECTION
67
68
\begin{document}
69
70
% Title must be 150 characters or less
71
\begin{flushleft}
72
{\Large
73
\textbf{Robust extraction of quantitative structural and textural information from histological images}
74
}
75
% Insert Author names, affiliations and corresponding author email.
76
\\
77
Q. Caudron$^{1,\ast}$, 
78
R. Garnier$^{1}$, 
79
K. A. Watt$^{2}$, 
80
J. G. Pilkington$^{2}$,
81
J. M. Pemberton$^{2}$,
82
T. A. Aboellail$^3$,
83
B. T. Grenfell$^{1,4}$
84
A. L. Graham$^1$,
85
\\
86
\bf{1} Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
87
\\
88
89
\bf{2} Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, EH9 3FL, UK
90
\\
91
92
\bf{3} Department of Microbiology, Immunology, and Pathology, Colorado State University, Fort Collins, CO, USA
93
\\  
94
\bf{4} Fogarty International Center, National Institutes of Health, Bethesda, MD, USA
95
\\
96
$\ast$ E-mail: qcaudron@princeton.edu
97
\end{flushleft}
98
99
100
101
102
103
104
105
106
107
108
109
110
111
% Please keep the abstract between 250 and 300 words
112
\section*{Abstract}
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
\section*{Introduction}
129
130
%The structural properties and visual appearance of organ tissue can provide valuable information about the condition of health of an individual, yielding insights into disease prevalence or revealing evidence of a particular pathology. 
131
Histopathology is widely practiced in medical and veterinary fields, where it is an essential tool in the diagnosis and discovery of infections, cancers, damaged tissue, and other conditions. In clinical settings, interpretation of histological samples is typically performed via microscopy by a pathologist. This results in a descriptive analysis of several aspects of the tissue's presentation, such as nuclear density, general cell morphology, or the presence of tumours. This time-intensive and expensive process is sensitive to intra- and inter-operator variation and subjectivity in interpretation. In addition, information in pathology reports is qualitative or semi-quantitative, which restricts the use of statistical tools for classification or quantitative analysis.
132
133
Recent advances in imaging technology and computational power have led to the emergence of a quantitative approach to histopathological analysis. Computer vision algorithms are applied to digitised histological preparations, extracting quantitative information about structural features in the tissue. These methods have been successfully applied to the high-throughput analysis of tissue banks for disease classification and phenotyping (Peng2008), and, increasingly, to aid in the detection and identification of disease in hospital settings (Tang2009). Computer-aided diagnosis combines image processing and artificial intelligence methods to aid pathologists in detecting and quantifying irregularities in the tissue, and is currently demonstrating extremely promising results in the fields of diagnostic and intervention radiology. 
134
135
In both high-throughput and laboratory settings, a certain level of image quality is almost guaranteed, allowing image processing algorithms to successfully and reproducibly identify structures of interest in digital images. However, little progress has been made in applying these methods to field ecology, where on-site conditions often constrain the state of tissue samples or capture of images. 
136
137
Here, we demonstrate that quantitative information can be extracted from high-variance images in a robust manner. We apply our algorithms to measures of liver state in the Soay sheep, from 4306 images of histological slides prepared from the liver necropsies of 141 individuals. We begin by presenting the Soay sheep system and associated data. We then describe the image capture process and preprocessing applied to images with the aim of reducing some of the variance in their colour, luminosity, and contrast. Then, we present various measures of structure and texture, which we compare with a traditional pathologist's analysis of the slides. We finish by comparing these measures to a traditional histological analysis of these slides by pathologist grading.
138
139
140
141
142
143
144
145
146
147
148
149
150
151
\section*{Methods}
152
153
154
\subsection*{Soay Sheep}
155
156
Background on the Soay. 
157
158
The data we have : physical measurements, necropsies.
159
160
Histology preparation ( slice, stain, slides per individual, ... )
161
162
163
164
165
166
167
168
\subsection*{Image Acquisition and Processing}
169
170
Imaging was performed on a Nikon Eclipse 80i bright-field microscope with a Nikon CFI Plan Achromat 4x objective. Fields of view measuring $2\times3$mm were captured in RAW format on a Canon 600D SLR camera, attached to the microscopy with an Amscope DSLR adaptor. For each image, the field of view and focus were found manually. Up to five 18-megapixel images were captured from each slide, at 14 bits per channel, in the three-channel sRGB colour space. Fields of view were selected to ensure they did not contain tears, folds, or other imperfections in the tissue caused by sample preparation.
171
172
Images were treated with a preprocessing stage upon capture. Saturation was increased globally, but also preferentially in low-saturation pixels. Yellows were reduced, to compensate for the microscope's lamp, and luminosity was increased in the highlights in order to improve separation of the tissue from the background. Finally, local midtone contrast was enhanced to increase sharpness without amplifying noise.
173
174
After acquisition and preprocessing, images were processed in Python with the scikit-image package. Images were first sigmoid-corrected for contrast, and converted to relative luminance greyscale. Thresholding was performed adaptively, where the threshold is defined as the Gaussian-weighted mean in a 301 pixel block centred around each pixel. Finally, connected components under 100 pixels in size were removed.
175
176
177
178
\subsection*{Measures}
179
180
Seven measures were calculated from processed images.
181
182
183
184
185
 In order to characterise aspects of the structure, texture, and spatial distribution of the tissue, 
186
187
 : lacunarity, entropy, directionality, characteristic scale, the number and mean size of inflammatory foci, and the tissue to sinusoid ratio.
188
189
190
191
1 Gabor filter directionality
192
193
2 Gabor filter scale
194
195
NOTE GABOR - phase-insensitive
196
197
3 normalised Lacunarity
198
199
4 Shannon entropy
200
201
5 Deconvolution foci count and size
202
203
6 Tissue to sinusoid ratio
204
205
206
207
208
209
\section*{Results}
210
211
\subsection*{Measure Distributions}
212
213
Plots of distributions ( include bighorn ? )
214
215
Intra- and inter-individual variances
216
217
PCA
218
219
220
221
222
223
224
225
\subsection*{Pathologist Analysis}
226
227
Correlation with image measures
228
229
230
231
232
233
234
235
236
237
\section*{Discussion}
238
239
240
241
242
243
244
245
246
     
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
% Do NOT remove this, even if you are not including acknowledgments
286
\section*{Acknowledgements}
287
288
QC and BTG were supported by funding from the US Department of Homeland Security contract HSHQDC-12-C-00058. BTG acknowledges support from the Bill \& Melinda Gates Foundation. BTG was funded by the RAPIDD program of the Science and Technology Directorate, Department of Homeland Security, and the Fogarty International Center, National Institutes of Health. 
289
290
291
292
293
294
295
296
297
%\section*{References}
298
% The bibtex filename
299
\bibliography{qbib}
300
301
\section*{Figure Legends}
302
%\begin{figure}[!ht]
303
%\begin{center}
304
%%\includegraphics[width=4in]{figure_name.2.eps}
305
%\end{center}
306
%\caption{
307
%{\bf Bold the first sentence.}  Rest of figure 2  caption.  Caption 
308
%should be left justified, as specified by the options to the caption 
309
%package.
310
%}
311
%\label{Figure_label}
312
%\end{figure}
313
314
315
\section*{Tables}
316
%\begin{table}[!ht]
317
%\caption{
318
%\bf{Table title}}
319
%\begin{tabular}{|c|c|c|}
320
%table information
321
%\end{tabular}
322
%\begin{flushleft}Table caption
323
%\end{flushleft}
324
%\label{tab:label}
325
% \end{table}
326
327
328
329
\vspace{0.4cm}
330
\begin{table}[!h]
331
\centering
332
\begin{tabular}{ l c c c }
333
\hline \\[-0.9em]
334
\textbf{Locality} & Population & Birth rate & {$\mathbf{\tau}$} \\[0.1em]
335
  \hline \\[-0.9em]        
336
  Bornholm & 47100 & 19.4 & 15 \\[0.1em]
337
  Faroe Islands & 28200 & 29.4 & 15 \\[0.1em]
338
  Reykjav\'{i}k & 47100 & 24.1 & 18 \\[0.1em]
339
  Hafnarfj\"{o}r\dh{}ur & 6000 & 22.4 &  8 \\[0.1em]           
340
  Akureyri & 7000 & 22.7 & 19 \\[0.1em]
341
  Vestmannaeyjar \hspace{0.2cm} & 3600 & 23.5 & 7 \\[0.1em]
342
  \hline  
343
\end{tabular}
344
\caption{Mean population sizes, birth rates, and sensitivity thresholds $\tau$ for each locality. Population sizes and annual birth rates per thousand are given as the mean over the study period. Thresholds were fit by maximising the correlation between the mean simulated epidemic time-series and the reported incidence data.}
345
\label{tableTau}
346
\end{table}
347
348
349
350
351
352
\section*{Figures}
353
354
\begin{figure}[!h]
355
\centering
356
%\includegraphics[width=\textwidth]{figures/q1.pdf}
357
\caption{\textbf{Reported and predicted biweekly incidence for Bornholm, the Faroe Islands, and four localities in Iceland.} The observed data is in blue. For the predicted time-series, the mean value of incidence simulations is plotted as a dark red line, with 95\% confidence intervals given in light red. Bornholm~: $R^2=0.78$; Faroe Islands~: $R^2=0.55$; Reykjav\'{i}k~: $R^2=0.73$; Hafnarfj\"{o}r\dh{}ur~: $R^2=0.86$; Akureyri~: $R^2=0.80$; Vestmannaeyjar~: $R^2=0.77$.}
358
\label{figIncidence}
359
\end{figure}
360
361
362
363
\begin{figure}[!h]
364
\centering
365
%\includegraphics[width=\textwidth]{figures/q2.pdf}
366
\caption{\textbf{Reporting rates and seasonalities.} Seasonality is plotted as a function of the biweek, with 95\% confidence intervals in light blue.}
367
\label{figSims}
368
\end{figure}
369
370
371
\begin{figure}[!h]
372
\centering
373
%\includegraphics[width=\textwidth]{figures/q3.pdf}
374
\caption{\textbf{Predictability of epidemic sizes.} The mean predicted size of each epidemic as a function of its observed size, from ten thousand simulations. Red lines are the regression lines with the follow coefficients of determination and slopes~-- Bornholm~: $R^2=0.76$, gradient~$=1.07$; Faroe Islands~: $R^2=0.77$, gradient~$=0.60$; Reykjav\'{i}k~: $R^2=0.64$, gradient~$=0.96$; Hafnarfj\"{o}r\dh{}ur~: $R^2=0.88$, gradient~$=1.18$; Akureyri~: $R^2 = 0.49$, gradient~$=0.72$; Vestmannaeyjar~: $R^2=0.76$, gradient~$=1.23$. The green line is the zero-intercept, gradient-one line representing a one-to-one match between observation and prediction.}
375
\label{fig_sizes}
376
\end{figure}
377
378
379
380
381
382
\end{document}
383