|
a |
|
b/mit-bih-arrhythmia-database-1.0.0/mitdbdir/intro.htm |
|
|
1 |
<html> |
|
|
2 |
<head> |
|
|
3 |
<title>MIT-BIH Arrhythmia Database Directory (Introduction)</title> |
|
|
4 |
</head> |
|
|
5 |
<body bgcolor="#FFFFFF"> |
|
|
6 |
|
|
|
7 |
<a href="mitdbdir.htm"><h1 align=center>MIT-BIH Arrhythmia Database Directory |
|
|
8 |
</h1></a> |
|
|
9 |
|
|
|
10 |
<p> |
|
|
11 |
<b>Next:</b> <a href="records.htm">Records</a> |
|
|
12 |
<b>Up:</b> <a href="mitdbdir.htm#toc">Contents</a> |
|
|
13 |
<b>Previous:</b> <a href="foreword.htm">Foreword</a> |
|
|
14 |
|
|
|
15 |
<a name="intro"><h1>Introduction</h1></a> |
|
|
16 |
<p> |
|
|
17 |
This introduction describes how the database records were |
|
|
18 |
obtained, and discusses the characteristics of the recorded signals. |
|
|
19 |
Following these notes are annotated ``full disclosure'' plots of the entire |
|
|
20 |
database. These can be useful for obtaining an overall |
|
|
21 |
impression of the contents of individual records. Following the |
|
|
22 |
``full disclosure'' plots are sample ECG strips. These strips |
|
|
23 |
were chosen to illustrate the salient features of each record. |
|
|
24 |
Next are notes on the important features of each record. |
|
|
25 |
These notes also include background information on the subjects, |
|
|
26 |
including their medications. |
|
|
27 |
At the end of the book are tables of rhythms and annotations, which |
|
|
28 |
summarize the contents of the database. |
|
|
29 |
These tables can be helpful in finding a record with a specific |
|
|
30 |
set of characteristics. |
|
|
31 |
|
|
|
32 |
<a name="selection"><h2>Selection criteria</h2> |
|
|
33 |
<p> |
|
|
34 |
The source of the ECGs included in the MIT-BIH Arrhythmia Database is a |
|
|
35 |
set of over 4000 long-term Holter recordings that were obtained |
|
|
36 |
by the Beth Israel Hospital Arrhythmia Laboratory between 1975 and 1979. |
|
|
37 |
Approximately 60% of these recordings were obtained from inpatients. |
|
|
38 |
The database contains 23 records |
|
|
39 |
(numbered from 100 to 124 inclusive with some numbers missing) |
|
|
40 |
chosen at random from this set, and 25 records |
|
|
41 |
(numbered from 200 to 234 inclusive, again with some numbers missing) |
|
|
42 |
selected from the same set to include a variety of rare but clinically |
|
|
43 |
important phenomena that would not be well-represented |
|
|
44 |
by a small random sample of Holter recordings. |
|
|
45 |
Each of the 48 records is slightly over 30 minutes long. |
|
|
46 |
<p> |
|
|
47 |
The first group is intended to serve as a representative sample of the |
|
|
48 |
variety of waveforms and artifact that an arrhythmia detector might |
|
|
49 |
encounter in routine clinical use. A table of random numbers was used |
|
|
50 |
to select tapes, and then to select half-hour segments of them. |
|
|
51 |
Segments selected in this way were excluded only if neither of the two |
|
|
52 |
ECG signals was of adequate quality for analysis by human experts. |
|
|
53 |
<p> |
|
|
54 |
Records in the second group were chosen to |
|
|
55 |
include complex ventricular, junctional, |
|
|
56 |
and supraventricular arrhythmias and conduction abnormalities. Several |
|
|
57 |
of these records were selected because features of the rhythm, QRS |
|
|
58 |
morphology variation, or signal quality may be expected to present |
|
|
59 |
significant difficulty to arrhythmia detectors; these records have |
|
|
60 |
gained considerable notoriety among database users. |
|
|
61 |
<p> |
|
|
62 |
The subjects were 25 men aged 32 to 89 years, and 22 women aged 23 to 89 |
|
|
63 |
years. (Records 201 and 202 came from the same male subject.) |
|
|
64 |
|
|
|
65 |
<a name="leads"><h2>ECG lead configuration</h2></a> |
|
|
66 |
<p> |
|
|
67 |
In most records, the upper signal is a modified limb lead II (MLII), |
|
|
68 |
obtained by placing the electrodes on the chest. The lower signal is |
|
|
69 |
usually a modified lead V1 (occasionally V2 or V5, and in one instance V4); |
|
|
70 |
as for the upper signal, the electrodes are also placed on the chest. |
|
|
71 |
This configuration is routinely used by the BIH Arrhythmia Laboratory. |
|
|
72 |
Normal QRS complexes are usually prominent in the upper signal. |
|
|
73 |
The lead axis for the lower signal may be nearly orthogonal to the mean |
|
|
74 |
cardiac electrical axis, however (i.e., normal beats are usually |
|
|
75 |
biphasic and may be nearly isoelectric). |
|
|
76 |
Thus normal beats are frequently difficult to discern in the lower signal, |
|
|
77 |
although ectopic beats will often be more prominent (see, for example, record |
|
|
78 |
106). |
|
|
79 |
A notable exception is record 114, for which the signals were reversed. |
|
|
80 |
Since this happens occasionally in clinical practice, arrhythmia detectors |
|
|
81 |
should be equipped to deal with this situation. |
|
|
82 |
In records 102 and 104, it was not possible to use modified lead II because of |
|
|
83 |
surgical dressings on the patients; |
|
|
84 |
modified lead V5 was used for the upper signal in these records. |
|
|
85 |
|
|
|
86 |
<a name="analog"><h2>Analog recording and playback</h2></a> |
|
|
87 |
<p> |
|
|
88 |
The original analog recordings were made using nine Del Mar Avionics |
|
|
89 |
model 445 two-channel recorders, designated <i>A</i> through <i>I</i>: |
|
|
90 |
<table border> |
|
|
91 |
<tr><th><i>Recorder</i></th><th><i>Records</i></th></tr> |
|
|
92 |
<tr><td align=center><i>A</i></td><td>102, 107, 111, 115, 121</td></tr> |
|
|
93 |
<tr><td align=center><i>B</i></td><td>212</td></tr> |
|
|
94 |
<tr><td align=center><i>C</i></td><td>203</td></tr> |
|
|
95 |
<tr><td align=center><i>D</i></td><td>118, 124, 217</td></tr> |
|
|
96 |
<tr><td align=center><i>E</i></td> |
|
|
97 |
<td>101, 103, 106, 108, 112, 117, 119, 122, 209, 219, 220, 223, 233</td></tr> |
|
|
98 |
<tr><td align=center><i>F</i></td> |
|
|
99 |
<td>104, 109, 123, 205, 207, 210, 215, 221</td></tr> |
|
|
100 |
<tr><td align=center><i>G</i></td> |
|
|
101 |
<td>100, 105, 114, 116, 213, 214, 222, 228</td></tr> |
|
|
102 |
<tr><td align=center><i>H</i></td><td>113, 201, 202, 231</td></tr> |
|
|
103 |
<tr><td align=center><i>I</i></td><td>200, 230, 232, 234</td></tr> |
|
|
104 |
</table> |
|
|
105 |
<br> |
|
|
106 |
(It is not known which recorder was used for record 208.) |
|
|
107 |
<p> |
|
|
108 |
During the digitization process, the analog recordings were played back |
|
|
109 |
on a Del Mar Avionics model 660 unit. The analog tapes used for records |
|
|
110 |
112, 115 through 124, 205, 220, 223, and 230 through 234 were played back |
|
|
111 |
and digitized at twice real time; the rest were played back at real time |
|
|
112 |
using a specially constructed capstan for the model 660 unit. |
|
|
113 |
Skew between the two signals was found to be as great as 40 |
|
|
114 |
milliseconds for some of the analog recorders. |
|
|
115 |
In addition to the fixed skew that results from extremely small differences |
|
|
116 |
in the orientations of the tape heads on the recorder and the playback unit, |
|
|
117 |
microscopic vertical wobbling of the tape, either during recording or playback, |
|
|
118 |
introduces a variable skew, which may be comparable in magnitude to the fixed |
|
|
119 |
skew. |
|
|
120 |
This problem (which also |
|
|
121 |
occurs on the AHA database) may present difficulties for certain two-channel |
|
|
122 |
analysis methods designed for real-time applications. |
|
|
123 |
<p> |
|
|
124 |
Minor tape speed variations should not pose problems for typical arrhythmia |
|
|
125 |
detectors. It is difficult to avoid tape sticking or slippage during low-speed |
|
|
126 |
playback, and several episodes of tape slippage were noted and marked with |
|
|
127 |
comment annotations. Wow and flutter should be studied carefully in the |
|
|
128 |
context of heart-rate variability studies, since flutter compensation |
|
|
129 |
was not possible in these recordings. A number of frequency-domain artifacts |
|
|
130 |
have been identified and related to specific mechanical components of the |
|
|
131 |
recorders and the playback unit: |
|
|
132 |
<table border> |
|
|
133 |
<tr><th><i>Frequency (Hz)</i></th><th><i>Source</i></th></tr> |
|
|
134 |
<tr><td align=center>0.042</td><td>Recorder pressure wheel</td></tr> |
|
|
135 |
<tr><td align=center>0.083</td><td>Playback unit capstan (for twice real-time playback)</td></tr> |
|
|
136 |
<tr><td align=center>0.090</td><td>Recorder capstan</td></tr> |
|
|
137 |
<tr><td align=center>0.167</td> |
|
|
138 |
<td>Playback unit capstan (for real-time playback)</td></tr> |
|
|
139 |
<tr><td align=center>0.18-0.10</td> |
|
|
140 |
<td>Takeup reel (frequency decreases over time)</td></tr> |
|
|
141 |
<tr><td align=center>0.20-0.36</td> |
|
|
142 |
<td>Supply reel (frequency increases over time)</td></tr> |
|
|
143 |
</table> |
|
|
144 |
<br> |
|
|
145 |
The most significant of these artifacts by far is the 0.167 Hz artifact on |
|
|
146 |
recordings that were played back at real time. The next largest is the |
|
|
147 |
0.090 Hz artifact; the 0.083 Hz artifact on recordings that were played back |
|
|
148 |
at twice real-time is of roughly the same magnitude as the 0.090 Hz artifact. |
|
|
149 |
The 0.042 Hz artifact is of much lower magnitude. Other frequencies |
|
|
150 |
related to the drive train (at 0.42 Hz, 1.96 Hz, 9.1 Hz, and |
|
|
151 |
42 Hz) do not appear as noticeable artifacts. |
|
|
152 |
The frequencies of the last two artifacts listed in the table depend on |
|
|
153 |
how much tape is on the supply and takeup reels; the supply reel causes |
|
|
154 |
a much more noticeable artifact than does the takeup reel. Other |
|
|
155 |
frequency-domain artifacts generated by the supply reel appear in the |
|
|
156 |
0.10-0.18 Hz and 0.30-0.54 Hz bands. |
|
|
157 |
<p> |
|
|
158 |
Four of the 48 records (102, 104, 107, and 217) include paced beats. |
|
|
159 |
The original analog recordings do not represent the pacemaker artifacts |
|
|
160 |
with sufficient fidelity to permit them to be recognized by pulse amplitude |
|
|
161 |
(or slew rate) and duration alone, the method commonly used for real-time |
|
|
162 |
processing. The database records reproduce the analog recordings with |
|
|
163 |
sufficient fidelity to permit use of pacemaker artifact detectors designed for |
|
|
164 |
tape analysis, however. |
|
|
165 |
|
|
|
166 |
<a name="digitization"><h2>Digitization</h2></a> |
|
|
167 |
<p> |
|
|
168 |
The analog outputs of the playback unit |
|
|
169 |
were filtered to limit analog-to-digital converter (ADC) |
|
|
170 |
saturation and for anti-aliasing, |
|
|
171 |
using a passband from 0.1 to 100 Hz relative to real time, well |
|
|
172 |
beyond the lowest and highest frequencies recoverable from the recordings. |
|
|
173 |
The bandpass-filtered signals were |
|
|
174 |
digitized at 360 Hz per signal relative to real time |
|
|
175 |
using hardware constructed at the MIT Biomedical Engineering Center and |
|
|
176 |
at the BIH Biomedical Engineering Laboratory. |
|
|
177 |
The sampling frequency was chosen to facilitate implementations of 60 Hz |
|
|
178 |
(mains frequency) digital notch filters in arrhythmia detectors. |
|
|
179 |
Since the recorders were battery-powered, most of the 60 Hz noise present |
|
|
180 |
in the database arose during playback. |
|
|
181 |
In those records that were digitized at twice real time, this noise appears |
|
|
182 |
at 30 Hz (and multiples of 30 Hz) relative to real time. |
|
|
183 |
<p> |
|
|
184 |
Samples were acquired from each signal almost simultaneously (the intersignal |
|
|
185 |
sampling skew was on the order of a few microseconds). As noted above, analog |
|
|
186 |
tape skew was several orders of magnitude larger. The ADCs were |
|
|
187 |
unipolar, with 11-bit resolution over a ±5 mV range. Sample values thus |
|
|
188 |
range from 0 to 2047 inclusive, with a value of 1024 corresponding to zero |
|
|
189 |
volts. |
|
|
190 |
<p> |
|
|
191 |
The 11-bit samples were originally recorded in 8-bit first difference format |
|
|
192 |
(this was necessary because of limited mass storage capacity). Given the |
|
|
193 |
sampling frequency and the resolution of the ADC, the difference encoding |
|
|
194 |
implies a maximum recordable slew rate of ±225 mV/s. In practice, this |
|
|
195 |
limit was exceeded by the input signals very infrequently, only during severe |
|
|
196 |
noise on a small number of records. The effect on the quality of the recorded |
|
|
197 |
signals is totally negligible. On this CD-ROM, the samples have been |
|
|
198 |
reconstructed from the first differences and stored as pairs of 12-bit |
|
|
199 |
amplitudes packed in triplets of consecutive bytes (for details on the storage |
|
|
200 |
format, see <a href="/physiotools/wag/signal-5.htm">signal(5)</a>). |
|
|
201 |
|
|
|
202 |
<a name="annotations"><h2>Annotations</h2></a> |
|
|
203 |
<p> |
|
|
204 |
An initial set of beat labels was produced by a simple slope-sensitive |
|
|
205 |
QRS detector, which marked each detected event as a normal beat. Two |
|
|
206 |
identical 150-foot chart recordings were printed for each 30-minute record, |
|
|
207 |
with these initial beat labels in the margin. |
|
|
208 |
For each record, the two charts were given to two cardiologists, who worked |
|
|
209 |
on them independently. The cardiologists added additional beat labels |
|
|
210 |
where the detector missed beats, deleted false detections as necessary, |
|
|
211 |
and changed the labels for all abnormal beats. They also added rhythm |
|
|
212 |
labels, signal quality labels, and comments. |
|
|
213 |
<p> |
|
|
214 |
The annotations were transcribed from the paper chart recordings. |
|
|
215 |
Once both sets of cardiologists' annotations for a given record |
|
|
216 |
had been transcribed and verified, they were automatically compared |
|
|
217 |
beat-by-beat, and another chart recording was printed. This chart showed |
|
|
218 |
the cardiologists' annotations in the margin, with all discrepancies |
|
|
219 |
highlighted. Each discrepancy was reviewed and resolved by consensus. |
|
|
220 |
The corrections were transcribed, and the annotations were then analyzed |
|
|
221 |
by an auditing program, which checked them for consistency and which |
|
|
222 |
located the ten longest and shortest R-R intervals in each record (to |
|
|
223 |
identify possible missing or falsely detected beats). |
|
|
224 |
<p> |
|
|
225 |
In early copies of the database, most beat labels were placed |
|
|
226 |
at the R-wave peak, but manually inserted labels were not always |
|
|
227 |
located precisely at the peak. |
|
|
228 |
In copies of the database made since 1983, the beat labels have been shifted |
|
|
229 |
from their original locations. |
|
|
230 |
The ECG (usually the upper signal) was digitally bandpass-filtered to |
|
|
231 |
emphasize the QRS complexes, and each beat label was moved to the major local |
|
|
232 |
extremum, after correction for phase shift in the filter. |
|
|
233 |
A few noisy beats were manually realigned. |
|
|
234 |
This process was applied to all records except record 117 in 1983; the |
|
|
235 |
beat labels for record 117 were not realigned until March 1998, however. |
|
|
236 |
The result is that annotations generally appear at the R-wave peak, and |
|
|
237 |
are located with sufficient accuracy to make the reference annotation |
|
|
238 |
files usable for studies requiring waveform averaging and for |
|
|
239 |
heart rate variability studies (but note the comments |
|
|
240 |
with respect to analog tape wow and flutter above). |
|
|
241 |
In the annotated ECG plots produced by <tt>psfd</tt> and <tt>pschart</tt>, |
|
|
242 |
and in printed copies of this directory, each label is placed so that the |
|
|
243 |
fiducial mark for the annotation corresponds to the left edge of the label. |
|
|
244 |
<p> |
|
|
245 |
The database contains approximately 109,000 beat labels. Sixteen |
|
|
246 |
were corrected in the first seven years after the database was released in 1980 |
|
|
247 |
(in records 104, 108, 114, 203, 207, 217, and 222); in addition, |
|
|
248 |
all of the left bundle branch block beats in record 214 were originally |
|
|
249 |
labelled as normal beats. The rhythm labels |
|
|
250 |
have been more substantially revised and now include notations for paced |
|
|
251 |
rhythm, bigeminy, and trigeminy, which were missing in early copies. |
|
|
252 |
<p> |
|
|
253 |
In October 1998, a rhythm label in record 203 was corrected. In |
|
|
254 |
October 2001, a seventeenth error in the beat labels was discovered |
|
|
255 |
and corrected (in record 209). In April 2003, 26 PVC annotations in |
|
|
256 |
record 119 were manually realigned by small amounts (up to 74 ms). In |
|
|
257 |
May 2003, an eighteenth error in the beat labels was discovered and |
|
|
258 |
corrected (in record 214). In April 2005, many of the episodes |
|
|
259 |
previously labelled as atrial fibrillation in record 222 were |
|
|
260 |
partially or completely relabelled as atrial flutter. In April 2008, |
|
|
261 |
three beat labels were corrected (two in record 108, and one in record |
|
|
262 |
215). In June 2010, the 22nd and 23rd errors in the beat labels were |
|
|
263 |
found and corrected (both in record 203). Thanks to Bob Bruce, Pat |
|
|
264 |
Hamilton, Yin Dengfeng, Roger Mark, Sebastian Vasquez, and Mariano |
|
|
265 |
Llamedo Soria for finding and reporting these errors. |
|
|
266 |
|
|
|
267 |
<hr> |
|
|
268 |
|
|
|
269 |
<a name="symbols"><h1>Symbols used in plots</h1></a> |
|
|
270 |
|
|
|
271 |
<p> |
|
|
272 |
[An expanded and updated version of the table below can be found at |
|
|
273 |
<a href="/physiobank/annotations.shtml"> |
|
|
274 |
<tt>http://www.physionet.org/physiobank/annotations.shtml</tt></a>.] |
|
|
275 |
|
|
|
276 |
<p> |
|
|
277 |
<table border> |
|
|
278 |
<tr><th><i>Symbol</i></th><th><i>Meaning</i></th></tr> |
|
|
279 |
<tr><td><b>·</b> <i>or</i> N</td><td>Normal beat</td></tr> |
|
|
280 |
<tr><td>L</td><td>Left bundle branch block beat</td></tr> |
|
|
281 |
<tr><td>R</td><td>Right bundle branch block beat</td></tr> |
|
|
282 |
<tr><td>A</td><td>Atrial premature beat</td></tr> |
|
|
283 |
<tr><td>a</td><td>Aberrated atrial premature beat</td></tr> |
|
|
284 |
<tr><td>J</td><td>Nodal (junctional) premature beat</td></tr> |
|
|
285 |
<tr><td>S</td><td>Supraventricular premature beat</td></tr> |
|
|
286 |
<tr><td>V</td><td>Premature ventricular contraction</td></tr> |
|
|
287 |
<tr><td>F</td><td>Fusion of ventricular and normal beat</td></tr> |
|
|
288 |
<tr><td>[</td><td>Start of ventricular flutter/fibrillation</td></tr> |
|
|
289 |
<tr><td>!</td><td>Ventricular flutter wave</td></tr> |
|
|
290 |
<tr><td>]</td><td>End of ventricular flutter/fibrillation</td></tr> |
|
|
291 |
<tr><td>e</td><td>Atrial escape beat</td></tr> |
|
|
292 |
<tr><td>j</td><td>Nodal (junctional) escape beat</td></tr> |
|
|
293 |
<tr><td>E</td><td>Ventricular escape beat</td></tr> |
|
|
294 |
<tr><td>/</td><td>Paced beat</td></tr> |
|
|
295 |
<tr><td>f</td><td>Fusion of paced and normal beat</td></tr> |
|
|
296 |
<tr><td>x</td><td>Non-conducted P-wave (blocked APB)</td></tr> |
|
|
297 |
<tr><td>Q</td><td>Unclassifiable beat</td></tr> |
|
|
298 |
<tr><td>|</td><td>Isolated QRS-like artifact</td></tr> |
|
|
299 |
<tr><td colspan=2 align=center>Rhythm annotations appear <i>below</i> the |
|
|
300 |
level used for beat annotations:</td></tr> |
|
|
301 |
<tr><td>(AB</td><td>Atrial bigeminy</td></tr> |
|
|
302 |
<tr><td>(AFIB</td><td>Atrial fibrillation</td></tr> |
|
|
303 |
<tr><td>(AFL</td><td>Atrial flutter</td></tr> |
|
|
304 |
<tr><td>(B</td><td>Ventricular bigeminy</td></tr> |
|
|
305 |
<tr><td>(BII</td><td>2° heart block</td></tr> |
|
|
306 |
<tr><td>(IVR</td><td>Idioventricular rhythm</td></tr> |
|
|
307 |
<tr><td>(N</td><td>Normal sinus rhythm</td></tr> |
|
|
308 |
<tr><td>(NOD</td><td>Nodal (A-V junctional) rhythm</td></tr> |
|
|
309 |
<tr><td>(P</td><td>Paced rhythm</td></tr> |
|
|
310 |
<tr><td>(PREX</td><td>Pre-excitation (WPW)</td></tr> |
|
|
311 |
<tr><td>(SBR</td><td>Sinus bradycardia</td></tr> |
|
|
312 |
<tr><td>(SVTA</td><td>Supraventricular tachyarrhythmia</td></tr> |
|
|
313 |
<tr><td>(T</td><td>Ventricular trigeminy</td></tr> |
|
|
314 |
<tr><td>(VFL</td><td>Ventricular flutter</td></tr> |
|
|
315 |
<tr><td>(VT</td><td>Ventricular tachycardia</td></tr> |
|
|
316 |
<tr><td colspan=2 align=center>Signal quality and comment annotations appear <i>above</i> |
|
|
317 |
the level used for beat annotations:</td></tr> |
|
|
318 |
<tr><td><i>qq</i></td> |
|
|
319 |
<td> |
|
|
320 |
Signal quality change: the first character (`c' or `n') indicates the quality |
|
|
321 |
of the upper signal (clean or noisy), and the second character indicates the |
|
|
322 |
quality of the lower signal</td></tr> |
|
|
323 |
<tr><td>U</td><td>Extreme noise or signal loss in both signals: ECG is unreadable</td></tr> |
|
|
324 |
<tr><td>M (<i>or</i> MISSB)</td><td>Missed beat</td></tr> |
|
|
325 |
<tr><td>P (<i>or</i> PSE)</td><td>Pause</td></tr> |
|
|
326 |
<tr><td>T (<i>or</i> TS)</td><td>Tape slippage</td></tr> |
|
|
327 |
</table> |
|
|
328 |
|
|
|
329 |
<HR> |
|
|
330 |
<P><ADDRESS> |
|
|
331 |
<I><A HREF="mailto:george@mit.edu">George B. Moody (<tt>george@mit.edu</tt>)</A></ADDRESS></I><BR> |
|
|
332 |
24 May 1997 |
|
|
333 |
<br> |
|
|
334 |
<i>Revised 24 June 2010</i> |
|
|
335 |
</body> |
|
|
336 |
</html> |