[5f0e3e]: / README.md

Download this file

75 lines (67 with data), 8.0 kB

About Dataset

see the introduction kernel on how to access data and use annotation labels

CONTEXT

This dataset contains Lead II signal (with annotations) of 201 records collected from following 3 databases available on PhysioNet under open access:

  1. MIT-BIH Arrhythmia Database [mitdb]
  2. MIT-BIH Supraventricular Arrhythmia Database [svdb]
  3. St Petersburg INCART 12-lead Arrhythmia Database [incartdb]

NOTE

  1. All signals have been resampled to 128Hz and gain has been removed.
  2. Baseline wander has been removed using Median Filtering.
  3. Denoising was NOT used.
  4. Signal data and annotation labels have been saved in numpy (.npy) format.
  5. All Signals are nearly 30 mins long.

CONTENT

Data has been organised as follows:

parent directory db_npy contains 3 sub-directories each of which represent one database

mitdb_npy has 48 records

svdb_npy has 78 records

incartdb_npy has 75 records

Each of these database directory contains a 'RECORDS' file that lists the ecg records available in that database.

Each record has 3 files associated with it:

  1. rec_BEAT.npy: contains 'beat' annotations (R-peaks and its label) for the record.
    each record may have variable number of beats based on heart rate
    usually we shall be interested in beat annotation labels only. Each beat label represents one R-peak and hence one beat

  2. rec_NBEAT.npy: contains 'non-beat' annotations (Other than R-peaks) for the record

  3. rec_SIG_II.npy: contains the Lead 2 signal data of the record as a single numpy array

(* see the introduction kernel on how to access data and use annotation labels *)

Understanding Annotations:

There are two types of annotations: Beat and Non-Beat annotations.
Beat annotations are associated with each heart-beat. If you are working with heart-beat classifications then only Beat annotations shall be useful and Non-Beat annotations can be ignored.

Standard PhysioNet Annotations are described in db_npy/annotations.txt file. These are common across all databases.
This file has 3 columns
Column 1: Label
Column 2: Type of label [ b=beat annotation; n=non-beat annotation ]
Column 3: Description

There are 19 Beat annotations and 22 Non-Beat annotations. However, not all annotations may occur in data files. For example, the Label 'r' does not occur even once in any three of the database but yet its the part of standard PhysioNet labels. (might be in use in some other database). It's advised to do a full annotation count before working with data.

According to AAMI recommendation, each beat is classified into one of the 5 types [ N, V, S, F, Q ]. However, you are free to choose any classification strategy.

IMPORTANT

  1. mitdb's record '102' and '104' DO NOT have lead 2 signal available hence files '102_SIG_II.npy' and '104_SIG_II.npy' are not present. However, they have BEAT and NBEAT files present. Its advised not to use those two records

  2. mitdb's record '102', '104', '107' and '217' are paced records

  3. mitdb's record '207' is the only record with 'Flutter' waves that are not marked by beat-annotations (no R-peaks marked). However, they are marked by non-beat annotations.

Acknowledgements

**PhysioNet **[https://physionet.org/]
MLA Goldberger, A., et al. "PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220." (2000).
APA Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., … & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.
Chicago Goldberger, A., L. Amaral, L. Glass, J. Hausdorff, P. C. Ivanov, R. Mark, J. E. Mietus, G. B. Moody, C. K. Peng, and H. E. Stanley. "PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220." (2000).
Harvard Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P.C., Mark, R., Mietus, J.E., Moody, G.B., Peng, C.K. and Stanley, H.E., 2000. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.
Vancouver Goldberger A, Amaral L, Glass L, Hausdorff J, Ivanov PC, Mark R, Mietus JE, Moody GB, Peng CK, Stanley HE. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

MIT-BIH Arrhythmia Database
Moody GB, Mark RG. The impact of the MIT-BIH Arrhythmia Database. IEEE Eng in Med and Biol 20(3):45-50 (May-June 2001). (PMID: 11446209) : Hosted at https://physionet.org/content/mitdb/1.0.0/

MIT-BIH Supraventricular Arrhythmia Database
Greenwald SD. Improved detection and classification of arrhythmias in noise-corrupted electrocardiograms using contextual information. Ph.D. thesis, Harvard-MIT Division of Health Sciences and Technology, 1990 : Hosted at https://physionet.org/content/svdb/1.0.0/

St Petersburg INCART 12-lead Arrhythmia Database
Hosted at https://physionet.org/content/incartdb/1.0.0/

DOI
https://doi.org/10.13026/C2F305
https://doi.org/10.13026/C2V30W
https://doi.org/10.13026/C2V88N