Lucie Maršánová, Andrea Nemcova, Radovan Smisek, Lukas Smital, Martin Vitek
Published: Jan. 19, 2021. Version: 1.0.0
Maršánová, L., Nemcova, A., Smisek, R., Smital, L., & Vitek, M. (2021). Brno University of Technology ECG Signal Database with Annotations of P Wave (BUT PDB) (version 1.0.0). PhysioNet. https://doi.org/10.13026/hwvj-5b53.
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.
Brno University of Technology ECG Signal Database with Annotations of P Wave (BUT PDB) is an ECG signal database with marked peaks of P waves created by the cardiology team at the Department of Biomedical Engineering, Brno University of Technology. The database consists of 50 2-minute 2-lead ECG signal records with various types of pathology. The ECGs were selected from three existing databases of ECG signal - the MIT-BIH Arrhythmia Database, the MIT-BIH Supraventricular Arrhythmia Database, and the Long Term AF Database. The P waves positions were manually annotated by two ECG experts in all 50 records. Each record contains also annotation of positions and types of QRS complexes (from original database) and dominant diagnosis (pathology) present in record. This database is created for the development, evaluation and objective comparison of P wave detection algorithms.
Accurate detection of P waves and subsequent cardiac pathological events is an important part of electrocardiogram (ECG) evaluation. Currently, methods for P wave detection in physiological conditions are well-described and efficient. However, results of testing on pathological signals are insufficient. A key reason for this is the limited availability of public ECG databases with annotated P waves and pathologies. There are 3 publicly available databases with manually annotated P waves - the QT database [1-3] (contains only physiological signals), MIT-BIH Arrhythmia Database P-Wave Annotations [4-6] (contains only few types of pathologies) and the Lobachevsky University Electrocardiography Database [7]. Here we introduce a new database with annotated P waves in signals with 23 different types of pathology.
The ECGs were selected from three existing databases of ECG signal - the MIT-BIH Arrhythmia Database [8], the MIT-BIH Supraventricular Arrhythmia Database [9,10], and the Long Term AF Database [11, 12]. Two-minute sections of these records were selected by ECG experts, who sought to identify "interesting" pathological signals. The database therefore has a higher incidence of pathologies than typically expected.
The P waves positions were manually annotated by two ECG experts for each beat in all 50 records. The first expert provided manual annotations, and the second checked them. Unclear parts of records were checked by both experts until a consensus was reached. These tasks were conducted manually, without the use of automated software. To facilitate the work of the ECG experts, a free software tool, SignalPlant [13], was used for manual annotation of P waves.
The database consists of 50 2-minute 2-lead ECG signals with various types of pathology with annotated P waves, selected from 3 existing databases of ECG signals [8, 9, 11]. The P waves were manually annotated by two ECG experts for each beat in all 50 records. Each record also contains annotation of positions and types of QRS complexes (from original databases) and dominant diagnosis (pathology).
In BUT PDB, there are 5437 P waves, 7638 QRS complexes of which 2201 are without P wave. In this database, 23 different types of pathology are present. Types of pathologies with their abbreviations used in database are listed in Table below. The exact types of pathologies in each signal are described in the "README".
All data are provided in the WaveForm Database (WFDB) format. The names (IDs) of the recordings are numbers from 01 to 50. The ECG signals are in files: *.dat, *.hea, and the annotations of P waves are in files with names *.pwave, the positions of QRS complexes and their types and sampling frequency of signal are in files with name *.qrs.
Abb. | Type of pathology | Number of cases | Name of signals with the pathology |
---|---|---|---|
A | Atrial premature beat | 14 | 4,5,9,16,17,18,26,28,35,38,39,40,41,43 |
AFIB | Atrial fibrillation | 9 | 7,49,50,8,44,45,46,47,48 |
AFL | Atrial flutter | 2 | 8,38 |
B | Ventricular bigeminy | 3 | 2,14,27 |
BI | Atrioventricular block 1st degree | 1 | 22 |
BII | Atrioventricular block 2nd degree | 2 | 1,13 |
BIII | Atrioventricular block 3rd degree | 1 | 3 |
E | Ventricular escape beat | 1 | 9 |
F | Fusion of ventricular and normal beat | 3 | 30,31,32 |
IVR | Idioventricular rhythm | 1 | 30 |
J | Nodal beat | 3 | 6,7,15 |
L | Left bundle branch block beat | 4 | 21,22,36,41 |
NA | Sinus arrhythmia | 1 | 24 |
NOD | Nodal rhythm | 3 | 6,7,15 |
P | Paced rhythm | 2 | 19,3 |
PREX | Pre-excitation | 1 | 12 |
R | Right bundle branch block beat | 4 | 1,13,26,34 |
SVTA | Supraventricular tachyarrhythmia | 3 | 9,11,43 |
T | Ventricular trigeminy | 2 | 27,29 |
V | Ventricular premature beat | 20 | 5,10,14,19,20,21,25,27,28,29,30,31,32,33,35,36,39,40,41,42 |
VFL | Ventricular flutter | 1 | 33 |
VP | Ventricular pair | 1 | 25 |
a | Aberrated atrial premature beat | 1 | 23 |
There are few publicly available databases of ECG signals that are accompanied by well-curated annotations of P waves. Our goal in creating the dataset was to help address this issue. The database may be valuable for the development, evaluation and objective comparison of P wave detection algorithms.
This work was funded by the United States Office of Naval Research (ONR) Global, award number N62909-19-1-2006.
The authors declare that there are no known conflicts of interest.