The Indian Institute of Science Fetal Heart Sound Database (IIScFHSDB) has 60 fetal phonocardiography (fPCG) recordings obtained from expecting mothers (age 18–27 years) with gestation weeks 30–40. The recordings were made at St. John’s Hospital, Bangalore, using an electronic stethoscope. Data were collected from the antenatal and labor wards by placing the stethoscope on the lower abdomen of each subject. Each fPCG recording was acquired with a gain of 500X, has an average duration of 8 minutes, and was sampled at 2 kHz.
Fetal monitoring during pregnancy and labor helps in identifying high-risk fetuses. The most commonly monitored parameter is the Fetal Heart Rate (FHR), from which clinically relevant features—such as baseline FHR, accelerations, and decelerations—can be derived. Various techniques exist for FHR determination, including fetal Doppler ultrasound, direct and indirect fetal electrocardiography (fECG), fetal magnetocardiography (fMCG), and fetal phonocardiography (fPCG). Devices based on fPCG are non-invasive, passive, simple, and user-friendly. However, extracting reliable FHR measurements from fPCG is challenging due to noise. This database provides fPCG recordings from a noisy clinical setting, enabling the development and evaluation of signal processing algorithms for fPCG denoising and FHR determination.
The fPCG recordings were obtained using an electronic stethoscope (SS30LA) connected to a data acquisition system (MP36, Biopac Systems Inc.) with a gain of 500X and a sampling frequency of 2 kHz. The gain of 500X indicates that the input signal is amplified 500 times before digitization. Recordings were made using Biopac Student Lab software and exported as .wav audio files for MATLAB analysis. Data were collected from pregnant women with gestation greater than 30 weeks and singleton pregnancies, without known maternal complications, at a location advised by a trained doctor.
A total of 60 fPCG recordings were collected from 60 subjects, with an average recording time of around 8 minutes. The data files are stored in .wav format and can be analyzed using MATLAB. No filtering was applied to the data except for the Biopac’s built-in notch filter to remove line frequency interference.
The demographic and clinical information of the subjects is shown in the table below:
Age | Gestation Week | Weight (kg) | Height (cm) |
---|---|---|---|
Mean ± SD: 25 ± 3.7 | Mean ± SD: 37 ± 2.0 | Mean ± SD: 68.3 ± 10.9 | Mean ± SD: 155 ± 7.3 |
Range: 18–37 | Range: 31–40 | Range: 49–103 | Range: 144–171 |
Each recording is labeled as subject_x. A .csv file is provided that contains information on each subject’s height, weight, gravida, age, clinical conditions (if present), and FHR values when available.
The fPCG dataset was developed to encourage the development and evaluation of algorithms for monitoring fetal cardiac activity, especially under noisy conditions. In prior work [2], this dataset was used to develop an FHR extraction algorithm that employs autocorrelation and validation steps to accurately measure FHR for varying signal qualities. An example MATLAB code (example_code.m
) is provided to demonstrate reading the audio file and applying a comb filter.
Version 1.0: Initial Release
The study was approved by the Institutional Ethics Committee, St. John’s Hospital, Bangalore (IEC code No. 07/2018). Data were collected after obtaining informed consent.
We would like to thank Dr. Shirley George and the staff at the Department of Obstetrics and Gynaecology, St. John’s Hospital, Bangalore, for their support in data acquisition.
The authors declare no conflict of interest.