[66cc6c]: / README.md

Download this file

18 lines (12 with data), 2.1 kB

About Dataset

Introduction

This is the keystroke dataset for the study titled 'High-accuracy detection of early Parkinson's Disease using multiple characteristics of finger movement while typing'. This research report is currently under review for publication by PLOS ONE.

The dataset contains keystroke logs collected from over 200 subjects, with and without Parkinson's Disease (PD), as they typed normally on their own computer (without any supervision) over a period of weeks or months (having initially installed a custom keystroke recording app, Tappy). This dataset has been collected and analyzed in order to indicate that the routine interaction with computer keyboards can be used to detect changes in the characteristics of finger movement in the early stages of PD.

Data

The participants, from the U.S., Canada, UK and Australia, had visited the project website and agreed to participate in the study. The research was approved by the Human Research Ethics Committee at Charles Sturt University, Australia, protocol number H17013.

Each data file collected includes the timing information from typing activity as the participants used their various Windows applications (such as email, word processing, web searches and the like). The keystroke acquisition software ('Tappy') provided timing accuracy of key press and release timestamps to within several milliseconds.

The data files comprise two Zip archives, one with the participant detail files and the other with the keystroke data files for each user.

Acknowledgements

This dataset is from the research article "High-accuracy detection of early Parkinson's Disease using multiple characteristics of finger movement while typing" by Warwick R. Adams. Read the article here: http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0188226#sec008

Inspiration

While this is a difficult dataset to work with, there is a rich trove of information. It is a great set to practice preprocessing, attempt to replicate the results of the article, or do your own analysis of keystroke data.