Skip to main content

Language Independent Gender Identification Through Keystroke Analysis.

Tsimperidis, I., Katos, V. and Clarke, N., 2015. Language Independent Gender Identification Through Keystroke Analysis. Information and Computer Security, 23 (3), 286 -301.

Full text available as:

s1-ln1758457095844769-1939656818Hwf-815345003IdV167036923417584570PDF_HI0001.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.


DOI: 10.1108/ICS-05-2014-0032


Purpose – In this work we investigate the feasibility of iden tifying the gender of an author by measuring the keystroke duration when typing a message. Design/methodology/approach – Three classifiers were constructed and tested. We empirically evaluated the effectiveness of the classifiers by using empirical data. We used primary data as well as a publicly available dataset containing keystrokes from a diff erent language to validate the language independence assumption. Findings – The results of this work indicate that it is possible to identify the gender of an author by analyzing keystroke durations with a probability of success in the region of 70%. Research limitations/implications – The proposed approach was validated with a limited number of participants and languages, yet the statistical tests show the significance of the results. However, t his approach will be further tested with other languages. Practical implications – Having the ability to identify the gender of an aut hor of a certain piece of text has value in digital forensics, as the proposed method will be a source of circumstantial evidence for “putting fingers on keyboard” and for arbitrating cases where the true origin of a message needs to be identified. Social implications – If the proposed method is included as part of a text composing system (such as email, and instant messaging applications) it could increase trust toward the applications that use it and may also work as a deterrent for crimes involving forgery. Originality/value – The proposed approach combines and adapts techniques from the domains of biometric authentication and data classification.

Item Type:Article
Uncontrolled Keywords:keystroke dynamics; keystroke duration; gender recognition; user classification
Group:Faculty of Science & Technology
ID Code:24545
Deposited By: Symplectic RT2
Deposited On:26 Aug 2016 10:15
Last Modified:14 Mar 2022 13:57


Downloads per month over past year

More statistics for this item...
Repository Staff Only -