GB2552035A - ECG Authentication method and apparatus - Google Patents

ECG Authentication method and apparatus Download PDF

Info

Publication number
GB2552035A
GB2552035A GB1611963.8A GB201611963A GB2552035A GB 2552035 A GB2552035 A GB 2552035A GB 201611963 A GB201611963 A GB 201611963A GB 2552035 A GB2552035 A GB 2552035A
Authority
GB
United Kingdom
Prior art keywords
input
feature vector
feature
user
filtering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB1611963.8A
Other versions
GB201611963D0 (en
Inventor
Condon Adrian
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
B-Secur Ltd
Original Assignee
B-Secur Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by B-Secur Ltd filed Critical B-Secur Ltd
Priority to GB1611963.8A priority Critical patent/GB2552035A/en
Publication of GB201611963D0 publication Critical patent/GB201611963D0/en
Priority to PCT/GB2017/052023 priority patent/WO2018007835A1/en
Publication of GB2552035A publication Critical patent/GB2552035A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/117Identification of persons
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/24Detecting, measuring or recording bioelectric or biomagnetic signals of the body or parts thereof
    • A61B5/316Modalities, i.e. specific diagnostic methods
    • A61B5/318Heart-related electrical modalities, e.g. electrocardiography [ECG]
    • A61B5/346Analysis of electrocardiograms
    • A61B5/349Detecting specific parameters of the electrocardiograph cycle
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7203Signal processing specially adapted for physiological signals or for diagnostic purposes for noise prevention, reduction or removal
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/7246Details of waveform analysis using correlation, e.g. template matching or determination of similarity
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/725Details of waveform analysis using specific filters therefor, e.g. Kalman or adaptive filters
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7235Details of waveform analysis
    • A61B5/7264Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61BDIAGNOSIS; SURGERY; IDENTIFICATION
    • A61B5/00Measuring for diagnostic purposes; Identification of persons
    • A61B5/72Signal processing specially adapted for physiological signals or for diagnostic purposes
    • A61B5/7225Details of analog processing, e.g. isolation amplifier, gain or sensitivity adjustment, filtering, baseline or drift compensation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication
    • G06F21/32User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/15Biometric patterns based on physiological signals, e.g. heartbeat, blood flow
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Public Health (AREA)
  • Molecular Biology (AREA)
  • Veterinary Medicine (AREA)
  • General Health & Medical Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Surgery (AREA)
  • Medical Informatics (AREA)
  • Heart & Thoracic Surgery (AREA)
  • Biophysics (AREA)
  • Pathology (AREA)
  • Biomedical Technology (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Cardiology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Psychiatry (AREA)
  • Physiology (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Fuzzy Systems (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)

Abstract

A method of performing electrocardiogram recognition comprising: receiving input from a user; filtering 408 the input; performing feature extraction 418 on the input by autocorrelation 416 to provide a first feature set; performing spectral analysis (e.g. using Pan-Tompkins 412) of the input to provide a second feature set in a frequency domain; combining the first and second feature sets to provide a combined feature vector 420; performing dimensionality reduction 422 on the combined feature vector to give a reduced feature vector; performing classification of the reduced feature vector to give a recognition decision, and/or storing the reduced feature vector for future recognition. By performing dimensionality reduction on the combined feature vector, the method is not constrained to any particular feature set nor vector size in either the time domain or in the frequency domain, but is able to optimally extract useful distinguishing features in each domain while resulting in a feature vector of a manageable length. Preferably the spectral analysis is performed on a representative PQRST curve, or PQRST curves that are time-shifted and superimposed to provide an average PQRST curve.

Description

(71) Applicant(s):
B-Secur Limited
Anderson House, 409 Holywood Road, BELFAST, BT4 2GU, United Kingdom (72) Inventor(s):
Adrian Condon (74) Agent and/or Address for Service:
Maucher Jenkins
Caxton Street, LONDON, SW1H 0RJ,
United Kingdom (51) INT CL:
G06K 9/00 (2006.01) A61B 5/0452 (2006.01)
G06F 21/32 (2013.01) (56) Documents Cited:
EP 3056138 A2 WO 2014/170897 A1
CN 105468951 A US 20140188770 A1
Wang, Y., Agrafioti, F., Hatzinakos, D., & Plataniotis, K. N. (2007). Analysis of human electrocardiogram for biometric recognition. EURASIP journal on Advances in Signal Processing, 2008(1), 148658 (58) Field of Search:
INT CLA61B, G06F, G06K
Other: WPI, EPODOC, TXTE, INSPEC (54) Title of the Invention: ECG Authentication method and apparatus
Abstract Title: Using ECG data in recognition and verification applications (57) A method of performing electrocardiogram recognition comprising: receiving input from a user; filtering 408 the input; performing feature extraction 418 on the input by autocorrelation 416 to provide a first feature set; performing spectral analysis (e.g. using Pan-Tompkins 412) of the input to provide a second feature set in a frequency domain; combining the first and second feature sets to provide a combined feature vector 420; performing dimensionality reduction 422 on the combined feature vector to give a reduced feature vector; performing classification of the reduced feature vector to give a recognition decision, and/or storing the reduced feature vector for future recognition. By performing dimensionality reduction on the combined feature vector, the method is not constrained to any particular feature set nor vector size in either the time domain or in the frequency domain, but is able to optimally extract useful distinguishing features in each domain while resulting in a feature vector of a manageable length. Preferably the spectral analysis is performed on a representative PQRST curve, or PQRST curves that are time-shifted and superimposed to provide an average PQRST curve.
Figure GB2552035A_D0001
At least one drawing originally filed was informal and the print reproduced here is taken from a later filed formal copy.
1/5
Fig. 1
CD
J?
Figure GB2552035A_D0002
Figure GB2552035A_D0003
N, subjects fo subjects
Figure GB2552035A_D0004
ID
C, dimensions
C, dimensions
C2 dimension (Cj< <C,)
Fig.2
2/5
10 17
Figure GB2552035A_D0005
227
Fig. 3
3/5
Fig.4
Figure GB2552035A_D0006
Filtering
408 _
Butterworth's filter 406
Normalization
404
JL ___410
10 17
Feature extraction
418
Dimensionality reduction
Classification --<!
i r
Autocorrelation 416
420
Pan-Tompkins 412
V
Spectral
Features 414
1
Figure GB2552035A_D0007
Figure GB2552035A_D0008
10 17
Figure GB2552035A_D0009
5/5
Fig. 6
17
O^imensionality O reduction—< model training
612
Classifier training
622
Figure GB2552035A_D0010
ECG AUTHENTICATION METHOD AND APPARATUS
Field of the Invention
This invention relates to biometric security, more particularly electrocardiogram authentication.
Background
As more and more electronic devices are used around the world, the more secure they have to be. User authentication is a process that verifies that a person in front of the device has rights (is authorized) to use the device's resources. A standard approach to user authorization is password verification (and its derivatives, including two-factor authorization, one-use tokens, etc.). A good password should be long enough to provide general security from brute-force attacks and unique so if it is compromised (i.e. known to third parties) it cannot be used in other systems.
Unfortunately remembering a great number of long passwords can be hard for most people. Other ways of user authorization are required. Promising approaches are biometrics-based authentication, which involve novel ways of verifying a user's identity using his or her natural characteristics. Most wide-spread methods include using information from fingerprints, iris or voice, as these are regarded unique to each user. Bioimpedance is also proposed for biometrics-based authentication, as described in W02001/20538, whereby different impedance measurements are made between different points on a user's hand or body at different frequencies.
Heartbeat characteristics appear to be unique as well. Research effort is put into developing a system that can identify users using their heartbeat characteristics.
There are many ways to analyse one's heartbeat, but the most practical approach is analysing the patterns gathered by Electrocardiograph, which records a heart's electric potential changes in time. A longer recording of heartbeat activity is called an electrocardiogram or ECG and is recorded using one or more pairs of electrodes. Each pair measures the change of electrical potential between the points of contact of electrodes. That change is strongly correlated with heart and muscle activity of the subject as the heart beat activity of the human body is stimulated through electrical impulses.
Fig. 1 shows an electrocardiogram signal depicting the electrical potential of a heart over time. The basic elements of a single heart beat are: (i) a P wave generated when the right and left atria of the heart are depolarized; a QRS complex reflecting the depolarization of the right and left ventricles; and a T wave corresponding to the ventricular repolarization. Existing methodologies attempt to characterize an individual by these different elements and their respective sizes, shapes and positions. WO 2008/107684 of Intellisense is an example of such an approach.
ECG Based Recognition Using Second Order Statistics by F. Agrafioti describes an autocorrelation based feature extraction approach illustrated in Fig. 2. The particular approach describes involves four stages: 1) preprocessing, in which noise and artefacts are removed, 2) optional template matching, in which a large-class number problem is transformed to a small-class number problem to reduce the possible number of classes and improve the efficiency of the system by pruning the search space, 3) feature extraction in which personalized signatures are created; and 4) classification, where every individual is identified. The feature extraction uses autocorrelation of ECG segments rather than fiducial detection. It is also described that the particular autocorrelation method has four stages: (i) windowing, in which the preprocessed ECG signal is subjected to segmentation into non overlapping windows; (ii) normalized autocorrelation computation for every window; (iii) dimensionality reduction with the Discrete Cosine Transform (DCT) or Linear Discriminant Analysis (LDA); and (iv) classification based on features obtained from the DCT or LDA. Template Matching with the correlation coefficient is performed on the autocorrelated ECG signals, before dimensionality reduction.
For a practical process, the system should be easy to use and be robust. In real life scenarios, the user cannot be expected to lie down and attach 12 electrodes to his or her body to provide a oneminute long ECG recording, just to authorize himself at an ATM (for example).
There is a need for an improved method of robust user authorization using short ECG samples from just two electrodes (placed on a subject's fingers or another relevant body part).
Summary of the Invention
In accordance with the present invention, a method of performing electrocardiogram recognition is provided comprising: receiving input from a user; filtering the input; performing feature extraction on the input by autocorrelation to provide a first feature set; performing spectral analysis of the input to provide a second feature set in a frequency domain; combining the first and second feature sets to provide a combined feature vector; performing dimensionality reduction on the combined feature vector to give a reduced feature vector; performing classification of the reduced feature vector to give a recognition decision, and/or storing the reduced feature vector for future recognition.
In accordance with another aspect of the invention, a method of model training is provided comprising: receiving a plurality of inputs; creating samples from the inputs; filtering and normalizing the samples; performing feature extraction on the samples; performing dimensionality reduction model training; reducing the length of a plurality of feature vectors; performing classifier training; transforming the feature vectors into a score; finding a threshold for the score; and combining and storing the model.
Brief Description of the Drawings
Fig. 1 is a diagram of an ECG sample.
Fig. 2 is a flow diagram of autocorrelation based feature extraction.
Fig. 3 is a circuit diagram of an embodiment of the invention, showing a user device, a terminal and a server.
Fig. 4 is a flow diagram showing the operation of the devices of Fig. 3 in authentication mode starting with a digitally sampled input after hardware filtering.
Fig. 5 is a flow diagram showing the operation of the devices of Fig. 3 in enrolment mode.
Fig. 6 is a flow diagram showing the operation of the devices of Fig. 3 in model training mode.
Glossary of terms
User, subject - a person using the system. Depending on the context it can be someone who is enrolled, owner of a model, trying to authorize or whose information was used during other user's enrolment process
ECG recording - a longer ECG recording (of about 20 seconds to one minute) of a single user, representing his or her heart activity.
ECG sample - a short part of an ECG recording. This part is extracted by cutting the ECG recording. In tests it is assumed that one ECG sample means one authorization attempt.
Verification/Authorization - process of checking if the user is authorized to use a device. It is performed using a short ECG sample, extracting its features and comparing it with an earlier prepared trained model.
Enrolment - process of preparing a user's model using a longer ECG recording. The recording is cut into smaller ECG samples and those samples are used to generate a user's model.
ECG features - numeric values representing an ECG sample's features. Those features are calculated using various methods and one feature can be represented either as a single number (i.e. mean value of sample) or a set of values (i.e. sample's autocorrelation). Multiple features can be combined to create a feature vector.
Feature vector - ordered structure combining multiple features of one ECG sample.
User model - a mathematical representation of multiple users' feature vectors that allow comparison of a provided feature vector (extracted from an ECG sample) against the user's original feature vectors. This process is called a binary classification, as we try to distinguish whether the provided feature vector belongs to the user (positive classification) or not (negative classification). Depending on the variant of the classifying algorithm the representations can vary. From simple storing of all users' feature vectors, through storing all feature vectors in the database, to storing set of support vectors that separate a user's feature vector from the rest of the population.
Detailed Description
In high level terms, an ECG device can perform an authentication check in two ways: (i) by comparing ECG data acquired with an enrolment template stored on a server, or (ii) checking against a template stored in encrypted form on the end user device, for example a smart card. The following description applies to both scenarios.
Fig. 3 illustrates suitable hardware for capturing and processing a user's ECG sample for (a) enrolment and (b) authentication. Fig. 3 shows a user device 100 having: sensors 105, 106; a power supply 218; an amplifier 220; hardware filters 221; a memory module 224; a microprocessor 226 and an optional display 214. Fig. 3 also shows a terminal 200, a memory 230, a microprocessor 232 and an optional display 228. Fig. 3 also shows a server 300, with which the user device 100 and terminal 200 optionally communicate 236, 238 with via a radio, antenna, base station and network (not shown). The user device 100 and the terminal 200 also optionally communicate 234 with each other.
The user device 100 has a power source 227 from the power supply 218 to the amplifier 220, memory 224 and microprocessor 226. The sensors 105, 106 are connected to the amplifier 220 and the filters 221. The amplifier 220, filters 221 and memory 224 are connected to the microprocessor 226. The sensors 105, 106 can be used to collect an electrocardiograph (ECG) signal and are positioned in a manner suitable for the user to put two thumbs or a finger from both hands on them simultaneously.
The filters 221 preferably comprise, in sequence, a high pass filter of about 0.5Hz cut-off and a low pass filter of about 180Hz cut-off, plus (optionally) a notch filter at 50Hz for filtering any noise from mains supply (60Hz for USA).
The processes of enrolment and authentication are now described. Authentication is described first, on the assumption that a user has already enrolled and the system has a sample of that user's feature vectors.
When a user wants to authorize in the system he/she must provide a short ECG sample. This sample is used at least once and a hard decision - accept or reject - is given.
Verification times can be 1, 2, 3, 5 or 10 seconds depending on the use case and signal quality. In this description we assume that this time is less than 2 seconds, this being the most likely time that will be used in an end product.
Because the window may be very short, a user can be allowed multiple such windows to authorize. If a user's sample is rejected he/she is given at least two more chances. From the user perspective, the verification time could be longer (anywhere from minimum 2 seconds, to any reasonable time).
The description below shows the lower level of the authorization process that analyses one ECG sample and gives an answer if that sample belongs to the user or an imposter.
When the user touches the electrodes, the hardware starts to record the signal. After it collects a 2second sample 402, that has been subject to preprocessing (amplifying in amplifier 220 and filtering in filter 221), that sample 402 is put through and analogue-to-digital converter (222) at the input of the processor 226 and is sent along a processing pipeline that transforms it into a binary response: true if the sample 402 matches the user's model, or false if an imposter is trying to authenticate. The processing pipeline has a number of steps to perform the transformations. These are illustrated in Fig. 4. The authentication process may run using the memory and microprocessor of just the user device 100 or the terminal 200, or a combination of both when they are in communication 234 with each other. The user device 100 and terminal 200 may communicated 236, 238 with a server 300 the results of the authentication.
At a high level, the sample 402 is first normalized 404 and filtered 408 and is then transformed 418 into a template representing the sample features. Then, based on some prior known statistics, insignificant features are removed in a process called dimensionality reduction 422. Finally, the reduced feature vector is tested using a binary classifier (support vector machine) 424 that was prepared during the enrolment process and can analyse the user's feature vector to give an answer - acceptance or rejection. The support vector machine 424 receives the owner feature vector 428 and has the population feature vector 430 already stored within it.
The algorithm assumes that it receives an ECG sample to verify - i.e. to check if it belongs to the model's owner. If the received input 402 has no discernible ECG signal, an error is displayed on a display 214, 228 of the user device 100 or terminal 200. The user may be asked to try again by means of a message on a display 214, 228.
The input signal 402 is first cut into samples. Each sample undergoes simple filtering steps to reduce noise and normalize the output. First, normalization 404 is performed, which means the signal is standardized (scaled) so that all values are between -1 and +1 and the mean is equal to zero. This processed is performed in two steps: first (eq. 1) the mean of the samples is calculated and all values are shifted by subtracting the mean, secondly (eq. 2) the resulting vector is divided by its maximum value.
For each signal ^KLn,x=(xi,x2,...,afc) we obtain its normalized version ye<»,y=(yi,y2,...,yn) , (1) where:
yi=xi—mean(x) yt.=yi/max(y) (2)
The normalized signal is then processed by a Butterworth's bandpass filter 406 of order 4 in the 0.5Hz-40Hz band.
Depending on what features will be extracted, the range and/or shape of the filters may vary. Good results are also achieved if the band is extended from a 0.5Hz high pass software filter (e.g. a 4th order Butterworth filter) to a 85Hz low-pass software filter - e.g. an HR Butterworth filter of 10th order. This provides a flat baseline with a burst of noise at each heartbeat. Other filters achieving this purpose can be used.
Note that the software filter has a narrower band than the hardware filter 221. Both are used and each complements the other. The hardware filter limits the bandwidth and reduces the amount of processing required by the processor. The software filter filters out noise that may have bypassed the hardware filter through the groundplane or even through the user's own fingers, and it further limits the bandwidth to the frequencies of particular interest.
The output of this step is a normalized signal (i.e. values between -1 and 1, with sample's mean equal to 0). From this sample all features are calculated.
The next phase is feature extraction 418. In this phase the filtered and normalized samples are analysed and features are extracted 418. Two types of features are used- temporal 416 (based on time domain) and spectral 412 (based on statistical features of different sub-bands). Those features are calculated independently and later combined 420 to produce one output feature vector, i.e. an ordered list of numbers. By using two (or more) features we calculate one vector for each (one vector with AC features, and one vector with Spectral features 414). Then those vectors are concatenated 420 into one, longer feature vector. The algorithm does not have to know which of the values in the feature vector correlate to which feature, but the ordering should be consistent between the enrolment and authentication phases.
The output of this step is long string of numbers - a feature vector. For each ECG sample this vector should have the same length.
The calculations of the two vectors are now described in greater detail.
The first feature calculated is normalized autocorrelation 416. This is done by performing a cross correlation of the signal x with maximum time lag M set to a number from 0.1 - 0.4 seconds (eq. 3). I.e. the time lag is fixed at a set value within this range. The signal is normalized by dividing it with the cross correlation value for time lag equal to 0 (eq. 4). Because cross correlation is a symmetrical function, we discard the negative part and the first dimension of the positive part (result for time lag equal to 0). The autocorrelation descriptor of signal x is defined as (x)=y, where y is defined as:
Τ=Χ£ΰΤΓ(χ,ΡΓ) (3) r= r/r0 (4) y=(n :i>0) (5)
The spectral approach analyses different sub bands of the ECG sample. Because the spectral features are not time shift invariant, the input signal should first be normalized in time.
As the building blocks of each ECG recordings are heartbeats, the sample is processed to extract positions and lengths of the heartbeats. A Pan-Tompkins algorithm 412 is used for that purpose. The Pan-Tompkins algorithm 412 recognizes Q.RS components of each heartbeat based upon digital analysis of slope, amplitude and width of each pulse.
When all of the beats are found, spectral features 414 are extracted from them (e.g. peaks in the frequency domain are identified) and a feature vector is created. (A representative PQ.RST curve can be selected, e.g. from the centre of the set of beats found, or a set of PQ.RS curves can be timeshifted and superimposed on one another to provide an average PQ.RS curve.)
The cycle's signal is filtered multiple times in different sub bands (sub bands are extracted using a narrow Butterworth's filter 406). From each sub band we calculate: mean of power, standard deviation of power, maximum amplitude, amplitude's deviation, power's kurtosis, power's skewness. Additionally we append to the feature statistical features of the whole ECG cycle: maximum value, standard deviation, kurtosis and skewness. We analyse multiple sub bands in the frequency range of interest. E.g. the range of interest may be set at 0.5Hz to 40Hz or 85Hz and divided into k bands of about 5Hz - 8Hz. About 6 to 10 bands are preferred. The bands may be narrower in the range 8Hz - 35Hz than at the extremes of the frequency range of interest. By way of example, in the range 0.5Hz to 50Hz, the following bands are suitable:
0.5 - 8 Hz; 8 - 13 Hz; 13 - 18 Hz; 18 - 25 Hz; 25 - 30 Hz; 30 - 35 Hz; 35 - 50 Hz.
We denote the signal of filtered sub band k to be si.
The feature vectors can be very long and they typically carry a lot of excess information. They are preferably subjected to a process of dimensionality reduction 422. There are many dimensionality reduction algorithms 422 and they can be classified into multiple groups. In general the dimensionality reduction algorithm knows how to find feature vectors in a collection of excessive data. The instructions on which parts of the vector to remove, which to transform, and which ones should be combined together is generated during the model training phase, as we have information through whole population. More on that process is described in the section on model preparation in relation to Fig. 6.
When we reduce the number of features, we are performing a projection (from higher dimensional space to lower one). This way we take a long feature vector, project it onto a smaller feature space, and get a shorter, more compressed feature vector. The only requirement of this step is that it has to produce vectors of constant length, similar to the feature extraction step.
The process of user authentication can be seen as a binary classification problem. We have two classes of samples (user's samples and other people's samples). The classifier 424 used in our approach relates to multiple ways of determining if a given feature vector resembles the owner's feature vectors. One of those ways is direct comparison of those vectors (using different vector metrics) or usage of statistical tools that separate the user's vectors from all the rest.
In the latter approach a mathematical model is generated and parameters defining the model are stored. The parameters describe what calculations should be performed to receive a binary answer - authorize the sample or reject the sample. Depending on the chosen approach and its implementation different classifier produce different outcomes.
The preferred model is a support vector machine (SVM) sometimes referred as a support vector network), which is a supervised learning model in which examples of a feature vector are classified into the category match subject or the category mismatch (i.e. more similar to general population). It is a non-probabilistic binary-linear classifier. The SVM model is a representation of input feature vectors as points in space, mapped so that the examples of the separate categories are divided by a clear gap that is as wide as possible. New examples are mapped into that same space and are identified as belonging to a category based on which side of the gap they fall on. The space can be multi-dimensional (having as many dimensions as the feature vectors). Feature vectors representing the two categories are stored and (preferably) the threshold, line or multi-dimensional surface defining their separation is also stored.
Unsupervised learning can be used to find natural clustering of feature vectors of many users into groups. When a population of users has been classified into a satisfactory number of groups (e.g. 4 to 20 clusters), these groups can be represented by representative feature vectors, and a newly input feature vector can be assigned to the subject (a match) or one of the groups (a mismatch). Such a clustering algorithm is called support vector clustering. In the embodiment, the feature vectors for each of the respective categories are stored.
Some models (like vector comparison) return the distance between the provided sample and the owner's feature vectors stored in the model. In that case additional number - a threshold - is calculated in that model. That threshold is the decision boundary that tells if the tested sample is close enough to the model saved sample. If the distance is smaller than the threshold this means the sample belongs to the owner.
Probabilistic classifiers, on the other hand, return a probability that given sample belongs to the model's owner. This number is a floating point value from 0 to 1. 1 means that it's certain that provided sample belongs to the owner, 0 means that there is no resemblance between the sample and the owner's model. Such classifier use thresholds as well, as it is hard to get a probability score of 1 each time the owner provides a sample. The threshold works the same way is in the similarity approach with the only change that scores above thresholds are accepted, and those below are rejected.
In order to first store a model for an owner, the owner must first undergo an enrolment process. This is illustrated in Fig. 5. During this process, the user provides a sample 502 of ECG so the model can be built for the user and consequently various patterns can be found to distinguish him or her from rest of the population.
Enrolment can be performed with the same device/hardware as authentication. During enrolment, a user is presented with instructions to provide an ECG recording which may be longer than the verification time (e.g. 20 or 30 seconds or up to 1 minute). A message is displayed on a display 214, 228 instructing the user to provide this recording. The message may simply instruct the user to place two fingers (or thumbs) of opposite hands on the sensors and hold them there while a graphical timer counts down. If necessary, the message may further specify other conditions (e.g. to control the environment to reduce noise and reduce variance between beats).
The device performs ECG recording over the enrolment period. If there are signal quality issues, the user may be instructed, by means of a message displayed on a display 214, 228, to repeat the process. The repeat message may instruct the user to make additional preparations (e.g. to first wipe the sensors with a clean dry cloth and/or to take three deep breaths).
An acquired ECG signal is processed (contemporaneously or at a later time) by the filtering 408, feature extraction 418 and dimensionality reduction 422 elements of Fig 3, but not the classification element 424. The result is a feature vector 510 for the newly enrolled user. This is stored for later use in the classification step of Fig. 4. Optionally, the feature vector may be put through a classifier model 504 (described below) that will identify an appropriate classifier threshold 508 for the particular feature vector.
To explain how an enrolment recording is transformed into a user's ECG representation, the process called model training will be described with reference to Fig. 6. The same model is used each time a user tries to authenticate or another user tries to enrol.
The process of model training includes the filtering 408 and feature extraction 418 elements of Fig. 3 but it uses a larger dataset 502, 602, 604 and is correspondingly more complex. Whereas during authentication we only use information gathered in the model (and processing parameters) the process of calculating those is much more complex and requires ECG samples from different subjects, as is now described.
As described above, the system analyses each subject's heartbeat in samples 606. To create samples 606 (e.g. 2s long) from the enrolment recording it is necessary to cut it into smaller parts. Each recording (subject's and the background's) is cut (blindly) into 2-second samples 606 and those samples 606 are used in the training process. The length of those samples 606 is dependent on the minimal verification time.
The output of this phase is a collection of pairs: (subject id, ECG sample), where the subject's id denotes who is the owner of the sample (it is later used in the dimensionality reduction 612 and classifier training 622 stages).
Features are extracted 418 using the same algorithm and parameters that will later be used in the authentication step. After this transformation the result is a collection of feature vectors.
To distinguish the subject's ECG parameters from other users, recordings of other subjects 616 are also required. This data is used as background information, especially in the dimensionality reduction step. This is known as supervised learning.
Because a feature vector obtained during the previous phase can be very long, we need to perform dimensionality reduction to reduce its length. Usually, gathered data has some regularities that can be easily found using some statistical reasoning. We could, for example, find out that in one particular dimension (e.g. some identified characteristic in time or frequency that has regularity across different subjects) one feature vector does not change across a whole population and therefore it does not carry any information. In this case the dimension is de-emphasised as it is of no use in the comparison between a user feature vector and population feature vectors. Another case is that one dimension can be expressed as interrelating with (or dependent on) other dimensions and therefore it also does not carry any more information than that encoded in other parts of the feature vector, so it can be de-emphasised. The de-emphasis of dimensions such as these enables the size of the feature vector to be reduced, without losing data vital for comparison.
This is the reason why other subjects' samples must be present during the training. They (together with the model owner's samples) statistically represent a sample population. From this data we can deduce the characteristics of the feature space.
There are two main approaches in dimensionality reduction. One blindly tries to compress the data to a smaller number of dimensions (e.g. Discrete Cosine Transform (DCT), or Principal Component Analysis (PCA)). Another takes into account that each point can belong to a different class/subject (e.g. Linear Discriminant Analysis (LDA), Partial Least Squares (PLS)). The first category is called unsupervised dimensionality reduction, the latter one is called supervised dimensionality reduction.
The unsupervised methods 608 try to find similarities in the data and minimize the variance in the final set of dimensions so the data can be compressed into a smaller number of dimensions, while supervised dimensionality reduction methods 610 reduce the dimensionality of the signal while trying to also increase a distance measure between the different classes in the new set of dimensions so the classes can be identified more easily.
DCT or PCA may perform unsupervised dimensionality reduction. A DCT is a Fourier-related transform that uses only real numbers to expresses the dataset in terms of a sum of cosine functions of different frequencies.
PCA uses orthogonal transformation to convert the dataset into a reduced set of principal components. Principal components account for as much of the variability in the data as possible, while reducing the number of dimensions. This method is also appropriate when the variables in the dataset are noisy. PCA concentrates the majority of the signal into the first few principal components, while the later principle components may be dominated by noise and so may be excluded without much loss of useful data.
LDA or PLS may perform supervised dimensionality reduction.
LDA is a method that attempts to model the difference between the classes of data. It finds a linear combination of features that characterizes or separates two or more classes of data. The resulting combination may be used for dimensionality reduction. A fundamental assumption of the LDA method is that the independent variables are normally distributed.
PLS is a statistical linear regression method which projects vectors to new spaces based on the dataset. The new space is chosen in a way that maximises covariance between extracted factors from the datasets.
Both approaches can be used in our system, often with the unsupervised methods taking place before the supervised methods are carried out. For example (as shown in Fig. 6) there can be a degree of unsupervised dimensionality reduction 608 followed by supervised dimensionality reduction 610. This has the advantage of reducing the complexity of the latter. The output is a recipe (usually a matrix) on how to reduce the length of a feature vector and (as a by-product) each input feature vector of the subject and each other user, transformed to the new (smaller) feature space.
Because the system is aimed at authorization of single users and because it is meant to be scalable and easily updateable, it focusses on a binary classification problem. What is needed is a reliable recipe to distinguish the user's feature vectors against feature vectors of others (population). By reliable is meant a yes/no classifier that has a low equal error rate (the rate at which the probability of a false positive is equal to the probability of a false negative.
Note that the error rates can be adjusted as required. For example, it may be preferable to have the probability of a false positive be lower than the probability of a false negative (if security against falsely authorising an unauthorised user is more important than customer satisfaction in being able to correctly identify an authorized user) or vice versa (e.g. for low value transactions such as a transport ticket or a turnstile).
Multiple approaches can be used generally based on a supervised classifier. Depending on the selected method the training process runs differently, but a uniform factor is that the classifier training 622 algorithm receives a list of all feature vectors that are grouped into two classes - model owners ECG 614, population ECG 616. The output of this step is a classifier model - i.e. a recipe (classifier matrix 620) for the classification process on how to transform the feature vector into a score. The score is a measure of whether the given ECG is more like the heartbeat template of the user or heartbeat templates of a wider user population represented by one or several templates for the wider population. For example, a population may be represented between 8 and 50 (or preferably between 8 and 20) representative templates.
After all features run through the whole pipeline we evaluate each model by presenting a set of new (unseen) ECG samples from various users. Each sample goes through the whole authentication process. For each sample we get an associated score (from the classifier), so we know who is the owner of the model, to whom the sample belongs in reality, and what was the classifier response. Using that information we find a threshold 618 for the score that will optimally separate any owner's samples from other subjects' samples. This threshold 618 is later stored together with the model and its use during the full authorization procedure.
All the components of the system: parameters for feature extraction, dimensionality reduction parameters and projection matrix, classifier's model and parameters and the threshold are combined together and saved. The entire model can be saved and can be later read and used in the authorization system.
Returning to the authorization process, the user puts his or her fingers on the electrodes and the system records a short (2 second) sample ECG sample. This sample is converted (via multiple stages of the process as described above in relation to Fig. 4) into a feature vector 428 - its representation.
This representation is compared (as described in Fig. 4) with the data stored in the enrolment phase (Fig. 5) in accordance with the model (created in accordance with Fig. 6). The effect of the comparison is a score value, i.e. an indicator showing how closely the provided sample fits the user's ECG characteristics. If the subject's sample belongs to the owner of the model, the score is high. When an imposter tries to authorize, the score is much lower. If the score exceeds some threshold (e.g. established during the enrolment phase) the user is successfully authenticated and can use the system. Otherwise he or she is rejected and cannot access the system.
As an optional additional feature in authentication, multiple authentication attempts may be permitted and combined. In this process, a user provides multiple ECG samples and authorization is based on an aggregate results of each sample's verification result.
As an optional additional feature in enrolment, multiple enrolment attempts may be permitted and combined. In this process, a user provides multiple ECG signals (e.g. at different times and under different conditions of relative rest and exertion), and enrolment is based on an average result of the resulting feature vectors.

Claims (19)

Claims
1. A method of performing electrocardiogram recognition comprising:
receiving input (402) from a user;
filtering (408) the input;
5 performing feature extraction (418) on the input by autocorrelation (416) to provide a first feature set;
performing spectral analysis (412, 414) of the input to provide a second feature set in a frequency domain;
combining (420) the first and second feature sets to provide a combined feature vector;
10 performing dimensionality reduction (422) on the combined feature vector to give a reduced feature vector;
performing classification (424) of the reduced feature vector to give a recognition decision, and/or storing the reduced feature vector (426) for future recognition.
15
2. The method of claim 1, wherein performing the spectral analysis is based on statistical features of different sub-bands.
3. The method of claim 2, wherein the statistical features of different sub-bands comprise calculating one or more of:
20 mean of power, standard deviation of power, maximum amplitude, amplitude's deviation, power's kurtosis and
25 skewness of power.
4. The method of claim 3, wherein, in addition to the statistical features of different sub-bands, statistical features of the whole ECG cycle are appended.
5.
The method of claim 1, wherein the input is first normalized in time and amplitude.
6. The method of claim 1, wherein performing spectral analysis comprises selecting a
5 representative PQ.RST curve or a set of PQ.RST curves and time-shifting and superimposing these on one another to provide an average PQ.RST curve.
7. The method of any one of the preceding claims wherein the filtering comprises first bandpass filtering the input in a hardware filter to limit to a first bandwidth and then converting to
10 digital form and further filtering to a second bandwidth, narrower than the first bandwidth.
8. A device for performing electrocardiogram recognition comprising:
one or more sensors for receiving input (402) from a user;
means for filtering (408) the input;
15 processing means for performing feature extraction (418) on the input by autocorrelation (416) to provide a first feature set, performing spectral analysis (412, 414) of the input to provide a second feature set in a frequency domain, combining (420) the first and second feature sets to provide a combined feature vector; performing dimensionality reduction (422) on the combined feature vector to give a reduced feature vector; and
20 means for performing classification (424) of the reduced feature vector to give a recognition decision, and/or for storing the reduced feature vector (426) for future recognition.
9. A method of enrolling a user in an electrocardiogram recognition system comprising:
receiving input (502) from a user;
25 filtering (408) the input;
performing feature extraction (418) on the input by autocorrelation (416) to provide a first feature set performing spectral analysis (412, 414) of the input to provide a second feature set in a frequency domain;
combining (420) the first and second feature sets to provide a combined feature vector;
performing dimensionality reduction (422) on the combined feature vector to give a reduced 5 feature vector (428); and/or storing the resulting feature vector (510) for the newly enrolled user.
10. The method of claim 9, wherein the spectral analysis is based on statistical features of different sub-bands.
11. The method of claim 9, wherein the input is first normalized in time and amplitude.
12. The method of claim 9, further comprising passing the feature vector through a classifier model (504) to identify an appropriate classifier threshold (508).
13. The method of claim 9, further comprising first displaying a message to prompt the user to provide input (502).
14. The method of claim 13, wherein a display (214, 228) provides instructions to the user to 20 provide the input (502) in a controlled environment for an extended period.
15. The method of claim 13, wherein a display 214, 228 provides instructions to the user to repeat providing the input (502) if the signal quality is poor or if heartbeat variability or other interbeat variations exceed a threshold.
16. The method of claim 9, wherein multiple enrolment attempts are permitted and combined.
17. The method of any one of claims 9 to 16 wherein the filtering comprises first bandpass filtering the input in a hardware filter to limit to a first bandwidth and then converting to digital form and further filtering to a second bandwidth, narrower than the first bandwidth.
18. A device for enrolling a user in an electrocardiogram recognition system comprising:
one or more sensors for receiving input (502) from a user;
means for filtering (408) the input;
processing means for performing feature extraction (418) on the input by autocorrelation 10 (416) to provide a first feature set, performing spectral analysis (412, 414) of the input to provide a second feature set in a frequency domain, and combining (420) the first and second feature sets to provide a combined feature vector; and means for performing dimensionality reduction (422) on the combined feature vector to give a reduced feature vector (428); and/or for storing the resulting feature vector (510) for the newly
15 enrolled user.
19. The device of claim 18, further comprising a display for displaying a message to prompt the user to provide input (502) to the sensors according to prompt instructions stored in the device.
Intellectual
Property
Office
Application No: GB1611963.8 Examiner: Alan Phipps
GB1611963.8A 2016-07-08 2016-07-08 ECG Authentication method and apparatus Withdrawn GB2552035A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GB1611963.8A GB2552035A (en) 2016-07-08 2016-07-08 ECG Authentication method and apparatus
PCT/GB2017/052023 WO2018007835A1 (en) 2016-07-08 2017-07-10 Ecg authentication method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB1611963.8A GB2552035A (en) 2016-07-08 2016-07-08 ECG Authentication method and apparatus

Publications (2)

Publication Number Publication Date
GB201611963D0 GB201611963D0 (en) 2016-08-24
GB2552035A true GB2552035A (en) 2018-01-10

Family

ID=56890743

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1611963.8A Withdrawn GB2552035A (en) 2016-07-08 2016-07-08 ECG Authentication method and apparatus

Country Status (2)

Country Link
GB (1) GB2552035A (en)
WO (1) WO2018007835A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140188770A1 (en) * 2011-05-10 2014-07-03 Foteini Agrafioti System and method for enabling continuous or instantaneous identity recognition based on physiological biometric signals
WO2014170897A1 (en) * 2013-04-14 2014-10-23 Yissum Research Development Company Of The Hebrew University Of Jerusalem Ltd. Classifying eeg signals in response to visual stimulus
CN105468951A (en) * 2015-11-17 2016-04-06 安徽华米信息科技有限公司 Method and device for identity recognition through electrocardiographic feature and wearable device
EP3056138A2 (en) * 2015-02-11 2016-08-17 Samsung Electronics Co., Ltd. Electrocardiogram (ecg)-based authentication apparatus and method thereof, and training apparatus and method thereof for ecg-based authentication

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7333850B2 (en) * 2004-05-28 2008-02-19 University Of Florida Research Foundation, Inc. Maternal-fetal monitoring system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140188770A1 (en) * 2011-05-10 2014-07-03 Foteini Agrafioti System and method for enabling continuous or instantaneous identity recognition based on physiological biometric signals
WO2014170897A1 (en) * 2013-04-14 2014-10-23 Yissum Research Development Company Of The Hebrew University Of Jerusalem Ltd. Classifying eeg signals in response to visual stimulus
EP3056138A2 (en) * 2015-02-11 2016-08-17 Samsung Electronics Co., Ltd. Electrocardiogram (ecg)-based authentication apparatus and method thereof, and training apparatus and method thereof for ecg-based authentication
CN105468951A (en) * 2015-11-17 2016-04-06 安徽华米信息科技有限公司 Method and device for identity recognition through electrocardiographic feature and wearable device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Wang, Y., Agrafioti, F., Hatzinakos, D., & Plataniotis, K. N. (2007). Analysis of human electrocardiogram for biometric recognition. EURASIP journal on Advances in Signal Processing, 2008(1), 148658 *

Also Published As

Publication number Publication date
GB201611963D0 (en) 2016-08-24
WO2018007835A1 (en) 2018-01-11

Similar Documents

Publication Publication Date Title
Komeili et al. Liveness detection and automatic template updating using fusion of ECG and fingerprint
US7689833B2 (en) Method and apparatus for electro-biometric identity recognition
US9646261B2 (en) Enabling continuous or instantaneous identity recognition of a large group of people based on physiological biometric signals obtained from members of a small group of people
JP4782141B2 (en) Method and apparatus for electronic biometric identification recognition
Sufi et al. Polynomial distance measurement for ECG based biometric authentication
US20060136744A1 (en) Method and apparatus for electro-biometric identity recognition
WO2018152711A1 (en) Electrocardiographic authentication-based door control system and authentication method therefor
Abdeldayem et al. ECG-based human authentication using high-level spectro-temporal signal features
JP5642210B2 (en) Method and apparatus for electronic biometric identification recognition
El_Rahman Biometric human recognition system based on ECG
Choi et al. User Authentication System Based on Baseline-corrected ECG for Biometrics.
JP2012176106A (en) Device and method for authentication, electronic device, and computer program
Nait-Ali Hidden biometrics: Towards using biosignals and biomedical images for security applications
Smitha et al. Online Electroencephalogram (EEG) based biometric authentication using visual and audio stimuli
Matos et al. Biometric recognition system using low bandwidth ECG signals
CN111444489B (en) Double-factor authentication method based on photoplethysmography sensor
GB2552035A (en) ECG Authentication method and apparatus
Yeen et al. Development of heartbeat based biometric system using wavelet transform
Santos et al. Eigen heartbeats for user identification
Canento et al. On real time ECG segmentation algorithms for biometric applications
Fatimah et al. Analysis of ECG for biometric identification
Alariki et al. A Review Study of Heartbeat Biometric Authentication.
NS et al. An efficient score level multimodal biometric system using ECG and fingerprint
Agrafioti Robust subject recognition using the Electrocardiogram
Walia et al. PPG and fingerprint: robust bimodal biometric system

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)