CN110236550B

CN110236550B - Human gait prediction device based on multi-mode deep learning

Info

Publication number: CN110236550B
Application number: CN201910464986.0A
Authority: CN
Inventors: 孙富春; 方斌; 王明; 吕钦
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2019-05-30
Filing date: 2019-05-30
Publication date: 2020-07-10
Anticipated expiration: 2039-05-30
Also published as: CN110236550A

Abstract

The invention provides a human body gait prediction device based on multi-mode deep learning, and belongs to the field of gait prediction and deep learning. The device includes: the device comprises an inertial sensor module, a pressure sensor module, a sound sensor module, an inertial sensor data acquisition and preprocessing module, a pressure sensor data acquisition and preprocessing module, a sound data acquisition and preprocessing module and a deep neural network processing module. The device utilizes an inertial sensor, a sole pressure sensor and a sound sensor to collect acceleration, angular velocity, angle and geomagnetic field component signals of lower limb movement of a human body and sole pressure and walking sound data, the collected data are input into a deep neural network processing module after being preprocessed, and the deep neural network processing module outputs a human body gait prediction result. The gait prediction system is simple and convenient to wear, can meet different human body requirements, and can be applied to gait prediction of the exoskeletal robot in the fields of medical rehabilitation and military in the future.

Description

Human gait prediction device based on multi-mode deep learning

Technical Field

The invention relates to a human body gait prediction device based on multi-mode deep learning, and belongs to the field of gait prediction and deep learning.

Background

With the development of artificial intelligence, especially the rise of deep learning in recent years, intelligent collaboration between people and machines has become an important field of artificial intelligence. The exoskeleton robot is an important representative of human-computer intelligent cooperation, perfectly combines human intelligence and robot strength, and has great development potential in the fields of medical rehabilitation and military in the future. The exoskeleton robot captures human motion gait in real time through a sensor sensing system, and a controller generates a control signal to drive a mechanical skeleton to move along with the human body. However, since data acquisition, signal processing, actuator response, and the like require a certain time, the mechanical skeletal motion gait lags behind the human motion gait, thereby affecting the wearing comfort and the human-computer coordination of the wearer. In order to solve the problem, the exoskeleton robot needs to accurately predict human gait in real time, so that a reference signal of a control system is ahead of the motion gait of the human body, and the motion gait of a wearer is followed in real time.

The essence of gait prediction is that historical data is used for predicting gait data and trends in the next period of time, and the gait prediction is a time sequence signal prediction. In the exoskeleton robot, a wearable sensor is often configured, so that a gait prediction device based on the wearable sensor needs to be researched. Currently, most gait prediction devices are image-based, or for single modality sensors, such as inertial sensors. The prediction device based on the image is often difficult to obtain accurate human gait and is not suitable for high-precision exoskeleton robot gait control. Most of the existing prediction devices based on the single-mode sensor need to extract gait features manually, and the algorithm has low calculation efficiency and prediction accuracy and poor robustness.

Disclosure of Invention

The invention aims to overcome the defects in the prior art and provides a human gait prediction device based on multi-mode deep learning. The device utilizes inertial sensor, plantar pressure sensor and sound sensor multimode sensor, gathers the acceleration, angular velocity, angle, geomagnetic field component signal of human low limbs motion to and plantar pressure and walking sound data, and utilizes and realizes the gait prediction to the human body based on multimode deep learning algorithm, and it is simple convenient to dress, can satisfy different human needs, can be applied to the gait prediction of outer skeleton robot in medical rehabilitation and military field in the future.

The invention provides a human gait prediction device based on multi-mode deep learning, which is characterized by comprising the following steps: the system comprises an inertial sensor module, a pressure sensor module, a sound sensor module, an inertial sensor data acquisition and preprocessing module, a pressure sensor data acquisition and preprocessing module, a sound sensor data acquisition and preprocessing module and a deep neural network processing module;

the system comprises an inertial sensor module, a pressure sensor module, a deep neural network processing module, a pressure sensor data acquisition and preprocessing module, a deep neural network processing module and a deep neural network processing module, wherein the inertial sensor module comprises 7 inertial sensors, each inertial sensor is connected with the inertial sensor data acquisition and preprocessing module in a wired parallel mode respectively, the pressure sensor module comprises 12 pressure sensors, each pressure sensor is connected with the pressure sensor data acquisition and preprocessing module in a wired parallel mode respectively, the sound sensor module comprises 2 sound sensors, each sound sensor is connected with the sound sensor data acquisition and preprocessing module in a wired parallel mode respectively, and the inertial sensor data acquisition and preprocessing module, the pressure sensor data acquisition and preprocessing module and the sound sensor data acquisition and preprocessing module are connected with the deep neural network processing module in a wired parallel mode respectively;

the 7 inertial sensors are respectively arranged at the positions of the waist back, the left thigh, the right thigh, the left calf, the right calf, the left instep and the right instep of a user, and each inertial sensor is respectively used for acquiring 3-dimensional acceleration data, 3-dimensional angular velocity data, 3-dimensional angle data and 3-dimensional magnetic field data of a corresponding part and sending the acquired data to the inertial sensor data acquisition and preprocessing module;

the inertial sensor acquisition and preprocessing module is used for receiving data acquired by each inertial sensor, performing data preprocessing of filtering and normalization, and sending the preprocessed inertial sensor data to the deep neural network processing module;

the 12 pressure sensors are distributed in an insole mode, 1 insole is respectively placed on the left sole and the right sole, 6 pressure sensors are respectively arranged on each insole, each pressure sensor collects sole pressure at a corresponding position, and the collected data are sent to the pressure sensor data collecting and preprocessing module;

the pressure sensor data acquisition and preprocessing module is used for receiving data acquired by each pressure sensor, performing data preprocessing of filtering and normalization, and sending the preprocessed pressure sensor data to the deep neural network processing module;

the 2 sound sensors are respectively arranged on the left instep and the right instep and used for collecting the sole sound data of the walking of the human body and sending the collected data to the sound sensor data collecting and preprocessing module;

the sound sensor data acquisition and preprocessing module is used for receiving data sent by each sound sensor, performing data preprocessing of filtering and normalization, and sending the preprocessed sound sensor data to the deep neural network processing module;

the deep neural network processing module is used for receiving the preprocessed inertial sensor data, pressure sensor data and sound sensor data, predicting the gait of the received data by using the deep neural network and outputting a gait prediction result;

1) enabling a tester to wear different sensors to acquire multi-modal data, preprocessing the multi-modal data, establishing a data sample set, and dividing the data sample set into a training data set, a verification data set and a test data set; the method comprises the following specific steps:

1-1) a tester respectively wears an inertial sensor module consisting of 7 inertial sensors, a pressure sensor module consisting of 12 pressure sensors and a sound sensor module consisting of 2 sound sensors; the 7 inertial sensors are respectively arranged at 7 positions of the back, the left thigh, the right thigh, the left calf, the right calf, the left instep and the right instep of a tester and are used for acquiring 3-dimensional acceleration data, 3-dimensional angular velocity data, 3-dimensional angle data and 3-dimensional magnetic field data of different parts of the lower limb of a human body; the 12 pressure sensors are distributed in an insole mode, 1 insole is respectively arranged at the left sole and the right sole, and each insole comprises 6 pressure sensor data acquisition points for acquiring sole pressure data of the 12 data points; the sound sensor is worn on the instep, and the left instep and the right instep are respectively 1 and used for collecting the sole sound of the walking of the human body;

1-2) after finishing wearing, enabling a tester to respectively perform 5 human gait behaviors in 5 walking environments, wherein the walking environments comprise: tile, cement, asphalt, sand, grass, the gait activities include: walking slowly on the flat ground, walking quickly on the flat ground, going up and down stairs, going up and down slopes, and turning left and right; wherein, going up and down stairs only under the walking environment of the tile land, going up and down slopes only under the walking environment of the asphalt land, and obtaining 17 environment gait combinations; wherein the time length of the single environment gait combination is 10-60 minutes;

1-3) under each environment gait combination, at each sampling moment, 84-dimensional data including 7 groups of 3-dimensional acceleration, 3-dimensional angular velocity, 3-dimensional angle and 3-dimensional magnetic field are acquired by 7 inertial sensors and sent to an inertial sensor acquisition and preprocessing module, 12 pressure sensors acquire 12-dimensional plantar pressure data and send to a pressure sensor data acquisition and preprocessing module, and 2 sound sensors acquire 2-dimensional walking sound data and send to a sound sensor data acquisition and preprocessing module; the sampling frequency of each sensor is 20-100 Hz;

all data at a single sampling instant constitutes the original data sample of 1 × 98,

i＝1,2,…,17，j＝1,2,3,…，

is the jth original data sample under the ith environment gait combination

The k-th dimension raw data in (1), 2, …,98, wherein the 98-dimensional data are arranged in the order of 21-dimensional acceleration, 21-dimensional angular velocity, 21-dimensional angle, 21-dimensional magnetic field, 12-dimensional pressure, 2-dimensional sound; all original data samples obtained by single environment gait combined sampling

Set of constitutions is

17 ambient gait combinations all

Forming a set of raw data samples

X^RawThe total size of the data samples of (1) is N;

1-4) each data acquisition and preprocessing module pair X^RawFiltering and normalizing corresponding data in all original data samples; filtering method selection Kalman filtering method, single original data sample

Data of each dimension in (1)

The normalization method for k-1, 2, …,98 is as follows:

in the formula:

normalized data of the kth dimension original data of the jth original data sample under the ith environment gait combination,

the k-dimension original data of the j original data sample under the i environment gait combination,

is the maximum of all the k-th dimension raw data,

is the minimum of all the k-th dimension raw data,

representing the mean of all k-dimension raw data;

after all the original data samples are preprocessed, a data sample set X is obtained^NormAnd sending to a deep neural network processing module;

1-5) deep neural network processing Module will X^NormDivided into training data sets X according to set proportion_TrainVerification data set X_ValidateAnd test data set X_Test(ii) a Wherein the training data set X_TrainThe proportion of the test data set is not less than 75%, the proportion of the verification data set is not less than 5%, and the proportion of the test data set is not less than 5%;

2) constructing a deep neural network based on a time convolution network in a deep neural network processing module; the method comprises the following specific steps:

2-1) determining a deep neural network structure;

adopting a time convolution network to construct a deep neural network, wherein the deep neural network is divided into a transition time prediction network and a target time prediction network;

let time 0 < t₁＜t₂＜t₃＜t₄＜t₅In data sample set X^NormIn, select t₁Time t₂Taking the data sample of the moment as input data x (t) of the deep neural network₁)…x(t₂)，t₃Time t₄Data sample creation for a time instant is a transition time instant sample label y (t)₃)…y(t₄)，t₅The data sample of a time of day is created as a target time of day sample label z (t)₅)；

The input data of the transition moment prediction network is t₁Time t₂Data sample x (t) at time instant₁)…x(t₂) Output prediction data of t₃Time t₄Data sample prediction value at time

Target time predicts network will x (t)₁)…x(t₂) All or part of the data x' (t)₁)…x′(t₂) And

with the input of predicted data t₅Predicted value of time

Let t₂＝t₁+7T_sample，t₃＝t₂+T_sample，t₄＝t₃+T_sample，t₅＝t₄+T_sample，T_sampleInputting a data sequence x (t) of 8 sampling moments into the network for the prediction of the data sampling interval, i.e. the transition moment₁)…x(t₂) Predicting and outputting data of 2 sampling moments

Target moment prediction network inputs 8 sampling moment data sequence x' (t)₁)…x′(t₂) And transition time prediction data of 2 sampling times

Predicting and outputting data of 1 sampling moment

2-2) determining a loss function of the deep neural network;

the loss function L for the deep neural network is:

in the formula, L_yAnd L_zRespectively representing the loss functions of the transition moment prediction network and the target moment prediction network,

and y represents the predicted value and the tag value of the predicted network output at the transition time respectively,

and z represents the predicted value and the tag value of the predicted network output at the target moment, respectively, w_yAnd w_zAre respectively L_yAnd L_zWeight coefficient, L_yAnd L_zSelection L₁Loss function or L₂Any of the loss functions:

in the formula, N_BRepresenting the number of samples in batch processing, the value range is 32,64,128 and 256,

the predicted value of the network output is u, the label value of the network output is j, and j represents the number of the jth output value of the network;

2-3) determining parameters and structural hyper-parameters of the deep neural network;

the predicted network parameters at the transition moment contain the weight W of the convolutional layer_ycAnd bias B_ycWeight W of the full link layer_yfAnd bias B_yf；

Target time prediction network parameter containing convolution layer weight W_zcAnd bias B_zcWeight W of the full link layer_zfAnd bias B_zf；

The structural hyper-parameters of the deep neural network comprise Block number, channel number, node number, convolution kernel length, void coefficient and Dropout coefficient;

the value range of the Block number is an integer in the range of [5,10], the value of the channel number is an integer in the range of [30,200], the value of the node number is an integer in the range of [50,500], the value of the convolution kernel length is 3 or 5, the value of the void coefficient is 1 or 2, and the value range of Dropout is [0,1 ];

3) training the deep neural network constructed in the step 2) to obtain the trained deep neural network and corresponding optimal parameters; the method comprises the following specific steps:

3-1) training a deep neural network;

determining training parameters of a deep neural network, comprising: number of training rounds N_EpochsAnd a learning rate α, wherein all data samples of the training data set are trained in one round for a number of training rounds N_EpochsHas a value range of N_EpochsNot less than 100, learning rate α value range of 0,1]；

Initializing parameter W of deep neural network by random method_yc、B_yc、W_yf、B_yf、W_zc、B_zc、W_zf、B_zfUsing a training data set X_TrainTraining the deep neural network parameters, and adopting a standard random gradient descent method to carry out W_yc、B_yc、W_yf、B_yf、W_zc、B_zc、W_zf、B_zfUpdating parameters; every interval N_VNumber of training rounds using validation data set X_ValidatePerforming one-time verification on the deep neural network, and automatically storing a data set X for a verification set_ValidateThe network parameter with the minimum error is used as the current network parameter;

if the validation data set error no longer decreases or the training number reaches a specified number N_EpochsIf yes, ending the training and entering the step 3-2);

3-2) Using test data set X_TestTesting the deep neural network after training is finished, and evaluating the optimal deep neural network parameters;

the criterion for evaluation is the mean error value p, and the calculation expression is:

in the formula, N_TestTo test the number of samples in a data set,

and z_iRespectively representing the ith predicted value and the tag value output by the target time prediction network;

if the estimated mean error value p<3%, finishing the evaluation, and saving the current network parameter as the optimal parameter W of the deep neural network_yc*、B_yc*、W_yf*、B_yf*、W_zc*、B_zc*、W_zf*、B_zfEntering step 4); if the evaluated average error value p is more than or equal to 3%, returning to the step 3-1), and retraining the deep neural network;

4) predicting human gait by using the trained deep neural network; the method comprises the following specific steps:

4-1) selecting a new tester, and repeating the step 1-1), so that the tester wears the inertial sensor module, the pressure sensor module and the sound sensor module respectively;

4-2) randomly selecting 1 walking environment from the 5 walking environments in the step 1-2), and randomly selecting 1 human gait behavior from the 5 human gait behaviors in the step 1-2), wherein ascending and descending stairs are only carried out in a tile ground walking environment, ascending and descending slopes are only carried out in an asphalt ground walking environment, the step 1-3) is repeated, original data samples under the environment gait combination after a tester wears three sensor modules are collected in real time and are respectively sent to corresponding data collection and preprocessing modules, and all data sampled once are arranged to form 1 original data sample of 1 × 98

As raw data samples

The k-th dimension raw data in (1, 2. ·, 98);

4-3) repeating steps 1-4), and

preprocessing is carried out, and the data sample after preprocessing is obtained and recorded as

Concurrence ofSending the data to a deep neural network processing module;

4-4) in the deep neural network processing module, will

Data samples corresponding to the first 7 sampling instants of the sampling instants and

form a new t₁Time t₂Inputting data into the deep neural network trained in the step 3), and outputting the tth test person by the network in real time₅Temporal gait prediction

Predicting outcome data for gait

The k-th dimension of (1), k is 1, 2.

The invention has the characteristics and beneficial effects that:

1. the human gait prediction device based on the multi-mode deep learning can effectively collect sensor data of three modes of an inertial sensor, a plantar pressure sensor and a sound sensor, carries out human gait prediction by using a deep neural network algorithm, and can predict 3-dimensional acceleration, 3-dimensional angular velocity, 3-dimensional angle, 3-dimensional magnetic field, 12-dimensional plantar pressure and 2-dimensional walking sound of 5 human gait behaviors (such as flat ground slow walking, flat ground fast walking, up and down stairs, up and down slopes, left and right turning) in 5 walking environments (such as tile ground, cement ground, asphalt ground, sand ground and grassland).

2. According to the human body gait prediction device based on the multi-mode deep learning, the time convolution network is adopted to construct the deep neural network for gait prediction, a characteristic extractor does not need to be designed artificially to extract gait characteristics, but the characteristic learning and the gait prediction are automatically integrated, so that the accuracy and the robustness of the human body gait prediction are improved.

3. The human body gait prediction device based on the multi-mode deep learning is simple and convenient to wear, can meet different human body requirements, is suitable for gait prediction of most of different human bodies, and can be applied to gait prediction of exoskeletal robots in the fields of medical rehabilitation and military in the future.

Drawings

FIG. 1 is a schematic view of the structure of the apparatus of the present invention.

Fig. 2 is a schematic view of the sensor wear of the device of the present invention.

Fig. 3 is a schematic diagram of left sole pressure acquired by the insole type sole pressure sensor of the device of the invention.

Fig. 4 is a diagram of a deep neural network in the apparatus of the present invention.

FIG. 5 is a Block diagram of the deep neural network of the device of the present invention.

In the figure, 1-7 are inertial sensors, 8-9 are sound sensors, 10-11 are insole type plantar pressure sensors, and ① - ⑥ are distribution positions of the plantar pressure sensors.

Detailed Description

The invention provides a human gait prediction device based on multi-modal deep learning, which is further described in detail below by combining the accompanying drawings and specific embodiments.

The invention provides a human gait prediction device based on multi-mode deep learning, which has a structure shown in figure 1 and comprises: the device comprises an inertial sensor module, a pressure sensor module, a sound sensor module, an inertial sensor data acquisition and preprocessing module, a pressure sensor data acquisition and preprocessing module, a sound sensor data acquisition and preprocessing module and a deep neural network processing module.

The inertial sensor module comprises 7 inertial sensors, each inertial sensor is connected with an inertial sensor data acquisition and preprocessing module in a wired parallel mode respectively, the pressure sensor module comprises 12 pressure sensors, each pressure sensor is connected with a pressure sensor data acquisition and preprocessing module in a wired parallel mode respectively, the sound sensor module comprises 2 sound sensors, each sound sensor is connected with a sound sensor data acquisition and preprocessing module in a wired parallel mode respectively, and the inertial sensor data acquisition and preprocessing module, the pressure sensor data acquisition and preprocessing module and the sound sensor data acquisition and preprocessing module are connected with a deep neural network processing module in a wired parallel mode respectively.

The inertial sensor is a conventional sensor which simultaneously integrates a three-axis gyroscope, a three-axis accelerometer, a three-axis angle meter and a three-axis electronic compass. The 7 inertial sensors are respectively arranged at the positions of the waist back, the left thigh, the right thigh, the left calf, the right calf, the left instep and the right instep of a user, as shown in fig. 2, each inertial sensor is respectively used for collecting 3-dimensional acceleration data, 3-dimensional angular velocity data, 3-dimensional angle data and 3-dimensional magnetic field data of different parts of the lower limb of a human body, and sending the collected data to the inertial sensor data collecting and preprocessing module.

The inertial sensor acquisition and preprocessing module is used for receiving the data acquired by each inertial sensor, performing data preprocessing of filtering and normalization, and sending the preprocessed inertial sensor data to the deep neural network processing module. The inertial sensor data acquisition and preprocessing module adopts a processing framework based on a conventional MCU and can be installed at any position of a human body. In this embodiment, the inertial sensor data acquisition and preprocessing module is installed at the back position.

The pressure sensor is a conventional film type pressure sensor, and can perform static and dynamic measurement on the pressure of any contact surface. The 12 pressure sensors are distributed in an insole mode, as shown in fig. 3, 1 insole is placed on each of the left sole and the right sole, 6 data acquisition points are arranged on each insole, the pressure of the sole is acquired by the 12 data acquisition points of each pressure sensor at the corresponding position to obtain 12-dimensional sole pressure data, and the acquired data are sent to the pressure sensor data acquisition and preprocessing module.

The pressure sensor data acquisition and preprocessing module is used for receiving data acquired by each pressure sensor, performing data preprocessing of filtering and normalization, and sending the preprocessed pressure sensor data to the deep neural network processing module. The pressure sensor data acquisition and preprocessing module adopts a processing framework based on a conventional MCU and can be installed at any position of a human body. In this embodiment, the pressure sensor data acquisition and preprocessing module is installed at the back position.

The sound sensor employs a conventional sound-sensitive condenser electret microphone sensor. 2 sound sensor dresses at the instep, and each 1 of left and right sides instep as shown in figure 2 for gather human walking's sole sound data, and send the data acquisition who gathers to sound sensor data acquisition and preprocessing module.

The sound sensor data acquisition and preprocessing module is used for receiving data sent by each sound sensor, performing data preprocessing of filtering and normalization, and sending the preprocessed sound sensor data to the deep neural network processing module. The data acquisition and preprocessing module of the sound sensor adopts a processing framework based on a conventional MCU and can be installed at any position of a human body. In this embodiment, the sound sensor data acquisition and preprocessing module is installed at the back position.

The deep neural network processing module is used for receiving the preprocessed inertial sensor data, pressure sensor data and sound sensor data, predicting the gait of the received data by using the deep neural network and outputting a gait prediction result. The deep neural network processing module adopts a processing framework based on a conventional GPU or FPGA and is used for improving the calculation efficiency of the deep neural network; meanwhile, an output interface of a USB or a serial port is adopted for outputting a gait prediction result for interactive use with external equipment or an exoskeleton system.

The deep neural network module adopts a Time Convolutional Network (TCN) to construct a deep neural network, and the network structure is divided into a transition time prediction network and a target time prediction network, as shown in fig. 4, where fig. 4(a) is the transition time prediction network, and fig. 4(b) is the target time prediction networkAnd predicting the network. The preprocessed inertial sensor data, the pressure sensor data and the sound sensor data received by the deep neural network form a data sample set X^Norm；

The input data of the prediction network at the transition moment is x (t)₁)…x(t₂) Output the prediction data as

Wherein

May be related to x (t)₁)…x(t₂) The dimensions of (A) are the same or different; target time predicts network will x (t)₁)…x(t₂) All or part of the data x' (t)₁)…x′(t₂) And

while as input, outputting the prediction data as

Where, x' (t)₁)…x′(t₂) Type of sensor data and data dimension and

the type of data of (a) is the same as the dimension,

the data type and dimension of (2) can be equal tox′(t₁)…x′(t₂) And

the data type and dimension of (a) are the same or different. In gait prediction, the general method is to directly pass through x (t)₁)…x(t₂) Prediction

The invention adds a transition process

Therefore, the network can learn more variation trends, the prediction inaccuracy caused by random errors at individual moments is reduced, and the prediction effect is improved.

The Block in the deep neural network adopts a residual structure, cavity causal convolution, weight normalization and Re L U, Dropout operation are sequentially executed, and then repeated execution is performed once in sequence, wherein the specific operation flow is shown in FIG. 5. the convolution 1 × 1 in the Block structure of TCN is an optional module, convolution operation is executed when the input dimension and the output dimension of the residual are different, and when the input dimension and the output dimension of the residual are the same, the convolution operation is not required to be executed, and a unit matrix is used for substitution, so that the residual structure can effectively reduce the loss of information in the convolution network, and is more convenient for program expansion.

The calculation formula of the cavity causal convolution operation F acting on the s-th output neuron is as follows:

in the formula: x is the input layer sequence x (t)₁)…x(t₂)，x_s-d*iCorresponding s-d x i inputs in the input layer sequence are shown, f is a convolution kernel, d is a hole coefficient, and k is the length of the convolution kernel.

The Re L U (Rectified L initial Unit) function has the calculation formula as follows:

f(u)＝max(0,u)

u is the input to the Re L U function, the derivative of the function being 1 when U >0 and 0 when U <0, such that the function has non-linearity.

The Dropout operation is to randomly discard the activation values of some neurons in the input to avoid overfitting and improve the generalization capability of the convolutional neural network. Dropout has a value range of [0,1 ].

The weight normalization operation is to re-parameterize each weight vector w of the neural network through a vector parameter v and a scalar parameter g, and perform random gradient descent on newly introduced parameters so as to accelerate the convergence speed of the optimization process. The weight vector w can be expressed as:

where v is a k-dimensional vector, g is a scalar, and | | · | | | represents the euclidean norm, this re-parameterization has the effect of fixing the euclidean norm of the weight parameter w, such that w ═ g, independent of the parameter v.

The working principle of the device is as follows:

i＝1,2,…,17，j＝1,2,3,…，

is the jth original data sample under the ith environment gait combination

Set of constitutions is

17 ambient gait combinations all

Forming a set of raw data samples

X^RawThe total size of the data samples of (1) is N;

1-4) each data acquisition and preprocessing module pair X^RawFiltering and normalizing the corresponding data in all the original data samples; filter method selection standard Kalman filter method, single raw data sample

Data of each dimension in (1)

The normalization method for k-1, 2, …,98 is as follows:

in the formula:

is the maximum of all the k-th dimension raw data,

for all k-dimension originalThe minimum value of the data is the minimum value,

representing the mean of all k-dimension raw data;

2-1) determining a deep neural network structure;

let time 0 < t₁＜t₂＜t₃＜t₄＜t₅In data sample set X^NormIn, select t₁Time t₂The data sample of the time is used as the input data x (t) of the neural network₁)…x(t₂)，t₃Time t₄Data sample creation for a time instant is a transition time instant sample label y (t)₃)…y(t₄)，t₅The data sample of a time of day is created as a target time of day sample label z (t)₅)；

The input sequence data of the transition time prediction network is t₁Time t₂Data sample x (t) at time instant₁)…x(t₂) Outputting predicted sequence data as t₃Time t₄Data sample prediction value at time

while outputting predicted sequence data as input t₅Predicted value of time

Predicting and outputting data of 1 sampling moment

2-2) determining a loss function of the deep neural network;

the loss function L for the deep neural network is:

the parameters needing to be optimized by the prediction network at the transition moment comprise the weight W of the convolutional layer_ycAnd bias B_ycWeight W of the full link layer_yfAnd bias B_yf；

The parameters needing to be optimized by the target time prediction network comprise the weight W of the convolutional layer_zcAnd bias B_zcWeight W of the full link layer_zfAnd bias B_zf；

3-1) training a deep neural network;

in the formula, N_TestTo test the number of samples in a data set,

As raw data samples

The k-th dimension raw data in (1, 2. ·, 98);

4-3) repeating steps 1-4), and

And sending to a deep neural network processing module;

4-4) in the deep neural network processing module, will

Predicting outcome data for gait

The k-th dimension of (1), k is 1, 2.

According to the human body gait prediction device based on multi-mode deep learning, the output gait prediction result can be directly transmitted to an exoskeleton robot or other systems to be used for gait control.

The above description is only an embodiment of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can understand that the modifications or substitutions within the technical scope of the present invention are included in the scope of the present invention, and therefore, the scope of the present invention should be subject to the protection scope of the claims.

Claims

1. A human gait prediction device based on multi-modal deep learning is characterized by comprising: the system comprises an inertial sensor module, a pressure sensor module, a sound sensor module, an inertial sensor data acquisition and preprocessing module, a pressure sensor data acquisition and preprocessing module, a sound sensor data acquisition and preprocessing module and a deep neural network processing module;

is the jth original data sample under the ith environment gait combination

The k-th dimension raw data in (1, 2., 98), wherein the 98-dimensional data are arranged in the order of 21-dimensional acceleration, 21-dimensional angular velocity, 21-dimensional angle, 21-dimensional magnetic field, 12-dimensional pressure and 2-dimensional sound; all original data samples obtained by single environment gait combined sampling