CN111329498A

CN111329498A - Multi-modal driver emotion auxiliary adjusting method

Info

Publication number: CN111329498A
Application number: CN202010157896.XA
Authority: CN
Inventors: 李翠霞; 李英豪; 王亚博; 吴卫东; 周元元; 杨珊珊; 闫凯波; 许书宁; 叶帅; 赵若琰
Original assignee: Zhengzhou University
Current assignee: Zhengzhou University
Priority date: 2020-03-09
Filing date: 2020-03-09
Publication date: 2020-06-26

Abstract

The invention discloses a multi-modal driver emotion auxiliary adjusting method, which relates to the field of artificial intelligence and machine learning, and comprises the steps of collecting multi-modal data information of a driver; according to the multi-modal data information, the emotional states of the drivers in the single modes are respectively identified, and whether the emotional state of the driver in each single mode is a road rage signal or not is judged; fusing judgment results of emotional states of a plurality of drivers in single mode, judging whether the fusion result is a total road rage signal, and grading the total road rage signal; and correspondingly outputting different voice reminding information according to the grade of the total road rage signal, and adjusting the emotion of the driver. The invention can monitor the emotion of the driver in real time, and can adopt related measures to assist in adjusting the emotion of the driver when the emotional state of the driver is abnormal, thereby reducing the occurrence of accidents and protecting the driving safely.

Description

Multi-modal driver emotion auxiliary adjusting method

Technical Field

The invention relates to the field of artificial intelligence and machine learning, in particular to a multi-modal driver emotion auxiliary adjusting method.

Background

The concept of road rage (Roadrage) was first derived from foreign psychology. The medical community classifies the road rage as paroxysmal rage disorder, which means that multiple anger fires explode, and the violent degree is called as a big feeling and an accident. For example, when a driver encounters traffic jam or other things affecting mood, the driver can show behaviors of operation modes such as anger, frequent whistling, frequent braking and the like.

Traffic accidents caused by road rage are rising every year, and the public traffic safety is seriously influenced. Although some driving assistance systems are already available in the market, the influence of the emotion of the driver on the driving safety is not considered, and an appropriate adjusting strategy is not matched according to the emotion recognition result.

Disclosure of Invention

In order to solve the problems in the background technology, the invention provides a multi-modal driver emotion auxiliary adjusting method, which collects data related to emotion judgment of a driver through various channels, performs emotion analysis and judgment through a deep learning method, and further adopts related measures to assist in adjusting the emotion of the driver, so that accidents are reduced.

The technical scheme of the invention is as follows:

a multi-modal driver emotion aid adjustment method comprises the following steps:

s1, collecting multi-modal data information of the driver; the multi-modal data information comprises image information, voice information and driving behavior information;

s2, according to the multi-modal data information, respectively identifying the emotional states of the drivers in a plurality of single modes, and judging whether the emotional state of each driver in the single mode is a road rage signal;

s3, fusing judgment results of emotional states of the drivers in the single modes, judging whether the fusion result is a general road rage signal, and grading the general road rage signal;

and S4, correspondingly outputting different voice reminding information according to the grade of the total road rage signal, and adjusting the emotion of the driver.

Preferably, the S2 includes image information identification, specifically:

the method comprises the steps that driver image information captured by a camera is disassembled into frames, the frames are respectively sent to an OpenCv face selector, and face images of drivers in a vehicle are cut out in real time;

and (3) converting the size of the facial image into 224 × 224 pixels, sending the facial image into a VGG16 deep convolution neural network model to extract features, and dividing the facial image into a road anger signal and a non-road anger signal.

Preferably, the S2 includes speech information recognition, specifically:

capturing the voice information of a driver in the vehicle in real time by adopting a VAD algorithm, and sending the voice information of the driver into an ASR model to convert the voice into character information;

judging whether the voice information of the driver contains a wake-up word or not; if the wake-up word is contained, sending the wake-up word into a voice interaction module to carry out human-computer interaction normally; if the awakening words are not contained, emotion analysis is carried out, and the voice information of the driver is divided into a road rage signal and a non-road rage signal.

Preferably, the S2 includes driving behavior information identification, specifically:

collecting driving behavior data of driver by film sensor mounted on steering wheel and brake

The collected driving behavior data of the driver are sent to a random forest model, and the driving behavior data of the driver are divided into a road rage signal and a non-road rage signal through the random forest model;

the driving behavior data of the driver comprise average starting hot-car time, on-site idling ratio, rapid accelerator pedaling frequency and rapid braking frequency of the driver within three days.

Preferably, in S3, if the emotional state of the driver in the single mode is a road rage signal, the emotional state is marked as "1"; the emotional state of the driver in each single mode is a road rage signal and is marked as '0'; the total road rage signal is graded, and the grading comprises the following steps:

when the emotional states of the drivers in the single modes are added to be 0, the driver is not angry;

when the emotional states of the drivers in the single modes are added to be 1, the driver is in mild road rage;

when the emotional states of the drivers in the single modes are added to be 2, the driver is moderate road rage;

and when the emotional states of the drivers in the single modes are added to be 3, the road is heavily irritated.

Compared with the prior art, the invention has the beneficial effects that: the invention can monitor the emotion of the driver in real time according to the image information, the voice information and the driving behavior information of the driver, and can adopt related voice reminding measures to assist in adjusting the emotion of the driver when the emotion state of the driver is abnormal, so that the occurrence of accidents is reduced, and the driving safety is protected.

Drawings

FIG. 1 is a flow chart of the present invention;

fig. 2 is a working principle diagram of the present invention.

Detailed Description

The technical solutions of the embodiments of the present invention are clearly and completely described below with reference to the drawings in the present invention, and it is obvious that the described embodiments are some embodiments of the present invention, but not all embodiments. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, shall fall within the scope of protection of the present invention.

As shown in fig. 1, the invention provides a multimodal driver emotion auxiliary adjusting method, which includes:

Specifically, as shown in fig. 2, the emotional state data of the driver in each of the plurality of single modalities includes image information, voice information, and driving behavior information, respectively.

Image information identification, specifically: the method comprises the steps that driver image information captured by a camera is disassembled into frames, the frames are respectively sent to an OpenCv face selector, and face images of drivers in a vehicle are cut out in real time; and (3) converting the size of the facial image into 224 × 224 pixels, sending the facial image into a VGG16 deep convolution neural network model to extract features, and dividing the facial image into a road anger signal and a non-road anger signal.

The voice information recognition specifically comprises the following steps: capturing the voice information of a driver in the vehicle in real time by adopting a VAD algorithm, and sending the voice information of the driver into an ASR model to convert the voice into character information; judging whether the voice information of the driver contains a wake-up word or not; if the awakening words are contained, the voice interaction module is sent to carry out human-computer interaction normally. Such as voice short message sending, voice weather and road condition information inquiry, voice playing, song cutting and the like.

If the awakening words are not contained, emotion analysis is carried out, and the voice information of the driver is divided into a road rage signal and a non-road rage signal.

The driving behavior information identification specifically comprises the following steps: collecting driving behavior data of a driver through a film sensor arranged at a steering wheel and a brake; the collected driving behavior data of the driver are sent to a random forest model, and the driving behavior data of the driver are divided into a road rage signal and a non-road rage signal through the random forest model; the driving behavior data of the driver comprise average starting hot-car time, on-site idling ratio, rapid accelerator pedaling frequency and rapid braking frequency of the driver within three days.

Wherein, the emotional state of the driver in each single mode is a road rage signal, and is marked as '1'; the emotional state of the driver in each single mode is a road rage signal and is marked as '0'; the total road rage signal is graded, and the grading comprises the following steps:

(1) when the emotional states of the drivers in the single modes are added to be 0, the driver is not angry, namely the emotional state of the driver is 0, and the driver is not angry, the voice prompt is not carried out;

(2) when the emotional states of the drivers in the single modes are added to be 1, the driver is in mild road rage, namely the emotional state of the driver is 1, the level 1 prompt is activated, and meanwhile voice reminding information can be played, for example, whether you are not happy and a relaxing music bar is played. The soothing music in the song library may be collected by the driver in advance according to personal preferences or selected from selected songs.

(3) When the emotional states of the drivers in the single modes are added to be 2, the driver is moderate road rage, namely the emotional state of the driver is 2, 2-level prompt is activated, and meanwhile voice reminding information can be played, for example, "you are suspected of road rage now, mom wants to speak to you". And calling the voice recorded by the mother in advance from the voice library to prompt the driver.

(4) When the emotional states of the drivers in the single modes are added to be 3, the driver is severe road rage, namely the emotional state of the driver is 3, 3-level prompt is activated, and meanwhile voice reminding information can be played, for example, "dangerous now, your baby comes". And calling out the sound recorded by the child in advance from the voice library to prompt the driver.

Wherein, the level 2 and the level 3 can also be selected and set by the driver according to the importance degree in the mind.

The above disclosure is only for the preferred embodiments of the present invention, but the embodiments of the present invention are not limited thereto, and any variations that can be made by those skilled in the art are intended to fall within the scope of the present invention.

Claims

1. A multi-modal driver emotion aid adjustment method is characterized by comprising the following steps:

2. The multi-modal driver emotion aid adjustment method as claimed in claim 1, wherein said S2 includes image information recognition, specifically:

3. The multi-modal driver emotion aid adjustment method as claimed in claim 1, wherein said S2 includes speech information recognition, specifically:

4. The multi-modal driver emotion aid adjustment method as claimed in claim 1, wherein the S2 includes driving behavior information identification, specifically:

5. The multi-modal driver emotion aid adjusting method as claimed in claim 1, wherein in S3, each of the driver emotion states in the single mode is a road rage signal, which is recorded as "1"; the emotional state of the driver in each single mode is a road rage signal and is marked as '0'; the total road rage signal is graded, and the grading comprises the following steps: