CN113314106A

CN113314106A - Electric power information query and regulation function calling method based on voice and intention recognition

Info

Publication number: CN113314106A
Application number: CN202110546414.4A
Authority: CN
Inventors: 李铁; 唐俊刺; 皮俊波; 姜枫; 陈晓东; 崔岱; 王淼; 姜狄; 葛延峰; 高凯; 孙文涛; 周志; 王明凯; 高梓济; 李桐; 王刚; 宋进良; 孙茜
Original assignee: State Grid Corp of China SGCC; State Grid Liaoning Electric Power Co Ltd; Electric Power Research Institute of State Grid Liaoning Electric Power Co Ltd
Current assignee: State Grid Corp of China SGCC; State Grid Liaoning Electric Power Co Ltd; Electric Power Research Institute of State Grid Liaoning Electric Power Co Ltd
Priority date: 2021-05-19
Filing date: 2021-05-19
Publication date: 2021-08-27

Abstract

The method comprises the steps of calling a power information query and regulation function based on voice and intention recognition, and outputting a scheduling telephone text by combining an acoustic model and a language model based on deep learning according to a scheduling voice corpus; carrying out scheduling intention recognition in a field recognition model and an intention recognition model based on deep learning according to the regulation instruction word vector; outputting a regulation instruction matching result by the slot position identification model by using a rule method; according to the regulation and control instruction matching result, power grid load flow analysis, short-circuit current calculation and switching-on and switching-off operations are combined to perform power information inquiry, power grid automatic calculation and power automatic scheduling; and inputting the scheduling telephone text and the regulation and control instruction matching result into a speech synthesis engine, converting the scheduling telephone text and the regulation and control instruction matching result into scheduling telephone speech, and performing automatic response of a scheduling telephone. The method realizes the function call of the power information inquiry and the regulation and control system from three aspects of a voice recognition technology in the power regulation and control field, an intention recognition technology in the power regulation and control field and a multi-turn dialogue technology of the regulation and control system.

Description

Electric power information query and regulation function calling method based on voice and intention recognition

Technical Field

The invention relates to the technical field of power grid regulation and control data analysis, in particular to a power information query and regulation and control function calling method based on voice and intention recognition.

Background

At present, as a core driving force of a new industrial revolution, artificial intelligence has remarkable effect in the aspects of enabling and promoting the traditional industry and hastening the emergence of new industries, and is expected to cause great revolution of economic structures. China highly pays attention to the development of the artificial intelligence industry, and with the construction and the improvement of new technologies such as a regulation cloud platform and the like, the data volume and the computing capacity of a power system are greatly improved recently, so that favorable conditions are created for developing the technical research of artificial intelligence robots in the field of power regulation.

Spoken language understanding is an important component of a human-machine dialog system, while intent recognition is a subtask and crucial in spoken language understanding. The accuracy of intent recognition is directly related to the performance of semantic slot filling and aids in the construction of subsequent dialog systems. In the prior art, the intention recognition method mainly comprises a semantic recognition method based on a rule template and a classification algorithm using statistical characteristics. The intent recognition method based on the rule template generally needs to artificially construct the rule template and classify the user intent text according to the category information. The statistical feature classification-based method needs to extract key features of the text, such as characters, word features, N-grams and the like, and then realizes intent classification by training a classifier. In consideration of the difficulty of intention recognition in a man-machine conversation system, the traditional machine learning method cannot understand deep semantic information of user words, and the research of an intention recognition method based on a deep neural network is promoted. With the development of the deep neural network, more and more students apply word vectors, convolutional neural network networks, cyclic neural networks, variable long-term memory networks, gated cyclic units, attention mechanisms and capsule networks to the intention recognition task, and compared with the traditional machine learning method, the deep learning model has the advantages that the recognition performance is greatly improved, and the single intention recognition and the multiple intention recognition are changed into more practical achievements.

With the rapid development of power systems, the market-oriented work of power is promoted, the integration characteristics of large power grids become more and more obvious, and the coupling relationship between different elements in the systems and the external environment is continuously enhanced. The types of power grid resources are continuously enriched, the new energy ratio is continuously improved, new control targets such as the improvement of the new energy consumption level are continuously evolved, and the uncertainty of the power grid state is further enhanced. A series of deep development changes of the power system and the regulation and control service enable the complexity of the scheduling control strategy and the regulation to be continuously improved, and higher requirements are provided for automation and intellectualization of the regulation and control service. The power dispatching control center is a 'command brain' integrating high-value data, analysis rules, expert experience and calculation decisions, the existing regulation and control mode mainly takes manual experience analysis as a main mode, a dispatcher needs to perform experience knowledge correlation on massive and diverse data and scheme models, more repetitive 'human brain labor' exists, and the efficiency is lower. Therefore, intelligent regulation is realized, and the working strength of regulation personnel is reduced.

Through the above analysis, in the prior art, the problems and defects in the real-time analysis of the regulation and control service are as follows:

(1) the existing multi-turn conversation technology has the problems of low intelligent degree, large research and development workload and the like.

(2) The existing power dispatching regulation and control mode mainly adopts manual experience analysis, and dispatching personnel need to perform experience knowledge correlation on massive diverse data and scheme models, so that more repetitive 'human-brain labor' is realized, and the efficiency is lower.

Disclosure of Invention

In order to solve the defects in the prior art, the invention aims to provide a power information query and regulation function calling method based on voice and intention recognition, and the power information query and regulation function calling is realized from three aspects of a voice recognition technology in the power regulation field, an intention recognition technology in the power regulation field and a multi-turn dialogue technology of a regulation system.

The invention adopts the following technical scheme.

The electric power information query and regulation function calling method based on voice and intention recognition comprises the following steps:

step 1, collecting scheduling voice corpora and constructing a scheduling voice corpus;

step 2, using the scheduling voice corpus as input data and inputting the input data into the trained acoustic model and language model; jointly outputting a dispatching telephone text by the trained acoustic model and the language model;

the trained acoustic model and language model are trained by adopting a deep learning algorithm by taking a scheduling voice corpus as a training set;

step 3, collecting a regulation and control instruction corpus, and vectorizing regulation and control instruction corpus words to construct a regulation and control instruction corpus according to the obtained regulation and control instruction word vectors;

step 4, taking the vector of the regulation instruction word as input data, and respectively inputting the input data into the trained field recognition model and the trained intention recognition model;

outputting a field matching result by a field recognition model, and outputting an intention recognition result by an intention recognition model according to the field matching result;

the method comprises the following steps that a trained field recognition model and an intention recognition model are trained by adopting a deep learning algorithm by taking a control instruction corpus as a training set;

step 5, taking the intention recognition result as input data of the slot position recognition model, and outputting a regulation and control instruction matching result by the slot position recognition model;

the slot position identification model extracts semantic tags from the scheduling instructions by using a rule analysis algorithm;

step 6, according to the regulation and control instruction matching result, defining a scheduling instruction to prepare for carrying out power scheduling; determining the running state of the power grid by combining power grid load flow calculation, short-circuit current calculation and switching-on and switching-off operations; meanwhile, according to the matching result of the regulation and control instruction, the power information query, the automatic power grid calculation and the automatic power dispatching response operation are executed;

and 7, inputting the scheduling telephone text and the regulation and control instruction matching result into a voice synthesis engine, converting the scheduling telephone text and the regulation and control instruction matching result into scheduling telephone voice through the voice synthesis engine, and automatically responding to the scheduling telephone by using the scheduling telephone voice.

Preferably, the first and second electrodes are formed of a metal,

in the step 1, noise reduction processing and feature extraction are carried out on the collected scheduling voice corpus, and the obtained feature sequence is stored in a scheduling voice corpus; the scheduling voice corpus comprises a general corpus and an electric power professional corpus, and further comprises a standard voice signal for matching degree testing.

Preferably, the first and second electrodes are formed of a metal,

the step 2 comprises the following steps:

step 2.1, using the scheduling voice corpus as input data and inputting the input data into the trained acoustic model and language model;

step 2.2, the trained acoustic model carries out noise processing and acoustic feature extraction on the scheduling voice corpus to output an acoustic feature sequence; obtaining the matching degree of the acoustic feature sequence, namely the score of an acoustic model, by using a standard voice signal;

step 2.3, the trained language model carries out noise processing and semantic feature extraction on the scheduling voice corpus to output a semantic feature sequence; obtaining the matching degree of the semantic feature sequence, namely the language model score by using a standard voice signal;

step 2.4, taking the feature sequence with the highest comprehensive score of the acoustic model and the language model as a dispatching telephone feature sequence;

and 2.5, performing language decoding on the scheduling telephone feature sequence based on the electric power professional corpus to obtain a scheduling telephone text.

Preferably, the first and second electrodes are formed of a metal,

in step 2.2, standard voice signals are used as matching degree test input data and input into the trained acoustic model, and the acoustic model outputs a standard acoustic feature sequence; and calculating the matching degree of the acoustic feature sequence and the standard acoustic feature sequence based on a distance algorithm.

Step 2.3, standard voice signals are used as matching degree test input data and input into the trained language model, and the language model outputs a standard semantic feature sequence; and calculating the matching degree of the semantic feature sequence and the standard semantic feature sequence based on a distance algorithm.

Preferably, the first and second electrodes are formed of a metal,

step 6, integrating and summarizing equipment tide, power grid events, operation modes, risk early warning, equipment maintenance, accident plans and meteorological information, and inquiring and watching various electric power information in the same interface; the interface supports the dragging of the information module, and the currently required information can be customized and displayed according to the requirements.

Further, step 6 further comprises: and the functions of report self-customization, picture self-generation and telephone self-response are realized by interacting with a regulation knowledge base.

Further, step 6 further comprises: and according to the switching operation knowledge graph and the disconnecting link position image recognition result, the intelligent ticket forming, the automatic execution of the operation ticket and the confirmation of the disconnecting link position of the operation ticket are realized.

Further, the picture self-generation is to integrate and summarize the equipment tide, the power grid event, the operation mode, the risk early warning, the equipment maintenance, the accident plan and the meteorological information, and check various information in one interface.

Preferably, the first and second electrodes are formed of a metal,

the step 7 comprises the following steps: acquiring a scheduling telephone text and a regulation and control instruction matching result; through setting the characteristic parameters of the synthesized voice, converting the text of the dispatching telephone and the matching result of the regulating instruction into voice by using a voice synthesis engine; the synthesized voice is broadcasted to the regulating personnel.

Compared with the prior art, the invention has the beneficial effects that:

and developing the application of an artificial intelligence technology in regulating and controlling the operation of the streamlined business based on the voice and intention recognition, and taking the intention recognition as a core module of a multi-turn conversation scene for understanding the intention of a dispatcher, converting the real intention of the dispatcher into a machine language and transmitting the machine language to a computer. The method faces to the actual scene, accurately identifies the information intention, realizes the flow intelligent execution of the regulation and control operation business, particularly the function calling of the power information inquiry and regulation and control system, lightens the flow business burden of the regulation and control professional, improves the working efficiency and the safety level, and ensures the safe and stable operation of the power grid.

The invention has the beneficial effects that the power grid regulates and controls the transactional work intelligent execution technology, realizes the power information query, search and function call based on voice, and realizes the functions of customizing reports, automatically composing pictures, automatically answering typical service calls and the like. The intelligent interaction of regulation and control things can be realized through voice recognition and intention recognition, the switching operation data information is managed and controlled on line, a dispatcher is assisted to execute the whole flow, and the power grid system level warning capability and the intelligent fault handling capability are improved. The intelligent control system has the advantages that simple and repetitive control work is replaced by an artificial intelligence means, the intelligence level of application products in the control field is improved, a dispatcher can be liberated from the repetitive work, and the dispatcher can have the ability to think more work which is beneficial to the operation of a power grid.

Drawings

FIG. 1 is a flow chart of a method for calling a power information query and regulation function based on voice and intention recognition according to the present invention;

fig. 2 is a schematic diagram illustrating a power information query and a regulation and control system function call implemented by a power regulation and control field speech recognition technology, a power regulation and control field intention recognition technology, and a regulation and control system multi-turn dialogue technology in an embodiment of the present invention.

Detailed Description

The present application is further described below with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present application is not limited thereby.

As shown in fig. 1, the method for calling the power information query and regulation function based on voice and intention recognition includes:

step 1, collecting a scheduling voice corpus and constructing a scheduling voice corpus.

In particular, the amount of the solvent to be used,

the trained acoustic model and language model are trained by adopting a deep learning algorithm by taking a scheduling voice corpus as a training set.

In particular, the amount of the solvent to be used,

the step 2 comprises the following steps:

and 2.1, taking the scheduling voice corpus as input data and inputting the scheduling voice corpus into the trained acoustic model and language model.

Step 2.2, the trained acoustic model carries out noise processing and acoustic feature extraction on the scheduling voice corpus to output an acoustic feature sequence; and obtaining the matching degree of the acoustic feature sequence, namely the acoustic model score by using the standard voice signal.

In particular, the amount of the solvent to be used,

Step 2.3, the trained language model carries out noise processing and semantic feature extraction on the scheduling voice corpus to output a semantic feature sequence; and obtaining the matching degree of the semantic feature sequence, namely the language model score by using the standard voice signal.

In particular, the amount of the solvent to be used,

And 2.4, taking the feature sequence with the highest comprehensive score of the acoustic model and the language model as a dispatching telephone feature sequence.

And 3, collecting the control instruction corpus, and vectorizing the control instruction corpus words to construct a control instruction corpus according to the obtained control instruction word vectors.

the trained field recognition model and the trained intention recognition model are trained by adopting a deep learning algorithm by taking a control instruction corpus as a training set.

and the slot position identification model extracts semantic tags from the scheduling instructions by using a rule analysis algorithm.

Step 6, according to the regulation and control instruction matching result, defining a scheduling instruction to prepare for carrying out power scheduling; determining the running state of the power grid by combining power grid load flow calculation, short-circuit current calculation and switching-on and switching-off operations; and simultaneously, according to the regulation and control instruction matching result, performing power information query, automatic power grid calculation and automatic power dispatching response operation.

In particular, the amount of the solvent to be used,

The picture self-generation is to integrate and summarize the equipment tide, the power grid event, the operation mode, the risk early warning, the equipment maintenance, the accident plan and the meteorological information, and check various information in one interface.

In particular, the amount of the solvent to be used,

In the preferred embodiment of the invention, the power information query and the regulation and control system function call are realized from three aspects of a voice recognition technology in the power regulation and control field, an intention recognition technology in the power regulation and control field and a multi-turn conversation technology of the regulation and control system, as shown in fig. 2. Firstly, a corpus of a general small sample set and an electric power professional lexicon is constructed, and an acoustic model and a language model are trained based on deep learning. And obtaining a characteristic sequence by noise processing and characteristic extraction of the dispatching voice telephone, matching the characteristic sequence with the characteristic sequence in the training library after voice decoding, and synthesizing the characteristic sequence with the highest score according to the acoustic model and the language model to obtain an output dispatching telephone text. Secondly, a field identification model and an intention identification model of the dispatching instruction are respectively established by adopting a deep learning method based on the linguistic data in the instruction category of the regulating and controlling field, in view of the fact that the regulating and controlling system is difficult to understand the dispatching instruction once in practical application, a large number of slot position identification models are established by using a rule method, the real intention of a regulating and controlling person is determined by matching with the slot position of the dispatching instruction, and then the electric power information inquiry, automatic dispatching, automatic calculation and other operations are carried out by combining with the services of electric network analysis, calculation, operation and the like in the intelligent regulating and controlling system. And the instruction reply information generated by the regulation and control system converts the text into voice through the voice synthesis engine and broadcasts the voice to the regulation and control personnel.

In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When used in whole or in part, can be implemented in a computer program product that includes one or more computer instructions. When loaded or executed on a computer, cause the flow or functions according to embodiments of the invention to occur, in whole or in part. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, the computer instructions may be transmitted from one website site, computer, server, or data center to another website site, computer, server, or data center via wire (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL), or wireless (e.g., infrared, wireless, microwave, etc.)). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that includes one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.

Compared with the prior art, the invention has the beneficial effects that:

The present applicant has described and illustrated embodiments of the present invention in detail with reference to the accompanying drawings, but it should be understood by those skilled in the art that the above embodiments are merely preferred embodiments of the present invention, and the detailed description is only for the purpose of helping the reader to better understand the spirit of the present invention, and not for limiting the scope of the present invention, and on the contrary, any improvement or modification made based on the spirit of the present invention should fall within the scope of the present invention.

Claims

1. The electric power information query and regulation function calling method based on voice and intention recognition is characterized in that,

the method comprises the following steps:

step 2, using the scheduling voice corpus as input data and inputting the input data into the trained acoustic model and language model; the trained acoustic model and language model are trained by adopting a deep learning algorithm by taking a scheduling voice corpus as a training set; jointly outputting a dispatching telephone text by the trained acoustic model and the language model;

step 4, taking the vector of the regulation instruction word as input data, and respectively inputting the input data into the trained field recognition model and the trained intention recognition model; outputting a field matching result by a field recognition model, and outputting an intention recognition result by an intention recognition model according to the field matching result; the trained field recognition model and the trained intention recognition model are trained by adopting a deep learning algorithm by taking a control instruction corpus as a training set;

step 5, taking the intention recognition result as input data of the slot position recognition model, and outputting a regulation and control instruction matching result by the slot position recognition model; the slot position identification model extracts semantic tags from a scheduling instruction by using a rule analysis algorithm;

2. The power information query and regulation function calling method based on voice and intention recognition according to claim 1,

3. The power information query and regulation function calling method based on voice and intention recognition according to claim 2,

the step 2 comprises the following steps:

4. The power information query and regulation function calling method based on voice and intention recognition according to claim 3,

in the step 2.2, a standard voice signal is used as matching degree test input data and is input into a trained acoustic model, and a standard acoustic feature sequence is output by the acoustic model; and calculating the matching degree of the acoustic feature sequence and the standard acoustic feature sequence based on a distance algorithm.

5. The power information query and regulation function calling method based on voice and intention recognition according to claim 3,

in the step 2.3, the standard voice signal is used as matching degree test input data and is input into the trained language model, and the language model outputs a standard semantic feature sequence; and calculating the matching degree of the semantic feature sequence and the standard semantic feature sequence based on a distance algorithm.

6. The power information query and regulation function calling method based on voice and intention recognition according to claim 1,

7. The real-time regulation and control business analysis method based on intention recognition of claim 6,

step 6 also includes: and the functions of report self-customization, picture self-generation and telephone self-response are realized by interacting with a regulation knowledge base.

8. The power information query and regulation function calling method based on voice and intention recognition according to claim 6,

step 6 also includes: and according to the switching operation knowledge graph and the disconnecting link position image recognition result, the intelligent ticket forming, the automatic execution of the operation ticket and the confirmation of the disconnecting link position of the operation ticket are realized.

9. The method for real-time analysis of regulatory business based on intent recognition as recited in claim 7,

the picture self-generation is to integrate and summarize equipment tide, power grid events, operation modes, risk early warning, equipment maintenance, accident plans and meteorological information, and check various information in one interface.

10. The power information query and regulation function calling method based on voice and intention recognition according to claim 1,