CN112507095A

CN112507095A - Information identification method based on weak supervised learning and related equipment

Info

Publication number: CN112507095A
Application number: CN202011481937.7A
Authority: CN
Inventors: 谢攀
Original assignee: Ping An International Smart City Technology Co Ltd
Current assignee: Ping An International Smart City Technology Co Ltd
Priority date: 2020-12-15
Filing date: 2020-12-15
Publication date: 2021-03-16

Abstract

The invention relates to the technical field of artificial intelligence, and provides an information identification method based on weak supervised learning, which comprises the following steps: acquiring a plurality of frequently asked question answering (FAQ) data in a government scene, and constructing a first label data set based on the plurality of FAQ data; performing multi-label text classification on the first label data set by using a first Convolutional Neural Network (CNN) to obtain a multi-label CNN model; obtaining a plurality of second label data sets, and adjusting the multi-label CNN model by using the plurality of second label data sets to obtain a final multi-label classifier; receiving an input user question; inputting the user questions into a final multi-label classifier to obtain a plurality of government affair entities matched with the user questions; outputting a plurality of government entities. The invention also relates to a block chain technology, which can upload a plurality of government affair entities to the block chain. The method can be applied to the fields of intelligent government affairs, intelligent communities and the like which need information identification, and therefore the development of intelligent cities is promoted.

Description

Information identification method based on weak supervised learning and related equipment

Technical Field

The invention relates to the technical field of artificial intelligence, in particular to an information identification method based on weak supervised learning and related equipment.

Background

At present, in a government scene, various consultation questions are often encountered, answers need to be provided, and the government question-answering robot plays a key role in question-answering. The government affair question and answer robot can assist government personnel to answer consultation questions of users, and the consultation questions generally correspond to relevant government affair entities, such as: registering individual industrial and commercial enterprises information, handling residence certificates, examining and approving pesticide advertisements, subsiding intellectual property rights, verifying and issuing production and operation licenses of breeding birds and the like.

However, in practice, it is found that government entities are generally specialized, and different users have different spoken names; in addition, the government entity information is hidden in the consultation problem expressed by the user, and it is difficult to accurately extract the government entity, so that the user experience is poor.

Therefore, how to effectively identify the government affair entity in the government affair scene is a technical problem to be solved urgently.

Disclosure of Invention

In view of the foregoing, there is a need for an information identification method and related apparatus based on weak supervised learning, which can effectively identify government affair entities in a government affair scene.

The first aspect of the present invention provides an information identification method based on weak supervised learning, including:

acquiring a plurality of frequently asked question answering (FAQ) data in a government scene, and constructing a first label data set based on the FAQ data;

performing multi-label text classification on the first label data set by using a first Convolutional Neural Network (CNN) to obtain a multi-label CNN model, wherein a full link layer in the first CNN corresponds to N activation functions, the N is the same as the dimension of an output label sequence, and the multi-label CNN model is a weakly supervised model;

obtaining a plurality of second label data sets, and adjusting the multi-label CNN model by using the plurality of second label data sets to obtain a final multi-label classifier;

receiving an input user question;

inputting the user questions into the final multi-label classifier to obtain a plurality of government affair entities matched with the user questions;

outputting the plurality of government entities.

In one possible implementation, the constructing the first tag data set based on the plurality of FAQ data includes:

scanning answer data in a plurality of FAQ data through a Chinese language model Ngram;

converting the answer data into a first vector according to a word vector model, and converting the government affair entities in a government affair entity library into a second vector according to the word vector model;

cosine similarity calculation is carried out on the first vector and the second vector to obtain a similarity score;

determining the government entity with the similarity score larger than a preset threshold value as a label matched with the FAQ data;

and constructing a first label data set according to each FAQ data and the matched label of the FAQ data.

In one possible implementation manner, in the first tag data set, each question corresponds to one tag list, each tag in the tag list corresponds to one government entity, and in the tag list, a tag of a government entity matching the question is set as a first identifier, and a tag of a government entity not matching the question is set as a second identifier.

In a possible implementation manner, the performing, by using a first convolutional neural network CNN, multi-label text classification on the first label data set to obtain a multi-label CNN model includes:

performing two-class training on each label in the first label data set by using a first Convolutional Neural Network (CNN) and adopting N sigmoid activation functions;

calculating a loss function of the label of each dimension, and calculating an average loss function of the labels in all dimensions;

determining the average loss function as a first loss function for the first tag data set;

and adjusting the model parameters of the first CNN to minimize the first loss function to obtain a multi-label CNN model.

In a possible implementation manner, the adjusting the multi-label CNN model by using the plurality of second label data sets to obtain a final multi-label classifier includes:

acquiring historical model parameters of the multi-label CNN model;

performing iterative training on the historical model parameters of the multi-label CNN model by adopting the plurality of second label data sets, and calculating a second loss function, wherein the second loss function is an average loss function of labels in all dimensions;

and adjusting the historical model parameters to minimize the second loss function to obtain the final multi-label classifier.

In one possible implementation, the outputting the plurality of government entities comprises:

acquiring a historical consultation field of a current user;

calculating a first matching degree of each government entity and the historical consultation field;

acquiring a second matching degree of each government entity and the user questions;

weighting the first matching degree and the second matching degree to obtain a weighted value of each government entity;

and outputting the plurality of government entities according to the order of the weighted values from high to low.

In a possible implementation manner, the information identification method based on weak supervised learning further includes:

judging whether a selection instruction of a current user for the plurality of government affair entities is received within preset time;

if the selection instruction of the current user for the plurality of government affair entities is not received within the preset time, obtaining the historical question-answer record of the current user;

determining the consultation field of the current user from the historical question-answer records;

screening out target government affair entities matched with the consultation field from the plurality of government affair entities;

and outputting the detailed content of the target government entity.

A second aspect of the present invention provides an information identifying apparatus comprising:

the system comprises an acquisition module, a data processing module and a data processing module, wherein the acquisition module is used for acquiring a plurality of frequently asked question answers (FAQ) data in a government scene and constructing a first label data set based on the FAQ data;

a classification module, configured to perform multi-label text classification on the first label data set by using a first Convolutional Neural Network (CNN) to obtain a multi-label CNN model, where a full link layer in the first CNN corresponds to N activation functions, N is the same as a dimension of an output label sequence, and the multi-label CNN model is a weakly supervised model;

the obtaining module is further configured to obtain a plurality of second tag data sets;

an adjusting module, configured to adjust the multi-label CNN model using the plurality of second label data sets, to obtain a final multi-label classifier;

the receiving module is used for receiving input user questions;

an input module, configured to input the user question into the final multi-label classifier, and obtain a plurality of government affair entities matching the user question;

and the output module is used for outputting the plurality of government affair entities.

A third aspect of the present invention provides an electronic device comprising a processor and a memory, wherein the processor is configured to implement the weak supervised learning based information identification method when executing a computer program stored in the memory.

A fourth aspect of the present invention provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the weak supervised learning based information identification method.

According to the technical scheme, the method can be applied to the fields of intelligent government affairs, intelligent communities and the like which need information identification, and therefore the development of intelligent cities is promoted. In the invention, a first label data set is constructed, a government entity recognition task is converted into a multi-label classification task, a first Convolutional Neural Network (CNN) is utilized to perform multi-label text classification on the first label data set to obtain a multi-label CNN model, and a plurality of second label data sets are used to adjust the multi-label CNN model to obtain a final multi-label classifier, namely, a model training optimization process from weak supervision learning to accurate adjustment is realized, the workload of manual labeling is greatly reduced, and meanwhile, the efficiency of model training and the accuracy of the model for government entity recognition are improved.

Drawings

Fig. 1 is a flowchart of a preferred embodiment of an information identification method based on weak supervised learning according to the present invention.

FIG. 2 is a functional block diagram of an information recognition apparatus according to a preferred embodiment of the present disclosure.

Fig. 3 is a schematic structural diagram of an electronic device implementing an information identification method based on weakly supervised learning according to a preferred embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The terms "first" and "second" in the description and claims of the present application and the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

It should be noted that the description relating to "first", "second", etc. in the present invention is for descriptive purposes only and is not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In addition, technical solutions between various embodiments may be combined with each other, but must be realized by a person skilled in the art, and when the technical solutions are contradictory or cannot be realized, such a combination should not be considered to exist, and is not within the protection scope of the present invention.

The electronic device is a device capable of automatically performing numerical calculation and/or information processing according to preset or stored instructions, and the hardware thereof includes, but is not limited to, a microprocessor, an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA), a Digital Signal Processor (DSP), an embedded device, and the like. The electronic device may also include a network device and/or a user device. The network device includes, but is not limited to, a single network server, a server group consisting of a plurality of network servers, or a Cloud Computing (Cloud Computing) based Cloud consisting of a large number of hosts or network servers. The user device includes, but is not limited to, any electronic product that can interact with a user through a keyboard, a mouse, a remote controller, a touch pad, or a voice control device, for example, a personal computer, a tablet computer, a smart phone, a Personal Digital Assistant (PDA), or the like.

Referring to fig. 1, fig. 1 is a flowchart illustrating an information identification method based on weakly supervised learning according to a preferred embodiment of the present invention. The order of the steps in the flowchart may be changed, and some steps may be omitted.

S11, obtaining a plurality of frequently asked question answering (FAQ) data in a government scene, and constructing a first label data set based on the FAQ data.

Where, FAQ is an abbreviation for english freqently ask Questions, chinese means "Frequently Asked Questions", or more colloquially "Frequently Asked Questions answers".

In the government affair scene, a large amount of FAQ data are usually available, and the data mainly have several sources, 1, a common question bank of government affair customer service personnel, and 2, real questions of users. Such data have the following characteristics: 1. no manual marking is performed; 2. the question expression is diversified and spoken, while the answer is generally given by professional personnel, and the expression is relatively standardized and specialized; 3. typically for a particular government entity.

Wherein the first label data set is the weakly supervised label data set.

The government entities such as individual industrial and commercial enterprises register information, transact residence certificates, examine and approve pesticide advertisements, fund intellectual property rights, verify and issue operation licenses for production of breeders and the like.

Specifically, the constructing a first tag data set based on the plurality of FAQ data includes:

Among them, N-Gram is a common Language Model in large vocabulary continuous speech recognition, called Chinese Language Model (CLM). The Chinese language model can realize automatic conversion to Chinese characters by using collocation information between adjacent words in the context.

Where a Word vector model such as Word2vec, Word2vec is a group of related models used to generate a Word vector. The word2vec model may be used to map each word to a vector, which may be used to represent word-to-word relationships, the vector being a hidden layer of a neural network.

In the first tag data set, each question corresponds to a tag list, each tag in the tag list corresponds to a government entity, and in the tag list, the tag of the government entity matched with the question is set as a first identifier, and the tag of the government entity unmatched with the question is set as a second identifier.

Specifically, for example, the tag list may be composed of numbers having a value of 1 (i.e., the first identifier) or 0 (i.e., the second identifier), and the length of the tag list is the number of government entities. The government affair entity corresponding to the label with the value of 1 is a matched government affair entity, and the government affair entity corresponding to the label with the value of 0 is an unmatched government affair entity.

S12, using a first Convolutional Neural Network (CNN) to perform multi-label text classification on the first label data set to obtain a multi-label CNN model, wherein a full link layer in the first CNN corresponds to N activation functions, N is the same as the dimension of an output label sequence, and the multi-label CNN model is a weakly supervised model.

Among them, Convolutional Neural Networks (CNN) are a kind of feed forward Neural Networks (fed forward Neural Networks) that contain convolution computation and have a deep structure. Structure of conventional CNN: (convolutional layer + nonlinear activation function (Relu or tanh) + pooling layer) x n + multiple fully connected layers.

In the invention, the first CNN sets the last full link layer in the traditional CNN as the corresponding label length, adds one sigmoid activation function to each dimension, and totally records N sigmoid activation functions, wherein each sigmoid activation function corresponds to one dimension in a label list.

Specifically, the using the first convolutional neural network CNN to perform multi-label text classification on the first label data set to obtain a multi-label CNN model includes:

The first label data set is trained by using the first convolutional neural network CNN, so that a weakly supervised model, namely a multi-label CNN model, can be obtained, and the multi-label CNN model belongs to a weakly supervised multi-label classifier and can roughly identify a government entity (namely the government entity).

And S13, obtaining a plurality of second label data sets, and adjusting the multi-label CNN model by using the plurality of second label data sets to obtain the final multi-label classifier.

Wherein each of the second tag data sets is an accurate set of question and answer data labeled manually, i.e., each question is configured with an accurate government entity and a tag.

Specifically, the adjusting the multi-label CNN model using the plurality of second label data sets to obtain the final multi-label classifier includes:

acquiring historical model parameters of the multi-label CNN model;

The multi-label CNN model is adjusted through a plurality of second label data sets which are labeled manually, so that the historical model parameters of the multi-label CNN model can be optimized, more accurate model parameters can be obtained, and the final multi-label classifier after iterative training is higher in identification precision and better in identification effect.

And S14, receiving the input user question.

Wherein the user question is a government affairs question.

And S15, inputting the user questions into the final multi-label classifier, and obtaining a plurality of government affair entities matched with the user questions.

Wherein the plurality of government entities are possible government entities with which the current user wants to consult.

And S16, outputting the plurality of government affair entities.

Specifically, the outputting the plurality of government affair entities includes:

acquiring a historical consultation field of a current user;

In this optional implementation, the consulting field of the same user is usually fixed, and each government entity and the historical consulting field may be subjected to conversion operation of word segmentation and sentence vectors, and after the vectors are obtained, cosine similarity calculation is performed, so that the first matching degree may be obtained. Among them, the conversion of the government affairs entity and the history consultation field into a vector belongs to the prior art and is not described in detail.

The first matching degree and the second matching degree can be preset with weights, and the weighted value of each government entity is calculated according to the weights, and the weighted value can be used for measuring the degree of conformity of the government entity with the user questions. A plurality of government entities are output according to the weighted values, so that the current user can more intuitively and preferentially see the government entities meeting the intention of the user without searching one by one, and the consultation satisfaction of the user can be improved.

Optionally, the method further includes:

and outputting the detailed content of the target government entity.

In the optional implementation mode, when a plurality of possible government affair entities are output, the current user is difficult to select due to personal professional level limitation, in this case, the consultation field to which the questions frequently asked by the user belong can be determined based on historical question and answer records, the range can be further reduced, and the target government affair entity matched with the consultation field can be screened out, so that more accurate service can be provided for the current user, and the consultation satisfaction degree of the user can be improved.

Optionally, the method further includes:

uploading the plurality of government entities to a blockchain.

Wherein, in order to ensure the privacy and the security of the data, the plurality of government affair entities can be uploaded to the block chain for storage.

In the method flow described in fig. 1, a first label data set is constructed, a government entity recognition task is converted into a multi-label classification task, an improved convolutional neural network CNN is used to perform multi-label text classification on the first label data set to obtain a multi-label CNN model, and a plurality of second label data sets labeled manually are used to adjust the multi-label CNN model to obtain a final multi-label classifier, so that a model training optimization process from weak supervision learning to accurate adjustment is realized, the workload of manual labeling is greatly reduced, and meanwhile, the efficiency of model training and the accuracy of the model for government entity recognition are improved.

According to the embodiment, the method and the system can be applied to the fields of intelligent government affairs, intelligent communities and the like which need information identification, and therefore development of intelligent cities is promoted. The above description is only a specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and it will be apparent to those skilled in the art that modifications may be made without departing from the inventive concept of the present invention, and these modifications are within the scope of the present invention.

Referring to fig. 2, fig. 2 is a functional block diagram of a preferred embodiment of an information recognition apparatus according to the present invention.

In some embodiments, the information identification device is run in an electronic device. The information identifying means may comprise a plurality of functional modules consisting of program code segments. Program codes of various program segments in the information identification device may be stored in the memory and executed by the at least one processor to perform part or all of the steps of the weak supervised learning based information identification method described in fig. 1.

In this embodiment, the information recognition apparatus may be divided into a plurality of functional modules according to the functions performed by the information recognition apparatus. The functional module may include: the system comprises an acquisition module 201, a classification module 202, an adjustment module 203, a receiving module 204, an input module 205 and an output module 206. The module referred to herein is a series of computer program segments capable of being executed by at least one processor and capable of performing a fixed function and is stored in memory.

The obtaining module 201 is configured to obtain multiple frequently asked question answers (FAQ) data in a government scene, and construct a first tag data set based on the multiple FAQ data.

The government entity is a government entity, such as information registration of individual industrial and commercial enterprises, residence permit handling, pesticide advertisement approval, intellectual property right maintenance, verification and issue of operation licenses for breeding birds.

A classification module 202, configured to perform multi-label text classification on the first label data set by using a first convolutional neural network CNN to obtain a multi-label CNN model, where a full link layer in the first CNN corresponds to N activation functions, N is the same as the dimension of an output label sequence, and the multi-label CNN model is a weakly supervised model.

The obtaining module 201 is further configured to obtain a plurality of second tag data sets.

An adjusting module 203, configured to adjust the multi-label CNN model using the multiple second label data sets, so as to obtain a final multi-label classifier.

acquiring historical model parameters of the multi-label CNN model;

A receiving module 204, configured to receive an input user question.

Wherein the user question is a government affairs question.

An input module 205, configured to input the user question into the final multi-label classifier, and obtain a plurality of government entities matching the user question.

An output module 206, configured to output the plurality of government entities.

acquiring a historical consultation field of a current user;

In this optional implementation, the consulting field of the same user is usually fixed, and each government entity and the historical consulting field may be subjected to conversion operation of word segmentation and sentence vectors, and after the vectors are obtained, cosine similarity calculation is performed, so that the first matching degree may be obtained. The first matching degree and the second matching degree can be preset with weights, and the weighted value of each government entity is calculated according to the weights, and the weighted value can be used for measuring the degree of conformity of the government entity with the user questions. A plurality of government entities are output according to the weighted values, so that the current user can more intuitively and preferentially see the government entities meeting the intention of the user without searching one by one, and the consultation satisfaction of the user can be improved.

Optionally, the information identification apparatus further includes:

the judging module is used for judging whether a selection instruction of the current user for the plurality of government affair entities is received within preset time;

the obtaining module 201 is further configured to obtain a historical question and answer record of the current user if a selection instruction of the current user for the multiple government affair entities is not received within a preset time;

the determining module is used for determining the consultation field of the current user from the historical question-answer records;

the screening module is used for screening out target government affair entities matched with the consultation field from the plurality of government affair entities;

the output module 206 is configured to output details of the target government entity.

In the information recognition device described in fig. 2, a first label data set is constructed, a government entity recognition task is converted into a multi-label classification task, an improved convolutional neural network CNN is utilized to perform multi-label text classification on the first label data set, a multi-label CNN model is obtained, and a plurality of second label data sets labeled manually are used to adjust the multi-label CNN model, so that a final multi-label classifier is obtained, that is, a model training optimization process from weak supervision learning to accurate adjustment is realized, the workload of manual labeling is greatly reduced, and meanwhile, the efficiency of model training and the accuracy of the model for government entity recognition are improved.

Fig. 3 is a schematic structural diagram of an electronic device implementing an information identification method based on weakly supervised learning according to a preferred embodiment of the present invention. The electronic device 3 comprises a memory 31, at least one processor 32, a computer program 33 stored in the memory 31 and executable on the at least one processor 32, and at least one communication bus 34.

Those skilled in the art will appreciate that the schematic diagram shown in fig. 3 is merely an example of the electronic device 3, and does not constitute a limitation of the electronic device 3, and may include more or less components than those shown, or combine some components, or different components, for example, the electronic device 3 may further include an input/output device, a network access device, and the like.

The at least one Processor 32 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. The processor 32 may be a microprocessor or the processor 32 may be any conventional processor or the like, and the processor 32 is a control center of the electronic device 3 and connects various parts of the whole electronic device 3 by various interfaces and lines.

The memory 31 may be used to store the computer program 33 and/or the module/unit, and the processor 32 may implement various functions of the electronic device 3 by running or executing the computer program and/or the module/unit stored in the memory 31 and calling data stored in the memory 31. The memory 31 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data) created according to the use of the electronic device 3, and the like. In addition, the memory 31 may include non-volatile and volatile memory, such as a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other storage devices.

With reference to fig. 1, the memory 31 in the electronic device 3 stores a plurality of instructions to implement an information identification method based on weak supervised learning, and the processor 32 can execute the plurality of instructions to implement:

receiving an input user question;

outputting the plurality of government entities.

Specifically, the processor 32 may refer to the description of the relevant steps in the embodiment corresponding to fig. 1 for a specific implementation method of the instruction, which is not described herein again.

In the electronic device 3 described in fig. 3, the government entity recognition task is converted into a multi-label classification task by constructing a first label data set, the first label data set is subjected to multi-label text classification by using an improved convolutional neural network CNN to obtain a multi-label CNN model, and the multi-label CNN model is adjusted by using a plurality of second label data sets labeled manually to obtain a final multi-label classifier, so that a model training optimization process from weak supervision learning to accurate adjustment is realized, the workload of manual labeling is greatly reduced, and meanwhile, the efficiency of model training and the accuracy of the model for government entity recognition are improved.

The integrated modules/units of the electronic device 3 may be stored in a computer-readable storage medium if they are implemented in the form of software functional units and sold or used as separate products. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying said computer program code, a recording medium, a usb-disk, a removable hard disk, a magnetic disk, an optical disk, a computer Memory, a Read-Only Memory (ROM), a random access Memory.

In the embodiments provided in the present invention, it should be understood that the disclosed system, apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the modules is only one logical functional division, and other divisions may be realized in practice.

The modules described as separate parts may or may not be physically separate, and parts displayed as modules may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment.

In addition, functional modules in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional module.

It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential attributes thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference signs in the claims shall not be construed as limiting the claim concerned. A plurality of units or means recited in the present invention may also be implemented by software or hardware.

Finally, it should be noted that the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting, and although the present invention is described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention.

Claims

1. An information identification method based on weak supervised learning is characterized in that the information identification method based on weak supervised learning comprises the following steps:

receiving an input user question;

outputting the plurality of government entities.

2. The weak supervised learning based information identification method of claim 1, wherein the constructing a first label data set based on a plurality of FAQ data comprises:

3. The weak supervised learning based information identification method according to claim 2, wherein in the first label data set, each question corresponds to one label list, each label in the label list corresponds to one government entity, and in the label list, a label of the government entity matching the question is set as a first identifier, and a label of the government entity not matching the question is set as a second identifier.

4. The information identification method based on weak supervised learning as claimed in claim 1, wherein the performing multi-label text classification on the first label data set by using a first Convolutional Neural Network (CNN) to obtain a multi-label CNN model comprises:

5. The weak supervised learning based information recognition method of claim 1, wherein the adjusting the multi-label CNN model by using the plurality of second label data sets to obtain a final multi-label classifier comprises:

acquiring historical model parameters of the multi-label CNN model;

6. The weak supervised learning based information identification method of claim 1, wherein the outputting the plurality of government entities comprises:

acquiring a historical consultation field of a current user;

7. The weak supervised learning based information identification method according to any one of claims 1 to 5, wherein the weak supervised learning based information identification method further comprises:

and outputting the detailed content of the target government entity.

8. An information recognition apparatus, characterized in that the information recognition apparatus comprises:

the receiving module is used for receiving input user questions;

9. An electronic device, characterized in that the electronic device comprises a processor and a memory, the processor is used for executing a computer program stored in the memory to realize the information identification method based on weak supervised learning as claimed in any one of claims 1 to 7.

10. A computer-readable storage medium storing at least one instruction, which when executed by a processor, implements a weak supervised learning based information identification method as recited in any one of claims 1 to 7.