CN111259673B

CN111259673B - Legal decision prediction method and system based on feedback sequence multitask learning

Info

Publication number: CN111259673B
Application number: CN202010031722.9A
Authority: CN
Inventors: 张春云; 崔超然; 尹义龙
Original assignee: Shandong University of Finance and Economics
Current assignee: Shandong University of Finance and Economics
Priority date: 2020-01-13
Filing date: 2020-01-13
Publication date: 2023-05-09
Anticipated expiration: 2040-01-13
Also published as: CN111259673A

Abstract

The invention discloses a law decision prediction method and a system based on feedback sequence multitask learning, wherein the method comprises the following steps: the text characteristic representation learning of the case description is realized by using a single-task law prediction method based on representation learning; by taking the information of the preceding task and the feedback information of the following task of each subtask as the input of the current task, the sequence relation and the reverse verification relation among the subtasks are considered, and legal judgment prediction based on feedback sequence multitask learning is realized. The invention is based on the combination of the task of the representation learning list and the feedback-based sequential multi-task learning method, effectively utilizes the advantages of the task of the representation learning list and the feedback-based sequential multi-task learning method in legal judgment and prediction, overcomes the defect that the task-based representation learning list method does not utilize the complementary information of other tasks in a targeted manner, and can improve the accuracy and the robustness of judgment and prediction results compared with the traditional multi-task learning-based method.

Description

Legal decision prediction method and system based on feedback sequence multitask learning

Technical Field

The invention belongs to the technical field of judicial judgment prediction, and particularly relates to a law judgment prediction method and system based on feedback sequence multitask learning.

Background

The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.

Legal decision prediction aims at predicting the decision result of legal cases based on case facts. The technology is a core technology of a legal assistant system, and the technology is studied in depth, so that the technology has important application value and practical significance. In one aspect, legal decision prediction can provide low-cost, high-quality legal consulting services for some masses unfamiliar with legal terms and complex decision procedures. On the other hand, it can provide convenient reference materials for professionals (e.g., lawyers, judges) to improve their work efficiency. Currently, legal decision prediction mainly involves three subtasks: relevant legal predictions, criminal predictions and criminal period predictions. Aiming at the prediction of the three tasks, the prediction is currently used as a classification task to classify related laws, criminals and criminal periods. Representative of the current comparison is a legal decision prediction method based on a single task representing learning and a multi-task legal decision prediction method based on a plurality of related sub-tasks.

The law judgment prediction method based on representation learning mainly adopts a deep neural network to encode the semantics of the case through training a large number of labeling samples, thereby realizing the mapping from a symbol space to a vector space, and finally realizing the prediction of relevant laws, criminals and criminal periods based on semantic vector representation of case description. However, the method based on legal judgment prediction of the presentation learning single task has the defect that the method only aims at a single task, classification of the single task is realized only based on the case description characteristics, and influence of other tasks on the task is not considered.

The legal judgment prediction method based on the multitasking mainly considers the association among all the subtasks of legal judgment, shares the related information of the learned fields mutually through a sharing representation in a shallow layer, supplements the related information of the learned fields mutually, trains different classification models based on the characteristics of the respective tasks in the last layer of the model, and finally realizes the parallel classification of a plurality of tasks. More specifically, the sub-tasks of legal decision prediction have a dependency relationship (namely a sequence relationship) and a verification relationship (feedback verification). Generally, legal persons determine relevant legal laws related to the legal persons according to the case descriptions, then determine criminals based on the relevant legal laws related to the legal laws, and determine corresponding criminal periods based on the relevant legal laws and the determined criminals; conversely, through feedback, the predicted corresponding criminal act can verify the relevant laws involved, and the predicted corresponding criminal period can also verify the laws and criminal acts involved. However, most of the currently adopted legal judgment prediction methods based on multitasking are classified by adopting a classification framework of multitasking learning for several related tasks, and the sequence relationship among the tasks and the feedback verification relationship among the tasks are rarely considered.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides a legal decision prediction method based on feedback sequence multitask learning, overcomes the defect that only a single task feature is needed to be considered to represent and other related task sharing information is difficult to utilize based on a method for representing a learning list task, and simultaneously adds sequence relation information and feedback verification information between tasks under a multitask-based framework.

To achieve the above object, one or more embodiments of the present invention provide the following technical solutions:

a law decision prediction method based on feedback sequence multitask learning comprises the following steps:

the text characteristic representation learning of the case description is realized by using a single-task law prediction method based on representation learning;

by taking the information of the preceding task and the feedback information of the following task of each subtask as the input of the current task, the sequence relation and the reverse verification relation among the subtasks are considered, and legal judgment prediction based on feedback sequence multitask learning is realized.

According to a further technical scheme, a case description and a training data set of relevant laws, criminals and criminal periods of the case description are obtained from a data center server, and the training data set is stored in a database;

text characteristic representation learning is carried out on the case description to obtain vector representation;

all laws, criminals and criminal periods perform text feature representation learning to obtain the feature vector representation.

According to a further technical scheme, a multitask pre-training model based on legal prediction, criminal prediction and criminal period prediction of case description is constructed, and corresponding pre-classification vectors of three subtasks are obtained;

feature fusion is carried out on a normal vector pointed by a pre-classification vector of the normal prediction task and a case description representation vector to obtain a case-normal representation vector;

performing feature fusion on a crime vector corresponding to a crime pointed by a pre-classification vector of a crime prediction task and a case representation vector to obtain a case-crime representation vector;

carrying out feature fusion on a criminal period vector corresponding to the criminal period pointed by the pre-classification vector of the criminal period prediction task and a case representation vector to obtain a case-criminal period representation vector;

taking a case-normal vector, a case-criminal vector and a case-criminal vector as inputs, and inputting the inputs into a bidirectional long-short-term memory neural network to obtain high-level semantic representation of the three vectors;

constructing classifiers for legal, criminal and criminal periods based on high-level characteristic representations of the case-legal vectors, the case-criminal vectors and the case-criminal period vectors;

the high-level characteristic representation is input into three classifiers to realize prediction of legal, criminal and criminal periods.

According to a further technical scheme, a multitask pre-training model based on legal prediction, criminal prediction and criminal period prediction of case description is constructed, and corresponding pre-classification vectors of three subtasks are obtained:

inputting the obtained case description vector into a multi-task classifier, and training the multi-task classification model to pre-classify the normal, crime and criminal period, so as to obtain normal classification vectors, crime prediction vectors and criminal period prediction vectors.

According to the further technical scheme, the BERT model is used for pre-training based on the case fact expression training data set, so that a language model of legal prediction tasks is obtained, and vector representations of D case fact descriptions are obtained.

According to the further technical scheme, based on the BERT model, a legal system vector, a criminal description vector and a criminal description vector are obtained by looking up a dictionary aiming at legal system content, criminal description and criminal description.

According to the further technical scheme, through the case fact description in the training data set and corresponding legal, criminal and criminal period labels, a multitask learning method of parameter hard sharing is adopted to obtain a legal classification vector, a criminal prediction vector and a criminal period prediction vector of each key fact description.

According to a further technical scheme, a gate in an LSTM block is adopted to encode a sequence relation between tasks in multi-task learning and a verification relation between a subsequent task and a current task:

step1: selecting each di in the batch case fact description D, acquiring a predicted result vector lri for a legal rule in a pre-classification result based on a multitasking pre-training classification model, and acquiring a legal rule vector l corresponding to the element with the maximum value in the vector _j Describing the vector and case fact description vector d _i Splicing, inputting to a full connection layer to obtain a case-normal vector representation dl _i ；

Step2: for each d _i Obtaining a predicted result vector cr for a crime in a pre-classification result based on a multi-task pre-classification model _i The criminal vector corresponding to the element in the vector is taken out, and the vector and the case fact description vector d are combined _i Splicing, inputting to a full connection layer to obtain a case-criminal vector representation dc _i ；

Step3: for each d _i Obtaining a predicted result vector pr aiming at criminal period in a pre-classification result based on a multi-task pre-classification model _i And extracting the criminal period vector p corresponding to the element with the largest value in the vector _i Describing the vector and case fact description vector d _i Splicing, inputting into a full-connection layer to obtain a case-criminal period vector representation dp _i ；

Step4, random initialization of initial State c of Forward LSTM Module ₀ And h ₀ State, case-law vector dl _i As input vectors, the cell states of the modules are calculated respectively

And forward output state->

Step5 in the cellular state

And its forward output state->

For the cell state and the input state of the current forward LSTM module at the moment, the case-crime vector dc _i As input vector, the cell status of the module is calculated separately +.>

And forward output state->

Step6: in the cellular state

And its forward output state->

For inputting cell state and input state at the moment on the current forward LSTM module, a case-criminal period vector dp _i As input vector, the cell status of the module is calculated separately +.>

And forward output state->

Step7: in the cellular state

And its forward output state->

For inputting cell state and input state at one moment on the current reverse LSTM module, a case-criminal period vector dp _i As input vector, the cell status of the module is calculated separately +.>

And reverse output state->

Step8: in the cellular state

And its reverse output state->

For the cell state and the input state at the current time on the reverse LSTM module, the case-crime vector dp _i As input vector, the cell status of the module is calculated separately +.>

And reverse output state->

Step9: in the cellular state

And its reverse output state->

For the cell state and the input state at the current time on the reverse LSTM module, the case-law vector dl _i As input vector, the cell status of the module is calculated separately +.>

And reverse output state->

Step10: respectively splicing the forward and reverse output states to obtain

As a case-based description d _i Corresponding to the input of the French classifier, the criminal classifier and the criminal period classifier, calculating a cross entropy loss function corresponding to the batch input, and updating parameters;

step11: if the iteration number is less than the limit number, the process jumps to Step1.

The invention also discloses a legal decision prediction system based on feedback sequence multitask learning, which comprises the following steps:

the text feature representation learning module is used for realizing text feature representation learning of the case description by using a single-task legal prediction method based on representation learning;

and the legal judgment prediction module takes the information of the preceding task and the feedback information of the following task of each subtask as the input of the current task, considers the sequence relation and the reverse verification relation among the subtasks, and realizes the legal judgment prediction based on feedback sequence multitask learning.

The one or more of the above technical solutions have the following beneficial effects:

the invention expands the legal judgment prediction considering single subtasks to a multitask learning method considering the sequence relation and the reverse verification relation among the tasks to realize the prediction of legal judgment prediction subtasks, on one hand, the multitask learning method is adopted to realize the complementation of information by utilizing the shared information among all the subtasks; on the other hand, by taking the information of the preceding task and the feedback information of the following task of each subtask as the input of the current task, the sequence relation and the reverse verification relation among the subtasks are considered, and the prediction precision of legal judgment prediction is better improved.

The invention is based on the combination of the task of the representation learning list and the feedback-based sequential multi-task learning method, effectively utilizes the advantages of the task of the representation learning list and the feedback-based sequential multi-task learning method in legal judgment and prediction, overcomes the defect that the task-based representation learning list method does not utilize the complementary information of other tasks in a targeted manner, and can improve the accuracy and the robustness of judgment and prediction results compared with the traditional multi-task learning-based method.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention.

FIG. 1 is a flow chart of a legal decision prediction method based on feedback sequence multitasking learning in an embodiment of the invention;

FIG. 2 is a schematic diagram of a multi-task pre-training classification model according to an embodiment of the invention.

Detailed Description

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the invention. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of exemplary embodiments according to the present invention. As used herein, the singular is also intended to include the plural unless the context clearly indicates otherwise, and furthermore, it is to be understood that the terms "comprises" and/or "comprising" when used in this specification are taken to specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof.

Embodiments of the invention and features of the embodiments may be combined with each other without conflict.

The invention provides a general idea:

the text characteristic representation learning of the case description is realized by using a single-task legal prediction method based on representation learning, the sequence relation and reverse verification relation among all the subtasks are considered by taking the information of the preceding task of each subtask and the information of the feedback information of the following task as the input of the current task, and finally the legal judgment prediction based on feedback sequence multitask learning is realized.

The technical steps are as follows: the method is based on two-way LSTM to realize sequence relation and reverse verification, wherein the forward LSTM takes a preceding task and a current task as inputs to realize sequence modeling between tasks, and the reverse LSTM takes a following task and the current task as inputs to realize reverse verification of the tasks. The implementation of this part corresponds to the eighth step.

Example 1

Referring to fig. 1, the embodiment discloses a law decision prediction method based on feedback sequence multitask learning, which comprises the following specific steps:

the first step: a training dataset of case descriptions and related laws, criminals and criminal periods is obtained.

And a second step of: and carrying out text characteristic representation learning on the case description to obtain vector representation.

And a third step of: all laws, criminals and criminal periods perform text feature representation learning to obtain the feature vector representation.

Fourth step: and constructing a multitask pre-training model based on legal prediction, criminal prediction and criminal period prediction of the case description, and acquiring corresponding pre-classification vectors of three subtasks.

Fifth step: and carrying out feature fusion on the normal vector pointed by the pre-classification vector of the normal prediction task and the case description representation vector to obtain the case-normal representation vector.

Sixth step: and carrying out feature fusion on the crime vector corresponding to the crime pointed by the pre-classification vector of the crime prediction task and the case representation vector to obtain the case-crime representation vector.

Seventh step: and carrying out feature fusion on the criminal period vectors corresponding to the criminal period pointed by the pre-classified vector of the criminal period prediction task and the case representation vector to obtain the case-criminal period representation vector.

Eighth step: the case-normal vector, the case-criminal vector and the case-criminal vector are taken as input and are input into a bidirectional long-short-term memory neural network (Bidirectional Long Short Term Memory, bi-LSTM) to obtain high-level semantic representation of the three vectors.

Ninth step: the classifier for legal, criminal and criminal period is constructed based on the higher-level characteristic representation of the vectors of the legal, criminal and criminal period.

Tenth step: and outputting prediction results of laws, criminals and criminal periods.

In the second step of (2), a vector representation d of the case description text is obtained by a method based on representation learning _i ，{i＝1，2，3…D}。

In the third step, each legal rule is obtained by adopting a method for expressing learning, and vectors of criminal and criminal periods are expressed as l respectively _i ，{i＝1，2，3…L}，c _i ，{i＝1，2，3…C}，p _i ，{i＝1，2，3…P}。

In the fourth step, the case description vector d obtained in the second step is used for _i Inputting { i=1, 2,3 … T } into a multi-task classifier, and training the multi-task classification model to realize pre-classification of legal, criminal and criminal periods and obtain a legal classification vector lr _i { i=1, 2,3 … D }, criminal prediction vector cr _i (i=1, 2,3 … C) and criminal prediction vector pr _i ，{i＝1，2，3…D}。

In the fifth step, the case description vector d obtained in the second step is used for _i (i=1, 2,3 … D) and the corresponding normal predictive vector lr obtained in the fourth step _i A normal vector l pointed to by { i=1, 2,3 … D } _i Feature fusion is carried out to obtain a case-legal expression vector dl _i ，{i＝1，2，3…D}。

In the sixth step, based on the case description vector d obtained in the second step _i (i=1, 2,3 … D) and the corresponding criminal prediction vector cr obtained in the fourth step _i Crime c pointed to by { i=1, 2,3 … D } _i Performing special treatmentThe case-criminal expression vector dc is obtained by the fusion of the signs _i ，{i＝1，2，3…D}。

In the seventh step, the case description vector d obtained in the second step is used for _i (i=1, 2,3 … D) and the corresponding criminal phase prediction vector pr obtained in the fourth step _i Criminal period vector p pointed to by { i=1, 2,3 … D } _i Feature fusion is carried out to obtain a case-criminal period expression vector dp _i ，{i＝1，2，3…D}。

In the eighth step, the case-law expression vector, the case-criminal vector and the case criminal vector obtained in the fifth, sixth and seventh steps are sequentially input into a Bi-LSTM network for training, and high-level characteristic expressions corresponding to the three vectors are obtained

And->

/>

In the ninth step, the high-level characteristic representation of the eighth step is input into three classifiers to realize prediction of laws, crimes and criminal periods.

In this embodiment, the text feature represents learning:

the feature representation learning of the text refers to representing the semantic, syntactic and other information of the text in a low-dimensional dense vector space through a modeling method, and then calculating and reasoning. Representation learning for text features is largely divided into three granularities: word vector representations, sentence vector representations, and document vector representations.

In this embodiment, the BERT model of the existing Google release is mainly used. The full name of BERT is Bidirectional Encoder Representation from Transformers, the encoder of the bi-directional transducer. The main innovation point of the model is that the method is on the pre-training method, namely, two methods of covered language model (Masked Language Model) and next sentence prediction (Next Sentence Prediction) are used for capturing words, sentences and chapter levels respectivelyOther features. Through the BERT model, the training data set can be pre-trained based on the case fact expression to obtain a language model of legal prediction tasks, so as to obtain vector representations D of D case fact descriptions _i { i=1, 2,3 … D }. Meanwhile, aiming at legal contents, criminal descriptions and criminal period descriptions in the task, a dictionary searching mode is adopted to obtain a legal vector l based on a BERT model _i { i=1, 2,3 … L }, criminal description vector c _i (i=1, 2,3 … C) and criminal description vector p _i ，{i＝1，2，3…P}。

In this embodiment, the multitasking pre-trained classification model:

referring to fig. 2, multi-task learning (Multitask learning, MTL) is one of the migration learning algorithms, and migration learning can be understood as defining a source domain and a target domain, learning in the source domain, and migrating the learned knowledge to the target domain, so as to improve the learning effect of the target domain. Two multitask learning modes in deep learning: hard and soft sharing of hidden layer parameters. This item takes the hard sharing mechanism of parameters as an example, but is not limited to the method of hard sharing, which is typically implemented by sharing hidden layers among all tasks, while preserving the output layers of several specific tasks.

The method comprises the steps of obtaining a legal classification vector lr of each key fact description by a multitask learning method of parameter hard sharing through case fact description and corresponding legal, criminal and criminal period labels in a training data set _i { i=1, 2,3 … D }, criminal prediction vector cr _i (i=1, 2,3 … C) and criminal prediction vector pr _i ，{i＝1，2，3…D}。

In this embodiment, a two-way long and short memory module network (Bi-LSTM) modeling task sequence relationships and feedback relationships:

a Long Short-term memory network (LSTM) is a time-circulating neural network, and is specifically designed to solve the gradient disappearance problem of a general circulating neural network (RNN). LSTM is a type of neural network that contains blocks of LSTM, the gates (gates) in each block can memorize values for indefinite lengths of time, and typically LSTM comprises three gates: forget gate, input gate and output gate. Currently, LSTM is mainly used for encoding context information in time series. The invention mainly adopts the gate in the LSTM block to code the sequence relation between tasks in multi-task learning and the verification relation between the follow-up task and the current task. The specific learning process is as follows:

step1: selecting each D in the batch case fact description D in m _i Obtaining a predicted result vector lr for a legal rule in a pre-classification result based on a multitasking pre-training classification model _i And extracting the normal vector l corresponding to the element with the largest value in the vector _j Describing the vector and case fact description vector d _i Splicing, inputting to a full connection layer to obtain a case-normal vector representation dl _i 。

Step2: for each d _i Obtaining a predicted result vector cr for a crime in a pre-classification result based on a multi-task pre-classification model _i And extracting the criminal vector c corresponding to the element with the value exceeding 0.5 in the vector _j { j=1, 2, … c }, describe these vectors and case facts as vector d _i Splicing, inputting to a full connection layer to obtain a case-criminal vector representation dc _i 。

Step3: for each d _i Obtaining a predicted result vector pr aiming at criminal period in a pre-classification result based on a multi-task pre-classification model _i And extracting the criminal period vector p corresponding to the element with the largest value in the vector _i Describing the vector and case fact description vector d _i Splicing, inputting into a full-connection layer to obtain a case-criminal period vector representation dp _i 。

And forward output state->

Step5 in the cellular state

And its forward output state->

And forward output state->

Step6: in the cellular state

And its forward output state->

And forward output state->

Step7: in the cellular state

And its forward output state->

And reverse output state->

Step8: in the cellular state

And its reverse output state->

And reverse output state->

Step9: in the cellular state

And its reverse output state->

And reverse output state->

Step10, respectively splicing the forward output state and the reverse output state to obtain

As a case-based description d _i Corresponding to the inputs of the French classifier, the criminal classifier and the criminal period classifier, calculating the corresponding cross entropy loss function of the batch input, and updating parameters.

Example two

It is an object of the present embodiment to provide a computing device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the legal decision prediction method steps of the first embodiment based on feedback sequence multitasking.

Example III

An object of the present embodiment is to provide a computer-readable storage medium.

A computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs a legal decision prediction method step of implementing a feedback sequence based multitasking learning in accordance with the first embodiment.

Example IV

The steps involved in the devices of the second, third and fourth embodiments correspond to those of the first embodiment of the method, and the detailed description of the embodiments can be found in the related description section of the first embodiment. The term "computer-readable storage medium" should be taken to include a single medium or multiple media including one or more sets of instructions; it should also be understood to include any medium capable of storing, encoding or carrying a set of instructions for execution by a processor and that cause the processor to perform any one of the methods of the present invention.

It will be appreciated by those skilled in the art that the modules or steps of the invention described above may be implemented by general-purpose computer means, alternatively they may be implemented by program code executable by computing means, whereby they may be stored in storage means for execution by computing means, or they may be made into individual integrated circuit modules separately, or a plurality of modules or steps in them may be made into a single integrated circuit module. The present invention is not limited to any specific combination of hardware and software.

The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

While the foregoing description of the embodiments of the present invention has been presented in conjunction with the drawings, it should be understood that it is not intended to limit the scope of the invention, but rather, it is intended to cover all modifications or variations within the scope of the invention as defined by the claims of the present invention.

Claims

1. A law decision prediction method based on feedback sequence multitask learning is characterized by comprising the following steps:

by taking the information of the preceding task and the feedback information of the following task of each subtask as the input of the current task, the sequence relation and the reverse verification relation among the subtasks are considered, and legal judgment prediction based on feedback sequence multitask learning is realized;

constructing a multi-task pre-training model based on legal prediction, criminal prediction and criminal period prediction of case description, and acquiring corresponding pre-classification vectors of three subtasks;

feature fusion is carried out on a normal classification vector pointed by a pre-classification vector of a normal prediction task and a case description expression vector to obtain a case-normal expression vector;

performing feature fusion on a crime prediction vector corresponding to a crime pointed by a pre-classification vector of a crime prediction task and a case description representation vector to obtain a case-crime representation vector;

carrying out feature fusion on a criminal period prediction vector corresponding to the criminal period pointed by a pre-classification vector of a criminal period prediction task and a case description expression vector to obtain a case-criminal period expression vector;

constructing classifiers for legal, criminal and criminal periods based on high-level semantic representations of the case-legal vectors, the case-criminal vectors and the case-criminal period vectors;

inputting the high-level semantic representation into three classifiers to realize prediction of legal, criminal and criminal periods;

the gates in the LSTM block are adopted to code the sequence relation between tasks in the multi-task learning and the verification relation between the follow-up task and the current task:

step1: selecting each case description representation vector D in a batch of case fact descriptions D _i Obtaining the predicted result direction aiming at the legal strips in the pre-classification result based on the multitasking pre-training classification modelMeasuring lr _i And extracting the normal vector l corresponding to the element with the largest value in the vector _j Representing the vector and the case description by a vector d _i Splicing, inputting to a full connection layer to obtain a case-normal vector representation dl _i ；

Step2: representing vector d for each case description _i Obtaining a predicted result vector cr for a crime in a pre-classification result based on a multi-task pre-classification model _i The criminal vector corresponding to the element in the vector is taken out, and the vector and the case description represent vector d _i Splicing, inputting to a full connection layer to obtain a case-criminal vector representation dc _i ；

Step3: representing vector d for each case description _i Obtaining a predicted result vector pr aiming at criminal period in a pre-classification result based on a multi-task pre-classification model _i And extracting the criminal period vector p corresponding to the element with the largest value in the vector _i Representing the vector and the case description by a vector d _i Splicing, inputting into a full-connection layer to obtain a case-criminal period vector representation dp _i ；

Step4: random initialization of initial state c of forward LSTM module ₀ And h ₀ Status, dl is expressed by case-law vector _i As input vectors, the cell states of the modules are calculated respectively

And forward output state->

Step5: in the cellular state

And its forward output state->

For the last moment of the current forward LSTM moduleIs the input cell state and input state of the case-criminal vector representing dc _i As input vector, the cell status of the module is calculated separately +.>

And forward output state->

Step6: in the cellular state

And its forward output state->

For the cell state and the input state of the current forward LSTM module at the moment, the case-criminal period vector represents dp _i As input vector, the cell status of the module is calculated separately +.>

And forward output state->

Step7: in the cellular state

And its forward output state->

For the cell state and the input state of the current reverse LSTM module at the moment, the case-criminal period vector represents dp _i As input vector, the cell status of the module is calculated separately +.>

And reverse output state->

Step8: in the cellular state

And its reverse output state->

For the input cell state and input state at the current time on the reverse LSTM module, the case-criminal vector represents dc _i As input vector, the cell status of the module is calculated separately +.>

And reverse output state->

Step9: in the cellular state

And its reverse output state->

For the input cell state and input state at the current time on the reverse LSTM module, the case-law vector represents dl _i As input vector, the cell status of the module is calculated separately +.>

And reverse output state->

Step10: respectively splicing the forward and reverse output states to obtain

Representing vector d as a case-based description _i Corresponding to the input of the French classifier, the criminal classifier and the criminal period classifier, calculating a cross entropy loss function corresponding to batch input, and updating parameters;

2. The legal decision prediction method based on feedback sequence multitasking learning as recited in claim 1, wherein a training data set of the case description and its related laws, criminals and criminals is obtained from a data center server, and the training data set is stored in a database;

3. The law decision prediction method based on feedback sequence multitask learning as claimed in claim 1, wherein a multitask pre-training model based on legal prediction, criminal prediction and criminal period prediction of case description is constructed, and corresponding pre-classification vectors of three subtasks are obtained:

inputting the obtained case description representation vector into a multi-task classifier, and training the multi-task classification model to pre-classify the normal, crime and criminal period, so as to obtain normal classification vectors, crime prediction vectors and criminal period prediction vectors.

4. The legal decision prediction method based on feedback sequence multitask learning of claim 1, wherein the language model of legal prediction tasks is obtained by pretraining based on a case description training data set through a BERT model, so as to obtain D case description expression vectors;

based on the BERT model, a dictionary searching mode is adopted for legal content, criminal descriptions and criminal descriptions to obtain legal vectors, criminal description vectors and criminal description vectors.

5. The legal judgment prediction method based on feedback sequence multitask learning as claimed in claim 1, wherein the rule classification vector, the criminal prediction vector and the criminal prediction vector of each case description are obtained by adopting a parameter hard-sharing multitask learning method for the case description and corresponding laws, criminal and criminal labels in the training data set.

6. A legal decision prediction system based on feedback sequence multitasking learning, comprising:

the legal judgment prediction module takes the information of the preceding task and the feedback information of the following task of each subtask as the input of the current task, considers the sequence relation and the reverse verification relation among the subtasks, and realizes the legal judgment prediction based on feedback sequence multitask learning;

step1: selecting each case description representation vector D in a batch of case fact descriptions D _i Obtaining a predicted result vector lr for a legal rule in a pre-classification result based on a multitasking pre-training classification model _i And extracting the normal vector l corresponding to the element with the largest value in the vector _j Representing the vector and the case description by a vector d _i Splicing, inputting to a full connection layer to obtain a case-normal vector representation dl _i ；

And forward output state->

Step5: in the cellular state

And its forward output state->

For the input cell state and input state at the current time on the forward LSTM module, the case-criminal vector represents dc _i As input vector, the cell status of the module is calculated separately +.>

And forward output state->

Step6: in the cellular state

And its forward output state->

And forward output state->

Step7: in the cellular state

And its forward output state->

And reverse output state->

Step8: in the cellular state

And its reverse output state->

And reverse output state->

Step9: in the cellular state

And its reverse output state->

And reverse output state->

Step10: respectively splicing the forward and reverse output states to obtain

7. A computing device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor performs the legal decision prediction method steps of any one of claims 1-5 based on feedback sequence multitasking learning.

8. A computer readable storage medium having stored thereon a computer program, which when executed by a processor performs the legal decision prediction method steps of any one of claims 1-5 based on feedback sequence multitasking.