CN113505207A - Machine reading understanding method and system for financial public opinion research and report - Google Patents

Machine reading understanding method and system for financial public opinion research and report Download PDF

Info

Publication number
CN113505207A
CN113505207A CN202110748656.1A CN202110748656A CN113505207A CN 113505207 A CN113505207 A CN 113505207A CN 202110748656 A CN202110748656 A CN 202110748656A CN 113505207 A CN113505207 A CN 113505207A
Authority
CN
China
Prior art keywords
data
financial
public opinion
answer
labeling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110748656.1A
Other languages
Chinese (zh)
Other versions
CN113505207B (en
Inventor
成昊
龚慧敏
敖翔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongke Suzhou Intelligent Computing Technology Research Institute
Original Assignee
Zhongke Suzhou Intelligent Computing Technology Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongke Suzhou Intelligent Computing Technology Research Institute filed Critical Zhongke Suzhou Intelligent Computing Technology Research Institute
Priority to CN202110748656.1A priority Critical patent/CN113505207B/en
Publication of CN113505207A publication Critical patent/CN113505207A/en
Application granted granted Critical
Publication of CN113505207B publication Critical patent/CN113505207B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/194Calculation of difference between files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

The invention discloses a machine reading understanding method and a machine reading understanding system for financial public opinion research, wherein the method mainly comprises data formulation and collection, training data marking, deep learning model construction and answer organization, and specifically predefines the question set of a user according to the requirements of the financial vertical field, and collects public opinion data associated with the question set; finding out data which is matched with the problems in the predefined problem set from public opinion data through keyword matching, screening out sentences containing problem answers in the data by using a supervised model, and labeling the data; acquiring vector representation of characters by using a pre-trained BERT model in the financial field, and interacting data and problems by using an attention mechanism in a natural language processing method to obtain fusion vector representation which can be understood by a computer; and logically combining more than two answers fed back by the deep learning model. According to the technical scheme, the accuracy rate and the processing efficiency of machine reading understanding are improved by using the supervised model of the labeled data.

Description

Machine reading understanding method and system for financial public opinion research and report
Technical Field
The invention relates to a technology for solving article semantics and answering related questions by a computer, in particular to a method and a system for machine reading understanding in the financial field based on a supervised and deep learning algorithm.
Background
Machine Reading Comprehension (MRC) is a technique that uses algorithms to make computing mechanisms solve article semantics and answer related questions. Since both articles and questions take the form of human language, machine-read understanding falls into the category of Natural Language Processing (NLP) and is one of the most recent topics among them. In recent years, with the development of machine learning, especially deep learning, machine reading understanding research has advanced sufficiently, and the head and corner of the user can be exposed in practical application.
More than 2016, statistical learning methods were used, involving a large amount of feature engineering, which was time-consuming and labor-intensive. After 2016, SQuAD datasets were released, some attention-based matching models, such as BiDAF, LSTM, etc., appeared. This has been followed by relatively complex models of various network structures, and correlation efforts have been made to capture matching relationships between questions and chapters through complex network structures. After 2018, with the emergence of various pre-trained language models, the reading understanding model effect is greatly improved in a near step, because the capability of a presentation layer becomes very strong, and a task-related network structure becomes simple.
In machine-reading understanding technology applications, there are four common tasks, which are described below:
firstly, completing shape filling: given article C, hiding one of the words or entities a (a ∈ C) as a question to fill in the gap, the completion gap-filling task requires that the correct word or entity a be filled in by maximizing the conditional probability P (a | C- { a }).
II, selecting a plurality of items: given an article C, a question Q, and a series of candidate answer sets, the multiple choice task picks out the correct answer question Q from the candidate answer set a by maximizing the conditional probability.
Thirdly, fragment extraction: given an article C (which contains n words) and a question Q, the segment extraction task extracts successive subsequences from the article as correct answers to the question by maximizing the conditional probability P (a | C, Q).
Fourthly, freely answering: given article C and question Q, the correct answer a to answer freely may sometimes not be a subsequence of article C, i.e., a ⊆ C or a lean C. The free-answer task predicts the correct answer a to answer the question Q by maximizing the conditional probability P (a | C, Q).
Free question answering is the most difficult of the four tasks and is also the task of most interest and concern in the industry. The answer form of the free answer task is very flexible, the understanding of natural language can be well tested, and the method is most close to the practical application, but the data set structure of the task is relatively difficult, and how to effectively evaluate the model effect needs to be deeply researched.
As shown in fig. 1, a typical machine reading understanding system generally includes four modules, namely, embedded coding, feature extraction, article-question interaction and answer prediction, which are described as follows:
embedding and coding: this module converts the input articles and questions in natural language into fixed-dimension vectors for subsequent processing by the machine. Early commonly used methods were traditional word representation methods such as one-hot representation and distributed word vectors, and context-based word representation methods pre-trained by large-scale corpora in recent two years have also been widely used, such as ELMo, GPT, Bert, and the like. Meanwhile, in order to better represent information such as semantic syntax, the word vector may be combined with linguistic features such as part-of-speech tags, named entities, question types, and the like to represent the word vector at a finer granularity.
Feature extraction: the word vector representations of the articles and questions encoded via the embedded coding layer are then passed to the feature extraction module to extract more context information. Common neural network models used in this module are Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs), and transform structures based on a multi-headed self-attention mechanism.
Article-question interaction: to achieve this goal, the article-question interaction module often uses a one-way or two-way attention mechanism to emphasize the portions of the original text that are more relevant to the question. Meanwhile, in order to deeply mine the relationship between the article and the question, the interaction process between the article and the question may be performed multiple times, so as to simulate the repeated reading behavior of human beings in reading understanding.
And (3) answer prediction: this module makes the final answer prediction based on the accumulated information from the three modules. The implementation of this module is highly task-dependent, as common machine-reading understanding tasks can be categorized by answer type.
However, the accuracy of the existing machine reading understanding model cannot meet the relatively complex requirements of the financial field in the industry, the response speed cannot meet the requirements of real-time question and answer, and the problem that the answer cannot be answered cannot be identified, so that the given answer and the problem do not meet or are far away from each other under specific conditions, and the reference significance is lacked.
Disclosure of Invention
In view of the defects of the prior art, the invention aims to provide a machine reading understanding method and a machine reading understanding system for financial public opinion research and newspaper, and solves the problems of insufficient accuracy and practicability and low efficiency of machine reading understanding in the financial field.
The invention achieves a technical solution of the above purpose: machine reading understanding method of financial public opinion research and newspaper is characterized by comprising the following steps:
data formulation and collection, corresponding to the requirements of the financial vertical field, predefining a question set of a user, and collecting public opinion data associated with the question set;
training data labeling, namely finding out data which is relevant to the problems in a predefined problem set from public sentiment data through keyword matching, screening out sentences containing problem answers in the data by utilizing a supervised model, and labeling the data;
constructing a deep learning model, namely acquiring vector representation of characters by using a pre-trained BERT model in the financial field, and interacting data and problems by using an attention mechanism in a natural language processing method to obtain fusion vector representation which can be understood by a computer;
and (4) answer organization, wherein more than two answers fed back by the deep learning model are logically combined.
The other technical solution of the invention for realizing the above purpose is as follows: machine of finance public opinion research and newspaper reads understanding system, its characterized in that includes:
the data formulating and collecting unit is used for predefining a question set of a user corresponding to the requirements of the financial vertical field and collecting public opinion data associated with the question set;
the training data labeling unit is used for finding out data which is relevant to the problems in the predefined problem set from the public sentiment data through keyword matching, screening out sentences containing problem answers in the data by utilizing a supervised model, and labeling the data;
the deep learning model building unit is used for acquiring vector representation of characters by utilizing a pre-trained BERT model in the financial field, and then interacting data and problems by an attention mechanism in a natural language processing method to obtain fusion vector representation which can be understood by a computer;
and the answer organization unit is used for logically combining more than two answers fed back by the deep learning model.
The new technical solution for detecting the target provided by the invention has obvious progress: the method and the system utilize a supervised model of high-quality labeled data, and improve the accuracy of machine reading understanding; for the input data of the last thousand characters, the processing speed is shortened to 500 ms/time, more emphasis is placed on judging whether the collected data has content points which can be used for answering the questions, and the effect of expert rule type question answering can be achieved by using lower cost.
Drawings
Fig. 1 is a topological schematic of a typical machine reading understanding system.
FIG. 2 is a schematic diagram of the main steps of the machine reading understanding method of the present invention.
Fig. 3 is a detailed flow chart of the machine reading understanding method of the present invention.
Detailed Description
The following detailed description of the embodiments of the present invention is provided in conjunction with the accompanying drawings to make the technical solution of the present invention easier to understand and grasp, so as to define the protection scope of the present invention more clearly.
Aiming at the technical development level of the current machine reading understanding and the insufficient current situation that the current machine reading understanding cannot meet the related requirements of the financial field, the invention innovatively provides a method and a system for machine reading understanding of the financial field based on a supervised deep learning algorithm, so as to solve the problems of insufficient accuracy and practicability and low efficiency of machine reading understanding of the financial field
The machine reading understanding method in the financial field is shown in fig. 2 and mainly comprises four main steps of data formulation and collection, training data labeling, deep learning model construction and answer organization. And the detailed flow implementation structure is shown in fig. 3.
In summary understanding of each step, data specification and collection refers to defining questions which may be asked by a user in advance according to requirements of the financial vertical field, screening out two parts of key questions and common questions by setting a screening threshold value related to the amount of the questions, and searching public opinion data such as news and research reports related to the questions through a web crawler.
The training data labeling refers to finding out data which is relevant to a predefined key problem from the collected public sentiment data through keyword matching, and delivering the data for manual labeling.
The deep learning model construction means that a proper model which can solve the problems needs to be constructed for the prepared training data. Conventional machine learning models do not process such document data well, requiring deep learning models of large scale parameters and structures to process. According to the scheme, a BERT (bidirectional Encoder retrieval from transformations) model obtained by pre-training in the financial field is used for obtaining vector representation of characters, and the model is characterized by good character processing effect, small model and high efficiency aiming at the financial field; and secondly, interacting data and key problems through an Attention mechanism (Attention) in a natural language processing technology to obtain a fusion vector representation which can be understood by a computer.
The sentences containing all key question answers in the data can be screened out by utilizing the stability of the deep learning model (with the supervision function). It should be noted that, when there is no answer related to a key question in a certain piece of data, the corresponding article is labeled as a zero answer set "answer", that is, unlabeled data, which is a key point that can identify a question that cannot be answered. Since this step has a great influence on the deep learning model, the labeling result of the data needs to be manually screened to avoid errors.
The answer organization refers to a built public opinion database and a trained deep learning model, and an answer is returned because the task of the model is reading understanding, namely inputting an input (data and question) in the form of input. This form is not intuitive for human review or summarization, and requires the formulation of an answer organization strategy that logically combines multiple answers. The more specific answer organization process is as follows: selecting one of more keyword text similarity matching algorithms for recalling the first ten data of any problem; II, inquiring all sub-questions or key words of the corresponding questions of the first ten data one by one through the built deep learning model, and obtaining the best answer of each data corresponding to all the sub-questions; III, optimally sorting the answers of the subproblems and comparing the answers with the sorting of the recall data; and IV, taking the splicing result of the first two non-empty answers of one of the sub-questions as the component of the corresponding sub-question in the final answer. The answers obtained by logical organization are more suitable for the reading impression of human beings.
The keyword text similarity matching algorithm has the possibility of diversified selection, and is based on the problem word vector consulted by the user
Figure DEST_PATH_IMAGE002
Public opinion data contained article word vector set
Figure DEST_PATH_IMAGE004
Where d represents the number of articles recalled and k represents the word vector dimension.
Alternative keyword text similarity matching algorithms include: 1. calculating the Euclidean distance:
Figure DEST_PATH_IMAGE006
2. calculating cosine distance:
Figure DEST_PATH_IMAGE008
3. calculating the Jacard similarity coefficient:
Figure DEST_PATH_IMAGE010
wherein Q represents the original text of the question and P represents the original text of the article;
4. pearson correlation coefficient:
Figure DEST_PATH_IMAGE012
the system is realized by programming and modifying a computer corresponding to the machine reading and understanding method. The system architecture body formed by the specific programming comprises the following four parts: the data formulating and collecting unit is used for predefining a question set of a user corresponding to the requirements of the financial vertical field and collecting public opinion data associated with the question set; through a manual input interface of a computer, the user inputs the problems related to the financial field into a background database and formats and stores the problems, and a screening threshold value can be set for screening key problems and common problems of a predefined problem set. (ii) a And accessing the internet cloud data through a network input interface, collecting various information and research reports associated with the question set, and storing the information and the research reports in an independent database in a data-by-data (different lengths).
And the training data labeling unit is used for finding out data which is relevant to important questions in the predefined question set from the public opinion data through keyword matching, screening out sentences containing question answers in the data by utilizing a supervised model, and labeling the data. The mass data processed by the unit are labeled and classified, and higher fine-grained support is provided for the machine learning process of the subsequent deep learning model.
The deep learning model building unit specifically realizes the following description of data and problem interaction:
the former module of the unit obtains the character vector representation through a BERT model pre-trained in the financial field, and comprises the following input: questions of user consultation
Figure RE-RE-DEST_PATH_IMAGE008
(ii) a Related articles
Figure RE-RE-DEST_PATH_IMAGE010
Wherein
Figure RE-RE-DEST_PATH_IMAGE012
Is a collection of articles that are,
Figure RE-RE-DEST_PATH_IMAGE014
(ii) a And (3) outputting: problem word vector representation
Figure RE-RE-DEST_PATH_IMAGE016
(ii) a Article word vector representation
Figure RE-RE-DEST_PATH_IMAGE018
Wherein
Figure RE-RE-DEST_PATH_IMAGE020
Is a set of word vectors for an article,
Figure RE-RE-DEST_PATH_IMAGE022
the process is as follows: initializing the identifiers [ CLS ], [ SEP ], and executing according to the following program flow:
Figure DEST_PATH_IMAGE029
the latter module of the unit interacts data and questions through attention mechanism in natural language processing method, including input: hidden layer output of BERT
Figure RE-DEST_PATH_IMAGE025
(ii) a And (3) outputting: the location of the beginning and end of the answer to the question in the article
Figure RE-DEST_PATH_IMAGE027
The process is as follows: the output Q, P of the previous section of modules is obtained and executed as follows:
Figure DEST_PATH_IMAGE038
and the answer organization unit is used for logically combining more than two answers fed back by the deep learning model, and the detailed description of the specific logic organization process is omitted. And the result of the answer organization is presented through an interface which is externally output by the computer.
From a more intuitive, pictorial example: when a computer system applying the machine reading understanding method of the financial public opinion research and report inputs a problem of 'big plate rising and falling conditions' in a problem input program. And the public opinion data which can be collected through internet access is large in scale, so that ten pieces of most relevant data in the database are recalled through keyword matching algorithms such as 'big dish', 'trend', 'fluctuation', and the like, the ten pieces of data are respectively merged with problems and serve as a built deep learning model to carry out data input for machine reading understanding, and the answer of each piece of data is obtained. And finally, combining the answer processing by using an answer organization interface to obtain a final answer suitable for the human reading perception.
Similarly, the problems of financial network security, scientific plate stock movement and the like are all suitable for the operation and realization of the machine reading understanding method exemplarily described in the previous paragraph.
In summary, the machine reading understanding method and system for applying the financial opinion research of the present invention can be seen in detail in conjunction with the illustrated embodiments, which have outstanding substantive features and significant progress. The method and the system utilize a supervised model of high-quality labeled data, and improve the accuracy of machine reading understanding; for the input data of the last thousand characters, the processing speed is shortened to 500 ms/time, more emphasis is placed on judging whether the collected data has content points which can be used for answering the questions, and the effect of expert rule type question answering can be achieved by using lower cost.
In addition to the above embodiments, the present invention may have other embodiments, and any technical solutions formed by equivalent substitutions or equivalent transformations are within the scope of the present invention as claimed.

Claims (8)

1. A machine reading understanding method of financial public opinion research and newspaper is characterized by comprising the following steps:
data formulation and collection, corresponding to the requirements of the financial vertical field, predefining a question set of a user, and collecting public opinion data associated with the question set;
training data labeling, namely finding out data which is relevant to the problems in a predefined problem set from public sentiment data through keyword matching, screening out sentences containing problem answers in the data by utilizing a supervised model, and labeling the data;
constructing a deep learning model, namely acquiring vector representation of characters by using a pre-trained BERT model in the financial field, and interacting data and problems by using an attention mechanism in a natural language processing method to obtain fusion vector representation which can be understood by a computer;
and (4) answer organization, wherein more than two answers fed back by the deep learning model are logically combined.
2. The machine-readable understanding method of financial public opinion research and report as claimed in claim 1, wherein: and setting a screening threshold value in data formulation and collection, and screening key problems and common problems for a predefined problem set.
3. The machine-readable understanding method of financial public opinion research and report as claimed in claim 1, wherein: in the training data labeling, for the part of data which is not found to be relevant to the questions in the predefined question set, labeling as a zero answer set.
4. The machine-readable understanding method of financial public opinion research according to claim 1 or 3, wherein: in the training data labeling, manual screening is carried out on the labeled data.
5. The machine-readable understanding method of financial public opinion research and report as claimed in claim 1, wherein: the answer organization process comprises the following steps:
selecting one of more keyword text similarity matching algorithms for recalling the first ten data of any problem;
II, inquiring all sub-questions or key words of the corresponding questions of the first ten data one by one through the built deep learning model, and obtaining the best answer of each data corresponding to all the sub-questions;
III, optimally sorting the answers of the subproblems and comparing the answers with the sorting of the recall data;
and IV, taking the splicing result of the first two non-empty answers of one of the sub-questions as the component of the corresponding sub-question in the final answer.
6. A machine reading understanding system of finance public opinion research and newspaper is characterized by comprising:
the data formulating and collecting unit is used for predefining a question set of a user corresponding to the requirements of the financial vertical field and collecting public opinion data associated with the question set;
the training data labeling unit is used for finding out data which is relevant to the problems in the predefined problem set from the public sentiment data through keyword matching, screening out sentences containing problem answers in the data by utilizing a supervised model, and labeling the data;
the deep learning model building unit is used for acquiring vector representation of characters by utilizing a pre-trained BERT model in the financial field, and then interacting data and problems by an attention mechanism in a natural language processing method to obtain fusion vector representation which can be understood by a computer;
and the answer organization unit is used for logically combining more than two answers fed back by the deep learning model.
7. The system of machine-readable understanding of financial public opinion research as claimed in claim 6, wherein: and a screening threshold value is set in the data formulating and collecting unit and is used for screening the key problems and the common problems of the predefined problem set.
8. The system of machine-readable understanding of financial public opinion research as claimed in claim 6, wherein: the training data labeling unit also comprises a labeling module used for labeling the part of data which is not found out to be relevant to the questions in the predefined question set with a zero answer set.
CN202110748656.1A 2021-07-02 2021-07-02 Machine reading understanding method and system for financial public opinion research report Active CN113505207B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110748656.1A CN113505207B (en) 2021-07-02 2021-07-02 Machine reading understanding method and system for financial public opinion research report

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110748656.1A CN113505207B (en) 2021-07-02 2021-07-02 Machine reading understanding method and system for financial public opinion research report

Publications (2)

Publication Number Publication Date
CN113505207A true CN113505207A (en) 2021-10-15
CN113505207B CN113505207B (en) 2024-02-20

Family

ID=78009840

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110748656.1A Active CN113505207B (en) 2021-07-02 2021-07-02 Machine reading understanding method and system for financial public opinion research report

Country Status (1)

Country Link
CN (1) CN113505207B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114330718A (en) * 2021-12-23 2022-04-12 北京百度网讯科技有限公司 Method and device for extracting causal relationship and electronic equipment

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140006012A1 (en) * 2012-07-02 2014-01-02 Microsoft Corporation Learning-Based Processing of Natural Language Questions
CN108304372A (en) * 2017-09-29 2018-07-20 腾讯科技(深圳)有限公司 Entity extraction method and apparatus, computer equipment and storage medium
CN109492076A (en) * 2018-09-20 2019-03-19 西安交通大学 A kind of network-based community's question and answer website answer credible evaluation method
CN111177326A (en) * 2020-04-10 2020-05-19 深圳壹账通智能科技有限公司 Key information extraction method and device based on fine labeling text and storage medium
US20200193153A1 (en) * 2018-12-17 2020-06-18 Cognition IP Technology Inc. Multi-segment text search using machine learning model for text similarity
CN111414461A (en) * 2020-01-20 2020-07-14 福州大学 Intelligent question-answering method and system fusing knowledge base and user modeling
CN111415740A (en) * 2020-02-12 2020-07-14 东北大学 Method and device for processing inquiry information, storage medium and computer equipment
CN111611361A (en) * 2020-04-01 2020-09-01 西南电子技术研究所(中国电子科技集团公司第十研究所) Intelligent reading, understanding, question answering system of extraction type machine
CN111708873A (en) * 2020-06-15 2020-09-25 腾讯科技(深圳)有限公司 Intelligent question answering method and device, computer equipment and storage medium
CN112100344A (en) * 2020-08-18 2020-12-18 淮阴工学院 Financial field knowledge question-answering method based on knowledge graph
CN112101423A (en) * 2020-08-22 2020-12-18 上海昌投网络科技有限公司 Multi-model fused FAQ matching method and device
CN112541052A (en) * 2020-12-01 2021-03-23 北京百度网讯科技有限公司 Method, device, equipment and storage medium for determining answer of question
KR20210033782A (en) * 2019-09-19 2021-03-29 에스케이텔레콤 주식회사 System and Method for Robust and Scalable Dialogue
WO2021082953A1 (en) * 2019-10-29 2021-05-06 平安科技(深圳)有限公司 Machine reading understanding method and apparatus, storage medium, and device
CN112800203A (en) * 2021-02-05 2021-05-14 江苏实达迪美数据处理有限公司 Question-answer matching method and system fusing text representation and knowledge representation

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140006012A1 (en) * 2012-07-02 2014-01-02 Microsoft Corporation Learning-Based Processing of Natural Language Questions
CN108304372A (en) * 2017-09-29 2018-07-20 腾讯科技(深圳)有限公司 Entity extraction method and apparatus, computer equipment and storage medium
CN109492076A (en) * 2018-09-20 2019-03-19 西安交通大学 A kind of network-based community's question and answer website answer credible evaluation method
US20200193153A1 (en) * 2018-12-17 2020-06-18 Cognition IP Technology Inc. Multi-segment text search using machine learning model for text similarity
KR20210033782A (en) * 2019-09-19 2021-03-29 에스케이텔레콤 주식회사 System and Method for Robust and Scalable Dialogue
WO2021082953A1 (en) * 2019-10-29 2021-05-06 平安科技(深圳)有限公司 Machine reading understanding method and apparatus, storage medium, and device
CN111414461A (en) * 2020-01-20 2020-07-14 福州大学 Intelligent question-answering method and system fusing knowledge base and user modeling
CN111415740A (en) * 2020-02-12 2020-07-14 东北大学 Method and device for processing inquiry information, storage medium and computer equipment
CN111611361A (en) * 2020-04-01 2020-09-01 西南电子技术研究所(中国电子科技集团公司第十研究所) Intelligent reading, understanding, question answering system of extraction type machine
CN111177326A (en) * 2020-04-10 2020-05-19 深圳壹账通智能科技有限公司 Key information extraction method and device based on fine labeling text and storage medium
CN111708873A (en) * 2020-06-15 2020-09-25 腾讯科技(深圳)有限公司 Intelligent question answering method and device, computer equipment and storage medium
CN112100344A (en) * 2020-08-18 2020-12-18 淮阴工学院 Financial field knowledge question-answering method based on knowledge graph
CN112101423A (en) * 2020-08-22 2020-12-18 上海昌投网络科技有限公司 Multi-model fused FAQ matching method and device
CN112541052A (en) * 2020-12-01 2021-03-23 北京百度网讯科技有限公司 Method, device, equipment and storage medium for determining answer of question
CN112800203A (en) * 2021-02-05 2021-05-14 江苏实达迪美数据处理有限公司 Question-answer matching method and system fusing text representation and knowledge representation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
毛先领 等: "问答***研究综述", 《计算机科学与探索》, vol. 6, no. 3, pages 193 - 207 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114330718A (en) * 2021-12-23 2022-04-12 北京百度网讯科技有限公司 Method and device for extracting causal relationship and electronic equipment

Also Published As

Publication number Publication date
CN113505207B (en) 2024-02-20

Similar Documents

Publication Publication Date Title
CN107832400B (en) A kind of method that location-based LSTM and CNN conjunctive model carries out relationship classification
US20220147836A1 (en) Method and device for text-enhanced knowledge graph joint representation learning
CN111143576A (en) Event-oriented dynamic knowledge graph construction method and device
CN103049435B (en) Text fine granularity sentiment analysis method and device
CN110750635B (en) French recommendation method based on joint deep learning model
CN112148832B (en) Event detection method of dual self-attention network based on label perception
CN111797241B (en) Event Argument Extraction Method and Device Based on Reinforcement Learning
CN101710343A (en) Body automatic build system and method based on text mining
CN110321563A (en) Text emotion analysis method based on mixing monitor model
CN113051914A (en) Enterprise hidden label extraction method and device based on multi-feature dynamic portrait
Zhang et al. History-based article quality assessment on Wikipedia
CN114896386A (en) Film comment semantic emotion analysis method and system based on BilSTM
Chen et al. Research on automatic essay scoring of composition based on CNN and OR
CN116775872A (en) Text processing method and device, electronic equipment and storage medium
CN110162651B (en) News content image-text disagreement identification system and identification method based on semantic content abstract
CN111831624A (en) Data table creating method and device, computer equipment and storage medium
CN117236676A (en) RPA process mining method and device based on multi-mode event extraction
CN115759119A (en) Financial text emotion analysis method, system, medium and equipment
CN111400449A (en) Regular expression extraction method and device
CN113505207A (en) Machine reading understanding method and system for financial public opinion research and report
CN117574898A (en) Domain knowledge graph updating method and system based on power grid equipment
CN114970553B (en) Information analysis method and device based on large-scale unmarked corpus and electronic equipment
CN116127954A (en) Dictionary-based new work specialized Chinese knowledge concept extraction method
CN113779994B (en) Element extraction method, element extraction device, computer equipment and storage medium
CN115391522A (en) Text topic modeling method and system based on social platform metadata

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant