CN113836884A - Official document template recommendation method and system - Google Patents

Official document template recommendation method and system Download PDF

Info

Publication number
CN113836884A
CN113836884A CN202111105392.4A CN202111105392A CN113836884A CN 113836884 A CN113836884 A CN 113836884A CN 202111105392 A CN202111105392 A CN 202111105392A CN 113836884 A CN113836884 A CN 113836884A
Authority
CN
China
Prior art keywords
official document
official
document
template
historical data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111105392.4A
Other languages
Chinese (zh)
Inventor
周剑明
林俊德
陈立峰
林诚汉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujia Newland Software Engineering Co ltd
Original Assignee
Fujia Newland Software Engineering Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujia Newland Software Engineering Co ltd filed Critical Fujia Newland Software Engineering Co ltd
Priority to CN202111105392.4A priority Critical patent/CN113836884A/en
Publication of CN113836884A publication Critical patent/CN113836884A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/186Templates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biophysics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a method and a system for recommending an official document template in the technical field of official document processing, wherein the method comprises the following steps: step S10, obtaining official document historical data and preprocessing the official document historical data; step S20, creating an official document template extraction model, and extracting a plurality of official document templates from official document historical data by using the official document template extraction model; step S30, creating and training an official document template matching model for matching the official document template and the official document title; and step S40, acquiring a new official document title input by the user, matching a corresponding official document template based on the new official document title and the official document template matching model, and completing official document template recommendation. The invention has the advantages that: the efficiency and the quality of official document writing have greatly been promoted.

Description

Official document template recommendation method and system
Technical Field
The invention relates to the technical field of official document processing, in particular to an official document template recommendation method and system.
Background
The official document writing has unique writing format and writing content, the writing format of rigorous professional can reflect the familiarity of the writer with the service, and the official document content can effectively exert the due information transfer utility; the writer needs to keep continuous learning input and writing output of the updated contents of the related fields for a long time to ensure the quality of the written contents.
At present, in the official document writing process, there are often pain spots such as complicated format adjustment, difficult searching of writing materials, and easy careless omission of character proofreading, and writers often need to consume a large amount of time on format typesetting and content writing, resulting in low official document processing efficiency, and then resulting in untimely work transaction processing, affecting the whole efficiency of enterprise organization operation, and increasing the labor cost of enterprises.
Therefore, how to provide a method and a system for recommending a document template to improve the efficiency and quality of document writing becomes an urgent problem to be solved.
Disclosure of Invention
The technical problem to be solved by the invention is to provide a method and a system for recommending a document template, so that the efficiency and the quality of document writing are improved.
In a first aspect, the present invention provides a method for recommending an official document template, comprising the following steps:
step S10, obtaining official document historical data and preprocessing the official document historical data;
step S20, creating an official document template extraction model, and extracting a plurality of official document templates from official document historical data by using the official document template extraction model;
step S30, creating and training an official document template matching model for matching the official document template and the official document title;
and step S40, acquiring a new official document title input by the user, matching a corresponding official document template based on the new official document title and the official document template matching model, and completing official document template recommendation.
Further, the step S10 is specifically:
acquiring official document historical data from an official document database, and carrying out preprocessing including data cleaning and data standardization on the official document historical data;
the official document historical data at least comprises an official document ID, a department ID, an official document title, a text, a sender, a receiver, receiving time and processing time;
the data cleaning specifically comprises the steps of carrying out validity check on the historical data of the official document, eliminating illegal characters and removing duplicate of the official document;
the data standardization is specifically to unify the official document historical data by official document coding, official document format and official document type.
Further, the step S20 is specifically:
creating an official document template extraction model based on a neural network, and performing paradigm training including pre-training and fine-tuning on the official document template extraction model by utilizing the preprocessed official document historical data;
and extracting a plurality of official document templates from official document historical data by using the trained official document template extraction model, and storing each official document template into a template database.
Further, the step S30 is specifically:
and establishing a document template matching model for matching the document template and the document titles based on a neural network, extracting a plurality of document titles from the document historical data to form a document title training set, and performing pre-training and fine-tuning paradigm training on the document template matching model by using the document title training set.
Further, the step S40 is specifically:
acquiring new official document titles input by a user, matching the new official document titles with official document titles in official document historical data by using a BM25 algorithm, screening N official document titles with highest similarity, inputting the official document titles with the highest similarity into an official document template matching model respectively to match with corresponding official document templates, and completing official document template recommendation.
In a second aspect, the present invention provides a document template recommendation system, including the following modules:
the system comprises a document historical data preprocessing module, a document historical data processing module and a document data processing module, wherein the document historical data preprocessing module is used for acquiring document historical data and preprocessing the document historical data;
the official document template generating module is used for creating an official document template extraction model and extracting a plurality of official document templates from official document historical data by using the official document template extraction model;
the official document template matching model creating module is used for creating and training an official document template matching model used for matching the official document template with the official document title;
and the official document template recommendation module is used for acquiring a new official document title input by a user, matching a corresponding official document template based on the new official document title and the official document template matching model, and completing official document template recommendation.
Further, the official document historical data preprocessing module specifically comprises:
acquiring official document historical data from an official document database, and carrying out preprocessing including data cleaning and data standardization on the official document historical data;
the official document historical data at least comprises an official document ID, a department ID, an official document title, a text, a sender, a receiver, receiving time and processing time;
the data cleaning specifically comprises the steps of carrying out validity check on the historical data of the official document, eliminating illegal characters and removing duplicate of the official document;
the data standardization is specifically to unify the official document historical data by official document coding, official document format and official document type.
Further, the official document template generation module specifically is:
creating an official document template extraction model based on a neural network, and performing paradigm training including pre-training and fine-tuning on the official document template extraction model by utilizing the preprocessed official document historical data;
and extracting a plurality of official document templates from official document historical data by using the trained official document template extraction model, and storing each official document template into a template database.
Further, the official document template matching model creation module specifically is:
and establishing a document template matching model for matching the document template and the document titles based on a neural network, extracting a plurality of document titles from the document historical data to form a document title training set, and performing pre-training and fine-tuning paradigm training on the document template matching model by using the document title training set.
Further, the official document template recommendation module specifically comprises:
acquiring new official document titles input by a user, matching the new official document titles with official document titles in official document historical data by using a BM25 algorithm, screening N official document titles with highest similarity, inputting the official document titles with the highest similarity into an official document template matching model respectively to match with corresponding official document templates, and completing official document template recommendation.
The invention has the advantages that:
by creating the official document template extraction model and the official document template matching model, and performing paradigm training including pre-training and fine-tuning on the official document template extraction model and the official document template matching model, the official document template extraction model can more accurately identify entity information, so that a high-quality official document template is generated, the official document template matching model can better match official document titles and official document templates, a BM25 algorithm is used for matching new official document titles input by a user and official document titles in official document historical data, the official document title with the highest similarity is selected and input into the official document template matching model, so that the corresponding official document template can be matched immediately, the efficiency and the quality of official document template recommendation are greatly improved, a writer can quickly write by using the recommended high-quality official document template, the efficiency and the quality of official document writing are greatly improved, and the official document processing efficiency is finally improved, the whole efficiency of enterprise's mechanism operation is promoted, the cost of labor of enterprise is reduced.
Drawings
The invention will be further described with reference to the following examples with reference to the accompanying drawings.
FIG. 1 is a flowchart of a document template recommendation method according to the present invention.
FIG. 2 is a schematic structural diagram of an official document template recommendation system according to the present invention.
Detailed Description
The technical scheme in the embodiment of the application has the following general idea: creating an official document template extraction model to extract an official document template from official document historical data, creating an official document template matching model to match the official document template and the official document title, matching a new official document title input by a user with the official document title in the official document historical data by utilizing a BM25 algorithm, screening out the official document template with the highest similarity to input the official document title and match the corresponding official document template in the official document template matching model, and recommending the official document template so as to improve the efficiency and quality of recommending the official document template and further improve the efficiency and quality of writing the official document.
Referring to fig. 1 to 2, a preferred embodiment of a document template recommendation method according to the present invention includes the following steps:
step S10, obtaining official document historical data and preprocessing the official document historical data;
step S20, creating an official document template extraction model, and extracting a plurality of official document templates from official document historical data by using the official document template extraction model;
step S30, creating and training an official document template matching model for matching the official document template and the official document title; performing semantic matching on the official document template and the official document title through the official document template matching model;
and step S40, acquiring a new official document title input by the user, matching a corresponding official document template based on the new official document title and the official document template matching model, and completing official document template recommendation.
The method comprises the steps that a user can immediately obtain a high-quality recommended official document template only by inputting a new official document title, and then the official document template is used for fast writing.
The step S10 specifically includes:
acquiring official document historical data from an official document database, and carrying out preprocessing including data cleaning and data standardization on the official document historical data;
the official document historical data at least comprises an official document ID, a department ID, an official document title, a text, a sender, a receiver, receiving time and processing time;
the data cleaning specifically comprises the steps of carrying out validity check on the historical data of the official document, eliminating illegal characters and removing duplicate of the official document;
the data standardization is specifically to unify the official document historical data by official document coding, official document format and official document type.
The step S20 specifically includes:
creating an official document template extraction model based on a neural network, and performing paradigm training including pre-training and fine-tuning on the official document template extraction model by utilizing the preprocessed official document historical data;
and extracting a plurality of official document templates from official document historical data by using the trained official document template extraction model, and storing each official document template into a template database.
The official document template extraction model is used for identifying entity information including names of people, place names, mechanism names and proper nouns in the text, replacing the identified entity information with spaces, and further merging and normalizing the text to generate an official document template; merging rules comprise continuous space merging, invalid word merging, html label rules and paragraph label rules.
Pre-training (pre-training/trained) refers to a model that is pre-trained or refers to a process of pre-training a model; fine-tuning (fine-tuning) refers to the process of applying a pre-trained model to its own data set to adapt the parameters to its own data set.
As most people lack enough data sets during model training, and overfitting is easily caused if the model is trained from the beginning, the method of pre-training and fine-tuning is adopted, so that time and computing resources are saved, and a better effect can be achieved quickly.
The step S30 specifically includes:
and establishing a document template matching model for matching the document template and the document titles based on a neural network, extracting a plurality of document titles from the document historical data to form a document title training set, and performing pre-training and fine-tuning paradigm training on the document template matching model by using the document title training set.
Extracting the official document titles in the official document historical data according to the format of 'title \ tsim _ title \ tdissim _ title' to form an official document title training set; wherein, title, sim _ title and dissim _ title are space word segmentation official document titles, which are segmented by extracting a model through the official document template, and are separated by a tab character '\ t'; sim _ title represents a positive case similar to the title, and dissim _ title represents a random negative case dissimilar to the title; and (3) generating a short text semantic matching document template matching model (SimNet model) special for the document title field by adopting the paradigm training of 'pre-training + similar fine tuning'.
The step S40 specifically includes:
acquiring new official document titles input by a user, matching the new official document titles with official document titles in official document historical data by utilizing a BM25 algorithm, screening N official document titles with highest similarity, respectively inputting the official document titles with the highest similarity into an official document template matching model to match with corresponding official document templates, and returning the matched official document templates through official document IDs to complete official document template recommendation. Namely, the BM25 algorithm is used for realizing rough recall, and the similar official document titles of topN are quickly searched in the official document historical data and returned for the new official document title.
The invention discloses a preferred embodiment of a document template recommendation system, which comprises the following modules:
the system comprises a document historical data preprocessing module, a document historical data processing module and a document data processing module, wherein the document historical data preprocessing module is used for acquiring document historical data and preprocessing the document historical data;
the official document template generating module is used for creating an official document template extraction model and extracting a plurality of official document templates from official document historical data by using the official document template extraction model;
the official document template matching model creating module is used for creating and training an official document template matching model used for matching the official document template with the official document title; performing semantic matching on the official document template and the official document title through the official document template matching model;
and the official document template recommendation module is used for acquiring a new official document title input by a user, matching a corresponding official document template based on the new official document title and the official document template matching model, and completing official document template recommendation.
The method comprises the steps that a user can immediately obtain a high-quality recommended official document template only by inputting a new official document title, and then the official document template is used for fast writing.
The official document historical data preprocessing module specifically comprises:
acquiring official document historical data from an official document database, and carrying out preprocessing including data cleaning and data standardization on the official document historical data;
the official document historical data at least comprises an official document ID, a department ID, an official document title, a text, a sender, a receiver, receiving time and processing time;
the data cleaning specifically comprises the steps of carrying out validity check on the historical data of the official document, eliminating illegal characters and removing duplicate of the official document;
the data standardization is specifically to unify the official document historical data by official document coding, official document format and official document type.
The official document template generation module specifically comprises:
creating an official document template extraction model based on a neural network, and performing paradigm training including pre-training and fine-tuning on the official document template extraction model by utilizing the preprocessed official document historical data;
and extracting a plurality of official document templates from official document historical data by using the trained official document template extraction model, and storing each official document template into a template database.
The official document template extraction model is used for identifying entity information including names of people, place names, mechanism names and proper nouns in the text, replacing the identified entity information with spaces, and further merging and normalizing the text to generate an official document template; merging rules comprise continuous space merging, invalid word merging, html label rules and paragraph label rules.
Pre-training (pre-training/trained) refers to a model that is pre-trained or refers to a process of pre-training a model; fine-tuning (fine-tuning) refers to the process of applying a pre-trained model to its own data set to adapt the parameters to its own data set.
As most people lack enough data sets during model training, and overfitting is easily caused if the model is trained from the beginning, the method of pre-training and fine-tuning is adopted, so that time and computing resources are saved, and a better effect can be achieved quickly.
The official document template matching model creating module specifically comprises:
and establishing a document template matching model for matching the document template and the document titles based on a neural network, extracting a plurality of document titles from the document historical data to form a document title training set, and performing pre-training and fine-tuning paradigm training on the document template matching model by using the document title training set.
Extracting the official document titles in the official document historical data according to the format of 'title \ tsim _ title \ tdissim _ title' to form an official document title training set; wherein, title, sim _ title and dissim _ title are space word segmentation official document titles, which are segmented by extracting a model through the official document template, and are separated by a tab character '\ t'; sim _ title represents a positive case similar to the title, and dissim _ title represents a random negative case dissimilar to the title; and (3) generating a short text semantic matching document template matching model (SimNet model) special for the document title field by adopting the paradigm training of 'pre-training + similar fine tuning'.
The official document template recommendation module specifically comprises:
acquiring new official document titles input by a user, matching the new official document titles with official document titles in official document historical data by utilizing a BM25 algorithm, screening N official document titles with highest similarity, respectively inputting the official document titles with the highest similarity into an official document template matching model to match with corresponding official document templates, and returning the matched official document templates through official document IDs to complete official document template recommendation. Namely, the BM25 algorithm is used for realizing rough recall, and the similar official document titles of topN are quickly searched in the official document historical data and returned for the new official document title.
In summary, the invention has the advantages that:
by creating the official document template extraction model and the official document template matching model, and performing paradigm training including pre-training and fine-tuning on the official document template extraction model and the official document template matching model, the official document template extraction model can more accurately identify entity information, so that a high-quality official document template is generated, the official document template matching model can better match official document titles and official document templates, a BM25 algorithm is used for matching new official document titles input by a user and official document titles in official document historical data, the official document title with the highest similarity is selected and input into the official document template matching model, so that the corresponding official document template can be matched immediately, the efficiency and the quality of official document template recommendation are greatly improved, a writer can quickly write by using the recommended high-quality official document template, the efficiency and the quality of official document writing are greatly improved, and the official document processing efficiency is finally improved, the whole efficiency of enterprise's mechanism operation is promoted, the cost of labor of enterprise is reduced.
Although specific embodiments of the invention have been described above, it will be understood by those skilled in the art that the specific embodiments described are illustrative only and are not limiting upon the scope of the invention, and that equivalent modifications and variations can be made by those skilled in the art without departing from the spirit of the invention, which is to be limited only by the appended claims.

Claims (10)

1. An official document template recommendation method is characterized in that: the method comprises the following steps:
step S10, obtaining official document historical data and preprocessing the official document historical data;
step S20, creating an official document template extraction model, and extracting a plurality of official document templates from official document historical data by using the official document template extraction model;
step S30, creating and training an official document template matching model for matching the official document template and the official document title;
and step S40, acquiring a new official document title input by the user, matching a corresponding official document template based on the new official document title and the official document template matching model, and completing official document template recommendation.
2. The official document template recommendation method of claim 1, characterized in that: the step S10 specifically includes:
acquiring official document historical data from an official document database, and carrying out preprocessing including data cleaning and data standardization on the official document historical data;
the official document historical data at least comprises an official document ID, a department ID, an official document title, a text, a sender, a receiver, receiving time and processing time;
the data cleaning specifically comprises the steps of carrying out validity check on the historical data of the official document, eliminating illegal characters and removing duplicate of the official document;
the data standardization is specifically to unify the official document historical data by official document coding, official document format and official document type.
3. The official document template recommendation method of claim 1, characterized in that: the step S20 specifically includes:
creating an official document template extraction model based on a neural network, and performing paradigm training including pre-training and fine-tuning on the official document template extraction model by utilizing the preprocessed official document historical data;
and extracting a plurality of official document templates from official document historical data by using the trained official document template extraction model, and storing each official document template into a template database.
4. The official document template recommendation method of claim 1, characterized in that: the step S30 specifically includes:
and establishing a document template matching model for matching the document template and the document titles based on a neural network, extracting a plurality of document titles from the document historical data to form a document title training set, and performing pre-training and fine-tuning paradigm training on the document template matching model by using the document title training set.
5. The official document template recommendation method of claim 1, characterized in that: the step S40 specifically includes:
acquiring new official document titles input by a user, matching the new official document titles with official document titles in official document historical data by using a BM25 algorithm, screening N official document titles with highest similarity, inputting the official document titles with the highest similarity into an official document template matching model respectively to match with corresponding official document templates, and completing official document template recommendation.
6. An official document template recommendation system is characterized in that: the system comprises the following modules:
the system comprises a document historical data preprocessing module, a document historical data processing module and a document data processing module, wherein the document historical data preprocessing module is used for acquiring document historical data and preprocessing the document historical data;
the official document template generating module is used for creating an official document template extraction model and extracting a plurality of official document templates from official document historical data by using the official document template extraction model;
the official document template matching model creating module is used for creating and training an official document template matching model used for matching the official document template with the official document title;
and the official document template recommendation module is used for acquiring a new official document title input by a user, matching a corresponding official document template based on the new official document title and the official document template matching model, and completing official document template recommendation.
7. The official document template recommendation system of claim 6, characterized in that: the official document historical data preprocessing module specifically comprises:
acquiring official document historical data from an official document database, and carrying out preprocessing including data cleaning and data standardization on the official document historical data;
the official document historical data at least comprises an official document ID, a department ID, an official document title, a text, a sender, a receiver, receiving time and processing time;
the data cleaning specifically comprises the steps of carrying out validity check on the historical data of the official document, eliminating illegal characters and removing duplicate of the official document;
the data standardization is specifically to unify the official document historical data by official document coding, official document format and official document type.
8. The official document template recommendation system of claim 6, characterized in that: the official document template generation module specifically comprises:
creating an official document template extraction model based on a neural network, and performing paradigm training including pre-training and fine-tuning on the official document template extraction model by utilizing the preprocessed official document historical data;
and extracting a plurality of official document templates from official document historical data by using the trained official document template extraction model, and storing each official document template into a template database.
9. The official document template recommendation system of claim 6, characterized in that: the official document template matching model creating module specifically comprises:
and establishing a document template matching model for matching the document template and the document titles based on a neural network, extracting a plurality of document titles from the document historical data to form a document title training set, and performing pre-training and fine-tuning paradigm training on the document template matching model by using the document title training set.
10. The official document template recommendation system of claim 6, characterized in that: the official document template recommendation module specifically comprises:
acquiring new official document titles input by a user, matching the new official document titles with official document titles in official document historical data by using a BM25 algorithm, screening N official document titles with highest similarity, inputting the official document titles with the highest similarity into an official document template matching model respectively to match with corresponding official document templates, and completing official document template recommendation.
CN202111105392.4A 2021-09-22 2021-09-22 Official document template recommendation method and system Pending CN113836884A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111105392.4A CN113836884A (en) 2021-09-22 2021-09-22 Official document template recommendation method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111105392.4A CN113836884A (en) 2021-09-22 2021-09-22 Official document template recommendation method and system

Publications (1)

Publication Number Publication Date
CN113836884A true CN113836884A (en) 2021-12-24

Family

ID=78960218

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111105392.4A Pending CN113836884A (en) 2021-09-22 2021-09-22 Official document template recommendation method and system

Country Status (1)

Country Link
CN (1) CN113836884A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019061525A (en) * 2017-09-27 2019-04-18 トッパン・フォームズ株式会社 Content recommendation system, content recommendation method, and program
CN109657223A (en) * 2018-12-18 2019-04-19 安徽省泰岳祥升软件有限公司 Automatic writing method and device for official document
CN110162607A (en) * 2019-02-20 2019-08-23 北京捷风数据技术有限公司 A kind of government organization document information retroactive method and device based on convolutional neural networks
US20190392066A1 (en) * 2018-06-26 2019-12-26 Adobe Inc. Semantic Analysis-Based Query Result Retrieval for Natural Language Procedural Queries
CN111522948A (en) * 2020-04-22 2020-08-11 中电科新型智慧城市研究院有限公司 Method and system for intelligently processing official document
CN112560419A (en) * 2020-12-08 2021-03-26 中国电子科技集团公司第十五研究所 Automatic document generation method and system
CN112597274A (en) * 2020-12-18 2021-04-02 深圳市彬讯科技有限公司 Document determination method, device, equipment and storage medium based on BM25 algorithm
CN113239181A (en) * 2021-05-14 2021-08-10 廖伟智 Scientific and technological literature citation recommendation method based on deep learning
CN113254574A (en) * 2021-03-15 2021-08-13 河北地质大学 Method, device and system for auxiliary generation of customs official documents

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019061525A (en) * 2017-09-27 2019-04-18 トッパン・フォームズ株式会社 Content recommendation system, content recommendation method, and program
US20190392066A1 (en) * 2018-06-26 2019-12-26 Adobe Inc. Semantic Analysis-Based Query Result Retrieval for Natural Language Procedural Queries
CN109657223A (en) * 2018-12-18 2019-04-19 安徽省泰岳祥升软件有限公司 Automatic writing method and device for official document
CN110162607A (en) * 2019-02-20 2019-08-23 北京捷风数据技术有限公司 A kind of government organization document information retroactive method and device based on convolutional neural networks
CN111522948A (en) * 2020-04-22 2020-08-11 中电科新型智慧城市研究院有限公司 Method and system for intelligently processing official document
CN112560419A (en) * 2020-12-08 2021-03-26 中国电子科技集团公司第十五研究所 Automatic document generation method and system
CN112597274A (en) * 2020-12-18 2021-04-02 深圳市彬讯科技有限公司 Document determination method, device, equipment and storage medium based on BM25 algorithm
CN113254574A (en) * 2021-03-15 2021-08-13 河北地质大学 Method, device and system for auxiliary generation of customs official documents
CN113239181A (en) * 2021-05-14 2021-08-10 廖伟智 Scientific and technological literature citation recommendation method based on deep learning

Similar Documents

Publication Publication Date Title
US10783367B2 (en) System and method for data extraction and searching
US10963692B1 (en) Deep learning based document image embeddings for layout classification and retrieval
US20130159277A1 (en) Target based indexing of micro-blog content
CN112395410B (en) Entity extraction-based industry public opinion recommendation method and device and electronic equipment
US10713425B2 (en) System and method for generating a proposal based on a request for proposal (RFP)
CN113704667A (en) Automatic extraction processing method and device for bidding announcement
EP4141818A1 (en) Document digitization, transformation and validation
CN116484808A (en) Method and device for generating controllable text for official document
CN113283984A (en) Personal loan information input method and device
CN116384344A (en) Document conversion method, device and storage medium
CN117095419A (en) PDF document data processing and information extracting device and method
CN113836884A (en) Official document template recommendation method and system
Prakash et al. Information extraction in unstructured multilingual web documents
Tarride et al. Handwritten text recognition from crowdsourced annotations
CN116306506A (en) Intelligent mail template method based on content identification
CN114495138A (en) Intelligent document identification and feature extraction method, device platform and storage medium
Romero et al. A historical document handwriting transcription end-to-end system
US20230308731A1 (en) Method for providing service of producing multimedia conversion content by using image resource matching, and apparatus thereof
Lam et al. Information Extraction from Documents: Question Answering Vs Token Classification in Real-World Setups
CN117708434B (en) Keyword-based user recommendation browsing content generation method
CN114169317B (en) Rule-based character attribute extraction method and system
CN116681042B (en) Content summary generation method, system and medium based on keyword extraction
Adam et al. Data extraction and matching The EurHisFirm experience
CN114528460A (en) Credit risk monitoring method for behavior of body losing
Shukla et al. A Review Mostly on Identification of Early Tamil Extracts Actors Through Historical Writing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination