CN113641695A - Character extraction method, device, equipment and medium for database statement - Google Patents

Character extraction method, device, equipment and medium for database statement Download PDF

Info

Publication number
CN113641695A
CN113641695A CN202110920713.XA CN202110920713A CN113641695A CN 113641695 A CN113641695 A CN 113641695A CN 202110920713 A CN202110920713 A CN 202110920713A CN 113641695 A CN113641695 A CN 113641695A
Authority
CN
China
Prior art keywords
character
identifier
target
character string
sql statement
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110920713.XA
Other languages
Chinese (zh)
Other versions
CN113641695B (en
Inventor
唐山
范渊
刘博�
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
DBAPPSecurity Co Ltd
Original Assignee
DBAPPSecurity Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by DBAPPSecurity Co Ltd filed Critical DBAPPSecurity Co Ltd
Priority to CN202110920713.XA priority Critical patent/CN113641695B/en
Publication of CN113641695A publication Critical patent/CN113641695A/en
Application granted granted Critical
Publication of CN113641695B publication Critical patent/CN113641695B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2433Query languages

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a character extraction method, a device, equipment and a medium of database sentences, wherein the method comprises the following steps: when a target character string in a target SQL sentence is to be extracted, setting a corresponding regular expression according to the target character string; if the screening character string meeting the regular expression exists in the target SQL statement, adding a first identifier and a second identifier at the initial position and the end position of the screening character string respectively to obtain a modified SQL statement; character-by-character matching is carried out on the modified SQL statement; and when the first identifier and the second identifier in the modified SQL statement are matched, extracting the character string between the first identifier and the second identifier to obtain a target character string. Obviously, the method can convert the target character string in the target SQL sentence into the character string which can be identified by the character-by-character matching rule, so that the target character string which can not be identified in the target SQL sentence can be extracted, and the purpose of extracting the special content characters from the SQL sentence is achieved.

Description

Character extraction method, device, equipment and medium for database statement
Technical Field
The invention relates to the technical field of computers, in particular to a method, a device, equipment and a medium for extracting characters of database sentences.
Background
SQL (Structured Query Language) statements are a Query and programming Language for databases. In the prior art, if a target character string in an SQL statement is to be extracted, character-by-character matching is usually required to be performed on the SQL statement. Such as: if the parsed SQL statement is: when the select from staff hand id is 2, when character 2 is detected, the character 2 is extracted using the corresponding parameter extraction template. However, when some special content characters in the SQL statement need to be extracted, because the special content characters are not character strings that can be identified by the existing character-by-character matching rules, a problem occurs in that the special content characters cannot be identified from the SQL statement. Such as: if the parsed SQL statement is: the special content character "zhang san" cannot be extracted from the SQL statement by using the existing character-by-character matching rule. At present, no effective solution exists for the technical problem.
Disclosure of Invention
In view of the above, the present invention provides a method, an apparatus, a device and a medium for extracting characters from a database statement, so as to extract special content characters from an SQL statement. The specific scheme is as follows:
a character extraction method of a database statement comprises the following steps:
when a target character string in a target SQL statement is to be extracted, setting a corresponding regular expression according to the target character string, and performing rule matching on the target SQL statement by using the regular expression;
if the screening character string meeting the regular expression exists in the target SQL statement, adding a first identifier and a second identifier at the initial position and the end position of the screening character string respectively to obtain a modified SQL statement; the first identifier and the second identifier are identifiers which are distinguished from any character in the target SQL statement;
performing character-by-character matching on the modified SQL statement;
and when the first identifier and the second identifier in the modified SQL statement are matched, extracting the character string between the first identifier and the second identifier to obtain the target character string.
Preferably, the target character string comprises a special character and/or an SQL character string and/or a word.
Preferably, the method further comprises the following steps:
and storing the regular expression.
Preferably, the first identifier and the second identifier are specifically user-defined identifiers.
Preferably, the first identifier and the second identifier are embodied as identifiers occurring in pairs.
Preferably, the method further comprises the following steps:
and if the screening character string meeting the regular expression does not exist in the target SQL sentence, re-executing the step of setting the corresponding regular expression according to the target character string and performing rule matching on the target SQL sentence by using the regular expression until the screening character string meeting the regular expression exists in the target SQL sentence.
Correspondingly, the invention also discloses a character extraction device of the database statement, which comprises the following steps:
the rule matching module is used for setting a corresponding regular expression according to a target character string when the target character string in the target SQL sentence is to be extracted, and performing rule matching on the target SQL sentence by using the regular expression;
the character adding module is used for respectively adding a first identifier and a second identifier at the initial position and the end position of the screening character string to obtain a modified SQL statement if the screening character string meeting the regular expression exists in the target SQL statement; the first identifier and the second identifier are identifiers which are distinguished from any character in the target SQL statement;
the character matching module is used for carrying out character-by-character matching on the modified SQL statement;
and the character extraction module is used for extracting the character string between the first identifier and the second identifier to obtain the target character string when the first identifier and the second identifier exist in the modified SQL statement.
Correspondingly, the invention also discloses a character extraction device of the database statement, which comprises the following steps:
a memory for storing a computer program;
a processor for implementing the steps of a character extraction method of a database statement as disclosed in the foregoing when executing the computer program.
Accordingly, the present invention also discloses a computer readable storage medium having stored thereon a computer program which, when being executed by a processor, implements the steps of a method for character extraction of a database statement as disclosed in the foregoing.
It can be seen that, in the present invention, when a target character string in a target SQL statement needs to be extracted, a corresponding regular expression is first set according to the target character string, and the regular expression is used to perform rule matching on the target SQL statement; if the screening character string meeting the regular expression exists in the target SQL statement, adding a first identifier and a second identifier at the initial position and the end position of the screening character string respectively to obtain a modified SQL statement; the first identifier and the second identifier are identifiers which are different from any character in the target SQL statement; and then, carrying out character-by-character matching on the modified SQL statement, and when a first identifier and a second identifier in the modified SQL statement are matched, extracting a character string between the first identifier and the second identifier to extract a target character string in the target SQL statement. Obviously, the method provided by the invention can convert the target character string in the target SQL sentence into the character string which can be identified by the character-by-character matching rule, so that the target character string which can not be identified in the target SQL sentence can be extracted, and the purpose of extracting the special content characters from the SQL sentence is achieved. Correspondingly, the character extraction device, the equipment and the medium for the database sentences also have the beneficial effects.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.
Fig. 1 is a flowchart of a character extraction method for a database statement according to an embodiment of the present invention;
FIG. 2 is a block diagram of a device for extracting characters from a database statement according to an embodiment of the present invention;
fig. 3 is a structural diagram of a character extraction device for a database statement according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, fig. 1 is a flowchart of a character extraction method of a database statement according to an embodiment of the present invention, where the character extraction method includes:
step S11: when a target character string in a target SQL statement is to be extracted, setting a corresponding regular expression according to the target character string, and performing rule matching on the target SQL statement by using the regular expression;
step S12: if the screening character string meeting the regular expression exists in the target SQL statement, adding a first identifier and a second identifier at the initial position and the end position of the screening character string respectively to obtain a modified SQL statement;
the first identifier and the second identifier are identifiers which are different from any character in the target SQL statement;
step S13: character-by-character matching is carried out on the modified SQL statement;
step S14: and when the first identifier and the second identifier in the modified SQL statement are matched, extracting the character string between the first identifier and the second identifier to obtain a target character string.
In this embodiment, a method for extracting characters from an SQL statement is provided, by which special content characters can be extracted from the SQL statement. Specifically, when a target character string in a target SQL statement is to be extracted, a corresponding regular expression is set according to the target character string, and the regular expression is used to perform rule matching on the target SQL statement. Wherein, the target character string is a special content character, such as: 2021-7-211: 11:11, jasmine, cola, etc.
If the screening character string meeting the regular expression exists in the target SQL statement, the regular expression set according to the target character string is adapted to the target character string, in this case, a first identifier and a second identifier need to be added to the initial position and the end position of the screening character string in the target SQL statement respectively, and the modified SQL statement is obtained. The first identifier and the second identifier added to the start position and the end position of the filter string may be any identifiers that can be distinguished from any character in the target SQL statement.
After the first identifier and the second identifier are added to the initial position and the end position of the screened character of the target SQL statement to obtain a modified SQL statement, character-by-character matching can be carried out on the modified SQL statement; after the first identifier and the second identifier are matched in the SQL statement in a character-by-character matching mode, the target character string can be extracted from the position between the first identifier and the second identifier.
Such as: when the target SQL statement is selected from static time, 2020-9-811:4:11, if the target SQL statement is matched by using the character-by-character matching rule in the prior art, the character string "2020-9-811: 4: 11" cannot be extracted from the target SQL statement. In this case, the regular expression may be set according to the character string "2020-9-811: 4: 11" first. The regular expression corresponding to the character string '2020-9-811: 4: 11' in the target SQL statement is as follows: (\ d {2} | \ d {4}) (? ([0] {1} \ d {1} | [1] {1} [0-2] {1}) (? ([0-2] {1} \ d {1} | [3] {1} [0-1] {1}) (? ([0-1] {1} \ d {1} | [2] {1} [0-3] {1}) (? ([0-5] {1} \ d {1}) (? ([0-5] {1} \ d {1}) before character-by-character matching is performed on the target SQL statement, whether a screening character string meeting the regular expression exists in the target SQL statement needs to be detected. If the filtering character string "2020-9-811: 4: 11" satisfying the regular expression exists in the target SQL statement, identifiers &% # and #% &arerespectively added at positions before and after the filtering character string "2020-9-811: 4: 11", and at this time, the target SQL statement becomes a modified SQL statement select &fromstaff time &% # 2020-9-811:4:11 #% &.
And then, performing character-by-character analysis on the modified SQL statement according to the existing character-by-character matching rule, and when the identifiers of &% # and #% & exist, extracting the character string between the identifiers of &% # and #% & in the modified SQL statement, so as to extract the character string '2020-9-811: 4: 11' from the modified SQL statement.
In addition, in practical applications, the method provided by this embodiment may also be used to analyze the SQL statement select from staff name, since the existing character-by-character matching rule cannot be used to extract "three pages" in the SQL statement. Specifically, when the SQL statement select from stop work where name is three, a regular expression corresponding to "three" needs to be set first, and the SQL statement select from stop work where name is three is matched with "three" by using the regular expression corresponding to "three", if a character string of "three" is matched in the SQL statement, then &% & and &% & m% are added before and after "three", so that the SQL statement select from stop work where name is three.
Then, the existing character-by-character matching rule is used for analyzing the SQL statement select from staff hand where name &% & three &% & when the character strings in the SQL statement are identified as &% & and &% & the & +, the character string in the middle of &% & and &% & can be extracted, so that the special character string of 'three' can be extracted from the SQL statement select from staff hand where name &% & three &% & the special character string can be extracted.
It can be understood that, compared with the prior art, the technical solution provided in this embodiment can convert the target character string in the target SQL statement into a character string recognizable by the character-by-character matching rule, so that the special content characters can be extracted from the target SQL statement by the method.
It can be seen that, in this embodiment, when a target character string in a target SQL statement needs to be extracted, first, a corresponding regular expression is set according to the target character string, and the regular expression is used to perform rule matching on the target SQL statement; if the screening character string meeting the regular expression exists in the target SQL statement, adding a first identifier and a second identifier at the initial position and the end position of the screening character string respectively to obtain a modified SQL statement; the first identifier and the second identifier are identifiers which are different from any character in the target SQL statement; and then, carrying out character-by-character matching on the modified SQL statement, and when a first identifier and a second identifier in the modified SQL statement are matched, extracting a character string between the first identifier and the second identifier to extract a target character string in the target SQL statement. Obviously, the method provided by the embodiment can convert the target character string in the target SQL statement into the character string recognizable by the character-by-character matching rule, so that the target character string which cannot be recognized in the target SQL statement can be extracted, and the purpose of extracting the special content characters from the SQL statement is achieved.
Based on the above embodiments, this embodiment further explains and optimizes the technical solution, and as a preferred implementation, the target character string includes a special character and/or an SQL character string and/or a character.
Specifically, in this embodiment, the target character string of the special content character in the target SQL statement may be a special character, or may be an SQL character string or a text, etc. Such as: the target character string may be zhang san, lie si, 2020-9-8, name, etc., as long as the character string cannot be extracted from the target SQL statement by using the existing character-by-character matching rule.
Therefore, by the technical scheme provided by the embodiment, the application range of the SQL statement character extraction method provided by the application can be further widened.
Based on the foregoing embodiment, this embodiment further describes and optimizes the technical solution, and as a preferred implementation, the character extraction method further includes:
and storing the regular expression.
In this embodiment, after the regular expression corresponding to the target character string is set, the regular expression may also be stored in order to expand the template extraction library of the target SQL statement. Therefore, in the subsequent matching process of the character string in the target SQL statement, the regular expression can be directly used for judging whether the character string which is required to be extracted by the user exists in the target SQL statement.
Based on the above embodiment, this embodiment further describes and optimizes the technical solution, and as a preferred implementation, the first identifier and the second identifier are specifically user-defined identifiers.
In this embodiment, when the first identifier and the second identifier are added to the start position and the end position of the screening character string of the target SQL statement, the first identifier and the second identifier may be set according to the code writing habit of the user. The first identifier and the second identifier defined by the user may be the same or different, as long as the first identifier and the second identifier can be distinguished from the characters in the target SQL statement, and character-by-character matching of the target SQL statement is easy.
Obviously, by the technical scheme provided by the embodiment, the flexibility of the user in setting the first identifier and the second identifier can be relatively improved.
Based on the above embodiments, this embodiment further describes and optimizes the technical solution, and as a preferred implementation, the first identifier and the second identifier are specifically identifiers appearing in pairs.
In practical applications, when the first identifier and the second identifier are added to the starting position and the ending position of the screening character string of the target SQL statement, the first identifier and the second identifier may also be set as identifiers that appear in pairs, such as: ", [ sic ], { }, [ sic ], and the like.
It is conceivable that, when the first identifier and the second identifier are set as identifiers appearing in pairs, in the process of performing character-by-character matching on the modified SQL statement, matching and recognition of the paired identifiers appearing in the modified SQL statement is easier, which not only improves the extraction efficiency of the target character string, but also improves the extraction accuracy of the target character string.
Obviously, by the technical scheme provided by the embodiment, the accuracy in extracting the target character string can be further ensured.
Based on the foregoing embodiment, this embodiment further describes and optimizes the technical solution, and as a preferred implementation, the character extraction method further includes:
and if the screening character string meeting the regular expression does not exist in the target SQL sentence, re-executing the step of setting the corresponding regular expression according to the target character string and performing rule matching on the target SQL sentence by using the regular expression until the screening character string meeting the regular expression exists in the target SQL sentence.
In this embodiment, if there is no filtering character string satisfying the regular expression in the target SQL statement, it indicates that the regular expression set according to the target character string is not adapted to the target character string. In this case, in order to ensure smooth proceeding of subsequent steps and to be able to extract a target character string of a special content character from a target SQL statement, the step of setting a corresponding regular expression according to the target character string and performing rule matching on the target SQL statement by using the regular expression needs to be executed again until a screening character string satisfying the regular expression appears in the target SQL statement.
Obviously, the technical scheme provided by the embodiment can further ensure that the target character string of the special content character can be extracted from the target SQL statement.
Referring to fig. 2, fig. 2 is a structural diagram of a character extraction device of a database statement according to an embodiment of the present invention, where the character extraction device includes:
the rule matching module 21 is configured to, when a target character string in a target SQL statement is to be extracted, set a corresponding regular expression according to the target character string, and perform rule matching on the target SQL statement by using the regular expression;
the character adding module 22 is configured to, if a screening character string satisfying the regular expression exists in the target SQL statement, add a first identifier and a second identifier at a start position and an end position of the screening character string, respectively, to obtain a modified SQL statement; the first identifier and the second identifier are identifiers which are different from any character in the target SQL statement;
the character matching module 23 is used for performing character-by-character matching on the modified SQL statement;
and the character extraction module 24 is configured to, when the first identifier and the second identifier exist in the modified SQL statement, extract a character string between the first identifier and the second identifier to obtain a target character string.
The character extraction device for the database sentences, provided by the embodiment of the invention, has the beneficial effects of the character extraction method for the database sentences.
Referring to fig. 3, fig. 3 is a structural diagram of a character extraction device of a database statement according to an embodiment of the present invention, where the character extraction device includes:
a memory 31 for storing a computer program;
a processor 32, for implementing the steps of the character extraction method of the SQL statement as disclosed in the foregoing when executing the computer program.
The character extraction equipment for the database sentences, provided by the embodiment of the invention, has the beneficial effects of the character extraction method for the database sentences.
Correspondingly, the embodiment of the invention also discloses a computer readable storage medium, wherein a computer program is stored on the computer readable storage medium, and when the computer program is executed by a processor, the steps of the character extraction method of the database statement as disclosed in the foregoing are realized.
The computer-readable storage medium provided by the embodiment of the invention has the beneficial effects of the character extraction method of the database statement disclosed in the foregoing.
The embodiments are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same or similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. But also includes other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The method, the device, the equipment and the medium for extracting the characters of the database sentences are introduced in detail, specific examples are applied in the text to explain the principles and the implementation mode of the invention, and the description of the embodiments is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (9)

1. A character extraction method for a database statement, comprising:
when a target character string in a target SQL statement is to be extracted, setting a corresponding regular expression according to the target character string, and performing rule matching on the target SQL statement by using the regular expression;
if the screening character string meeting the regular expression exists in the target SQL statement, adding a first identifier and a second identifier at the initial position and the end position of the screening character string respectively to obtain a modified SQL statement; the first identifier and the second identifier are identifiers which are distinguished from any character in the target SQL statement;
performing character-by-character matching on the modified SQL statement;
and when the first identifier and the second identifier in the modified SQL statement are matched, extracting the character string between the first identifier and the second identifier to obtain the target character string.
2. The character extraction method according to claim 1, wherein the target character string includes a special character and/or an SQL character string and/or a letter.
3. The character extraction method according to claim 1, further comprising:
and storing the regular expression.
4. The character extraction method according to claim 1, wherein the first identifier and the second identifier are specifically user-defined identifiers.
5. The character extraction method according to claim 1, wherein the first identifier and the second identifier are specifically identifiers that appear in pairs.
6. The character extraction method according to any one of claims 1 to 5, characterized by further comprising:
and if the screening character string meeting the regular expression does not exist in the target SQL sentence, re-executing the step of setting the corresponding regular expression according to the target character string and performing rule matching on the target SQL sentence by using the regular expression until the screening character string meeting the regular expression exists in the target SQL sentence.
7. A character extraction apparatus for a database sentence, comprising:
the rule matching module is used for setting a corresponding regular expression according to a target character string when the target character string in the target SQL sentence is to be extracted, and performing rule matching on the target SQL sentence by using the regular expression;
the character adding module is used for respectively adding a first identifier and a second identifier at the initial position and the end position of the screening character string to obtain a modified SQL statement if the screening character string meeting the regular expression exists in the target SQL statement; the first identifier and the second identifier are identifiers which are distinguished from any character in the target SQL statement;
the character matching module is used for carrying out character-by-character matching on the modified SQL statement;
and the character extraction module is used for extracting the character string between the first identifier and the second identifier to obtain the target character string when the first identifier and the second identifier exist in the modified SQL statement.
8. A character extraction device of a database sentence, comprising:
a memory for storing a computer program;
a processor for implementing the steps of a character extraction method of a database statement as claimed in any one of claims 1 to 6 when executing said computer program.
9. A computer-readable storage medium, having stored thereon a computer program which, when being executed by a processor, carries out the steps of a method for character extraction of a database statement as claimed in any one of claims 1 to 6.
CN202110920713.XA 2021-08-11 2021-08-11 Character extraction method, device, equipment and medium for database statement Active CN113641695B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110920713.XA CN113641695B (en) 2021-08-11 2021-08-11 Character extraction method, device, equipment and medium for database statement

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110920713.XA CN113641695B (en) 2021-08-11 2021-08-11 Character extraction method, device, equipment and medium for database statement

Publications (2)

Publication Number Publication Date
CN113641695A true CN113641695A (en) 2021-11-12
CN113641695B CN113641695B (en) 2024-03-22

Family

ID=78420894

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110920713.XA Active CN113641695B (en) 2021-08-11 2021-08-11 Character extraction method, device, equipment and medium for database statement

Country Status (1)

Country Link
CN (1) CN113641695B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102609459A (en) * 2012-01-12 2012-07-25 神州数码网络(北京)有限公司 Method and device for string matching based on regular expression
US20170308576A1 (en) * 2016-04-26 2017-10-26 International Business Machines Corporation Character matching in text processing
CN112783919A (en) * 2021-02-02 2021-05-11 广州海量数据库技术有限公司 Method and device for processing character strings of query statement

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102609459A (en) * 2012-01-12 2012-07-25 神州数码网络(北京)有限公司 Method and device for string matching based on regular expression
US20170308576A1 (en) * 2016-04-26 2017-10-26 International Business Machines Corporation Character matching in text processing
CN112783919A (en) * 2021-02-02 2021-05-11 广州海量数据库技术有限公司 Method and device for processing character strings of query statement

Also Published As

Publication number Publication date
CN113641695B (en) 2024-03-22

Similar Documents

Publication Publication Date Title
CN110321432B (en) Text event information extraction method, electronic device and nonvolatile storage medium
CN109388795B (en) Named entity recognition method, language recognition method and system
US20180190270A1 (en) System and method for semantic analysis of speech
US11138377B2 (en) Automated document analysis comprising company name recognition
CN107943786B (en) Chinese named entity recognition method and system
CN108170708B (en) Vehicle entity identification method, electronic equipment, storage medium and system
JP6427466B2 (en) Synonym pair acquisition apparatus, method and program
CN109522396B (en) Knowledge processing method and system for national defense science and technology field
CN111104801A (en) Text word segmentation method, system, device and medium based on website domain name
KR20120064559A (en) Apparatus and method for question analysis for open web question-answering
CN114266256A (en) Method and system for extracting new words in field
CN112287071A (en) Text relation extraction method and device and electronic equipment
CN106156340A (en) A kind of name entity link method
CN113032371A (en) Database grammar analysis method and device and computer equipment
CN112560425A (en) Template generation method and device, electronic equipment and storage medium
Kuncham et al. Statistical sandhi splitter for agglutinative languages
CN110956043A (en) Domain professional vocabulary word embedding vector training method, system and medium based on alias standardization
CN113641695A (en) Character extraction method, device, equipment and medium for database statement
CN110986972A (en) Information processing method and device for vehicle navigation
JP4148247B2 (en) Vocabulary acquisition method and apparatus, program, and computer-readable recording medium
CN107247708B (en) Surname identification method and system
Sharum et al. Name extraction for unstructured Malay text
CN115934921B (en) Task type answer determining method and device, electronic equipment and storage medium
Fachrurrozi et al. Identification of Ambiguous Sentence Pattern in Indonesian Using Shift-Reduce Parsing
CN112579466B (en) Method and device for generating test cases and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant