CN109933702B - Retrieval display method, device, equipment and storage medium - Google Patents

Retrieval display method, device, equipment and storage medium Download PDF

Info

Publication number
CN109933702B
CN109933702B CN201910180992.3A CN201910180992A CN109933702B CN 109933702 B CN109933702 B CN 109933702B CN 201910180992 A CN201910180992 A CN 201910180992A CN 109933702 B CN109933702 B CN 109933702B
Authority
CN
China
Prior art keywords
text
different
area
display
keyword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910180992.3A
Other languages
Chinese (zh)
Other versions
CN109933702A (en
Inventor
张济徽
尹华华
王新波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Smart Bud Information Technology Suzhou Co ltd
Original Assignee
Smart Bud Information Technology Suzhou Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Smart Bud Information Technology Suzhou Co ltd filed Critical Smart Bud Information Technology Suzhou Co ltd
Priority to CN201910180992.3A priority Critical patent/CN109933702B/en
Publication of CN109933702A publication Critical patent/CN109933702A/en
Application granted granted Critical
Publication of CN109933702B publication Critical patent/CN109933702B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses a retrieval display method, a retrieval display device, retrieval display equipment and a storage medium. The method comprises the following steps: acquiring a keyword input by a user; matching the keywords with different text regions of the target document to generate display modes of the different text regions; the display mode is displayed, and through the technical scheme of the invention, the search function can be optimized and the search experience of a user can be enhanced.

Description

Retrieval display method, device, equipment and storage medium
Technical Field
The embodiment of the invention relates to computer technology, in particular to a retrieval display method, a retrieval display device, retrieval display equipment and a storage medium.
Background
Content retrieval is a core function of an IT system, a search engine or a SaaS application providing data services, and retrieval result pages generated aiming at different retrieval keywords or retrieval formulas are used for performing result preview, operation or optimized retrieval formula further search for users, so that the specific display mode of the retrieval result pages is very important for products providing the data services.
The existing search engines mainly comprise a search engine of Google and a hundredth search engine. The search engine of Google mainly adopts the PageRank algorithm proposed by Sergey Brin and Lawrence Page, which determines the importance of web pages based on web pages linked from many high-quality web pages to rank the web pages, and currently, more than 200 dimensions are used for ranking the web pages. The Baidu search engine mainly uses a hyperchain analysis algorithm to sequence the search results of the web crawlers. At present, algorithms of Google search engines and Baidu search engines only solve the problem of webpage or article sequencing and do not solve the problem of better display of search result pages.
In the prior art, the display purposes of a search result page of a patent document and a search result page of a search engine are different, a general webpage is an unstructured text, and the search engine partially displays matched contents according to the matching of titles or local texts and displays the matched contents. While general papers or patent documents are structured texts with titles, abstracts, claims, descriptions, drawings and the like, these different parts are all parts which are viewed and previewed by patent searchers according to the intentions of searching, but the display result pages of the existing search engines cannot well show the different parts.
Even professional patent search engine tools have different display views, such as list views, table views and graphic views, which cannot well show the search result proportion of different parts of a structured paper or a patent, some search results are matched according to titles and abstracts, but the search results are not displayed according to the matching degree of the parts of titles, abstracts, claims, specifications and drawings.
The use of current search results pages is problematic: as shown in fig. 1A, is an illustration of a tabular view. The search result page obtained by searching according to the artificial intelligence keyword cannot show the matching degree of the patent and the artificial intelligence no matter the title, the abstract or the description in the search result page. As shown in fig. 1B, is an illustration of a standard view. According to the artificial intelligence keyword, the search result page is searched, and the matching degree of the patent and the artificial intelligence can not be seen no matter the title, the abstract or the description in the search result page. As shown in fig. 1C, a search result page is a schematic diagram, which is a matching search based on the classified keywords or academic keywords, but cannot compare the matching proportion visually with the abstract, the specification, the claims and other places. As shown in fig. 1D, another schematic diagram of a search result page is shown, where text matching and thumbnail display are displayed according to titles, abstracts and the like, and matching density and weight corresponding to keywords cannot be well shown.
Disclosure of Invention
The embodiment of the invention provides a retrieval display method, a retrieval display device, retrieval display equipment and a storage medium, so as to realize the optimization of a search function and enhance the search experience of a user.
In a first aspect, an embodiment of the present invention provides a retrieval display method, including:
acquiring a keyword input by a user;
matching the keywords with different text regions of the target document to generate display modes of the different text regions;
and displaying the display mode.
Further, the matching the keyword with different text regions of the target document to generate a display mode of different text regions includes:
presetting area weights of different text regions of the target document;
determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region;
and generating display modes of different text regions according to the display areas.
Further, generating the display mode of different text regions includes:
determining a first number of hits of the keyword in the text region, wherein the first number of hits is the number of times of matching of the keyword with text content in the text region;
determining the matching weight of the text region according to the first number of hits;
confirming the display color of the corresponding text area in a first database according to the matching weight, wherein the first database is pre-established with the corresponding relation between the display color and the matching weight;
and generating display modes of different text areas according to the display colors.
Further, the text area includes a plurality of different blocks, and the display mode for generating different text areas includes:
determining a second number of hits of the keyword in different blocks of the text area, wherein the second number of hits is the number of times of matching of the keyword with text contents in different blocks of the text area;
determining the thermal weight of the different blocks according to the second number of hits;
confirming the color depth of the corresponding text area in a second database according to the thermal weight, wherein the second database is pre-established with the corresponding relation between the color depth and the thermal weight;
and generating display modes of different text regions according to the color depth.
Further, the text area includes a plurality of different blocks, and after the display modes of different text areas are generated, the method further includes:
determining a hit position of the keyword in the text region, wherein the hit position is a position when the keyword and the text content in the text region are successfully matched;
determining a hit block according to the hit position, wherein the hit position is within the range of the hit block;
highlighting the hit block or tagging the hit block with one of the keyword, an abbreviation of the keyword, and a substitute for the keyword.
Further, the determining the hit position of the keyword in the text region includes:
acquiring the line number of the keyword when the matching of the text area is successful as a first position parameter;
acquiring a line offset of the keyword when the text area is successfully matched as a second position parameter;
and determining the hit position according to the first position parameter and the second position parameter.
Further, the display mode comprises: at least one of rectangular, circular, and elliptical.
In a second aspect, an embodiment of the present invention further provides a retrieval display apparatus, where the apparatus includes:
the acquisition module is used for acquiring keywords input by a user;
the generating module is used for matching the keywords with different text regions of the target document so as to generate display modes of the different text regions;
and the display module is used for displaying the display mode.
Further, the generating module comprises:
a setting unit configured to set area weights of different text regions of the target document in advance;
the determining unit is used for determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region;
and the graph generating unit is used for generating display modes of different text regions according to the display area.
Further, the generating module comprises:
a first number-of-hits determination unit configured to determine a first number of hits of the keyword in the text region, the first number of hits being a number of times that the keyword and text content of the text region are matched;
a matching weight determination unit for determining a matching weight of the text region according to the first number of hits;
the display color confirmation unit is used for confirming the display color of the corresponding text area in a first database according to the matching weight, and the first database is pre-established with the corresponding relation between the display color and the matching weight;
and the display mode generating unit is used for generating display modes of different text areas according to the display colors.
Further, the text area includes a plurality of different blocks, and the generating module is specifically configured to:
determining a second number of hits of the keyword in different blocks of the text area, wherein the second number of hits is the number of times that the keyword is matched with the text content in different blocks of the text area;
determining thermodynamic weights of the different blocks according to the second number of hits;
confirming the color depth of the corresponding text area in a second database according to the thermal weight, wherein the second database is pre-established with the corresponding relation between the color depth and the thermal weight;
and generating display modes of different text areas according to the color depth.
Further, the method also comprises the following steps:
a hit position determining unit, configured to determine a hit position of the keyword in the text region, where the hit position is a position where matching between the keyword and text content in the text region is successful;
a hit block determination unit for determining a hit block according to the hit position, the hit position being within the hit block;
a marking unit for highlighting the hit block or marking the hit block using one of the keyword, an abbreviation of the keyword, and a substitute of the keyword.
Further, the hit position determination unit is specifically configured to:
acquiring the line number of the keyword when the matching of the text area is successful as a first position parameter;
acquiring a line offset of the keyword when the matching of the keyword in the text area is successful as a second position parameter;
and determining the hit position according to the first position parameter and the second position parameter.
Further, the display mode comprises: at least one of rectangular, circular, and elliptical.
In a third aspect, an embodiment of the present invention further provides a computer device, including a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor executes the computer program to implement the retrieval presentation method according to any one of the embodiments of the present invention.
In a fourth aspect, the embodiment of the present invention further provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the retrieval presentation method according to any one of the embodiments of the present invention.
The embodiment of the invention obtains the key words input by the user; matching the keywords with different text regions of the target document to generate display modes of the different text regions; the display mode is displayed, the search function can be optimized, a user can conveniently and quickly know the integral matching result of the search, and the search experience of the user is enhanced.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.
FIG. 1A is a diagram of a table view in the prior art;
FIG. 1B is a diagram of a standard view in the prior art;
FIG. 1C is a diagram illustrating a search results page in the prior art;
FIG. 1D is a schematic diagram of another search results page in the prior art;
FIG. 2 is a flowchart of a retrieval displaying method according to a first embodiment of the present invention;
FIG. 3A is a flowchart of a retrieval displaying method according to a second embodiment of the present invention;
FIG. 3B is a diagram comparing the representation of a sub-item of the patent search result list according to the second embodiment of the present invention by the second border size of the rectangular box with a sub-item of the patent search result list according to the prior art;
FIG. 4A is a flowchart of a retrieval display method according to a third embodiment of the present invention;
fig. 4B is a diagram comparing the representation manner of a sub-item of the patent search result list through the second border size and color of the rectangular box in the third embodiment of the present invention with that of a sub-item of the patent search result list in the prior art;
FIG. 5A is a flowchart of a retrieval displaying method according to a fourth embodiment of the present invention;
FIG. 5B is a diagram comparing the size of a sub-item of the patent search result list in the second frame of the rectangular frame and the representation of the thermodynamic diagram with a sub-item of the patent search result list in the prior art;
FIG. 5C is a diagram of the second border size of the rectangular box and the representation of the thermodynamic diagram of a sub-item of the patent search result list in the fourth embodiment of the present invention;
FIG. 6A is a flowchart of a retrieval displaying method according to a fifth embodiment of the present invention;
FIG. 6B is a diagram of a search keyword according to a fifth embodiment of the present invention;
FIG. 6C is a comparison graph of a sub-item of the patent search result list marked by the second border size of the rectangular box and the representation manner of the thermodynamic diagram and by using the keywords in the fifth embodiment of the present invention with a sub-item of the patent search result list in the prior art;
fig. 6D is a diagram illustrating a manner in which a sub-item of the patent search result list is represented by the second border size of the rectangular box and the thermodynamic diagram and is marked by using a keyword according to a fifth embodiment of the present invention;
FIG. 6E is a diagrammatic illustration of a number of different modes of presentation of a fifth embodiment of the invention;
FIG. 7 is a schematic structural diagram of a retrieval display apparatus according to a sixth embodiment of the present invention;
fig. 8 is a schematic structural diagram of a computer device in a seventh embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It should be further noted that, for the convenience of description, only some structures related to the present invention are shown in the drawings, not all of them.
It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures. Meanwhile, in the description of the present invention, the terms "first", "second", and the like are used only for distinguishing the description, and are not to be construed as indicating or implying relative importance.
Example one
Fig. 2 is a flowchart of a retrieval display method according to an embodiment of the present invention, where the present embodiment is applicable to a retrieval display situation, the method may be executed by a retrieval display apparatus according to an embodiment of the present invention, and the apparatus may be implemented in a software and/or hardware manner, as shown in fig. 2, the method specifically includes the following steps:
s110, obtaining the key words input by the user.
The keyword may be in a search form or in other forms, for example, the keyword may be 5G communication or XX company, which is not limited in this embodiment of the present invention.
Specifically, the manner of obtaining the keywords input by the user may be that the user manually inputs the keywords through an external device, for example, the user inputs the keywords by pressing a control on a keyboard, and may also input the keywords through a voice form, for example, the user inputs the keywords in a "5G communication", obtains the voice input by the user, converts the voice input by the user into a text form through a voice recognition technology, and further obtains the keywords input by the user, and also obtains the keywords input by the user through other manners, for example, obtains the keywords input by the user through a gesture, and the embodiment of the present invention is not limited to this.
And S120, matching the keywords with different text regions of the target document to generate display modes of the different text regions.
The target document may be a patent document or a patent application document, may also be a thesis, and may also be another structured document, which is not limited in this embodiment of the present invention.
If the target document is a patent document or a patent application document, the different text areas of the target document are 5 different text areas including a title area, a summary area, a claim area, a specification area and a drawing area according to the structured text characteristics of the patent document or the patent application document. In alternative embodiments, different text regions of the target document may also be selectively or eligibly displayed, for example, the different text regions of the target document in the structured text of the patent document or the patent application document may be 4 different text regions of a title region, a summary region, a claim region and a specification region, or may be 3 different text regions of the summary region, the claim region and the specification region.
The matching between the keyword and different text regions of the target document may be performed in such a manner that the keyword is sequentially matched with different text regions of the target document, for example, if the keyword is keyword a, the different text regions of the target document include: a text region T, a text region Q, a text region C, a text region D and a text region B; and matching the keyword A with the text area T, the text area Q, the text area C, the text area D and the text area B in sequence.
Specifically, the method of matching the keyword with different text regions of the target document to generate the display modes of different text regions may be to match the keyword with different text regions of the target document, determine a display area according to the number of matched characters, determine a display mode according to the display area, determine a display area according to a product of weights of different text regions of the target document and lengths of text contents corresponding to the text regions, determine a display mode according to the display area, or may determine a matching weight of a text region according to the number of times of matching the text contents of the keyword with the text regions, further determine a display color of the text region according to the matching weight, determine a display image according to the display color and the display area, and determine thermal weights, color depths, display colors, and display areas of different blocks according to the number of times of matching the text contents of the different blocks of the keyword with the text regions, where the embodiment of the present invention does not limit this.
And S130, displaying the display mode.
The display manner may include a rectangle, an ellipse, or a circle, which is not limited in this embodiment of the present invention.
Specifically, the display manner may be that different text regions are surrounded by rectangular frames with different second frame sizes, where the first frame sizes of the rectangular frames are the same, and the second frame sizes of the rectangular frames are related to the lengths of the text contents of the different text regions, so that a previewer can visually recognize the different second frame sizes of the rectangular frames; different colors can be preset for different text areas, and a previewer can be intuitively known through the different colors; or the hit rates of different text regions can be identified by using different colors, wherein the hit rate refers to the number of times of matching between the keywords and the text contents of the text regions; the hit density can be displayed through the thermodynamic diagrams, the text area needs to be divided to obtain different blocks, the thermodynamic diagrams are obtained according to the matching times of the keywords and the text content of the different blocks in the text area, and the different text areas are displayed through the different thermodynamic diagrams; the matching position of the keyword may also be embodied in the form of a mark, which is not limited in this embodiment of the present invention. The first frame size of the rectangular frame in this embodiment may be the size of the display mode along the width or horizontal direction, and the second frame size of the rectangular frame in the display mode may be the size of the display mode along the height or vertical direction.
Optionally, the displaying method for generating different text regions includes:
determining a first number of hits of the keyword in the text region, wherein the first number of hits is the number of times of matching of the keyword with text content in the text region;
determining the matching weight of the text region according to the first number of hits;
confirming the display color of the corresponding text area in a first database according to the matching weight, wherein the first database is pre-established with the corresponding relation between the display color and the matching weight;
and generating display modes of different text areas according to the display colors.
Specifically, the display color of the corresponding text region is confirmed by the first number of hits of the keyword in the text region, and then the display mode of different text regions is generated according to the display color, for example, the keyword is 10 times in the first number of hits of the region a, the keyword is 5 times in the first number of hits of the region B, the keyword is 2 times in the first number of hits of the region C, the matching weight of the region a is determined to be Q, the matching weight of the region B is determined to be W, the matching weight of the region C is determined to be P, in the first database, the text display color corresponding to the matching weight of Q is blue, the text display color corresponding to the matching weight of W is green, the text display color corresponding to the matching weight of P is red, the text display color corresponding to the region a is determined to be blue, the display color of the region B is green, and the display color of the region C is red. Optionally, the text area includes a plurality of different blocks, and the display manner for generating different text areas includes:
determining a second number of hits of the keyword in different blocks of the text area, wherein the second number of hits is the number of times that the keyword is matched with the text content in different blocks of the text area;
determining thermodynamic weights of the different blocks according to the second number of hits;
confirming the color depth of the corresponding text area in a second database according to the thermal weight, wherein the second database is pre-established with the corresponding relation between the color depth and the thermal weight;
and generating display modes of different text regions according to the color depth.
Specifically, the color depth of the corresponding text region is determined by the second hit numbers of the keywords in the different blocks of the text region, and then the display modes of the different text regions are generated according to the color depth, for example, the region a includes a block X, a block Y, and a block Z, the first hit number of the keywords in the block X of the region a is 10 times, the first hit number of the keywords in the block Y of the region a is 5 times, the first hit number of the keywords in the block Z of the region a is 2 times, the thermal weight of the block X is determined to be U, the thermal weight of the block Y is determined to be V, the thermal weight of the block Z is determined to be W, in the second database, the color depth of the block corresponding to U is L, the color depth of the block corresponding to V is M, the thermal weight of the block corresponding to W is N, the color depth of the text region is determined according to the color depths of the different blocks, and then the color depth of the block X in the region a is determined to be L, the color depth of the block Y is M, and the color depth of the block corresponding to Z is N.
According to the technical scheme of the embodiment, keywords input by a user are obtained; matching the keywords with different text regions of the target document to generate display modes of the different text regions; the display mode is displayed, keyword matching can be carried out according to different text regions, then the search function is optimized, different display modes are generated through different text regions, intuitive knowledge can be provided for a retrieval user, and the search experience of the user is enhanced.
Example two
Fig. 3A is a flowchart of a retrieval display method in a second embodiment of the present invention, where the embodiment is optimized based on the foregoing embodiment, and in this embodiment, the displaying manner of matching the keyword with different text regions of the target document to generate different text regions includes: presetting area weights of different text regions of the target document; determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region; and generating display modes of different text areas according to the display areas.
As shown in fig. 3A, the method of this embodiment specifically includes the following steps:
s210, acquiring the keywords input by the user.
S220, the area weight of different text areas of the target document is preset.
The method for setting the area weight of different text regions of the target document may be determined according to the number of words of the text content of different text regions, for example, the area weight of a text region in which the number of words of the text content of the text region is small may be set to be larger; the setting may also be directly performed according to the usage habit of the user, or may be performed according to the importance degree of the text region, which is not limited in the embodiment of the present invention.
Specifically, the area weight of the different text regions of the target document is preset, and for example, if the target document is a patent document or a patent application document, and the different text regions of the target document are a title region, a summary region, a claim region, a description region and a drawing region, respectively, the weight of the title region is preset to be 25%, the weight of the summary region is preset to be 20%, the weight of the claim region is preset to be 25%, the weight of the description region is preset to be 15%, and the weight of the drawing region is preset to be 15%.
In a specific example, the graphic result page is divided into T, a, C, D and B according to the structural text features of the patent document or the patent application document, which correspond to 5 different areas of the title, the abstract, the claims, the specification and the drawings respectively, and the graphic result page display mode of matching ratio redo of the above 5 different parts is matched according to the keywords input by the user. The areas of the different rectangles represent different text proportions of the original text, as shown in fig. 3B, which is a sub-item of the search result list of the patent document or the patent application document, the left side is a sub-item of the search result list of the patent document or the patent application document in the embodiment of the present invention, and the right side is a search result in the prior art, wherein the sub-item refers to a target document in the search result list, and if a target document related to the keyword can be searched, the search result list includes at least one target document. The title, abstract, claims, specification and drawings are shown as rectangles, as shown in fig. 3B, wherein the height 1 of each rectangle is different, and the relative size of each rectangle is schematically shown according to the text length of the patent document or patent application document, so as to provide a previewer with an intuitive understanding that different text areas correspond to different rectangle heights 1. In order not to distort the rectangular scale too much (the title may be little text and the description is large) the rectangular display is designed with a certain weight so that the scale is moderate to reflect the text height of different areas of the patent document or patent application document. The height calculation for the rectangular display has two steps:
the method comprises the following steps: as shown in Table one, a certain weight of the text defining the different parts.
Watch 1
Figure BDA0001991329230000081
Step two: and obtaining the second border size of the rectangular boxes of different areas according to the different area weights multiplied by the corresponding text lengths, so that the height proportional relation of each part can be obtained, and then calculating the height of each part according to the total height of the displayed thumbnail. As shown in table two, the second frame size of the rectangular frame is calculated.
Watch two
Figure BDA0001991329230000091
And S230, determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region.
The length of the text content corresponding to the text region may be the number of lines or words of the text, for example, the length of the text content corresponding to the text region may be the length of the text in the title region, the length of the text in the abstract region, the length of the text in the claim region, the length of the text in the description region, and the length of the text in the drawing region, where the length of the text corresponding to the drawing region may be obtained by performing character recognition on the drawing region in advance to obtain the text corresponding to the drawing region, and then obtaining the length of the text corresponding to the drawing region.
Specifically, the display area of the text region is determined according to the product of the area weight of the text region and the length of the text content corresponding to the text region, for example, the second border size of the rectangular box of the text region is determined according to the product of the area weight of the text region and the length of the text content corresponding to the text region, so as to obtain the proportional relationship of the display heights of different text regions, and the heights of different text regions are calculated according to the total height of the displayed thumbnail.
S240, generating display modes of different text areas according to the display areas.
The display area refers to the display area of different text regions, and the different text regions may be displayed in a rectangular form, a circular form, or an elliptical form, which is not limited in the embodiment of the present invention.
Specifically, the display mode of generating different text regions according to the display area may be, for example, generating rectangles with different display areas on the peripheries of different text regions according to the display area, and because the rectangles have the same length, the previewer may be given an intuitive understanding through the height of the rectangle.
And S250, displaying the display mode.
Optionally, the display mode includes: at least one of rectangular, circular, and elliptical.
According to the technical scheme of the embodiment, area weights of different text areas of a target document are preset; determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region; the display modes of different text areas are generated according to the display areas, the search function can be optimized, a user can conveniently and accurately position the position, whether the patent document or the patent application document is related to the content which the user wants to search or not is checked, and the search experience of the user is enhanced.
EXAMPLE III
Fig. 4A is a flowchart of a retrieval display method in a third embodiment of the present invention, which is optimized based on the second embodiment, in this embodiment, a display manner for generating different text regions according to the display area includes: determining a first number of hits of the keyword in the text region, wherein the first number of hits is the number of times of matching of the keyword with text content in the text region; determining the matching weight of the text region according to the first number of hits; confirming the display color of the corresponding text area in a first database according to the matching weight, wherein the first database is pre-established with the corresponding relation between the display color and the matching weight; and generating display modes of different text areas according to the display colors and the display areas.
As shown in fig. 4A, the method of this embodiment specifically includes the following steps:
s310, obtaining the key words input by the user.
S320, the area weight of different text areas of the target document is preset.
S330, determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region.
S340, determining the first number of hits of the keyword in the text area, wherein the first number of hits is the number of times of matching between the keyword and the text content of the text area.
The number of times of matching the keyword with the text content in the text region is to match the keyword with the text content in different text regions in sequence to obtain the number of times of matching the text content in different text regions of the keyword, for example, the number of times of matching the keyword with the text content in the text region P may be 1 when the keyword is the keyword a and the content corresponding to the text region P is "the keyword a is other content (the other content does not include the keyword a)".
S350, determining the matching weight of the text region according to the first number of hits.
Specifically, the way of determining the matching weight of the text region according to the first number of hits may be that the greater the first number of hits, the greater the matching weight of the text region, or the greater the first number of hits, the smaller the matching weight of the text region, which is not limited in this embodiment of the present invention.
S360, confirming the display color of the corresponding text area in the first database according to the matching weight, wherein the corresponding relation between the display color and the matching weight is pre-established in the first database.
The corresponding relationship between the display color and the matching weight is pre-established, for example, the display color a corresponds to the matching weight P, the display color B corresponds to the matching weight Q, and the display color C corresponds to the matching weight W.
Specifically, the display color of the corresponding text region is determined in the first database according to the matching weight, for example, the display color of the text region corresponding to the matching weight may be obtained by querying the first database according to the matching weight.
And S370, generating display modes of different text regions according to the display colors and the display areas.
Specifically, the display colors and the display areas of different text regions are different, so that the area weights of different text regions and the matching times of the different text regions and the keywords can be visually embodied, and the user can browse more conveniently.
And S380, displaying the display mode.
In a specific example, as shown in fig. 4B, the left side is a sub-item of the search result list of the patent document or the patent application document in the embodiment of the present invention, and the right side is the search result in the prior art, and the number of hits of the keyword in different areas is represented by different colors or different shades, which is convenient for the searcher of the patent document or the patent application document to quickly find out which part is closely related to the keyword, and jump to the corresponding area for viewing. The system can be configured by specifically selecting which color to set according to the visibility requirements of different customers. The colors shown in fig. 4B are schematic diagrams and the system may be made configurable for different colors. As shown in fig. 4B, different text regions correspond to different rectangular heights 1. The matching degree can also be shown by the depth of the color, and the depth of each color is calculated as follows: first number of hits = number of times of matching of keyword with text content of text region, N Hit To represent different numbers of first hits (N represents different parts), the matching algorithm for a specific keyword can be determined by a scheme so that is generally well established in the industrylr or lucene. The implementation is illustrated with the basic RGB primaries: according to the RGB specification, white: (255, 255, 255) black: (0,0,0). For example (139, 35, 39) represents a default deep red color (the system may be configured with other colors) when all hits are made. Setting default offsets according to red, green and blue:
Offset red =255-139=116
Offset green =255-35=220
Offset blue =255-39=216
calculating the red, green and blue values according to the hit rate:
red=max(255-(Offset red *N Hit ),0)
green=max(255-(Offset gren *N Hit ),0)
blue=max(255-(Offset blue *N Hit ),0)
N Hit and (5) obtaining the values of red, green and blue according to the number of hits. The obtained red, green and blue values can be directly used as the colors of the corresponding regions, and the weight N of each region can be combined weight The final color of each part was obtained:
T Color =N Weight *RGB(red,green,blue,N Hit )
the weight is lower because the text of the title is less, while the weight of the description will be higher, which is the opposite way to calculate the height of the rectangle. As shown in table three, the weight distribution table of the rectangular color is shown.
Watch III
Color weights Specific gravity of Description of the invention
T
Weight 20% Weight indicating title area
A weight 15% Weight representing summary area
C
weight 20% Weights representing claim regions
D
Weight 30% Weight representing region of the specification
B weight 15% Weights representing regions of the drawing
As shown in table four, the color calculation method is for different parts:
watch four
Figure BDA0001991329230000111
Figure BDA0001991329230000121
According to the technical scheme, the area weights of different text areas of the target document are preset, the display area of the text area is determined according to the product of the area weight of the text area and the length of text content corresponding to the text area, the first number of hits of a keyword in the text area is determined, the matching weight of the text area is determined according to the first number of hits, the display color of the corresponding text area is confirmed in the first database according to the matching weight, the corresponding relation between the display color and the matching weight is pre-established in the first database, the display modes of different text areas are generated according to the display color and the display area, the search function can be optimized, the user can more conveniently accurately position the position through the display of different display colors and display areas, whether the patent document or the patent application document is related to the content which the user wants to search is checked, and the user search experience is enhanced.
Example four
Fig. 5A is a flowchart of a retrieval display method according to a fourth embodiment of the present invention, which is optimized based on the second embodiment, in this embodiment, the text region includes a plurality of different blocks, and the display manner of generating different text regions according to the display area includes: determining a second number of hits of the keyword in different blocks of the text area, wherein the second number of hits is the number of times that the keyword is matched with the text content in different blocks of the text area; determining thermodynamic weights of the different blocks according to the second number of hits; confirming the color depth of the corresponding text area in a second database according to the thermal weight, wherein the second database is pre-established with the corresponding relation between the color depth and the thermal weight; and generating display modes of different text regions according to the color depth, the display color and the display area.
As shown in fig. 5A, the method of this embodiment specifically includes the following steps:
and S410, acquiring the keywords input by the user.
S420, area weights of different text regions of the target document are preset, and the text regions comprise a plurality of different blocks.
S430, determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region.
S440, determining a second number of hits of the keyword in different blocks of the text region, the second number of hits being the number of times the keyword matches the text content of the different blocks of the text region.
The text area is divided into different blocks in advance, each text area includes at least two blocks, and the specific block division form is not limited in the embodiment of the present invention.
Specifically, the manner of determining the second number of hits of the keyword in different blocks of the text region may be to match the keyword with different blocks of the text region in sequence, so as to obtain the number of times that the keyword is matched with the text content in different blocks of the text region.
S450, determining the thermal weight of different blocks according to the second hit number.
The relationship between the second number of hits and the thermal weight is that the larger the second number of hits is, the larger the thermal weight is.
Specifically, the manner of determining the thermal power weights of the different blocks according to the second number of hits may be a database in which a correspondence between the second number of hits and the thermal power weights is preset, and the database is queried according to the currently obtained second number of hits to obtain the thermal power weights of the different blocks corresponding thereto.
And S460, confirming the color depth of the corresponding text area in a second database according to the thermal weight, wherein the second database is pre-established with the corresponding relation between the color depth and the thermal weight.
For example, the color depth corresponding to the thermal weight may be obtained by querying the second database according to the thermal weight, and the color depth of the text region may be determined according to the obtained color depth.
Optionally, a corresponding relationship between the color depth of the block and the thermal weight may be pre-established, the color depth of the block corresponding to the current thermal weight may be queried according to the current thermal weight, and the color depth of the text region may be obtained according to the color depths of different blocks in the same text region.
And S470, generating display modes of different text regions according to the color depth, the display color and the display area.
And S480, displaying the display mode.
In a specific example, as shown in fig. 5B, the left side is a sub-item of the search result list of the patent document or the patent application document in the embodiment of the present invention, the right side is the search result in the prior art, different areas can be presented by using different thermodynamic diagrams, different parts of the same area have different distributions of search hits, the same color-dark thermodynamic diagram represents the density of the hits, and the hit range of the search condition of the user, so that the user can conveniently pinpoint the position and check whether the patent document or the patent application document is related to the content that the user wants to search. As shown in fig. 5B, the block hit rate corresponding to the middle part 2 of the abstract area is the highest, the block hit rate corresponding to the upper right corner part 3 of the specification area is the highest, and the hit matching degree of the text is represented by a representation manner of thermodynamic diagrams for hit blocks in different parts of the same area. The specific matching method is shown in fig. 5C, which takes the description of the specification area as an example: dividing the specification area into x blocks, y represents the offset of each line, x represents the number of lines, calculating which block the index search key falls in according to the matching condition of the index search key, namely the offset number returned by solr, and showing the thermodynamic diagram by looking at the area where the index matching is shown: hotspot = { X = n ,Y m H, where hotspot represents a set of Hotspots, X n Representing the abscissa, Y, of a rectangle m Indicating the rectangular ordinate, hotspot corresponds to three areas 4 as the higher hit block, as shown in fig. 5C.
According to the technical scheme, area weights of different text areas of a target document are preset, the text areas comprise a plurality of different blocks, display areas of the text areas are determined according to the product of the area weights of the text areas and the lengths of text contents corresponding to the text areas, second hits of keywords in the different blocks of the text areas are determined, thermal weights of the different blocks are determined according to the second hits, color depths of the corresponding text areas are confirmed in a second database according to the thermal weights, corresponding relations between the color depths and the thermal weights are preset in the second database, display modes of the different text areas are generated according to the color depths, the display colors and the display areas, a search function can be optimized, the matching degree of hit of the text can be better displayed through a thermodynamic diagram, a user can conveniently accurately position the text, whether the patent document or the patent application document is related to the contents which the user wants to search or not is checked, and user search experience is enhanced.
EXAMPLE five
Fig. 6A is a flowchart of a retrieval display method in a fifth embodiment of the present invention, which is optimized based on the second embodiment, in this embodiment, the text region includes a plurality of different blocks, and after generating display modes of different text regions according to the display areas, the method further includes: determining a hit position of the keyword in the text region, wherein the hit position is a position when the keyword and the text content in the text region are successfully matched; determining a hit block according to the hit position, wherein the hit position is within the range of the hit block; highlighting the hit block or tagging the hit block with one of the keyword, an abbreviation of the keyword, and an alternative to the keyword.
As shown in fig. 6A, the method of this embodiment specifically includes the following steps:
and S510, acquiring the keywords input by the user.
S520, the area weight of different text areas of the target document is preset.
S530, determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region.
And S540, generating display modes of different text regions according to the display areas.
And S550, displaying the display mode.
Optionally, the display mode includes: at least one of rectangular, circular, and elliptical.
And S560, determining the hit position of the keyword in the text area, wherein the hit position is the position when the keyword is successfully matched with the text content in the text area.
Specifically, the hit position of the keyword in the text region is determined, for example, if the position where the matching between the keyword and the text content in the text region is successful is position a, the position of the keyword in the hit position in the text region is position a.
S570, determining the hit block according to the hit position, wherein the hit position is within the range of the hit block.
S580, the hit block is highlighted or marked with one of the keyword, an abbreviation of the keyword, and a substitute word of the keyword.
The number of the keywords may be one or multiple, and if there is one keyword, the keywords are directly marked, and if there are multiple keywords, the keywords may be sequentially marked, which is not limited in the embodiment of the present invention.
Specifically, the manner of highlighting the hit block or marking the hit block by using one of the keyword, the abbreviation of the keyword, and the substitute word of the keyword may be to mark the keyword directly on the hit block, or may preset a corresponding relationship between the keyword and a mark symbol, and to mark the mark symbol corresponding to the keyword according to the search of the keyword, for example, the keyword a may be directly marked on the hit block, or the keyword a may be preset to correspond to Q, so that the mark Q may be marked on the hit block.
Optionally, the determining the hit position of the keyword in the text region includes:
and acquiring the line number of the keyword when the matching in the text area is successful as a first position parameter.
And the first position parameter is the line number of the keyword when the matching in the text area is successful.
And acquiring the line offset of the keyword when the text area is successfully matched as a second position parameter.
And the second position parameter is the line offset of the keyword when the text area is successfully matched.
And determining the hit position according to the first position parameter and the second position parameter.
Specifically, different search conditions are used for different hit display modes in the same area. For example, as shown in fig. 6B, there is a search formula, there are three search conditions C1, C2, and C3, after the search, as shown in fig. 6C, the left side is a sub-item of the search result list of the patent document or the patent application document in the embodiment of the present invention, and the right side is the search result in the prior art, the display effect of the search result page is as follows, the hit block 5 corresponds to the hit block of C1, the hit block 6 corresponds to the hit block of C2, the hit block 7 corresponds to the hit block of C3, the hit positions of C1, C2, and C3 in the specification area are different, and these different blocks and the prompt information, which are C1, C2, and C3 in this embodiment, respectively, can help the previewer well and better quickly locate to the search place to perform the preview and view.
For example, the description may be given in the specification area, and the method may be the same for the remaining areas. As shown in fig. 6D, the specification area is divided into x by y blocks.
The formula of the hit positions of the keywords in the different search formulas in the text area is as follows:
Search C[i] ={X n ,Y m }
wherein, search C[i] Different keywords in the index are represented, and an independent keyword is between different logical operators, such as TA: (5 g) AND ALL _ AN: (Hua is) AND DESC: the search formula (communication) has three keywords, which are: search C[1] : namely TA: (5g) Search, search C[2] : ALL _ AN: (Hua is), search C[3] : and (4) DESC: (communication).
Specifically, { X n ,Y m H, calculating the hit position of different keywords in the text area, specifically, according to the offset returned by the search engine solr, the number of lines X of the keyword in the text area when matching is successful n Then according to the specific line number and the character of each lineWidth, offset-X n *Line width And obtaining the line offset of the keyword when the text area is successfully matched, and then segmenting the text area according to the rectangle, and finally calculating which block the keyword falls into, namely the hit block. It should be noted that different matching cursors (offsets) may appear for the same keyword, and then this time a set Search of rectangles is returned for the same keyword C[i] ={X n ,Y m }。
As shown in fig. 6E, the display manner is not only rectangular, but also oval, square, circle or other different layouts to represent the distribution of hits in different areas. It may also be arranged in a transverse, or longitudinal, arrangement.
The embodiment of the invention can improve the user search experience, the patent literature or patent application literature search is a professional structured data search mode, and the display mode of the result page is also structured, so that the keyword input by the user can be accurately matched. In addition, the embodiment of the present invention may be used for searching patent documents or patent application documents, or structured searching of other articles, such as articles, or other structured texts, so the protection scope may be any display mode with a structure, which is not limited to the presentation mode of the TACDB, and as long as structured data that can be searched, the embodiment of the present invention may be displayed by the retrieval display method in the embodiment of the present invention, thereby enhancing the user search experience.
According to the technical scheme, the area weights of different text areas of the target document are preset, the display area of the text area is determined according to the product of the area weight of the text area and the length of the text content corresponding to the text area, the display modes of the different text areas are generated according to the display area, the hit position of the keyword in the text area is determined, the hit position is the position when the keyword and the text content in the text area are successfully matched, hit blocks are highlighted or marked by using one of the keyword, the abbreviation of the keyword and the alternative word of the keyword, a previewer can be well helped, the searched place can be better and quickly located, previewing and viewing are carried out, and the user search experience is enhanced.
Example six
Fig. 7 is a schematic structural diagram of a retrieval display apparatus according to a sixth embodiment of the present invention. The embodiment may be applicable to the case of retrieval display, the apparatus may be implemented in a software and/or hardware manner, and the apparatus may be integrated in any device providing a retrieval display function, as shown in fig. 7, where the retrieval display apparatus specifically includes: an acquisition module 610, a generation module 620, and a display module 630.
The acquiring module 610 is configured to acquire a keyword input by a user;
a generating module 620, configured to match the keyword with different text regions of the target document to generate display manners of the different text regions;
a display module 630, configured to display the display manner.
Optionally, the generating module includes:
a setting unit, configured to preset area weights of different text regions of the target document;
the determining unit is used for determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region;
and the graph generating unit is used for generating display modes of different text regions according to the display area.
Optionally, the generating module includes:
a first number-of-hits determination unit configured to determine a first number of hits of the keyword in the text region, the first number of hits being a number of times that the keyword and a text content of the text region are matched;
a matching weight determination unit for determining a matching weight of the text region according to the first number of hits;
the display color confirmation unit is used for confirming the display color of the corresponding text area in a first database according to the matching weight, and the first database is pre-established with the corresponding relation between the display color and the matching weight;
and the display mode generating unit is used for generating display modes of different text areas according to the display colors.
Optionally, the text area includes a plurality of different blocks, and the generating module is specifically configured to:
determining a second number of hits of the keyword in different blocks of the text area, wherein the second number of hits is the number of times that the keyword is matched with the text content in different blocks of the text area;
determining the thermal weight of the different blocks according to the second number of hits;
confirming the color depth of the corresponding text area in a second database according to the thermal weight, wherein the second database is pre-established with the corresponding relation between the color depth and the thermal weight;
and generating display modes of different text regions according to the color depth.
Optionally, the method further includes:
a hit position determining unit, configured to determine a hit position of the keyword in the text region, where the hit position is a position where matching of the keyword and text content of the text region is successful;
a hit block determination unit for determining a hit block according to the hit position, the hit position being within the hit block;
a marking unit for highlighting the hit block or marking the hit block using one of the keyword, an abbreviation of the keyword, and a substitute of the keyword.
Optionally, the hit position determining unit is specifically configured to:
acquiring the line number of the keyword when the matching in the text area is successful as a first position parameter;
acquiring a line offset of the keyword when the text area is successfully matched as a second position parameter;
and determining the hit position according to the first position parameter and the second position parameter.
Optionally, the display mode includes: at least one of rectangular, circular, and elliptical.
The product can execute the method provided by any embodiment of the invention, and has corresponding functional modules and beneficial effects of the execution method.
According to the technical scheme of the embodiment, keywords input by a user are obtained; matching the keywords with different text regions of the target document to generate display modes of the different text regions; the display mode is displayed, so that the search function can be optimized, and the search experience of a user can be enhanced.
EXAMPLE seven
Fig. 8 is a schematic structural diagram of a computer device in a seventh embodiment of the present invention. FIG. 8 illustrates a block diagram of an exemplary computer device 12 suitable for use in implementing embodiments of the present invention. The computer device 12 shown in fig. 8 is only an example and should not bring any limitations to the functionality or scope of use of the embodiments of the present invention.
As shown in FIG. 8, computer device 12 is in the form of a general purpose computing device. The components of computer device 12 may include, but are not limited to: one or more processors or processing units 16, a system memory 28, and a bus 18 that couples various system components including the system memory 28 and the processing unit 16.
Bus 18 represents one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. By way of example, such architectures include, but are not limited to, industry Standard Architecture (ISA) bus, micro-channel architecture (MAC) bus, enhanced ISA bus, video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus.
Computer device 12 typically includes a variety of computer system readable media. Such media may be any available media that is accessible by computer device 12 and includes both volatile and nonvolatile media, removable and non-removable media.
The system memory 28 may include computer system readable media in the form of volatile memory, such as Random Access Memory (RAM) 30 and/or cache memory 32. Computer device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 34 may be used to read from and write to non-removable, nonvolatile magnetic media (not shown in FIG. 8, and commonly referred to as a "hard drive"). Although not shown in FIG. 8, a magnetic disk drive for reading from and writing to a removable, nonvolatile magnetic disk (e.g., a "floppy disk") and an optical disk drive for reading from or writing to a removable, nonvolatile optical disk (e.g., a CD-ROM, DVD-ROM, or other optical media) may be provided. In these cases, each drive may be connected to bus 18 by one or more data media interfaces. Memory 28 may include at least one program product having a set (e.g., at least one) of program modules that are configured to carry out the functions of embodiments of the invention.
A program/utility 40 having a set (at least one) of program modules 42 may be stored, for example, in memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules, and program data, each of which examples or some combination thereof may comprise an implementation of a network environment. Program modules 42 generally carry out the functions and/or methodologies of the described embodiments of the invention.
Computer device 12 may also communicate with one or more external devices 14 (e.g., keyboard, pointing device, display 24, etc.), with one or more devices that enable a user to interact with computer device 12, and/or with any devices (e.g., network card, modem, etc.) that enable computer device 12 to communicate with one or more other computing devices. Such communication may be through an input/output (I/O) interface 22. In the computer device 12 of the present embodiment, the display 24 is not provided as a separate body but is embedded in the mirror surface, and when the display surface of the display 24 is not displayed, the display surface of the display 24 and the mirror surface are visually integrated. Also, computer device 12 may communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network such as the Internet) through network adapter 20. As shown, the network adapter 20 communicates with the other modules of the computer device 12 over the bus 18. It should be understood that although not shown in the figures, other hardware and/or software modules may be used in conjunction with computer device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage systems, among others.
The processing unit 16 executes various functional applications and data processing by running programs stored in the system memory 28, for example, to implement the retrieval presentation method provided by the embodiment of the present invention: acquiring a keyword input by a user; matching the keywords with different text regions of the target document to generate display modes of the different text regions; and displaying the display mode.
Example eight
An eighth embodiment of the present invention provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the retrieval and presentation method provided in all the embodiments of the present invention: acquiring a keyword input by a user; matching the keywords with different text regions of the target document to generate display modes of the different text regions; and displaying the display mode.
Any combination of one or more computer-readable media may be employed. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
It is to be noted that the foregoing is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be understood by those skilled in the art that the present invention is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the invention. Therefore, although the present invention has been described in greater detail by the above embodiments, the present invention is not limited to the above embodiments, and may include other equivalent embodiments without departing from the spirit of the present invention, and the scope of the present invention is determined by the scope of the appended claims.

Claims (9)

1. A retrieval display method is characterized by comprising the following steps:
acquiring a keyword input by a user;
matching the keywords with different text regions of a target document to generate a display mode of different text regions, wherein the target document is a structured document, and the different text regions of the target document are related to the type of the target document;
displaying the display mode;
the matching of the keywords with different text regions of the target document to generate display modes of different text regions comprises the following steps:
presetting area weights of different text regions of the target document;
determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region;
generating display modes of different text regions according to the display areas;
determining area weights according to different importance degrees of the text regions;
determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region comprises:
determining a second border size of a rectangular box of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region;
and determining a rectangular area according to a first border size of the rectangular frame and a second border size of the rectangular frame, wherein the first border sizes of the rectangular frames of different text regions are the same.
2. The method of claim 1, wherein generating the presentation of the different text regions comprises:
determining a first number of hits of the keyword in the text region, wherein the first number of hits is the number of times of matching of the keyword with text content in the text region;
determining the matching weight of the text region according to the first number of hits;
confirming the display color of the corresponding text area in a first database according to the matching weight, wherein the first database is pre-established with the corresponding relation between the display color and the matching weight;
and generating display modes of different text areas according to the display colors.
3. The method of claim 1, wherein the text region comprises a plurality of different blocks, and wherein generating the presentation of the different text regions comprises:
determining a second number of hits of the keyword in different blocks of the text area, wherein the second number of hits is the number of times of matching of the keyword with text contents in different blocks of the text area;
determining thermodynamic weights of the different blocks according to the second number of hits;
confirming the color depth of the corresponding text area in a second database according to the thermal weight, wherein the second database is pre-established with the corresponding relation between the color depth and the thermal weight;
and generating display modes of different text areas according to the color depth.
4. The method according to claim 1, wherein the text area includes a plurality of different blocks, and after generating the display modes of the different text areas, the method further comprises:
determining a hit position of the keyword in the text region, wherein the hit position is a position when the keyword and the text content in the text region are successfully matched;
determining a hit block according to the hit position, wherein the hit position is within the range of the hit block;
highlighting the hit block or tagging the hit block with one of the keyword, an abbreviation of the keyword, and a substitute for the keyword.
5. The method according to claim 4, wherein the determining the hit position of the keyword in the text region comprises:
acquiring the line number of the keyword when the matching of the text area is successful as a first position parameter;
acquiring a line offset of the keyword when the text area is successfully matched as a second position parameter;
and determining the hit position according to the first position parameter and the second position parameter.
6. The method according to any one of claims 1 to 5, wherein the presentation means comprises: at least one of rectangular, circular, and elliptical.
7. A retrieval presentation device, comprising:
the acquisition module is used for acquiring keywords input by a user; the generating module is used for matching the keywords with different text regions of the target document so as to generate display modes of the different text regions;
the display module is used for displaying the display mode;
the target document is a structured document, and different text areas of the target document are related to the type of the target document;
the generation module comprises:
a setting unit, configured to preset area weights of different text regions of the target document;
the determining unit is used for determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region;
the graphic generation unit is used for generating display modes of different text areas according to the display areas;
determining area weights according to different importance degrees of the text regions;
determining the display area of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region comprises:
determining a second border size of a rectangular box of the text region according to the product of the area weight of the text region and the length of the text content corresponding to the text region;
and determining a rectangular area according to a first border size of the rectangular frame and a second border size of the rectangular frame, wherein the first border sizes of the rectangular frames of different text regions are the same.
8. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method according to any of claims 1-6 when executing the program.
9. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1-6.
CN201910180992.3A 2019-03-11 2019-03-11 Retrieval display method, device, equipment and storage medium Active CN109933702B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910180992.3A CN109933702B (en) 2019-03-11 2019-03-11 Retrieval display method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910180992.3A CN109933702B (en) 2019-03-11 2019-03-11 Retrieval display method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN109933702A CN109933702A (en) 2019-06-25
CN109933702B true CN109933702B (en) 2022-12-16

Family

ID=66986710

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910180992.3A Active CN109933702B (en) 2019-03-11 2019-03-11 Retrieval display method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109933702B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111144210B (en) * 2019-11-26 2023-07-18 泰康保险集团股份有限公司 Image structuring processing method and device, storage medium and electronic equipment
CN112445830B (en) * 2020-11-26 2024-05-14 湖南智慧政务区块链科技有限公司 Data analysis system based on block chain technology
CN117573847B (en) * 2024-01-16 2024-05-07 浙江同花顺智能科技有限公司 Visualized answer generation method, device, equipment and storage medium

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8082240B2 (en) * 2007-02-28 2011-12-20 Classe Qsl, S.L. System for retrieving information units
CN101419661B (en) * 2007-10-26 2011-08-24 国际商业机器公司 Method for displaying image based on text in image and system
CN102902688B (en) * 2011-07-27 2016-08-10 汉王科技股份有限公司 Keyword lookup result presentation method and device
US8805842B2 (en) * 2012-03-30 2014-08-12 Her Majesty The Queen In Right Of Canada, As Represented By The Minister Of National Defence, Ottawa Method for displaying search results
CN107644082B (en) * 2013-03-07 2021-04-16 海信视像科技股份有限公司 Information searching and displaying method and device
CN103473282B (en) * 2013-08-29 2016-10-05 北京奇虎科技有限公司 A kind of apparatus and method generating the Hot Contents page
CN103631856B (en) * 2013-10-17 2017-01-11 四川大学 Subject visualization method for Chinese document set
CN105005562B (en) * 2014-04-15 2018-09-21 索意互动(北京)信息技术有限公司 The display processing method and device of retrieval result
CN104361042B (en) * 2014-10-29 2019-02-12 中国建设银行股份有限公司 A kind of information retrieval method and device
CN106503468A (en) * 2016-11-05 2017-03-15 深圳市前海安测信息技术有限公司 Electronic health record analysis system and method for the medical information based on coupling keyword
CN106844717A (en) * 2017-02-08 2017-06-13 北京小米移动软件有限公司 Webpage search display methods and device
CN108038096A (en) * 2017-11-10 2018-05-15 平安科技(深圳)有限公司 Knowledge database documents method for quickly retrieving, application server computer readable storage medium storing program for executing
CN108984685B (en) * 2018-06-29 2020-07-03 北京字节跳动网络技术有限公司 Method and device for automatically displaying document directory

Also Published As

Publication number Publication date
CN109933702A (en) 2019-06-25

Similar Documents

Publication Publication Date Title
US6389435B1 (en) Method and system for copying a freeform digital ink mark on an object to a related object
JP4746136B2 (en) Rank graph
US8364673B2 (en) System and method for dynamically and interactively searching media data
US8892554B2 (en) Automatic word-cloud generation
US9367523B2 (en) System and method for using design features to search for page layout designs
CN109933702B (en) Retrieval display method, device, equipment and storage medium
US8606789B2 (en) Method for layout based document zone querying
US20080021891A1 (en) Searching a document using relevance feedback
US20100325115A1 (en) Method and system for displaying and processing electronic file list
US20140379719A1 (en) System and method for tagging and searching documents
US20030236778A1 (en) Drawing search support apparatus and drawing search method
JP2008192055A (en) Content search method and content search apparatus
US20130055076A1 (en) Online Analysis and Display of Correlated Information
WO2014206151A1 (en) System and method for tagging and searching documents
EP2110758B1 (en) Searching method based on layout information
JP2007317034A (en) Image processing apparatus, image processing method, program, and recording medium
US10175850B2 (en) Search inquiry method using contextual annotation
JP5516918B2 (en) Image element search
US9552378B2 (en) Method and apparatus for saving search query as metadata with an image
CN111814481B (en) Shopping intention recognition method, device, terminal equipment and storage medium
CN105653571A (en) Bookmark storage and bookmark operation instruction responding method, and browser
US9208150B2 (en) Automatic association of informational entities
CN110162773A (en) Title estimator
KR20090037704A (en) Meta data generation method for intutive image search
JP6157965B2 (en) Electronic device, method, and program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant