CN116627908A - Keyword searching method, device and equipment - Google Patents

Keyword searching method, device and equipment Download PDF

Info

Publication number
CN116627908A
CN116627908A CN202310378092.6A CN202310378092A CN116627908A CN 116627908 A CN116627908 A CN 116627908A CN 202310378092 A CN202310378092 A CN 202310378092A CN 116627908 A CN116627908 A CN 116627908A
Authority
CN
China
Prior art keywords
target
keyword
target object
image
page
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310378092.6A
Other languages
Chinese (zh)
Inventor
宋敏
方俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuxin Kunpeng Beijing Information Technology Co ltd
Original Assignee
Fuxin Kunpeng Beijing Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuxin Kunpeng Beijing Information Technology Co ltd filed Critical Fuxin Kunpeng Beijing Information Technology Co ltd
Priority to CN202310378092.6A priority Critical patent/CN116627908A/en
Publication of CN116627908A publication Critical patent/CN116627908A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • G06F16/148File search processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • G06F16/156Query results presentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a keyword searching method, a keyword searching device and keyword searching equipment, and relates to the technical field of format document processing, wherein the method comprises the following steps: responding to a search instruction aiming at a target keyword, performing image conversion processing on each page in the format document to obtain an image corresponding to each page; determining whether a target object comprising the target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table; and displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists. The keyword searching method, device and equipment provided by the invention are used for searching texts, pictures, tables and the like comprising target keywords in the format document.

Description

Keyword searching method, device and equipment
Technical Field
The present invention relates to the field of format document processing technologies, and in particular, to a keyword searching method, apparatus and device.
Background
An Open-layout Document (OFD) is a national layout Document format standard in China, and is an autonomous format belonging to China.
In the related art, a user cannot search for texts, pictures, tables and the like including certain keywords in the OFD reading process, so that the user experience of the OFD is poor.
Therefore, how to find text, pictures, tables and the like including certain keywords in the OFD becomes a technical problem to be solved.
Disclosure of Invention
The invention provides a keyword searching method, device and equipment, which are used for searching texts, pictures, tables and the like comprising certain keywords in an OFD.
In a first aspect, the present invention provides a keyword searching method, including:
responding to a search instruction aiming at a target keyword, performing image conversion processing on each page in the format document to obtain an image corresponding to each page;
determining whether a target object comprising the target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table;
and displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists.
According to the keyword searching method provided by the invention, the determining whether the target object including the target keyword exists in the image comprises the following steps:
performing optical character recognition on the image to obtain text information in the image;
determining whether the text information comprises the target keyword;
and if the text information comprises the target keyword, determining that the image comprises the target object.
According to the keyword searching method provided by the invention, the determining that the image includes the target object includes:
converting the page corresponding to the image into an extensible markup language file;
determining a target label corresponding to a position range comprising the position according to the position of the target keyword in the text information and the position range corresponding to each label in the extensible markup language file;
and determining the object included in the target label as the target object included in the image.
According to the keyword searching method provided by the invention, the type identifier corresponding to the target object is displayed, and the method comprises the following steps:
determining the object attribute of the target object in the target label as the target type of the target object;
and determining the identifier corresponding to the target type as the type identifier corresponding to the target object, and displaying the type identifier corresponding to the target object.
According to the keyword searching method provided by the invention, the method further comprises the following steps:
and responding to the touch instruction of the type identifier, and displaying the target object corresponding to the type identifier.
According to the keyword searching method provided by the invention, the method further comprises the following steps:
and displaying the page identification of the target page where the target keyword is located and the frequency of the target keyword in the target page.
According to the keyword searching method provided by the invention, the displaying of the target keyword and the type identifier corresponding to the target object includes:
displaying a search result list;
and displaying the target keyword and the type identifier corresponding to the target object in the search result list.
In a second aspect, the present invention provides a keyword search apparatus, including:
the response module is used for responding to the search instruction aiming at the target keyword, carrying out image conversion processing on each page in the format document and obtaining an image corresponding to each page;
a determining module, configured to determine whether a target object including the target keyword exists in the image, where the target object includes at least one of a text, a picture, and a table;
and the display module is used for displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists.
According to the keyword searching device provided by the invention, the determining module is specifically used for:
performing optical character recognition on the image to obtain text information in the image;
determining whether the text information comprises the target keyword;
and if the text information comprises the target keyword, determining that the image comprises the target object.
According to the keyword searching device provided by the invention, the determining module is specifically used for:
converting the page corresponding to the image into an extensible markup language file;
determining a target label corresponding to a position range comprising the position according to the position of the target keyword in the text information and the position range corresponding to each label in the extensible markup language file;
and determining the object included in the target label as the target object included in the image.
According to the keyword searching device provided by the invention, the display module is specifically used for:
determining the object attribute of the target object in the target label as the target type of the target object;
and determining the identifier corresponding to the target type as the type identifier corresponding to the target object, and displaying the type identifier corresponding to the target object.
According to the keyword searching device provided by the invention, the display module is also used for:
and under the condition that the response module responds to the touch instruction of the type identifier, displaying the target object corresponding to the type identifier.
According to the keyword searching device provided by the invention, the display module is also used for:
and displaying the page identification of the target page where the target keyword is located and the frequency of the target keyword in the target page.
According to the keyword searching device provided by the invention, the display module is also used for:
displaying a search result list;
and displaying the target keyword and the type identifier corresponding to the target object in the search result list.
The invention also provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the keyword search method as in any one of the first aspects when executing the program.
The present invention also provides a non-transitory computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements the keyword search method of any one of the first aspects above.
The invention also provides a computer program product comprising a computer program which when executed by a processor implements the keyword search method of any one of the first aspects above.
The invention provides a keyword searching method, a keyword searching device and keyword searching equipment, wherein in the method, images corresponding to all pages are obtained through image conversion processing of all pages in a format document; determining whether a target object comprising the target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table, displaying the target keyword and a type identifier corresponding to the target object when the target object exists, and searching the text, the picture, the table and the like comprising the target keyword in the format document, so that the technical problem of how to search the OFD for the text, the picture, the table and the like comprising certain keywords is solved.
Drawings
In order to more clearly illustrate the invention or the technical solutions of the prior art, the following description will briefly explain the drawings used in the embodiments or the description of the prior art, and it is obvious that the drawings in the following description are some embodiments of the invention, and other drawings can be obtained according to the drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic flow chart of a keyword searching method provided by the invention;
FIG. 2 is a schematic diagram of a display interface according to the present invention;
FIG. 3 is a second schematic diagram of a display interface provided by the present invention;
FIG. 4 is a third schematic diagram of a display interface according to the present invention;
FIG. 5 is a schematic diagram of a keyword search device according to the present invention;
fig. 6 is a schematic diagram of the physical structure of the electronic device provided by the present invention.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The keyword searching method provided by the invention is described below with reference to specific embodiments.
Fig. 1 is a schematic flow chart of a keyword searching method provided by the invention. As shown in fig. 1, the method includes:
and step 101, responding to a search instruction aiming at the target keyword, and performing image conversion processing on each page in the layout document to obtain an image corresponding to each page.
Optionally, the execution main body of the keyword searching method provided by the invention can be electronic equipment, and also can be a keyword searching device arranged in the electronic equipment, and the device is realized by combining software and/or hardware.
The electronic device may be, for example, a notebook, a desktop computer, or the like.
The layout Document may be, for example, an Open-layout Document (OFD), a portable file format (Portable Document Format, PDF) Document, or the like.
The lookup instruction may be an instruction generated by the electronic device based on a keyword lookup operation for the target keyword.
The target keyword may be, for example, any keyword that the user needs to find, such as "image", "OFDM", etc.
The keyword search operation may be a search operation input by a user in a display interface on which a layout document is displayed. The display interface is displayed on the electronic device.
Optionally, an OFD parser may be included in the electronic device, and the OFD parser may be used to parse the layout document into pages.
Step 102, determining whether a target object comprising a target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table.
The above step 102 is performed for each image corresponding to each page.
The target object may also be a formula, a graph, or the like, for example, and is not limited herein, as long as the target keyword is included.
Alternatively, characters in the image may be identified to obtain text information, and in the case that the target keyword is included in the text information, it is determined that a target object including the target keyword exists in the image.
And step 103, displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists.
Optionally, displaying the target keyword may include: displaying a preset number of characters comprising the target keyword.
For example, in the case where the text information includes "open document fod, OFD mobile reader supports viewing OFD/PDF document", if the target keyword is fod, the preset number is 10, then "fod mobile reader", or "open document fod, OFD mobile" may be displayed.
Alternatively, in the present invention, a target object may also be displayed.
Optionally, displaying the type identifier corresponding to the target object includes:
in the case that the target object is text, a type identifier corresponding to the text can be displayed;
in the case that the target object is a picture, a type identifier corresponding to the picture can be displayed;
in the case that the target object is a table, a type identifier corresponding to the table may be displayed.
Alternatively, in the case that the target object is text, the type identifier corresponding to the text may not be displayed.
In the keyword searching method provided in the embodiment of fig. 1, an image corresponding to each page is obtained by performing image conversion processing on each page in the layout document; determining whether a target object comprising the target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table, displaying the target keyword and a type identifier corresponding to the target object when the target object exists, and searching the text, the picture, the table and the like comprising the target keyword in the format document, so that the technical problem of how to search the OFD for the text, the picture, the table and the like comprising certain keywords is solved.
In some embodiments, step 102 may specifically include:
performing optical character recognition (optical character recognition, OCR) on the image to obtain text information in the image;
determining whether the text information comprises a target keyword;
and if the text information comprises the target keyword, determining that the image comprises the target object.
Optionally, determining whether the text information includes the target keyword includes:
performing character matching processing on the target keywords and the text information;
and determining that the text information comprises the target keyword in the case that the character and the target keyword are matched to be the same.
In the invention, the image is subjected to OCR to obtain text information, and whether the image comprises the target object is determined by the text information, so that the search of non-text content comprising the target keyword in the format document (namely, the picture comprising the target keyword can be searched) can be effectively solved, and the search efficiency and the search accuracy are improved.
In some embodiments, determining that the target object is included in the image comprises:
converting the page corresponding to the image into an extensible markup language file (Extensible Markup Language, XML);
determining a target label corresponding to a position range comprising the position according to the position of the target keyword in the text information and the position range corresponding to each label in the extensible markup language file;
and determining the object included in the target label as the target object included in the image.
The position of the target keyword in the text information is the position of the target keyword in the page (corresponding to the image to which the text information corresponds).
For example, the code corresponding to a certain tag in the extensible markup language file is as follows:
<ofd:ImageObject ID="922"CTM="6.25937 0 0 5.16705 0 0"Boundary="138.77185 87.73356 6.25937 5.16706"ResourceID="923">
<ofd:Clips>
<ofd:Clip>
<ofd:Area>
<ofd:Path ID="924"Boundary="0.00001 -0.00002 1 1"Fill="true">
<ofd:AbbreviatedData>M-0 0L 1 0L 1 1L-0 1C</ofd:AbbreviatedData>
</ofd:Path>
</ofd:Area>
</ofd:Clip>
</ofd:Clips>。
the labels correspond to the position ranges 138.77185, 87.73356, 6.25937 and 5.16706. Wherein 138.77185 is the distance between the left boundary of the area occupied by the tag and the left boundary of the page, 87.73356 is the distance between the upper boundary of the area occupied by the tag and the upper boundary of the page, 6.25937 is the distance between the right boundary of the area occupied by the tag and the right boundary of the page, and 5.16706 is the distance between the lower boundary of the area occupied by the tag and the lower boundary of the page.
For example, when the position range corresponding to the tag is 138.77185, 87.73356, 6.25937, 5.16706, if the position of the target keyword in the text information is 140, 88, 7, 6, the position range corresponding to the tag includes the position of the target keyword in the text information, the tag is the target tag, and the object "922" included in the target tag is the target object (i.e., the picture of the mark 922).
In some embodiments, displaying the type identifier corresponding to the target object includes:
determining the object attribute of the target object in the target label as the target type of the target object;
and determining the identifier corresponding to the target type as the type identifier corresponding to the target object, and displaying the type identifier corresponding to the target object.
For example, if the object attribute of the target object in the target tag is "ImageObject", the target type is a picture type.
Optionally, the electronic device stores a correspondence between a plurality of types and identifiers corresponding to the plurality of types, and after determining the target type, the identifier corresponding to the target type may be searched for in the correspondence according to the target type, and the identifier corresponding to the target type is determined as the type identifier corresponding to the target object.
In some embodiments, displaying the target keyword and the type identifier corresponding to the target object includes:
displaying a search result list;
and displaying the target keywords and the type identifiers corresponding to the target objects in a search result list.
The list of search results is described below in connection with the display interface shown in fig. 2.
Fig. 2 is a schematic diagram of a display interface provided by the present invention. For example, as shown in fig. 2, the electronic device may display a display interface 20, where a layout document 201 is displayed in the display interface 20, and the display interface includes a search result list 202.
For example, in the case where the target keyword is "reader", the type identifier corresponding to the target keyword and the target object is displayed in the search result list 202.
For example, the type identifier 2021 is displayed in the case where the target object is text, the type identifier 2022 is displayed in the case where the target object is a picture, and the type identifier 2023 is displayed in the case where the target object is a table.
In some embodiments, the method provided by the invention further comprises:
and displaying the page identification of the target page where the target keyword is located and the frequency of the target keyword in the target page.
FIG. 3 is a second schematic diagram of the display interface provided by the present invention. On the basis of fig. 2, as shown in fig. 3, the page identifier of the target page where the target keyword is located and the number of times that the target keyword appears in the target page may also be displayed in the search result list 202.
For example, the page identification of the target page where the target keyword "reader" is located is "page 2", and the number of occurrences of the "reader" in the target page ("page 2") is "2"; the page identifier of the target page where the target keyword 'reader' is located is 'page 19', and the number of times of appearance of the 'reader' in the target page is '1'; the page identification of the target page where the target keyword "reader" is located is "page 23", and the number of occurrences of the "reader" in the target page is "1".
Optionally, the total number of times the target keyword "reader" appears in all pages "36" may also be in the search results list 202.
In some embodiments, the method provided by the invention further comprises: and responding to the touch instruction of the type identifier, and displaying the target object corresponding to the type identifier.
FIG. 4 is a third schematic diagram of the display interface provided by the present invention. On the basis of fig. 3, as shown in fig. 4, the display interface 21 is displayed in response to a touch instruction of the type identifier 2022.
The page 19 is displayed in the display interface 21, and the target object 210 corresponding to the type identifier 2022 is displayed.
In the invention, the touch control instruction of the type identifier is responded, the target object corresponding to the type identifier is displayed, and the query experience of the user can be provided.
The keyword searching device provided by the invention is described below, and the keyword searching device described below and the keyword searching method described above can be referred to correspondingly.
Fig. 5 is a schematic diagram of a keyword search device according to the present invention. As shown in fig. 5, the apparatus includes:
the response module 501 is configured to respond to a search instruction for a target keyword, and perform image conversion processing on each page in the layout document to obtain an image corresponding to each page;
a determining module 502, configured to determine whether a target object including the target keyword exists in the image, where the target object includes at least one of text, a picture, and a table;
and the display module 503 is configured to display the target keyword and a type identifier corresponding to the target object when it is determined that the target object exists.
The keyword searching device provided by the invention can execute the keyword searching method, and has the same beneficial effects as the keyword searching method, and the description is omitted here.
According to the keyword searching device provided by the invention, the determining module 502 is specifically configured to:
performing optical character recognition on the image to obtain text information in the image;
determining whether the text information comprises the target keyword;
and if the text information comprises the target keyword, determining that the image comprises the target object.
According to the keyword searching device provided by the invention, the determining module 502 is specifically configured to:
converting the page corresponding to the image into an extensible markup language file;
determining a target label corresponding to a position range comprising the position according to the position of the target keyword in the text information and the position range corresponding to each label in the extensible markup language file;
and determining the object included in the target label as the target object included in the image.
According to the keyword searching device provided by the invention, the display module 503 is specifically configured to:
determining the object attribute of the target object in the target label as the target type of the target object;
and determining the identifier corresponding to the target type as the type identifier corresponding to the target object, and displaying the type identifier corresponding to the target object.
According to the keyword searching device provided by the invention, the display module 503 is further configured to:
and displaying the target object corresponding to the type identifier under the condition that the response module 501 responds to the touch instruction of the type identifier.
According to the keyword searching device provided by the invention, the display module 503 is further configured to:
and displaying the page identification of the target page where the target keyword is located and the frequency of the target keyword in the target page.
According to the keyword searching device provided by the invention, the display module 503 is further configured to:
displaying a search result list;
and displaying the target keyword and the type identifier corresponding to the target object in the search result list.
Fig. 6 is a schematic diagram of the physical structure of the electronic device provided by the present invention. As shown in fig. 6, the electronic device may include: processor 610, communication interface (Communications Interface) 620, memory 630, and communication bus 640, wherein processor 610, communication interface 620, and memory 630 communicate with each other via communication bus 640. The processor 610 may invoke logic instructions in the memory 630 to perform a key lookup method comprising: responding to a search instruction aiming at a target keyword, performing image conversion processing on each page in the format document to obtain an image corresponding to each page; determining whether a target object comprising the target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table; and displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists.
Further, the logic instructions in the memory 630 may be implemented in the form of software functional units and stored in a computer-readable storage medium when sold or used as a stand-alone product. Based on this understanding, the technical solution of the present invention may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
In another aspect, the present invention also provides a computer program product, the computer program product including a computer program, the computer program being storable on a non-transitory computer readable storage medium, the computer program, when executed by a processor, being capable of performing the keyword search method provided by the above methods, the method comprising: responding to a search instruction aiming at a target keyword, performing image conversion processing on each page in the format document to obtain an image corresponding to each page; determining whether a target object comprising the target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table; and displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists.
In yet another aspect, the present invention also provides a non-transitory computer readable storage medium having stored thereon a computer program which, when executed by a processor, is implemented to perform the keyword search method provided by the above methods, the method comprising: responding to a search instruction aiming at a target keyword, performing image conversion processing on each page in the format document to obtain an image corresponding to each page; determining whether a target object comprising the target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table; and displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists.
The apparatus embodiments described above are merely illustrative, wherein the elements illustrated as separate elements may or may not be physically separate, and the elements shown as elements may or may not be physical elements, may be located in one place, or may be distributed over a plurality of network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. Those of ordinary skill in the art will understand and implement the present invention without undue burden.
From the above description of the embodiments, it will be apparent to those skilled in the art that the embodiments may be implemented by means of software plus necessary general hardware platforms, or of course may be implemented by means of hardware. Based on this understanding, the foregoing technical solution may be embodied essentially or in a part contributing to the prior art in the form of a software product, which may be stored in a computer readable storage medium, such as ROM/RAM, a magnetic disk, an optical disk, etc., including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method described in the respective embodiments or some parts of the embodiments.
Finally, it should be noted that: the above embodiments are only for illustrating the technical solution of the present invention, and are not limiting; although the invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (10)

1. A keyword search method, comprising:
responding to a search instruction aiming at a target keyword, performing image conversion processing on each page in the format document to obtain an image corresponding to each page;
determining whether a target object comprising the target keyword exists in the image, wherein the target object comprises at least one of a text, a picture and a table;
and displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists.
2. The keyword search method of claim 1, wherein the determining whether a target object including the target keyword exists in the image comprises:
performing optical character recognition on the image to obtain text information in the image;
determining whether the text information comprises the target keyword;
and if the text information comprises the target keyword, determining that the image comprises the target object.
3. The keyword search method of claim 2, wherein the determining that the image includes the target object includes:
converting the page corresponding to the image into an extensible markup language file;
determining a target label corresponding to a position range comprising the position according to the position of the target keyword in the text information and the position range corresponding to each label in the extensible markup language file;
and determining the object included in the target label as the target object included in the image.
4. The keyword searching method according to claim 3, wherein displaying the type identifier corresponding to the target object includes:
determining the object attribute of the target object in the target label as the target type of the target object;
and determining the identifier corresponding to the target type as the type identifier corresponding to the target object, and displaying the type identifier corresponding to the target object.
5. The keyword search method of any one of claims 1 to 4, further comprising:
and responding to the touch instruction of the type identifier, and displaying the target object corresponding to the type identifier.
6. The keyword search method of any one of claims 1 to 4, further comprising:
and displaying the page identification of the target page where the target keyword is located and the frequency of the target keyword in the target page.
7. The keyword searching method according to any one of claims 1 to 4, wherein the displaying the target keyword and the type identifier corresponding to the target object includes:
displaying a search result list;
and displaying the target keyword and the type identifier corresponding to the target object in the search result list.
8. A keyword search apparatus, comprising:
the response module is used for responding to the search instruction aiming at the target keyword, carrying out image conversion processing on each page in the format document and obtaining an image corresponding to each page;
a determining module, configured to determine whether a target object including the target keyword exists in the image, where the target object includes at least one of a text, a picture, and a table;
and the display module is used for displaying the target keyword and the type identifier corresponding to the target object under the condition that the target object exists.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the key word searching method of any of claims 1 to 7 when the program is executed by the processor.
10. A non-transitory computer readable storage medium having stored thereon a computer program, wherein the computer program when executed by a processor implements the keyword search method of any one of claims 1 to 7.
CN202310378092.6A 2023-04-11 2023-04-11 Keyword searching method, device and equipment Pending CN116627908A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310378092.6A CN116627908A (en) 2023-04-11 2023-04-11 Keyword searching method, device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310378092.6A CN116627908A (en) 2023-04-11 2023-04-11 Keyword searching method, device and equipment

Publications (1)

Publication Number Publication Date
CN116627908A true CN116627908A (en) 2023-08-22

Family

ID=87590935

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310378092.6A Pending CN116627908A (en) 2023-04-11 2023-04-11 Keyword searching method, device and equipment

Country Status (1)

Country Link
CN (1) CN116627908A (en)

Similar Documents

Publication Publication Date Title
CN110362370B (en) Webpage language switching method and device and terminal equipment
US20090125529A1 (en) Extracting information based on document structure and characteristics of attributes
CN108108342B (en) Structured text generation method, search method and device
US8868556B2 (en) Method and device for tagging a document
CN111459977B (en) Conversion of natural language queries
KR20130142121A (en) Multi-modal approach to search query input
US9514113B1 (en) Methods for automatic footnote generation
US20140012841A1 (en) Weight-based stemming for improving search quality
US20110258202A1 (en) Concept extraction using title and emphasized text
WO2020056977A1 (en) Knowledge point pushing method and device, and computer readable storage medium
CN105843800A (en) DOI-based language information display method and device
CN111737443B (en) Answer text processing method and device and key text determining method
Ugale et al. Document management system: A notion towards paperless office
US20150106701A1 (en) Input support method and information processing system
CN112416142A (en) Method and device for inputting characters and electronic equipment
CN110489032B (en) Dictionary query method for electronic book and electronic equipment
CN114676133A (en) Index creating method, device, equipment and storage medium
CN109783612B (en) Report data positioning method and device, storage medium and terminal
CN113779364A (en) Searching method based on label extraction and related equipment thereof
CN112527954A (en) Unstructured data full-text search method and system and computer equipment
US20230050371A1 (en) Method and device for personalized search of visual media
CN116627908A (en) Keyword searching method, device and equipment
CN115687663A (en) Video retrieval and marking method, system and storage medium based on full text search
CN110647666B (en) Intelligent matching method and device for templates and formulas and computer readable storage medium
CN114330240A (en) PDF document analysis method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination