CN111597107A - Information output method and device and electronic equipment - Google Patents

Information output method and device and electronic equipment Download PDF

Info

Publication number
CN111597107A
CN111597107A CN202010321640.8A CN202010321640A CN111597107A CN 111597107 A CN111597107 A CN 111597107A CN 202010321640 A CN202010321640 A CN 202010321640A CN 111597107 A CN111597107 A CN 111597107A
Authority
CN
China
Prior art keywords
identification character
character string
identification
file
meet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010321640.8A
Other languages
Chinese (zh)
Other versions
CN111597107B (en
Inventor
张健
刘晨阳
孙圣达
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing ByteDance Network Technology Co Ltd
Original Assignee
Beijing ByteDance Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing ByteDance Network Technology Co Ltd filed Critical Beijing ByteDance Network Technology Co Ltd
Priority to CN202010321640.8A priority Critical patent/CN111597107B/en
Publication of CN111597107A publication Critical patent/CN111597107A/en
Application granted granted Critical
Publication of CN111597107B publication Critical patent/CN111597107B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3604Software analysis for verifying properties of programs
    • G06F11/3608Software analysis for verifying properties of programs using formal methods, e.g. model checking, abstract interpretation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The embodiment of the disclosure discloses an information output method, an information output device and electronic equipment. One embodiment of the method comprises: acquiring a first identification character string set, wherein the identification character string is used for identifying description information to be displayed in a webpage; extracting a plurality of identification character strings from a code file of a predefined webpage to obtain a second identification character string set; determining a second identification character string which does not meet a first preset requirement in a second identification character string set based on a first identification character string in the first identification character string set; and outputting the second identification character strings which are contained in the second identification character string set and do not meet the first preset requirement. This embodiment improves the accuracy of determining the second identification string from the code file of the predefined web page that does not comply with the first preset requirement.

Description

Information output method and device and electronic equipment
Technical Field
The embodiment of the disclosure relates to the technical field of computers, in particular to an information output method, an information output device and electronic equipment.
Background
With the continuous development of computer technology, various software products emerge. It should be understood that the software product may be an application that provides various functions (e.g., shopping, chat, etc.).
Before the software product is on line, information to be displayed in a webpage needs to be prepared in advance, and whether identification information of the information to be displayed in a code file of the webpage meets requirements needs to be detected. It is understood that the information to be displayed is identified in the code file by the identification information.
In order to detect whether the identification information contained in the code file meets the requirements, in a related manner, after a plurality of pieces of identification information are extracted from the code file, the identification information which does not meet the requirements and is contained in the extracted plurality of pieces of identification information is directly detected. The direct detection of the extracted plurality of identification information may result in a lower detection accuracy of the identification information contained in the code file.
Disclosure of Invention
This disclosure is provided to introduce concepts in a simplified form that are further described below in the detailed description. This disclosure is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.
The embodiment of the disclosure provides an information output method, an information output device and electronic equipment, and improves the accuracy of determining a second identification character string which does not meet a first preset requirement from a code file of a predefined webpage.
In a first aspect, an embodiment of the present disclosure provides an information output method, including: acquiring a first identification character string set, wherein the identification character string is used for identifying description information to be displayed in a webpage; extracting a plurality of identification character strings from a code file of a predefined webpage to obtain a second identification character string set; determining a second identification character string which does not meet a first preset requirement in a second identification character string set based on a first identification character string in the first identification character string set; and outputting the second identification character strings which are contained in the second identification character string set and do not meet the first preset requirement.
In a second aspect, an embodiment of the present disclosure provides an information output apparatus, including: the system comprises an acquisition unit, a display unit and a display unit, wherein the acquisition unit is used for acquiring a first identification character string set, and the identification character strings are used for identifying description information to be displayed in a webpage; the extraction unit is used for extracting a plurality of identification character strings from a code file of a predefined webpage to obtain a second identification character string set; the determining unit is used for determining a second identification character string which does not meet a first preset requirement in the second identification character string set based on a first identification character string in the first identification character string set; and the output unit is used for outputting the second identification character strings which are contained in the second identification character string set and do not meet the first preset requirement.
In a third aspect, an embodiment of the present disclosure provides an electronic device, including: one or more processors; a storage device for storing one or more programs which, when executed by the one or more processors, cause the one or more processors to implement the information output method according to the first aspect.
In a fourth aspect, an embodiment of the present disclosure provides a computer-readable medium on which a computer program is stored, which when executed by a processor, implements the steps of the information output method according to the first aspect.
According to the information output method, the information output device and the electronic equipment, the first identification character string set is obtained firstly, the identification character strings are extracted from the code file of the predefined webpage to obtain the second identification character string set, then the identification character strings which do not meet the first preset requirement in the second identification character string set are determined based on the identification character strings in the first identification character string set, and the identification character strings which do not meet the first preset requirement and are contained in the second identification character string set are further output. On the basis of the first identification character string contained in the first identification character string set, the second identification character string which does not accord with the first preset requirement in the second identification character string set is determined, so that the accuracy of determining the second identification character string which does not accord with the first preset requirement from the second identification character string set is improved. Further, the accuracy of determining the second identification character string which does not meet the first preset requirement from the code file of the predefined webpage is improved.
Drawings
The above and other features, advantages and aspects of various embodiments of the present disclosure will become more apparent by referring to the following detailed description when taken in conjunction with the accompanying drawings. Throughout the drawings, the same or similar reference numbers refer to the same or similar elements. It should be understood that the drawings are schematic and that elements and features are not necessarily drawn to scale.
FIG. 1 is a flow diagram of some embodiments of an information output method according to the present disclosure;
FIG. 2 is a schematic diagram of one application scenario of an information output method according to the present disclosure;
FIG. 3 is a flow chart of still further embodiments of information output methods according to the present disclosure;
FIG. 4 is a schematic block diagram of some embodiments of an information output device according to the present disclosure;
FIG. 5 is an exemplary system architecture to which the information output methods of some embodiments of the present disclosure may be applied;
fig. 6 is a schematic diagram of a basic structure of an electronic device provided in accordance with some embodiments of the present disclosure.
Detailed Description
Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While certain embodiments of the present disclosure are shown in the drawings, it is to be understood that the present disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein, but rather are provided for a more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the disclosure are for illustration purposes only and are not intended to limit the scope of the disclosure.
It should be understood that the various steps recited in the method embodiments of the present disclosure may be performed in a different order, and/or performed in parallel. Moreover, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this respect.
The term "include" and variations thereof as used herein are open-ended, i.e., "including but not limited to". The term "based on" is "based, at least in part, on". The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; the term "some embodiments" means "at least some embodiments". Relevant definitions for other terms will be given in the following description.
It should be noted that the terms "first", "second", and the like in the present disclosure are only used for distinguishing different devices, modules or units, and are not used for limiting the order or interdependence relationship of the functions performed by the devices, modules or units.
It is noted that references to "a", "an", and "the" modifications in this disclosure are intended to be illustrative rather than limiting, and that those skilled in the art will recognize that "one or more" may be used unless the context clearly dictates otherwise.
The names of messages or information exchanged between devices in the embodiments of the present disclosure are for illustrative purposes only, and are not intended to limit the scope of the messages or information.
Referring to fig. 1, a flow of some embodiments of an information output method according to the present disclosure is shown. As shown in fig. 1, the information output method includes the following steps:
step 101, a first identification character string set is obtained.
In the present embodiment, an execution subject of the information output method (e.g., terminal apparatuses 501, 502 shown in fig. 5) may acquire the first identification string set.
The identification character string is used for identifying the description information to be displayed in the webpage. In practice, an identification string may uniquely identify a piece of descriptive information.
The descriptive information may be information described in one or more languages. In practice, the description information may describe information conveyed to the user by the web page.
In some scenarios, the executing agent of the information output method may obtain the first set of identification strings from a local or communicatively connected server (e.g., server 504 shown in fig. 5).
Step 102, extracting a plurality of identification character strings from a code file of a predefined webpage to obtain a second identification character string set.
In this embodiment, the execution subject of the information output method may extract a plurality of identification character strings from a code file of a predefined web page, to obtain a second identification character string set.
The predefined web page may be a predefined web page. Accordingly, the code file of the predefined web page may be one or more code files that generate the predefined web page.
It is understood that the set of the plurality of identification strings extracted from the code file of the predefined web page is the second set of identification strings.
In some scenarios, the executing agent of the information output method may extract a plurality of identification character strings from a code file of a predefined web page through a pre-trained machine learning model, to obtain a second identification character string set.
The execution body of the information output method may execute step 101 and step 102 in parallel, or may execute step 101 and step 102 separately.
Step 103, determining a second identification character string which does not meet the first preset requirement in the second identification character string set based on the first identification character string in the first identification character string set.
In this embodiment, after obtaining the first identification character string set and the second identification character string set, the execution main body of the information output method may determine, based on the first identification character string included in the first identification character string set, the second identification character string in the second identification character string set that does not meet the first preset requirement.
The first preset requirement may be a preset requirement for a second identification string included in the second identification string set.
In some scenarios, the execution subject of the information output method may perform the following steps for each of the second identification strings in the second identification string set. Specifically, the executing entity of the information output method may determine a similarity between the second identification character string and each of the first identification character strings in the first identification character string set, and in response to the determined similarity being less than a preset similarity threshold, the executing entity of the information output method may determine that the second identification character string does not meet the first preset requirement.
And 104, outputting second identification character strings which are contained in the second identification character string set and do not meet the first preset requirement.
In this embodiment, after determining the second identification character string included in the second identification character string set and not meeting the first preset requirement, the execution main body of the information output method may output the second identification character string included in the second identification character string set and not meeting the first preset requirement.
In some scenarios, the executing body of the information output method may output, to the first list, a second identification string of the second identification string set that does not meet the first preset requirement. The executing body of the information output method can also output a second identification character string which meets the first preset requirement in the second identification character string set to a second list. Here, the first list and the second list are not the same.
Referring to fig. 2, an application scenario of an information output method according to an embodiment of the present disclosure is shown. As shown in fig. 2, first, the terminal device 201 may acquire a first identification string set 202 from a server (not shown in the figure) of the communication connection. Then, the terminal device 201 may extract a plurality of identification strings from the code file of the predefined web page, resulting in a second set of identification strings 203. Then, the terminal device 201 may determine, based on the first identification string in the first identification string set 202, the second identification string 2031 in the second identification string set 203 that does not meet the first preset requirement. Further, the terminal apparatus 201 may output a second identification string 2031.
In this embodiment, after a plurality of identification character strings are extracted from a code file of a predefined webpage to obtain a second identification character string set, a second identification character string which does not meet a first preset requirement in the second identification character string set is determined based on a first identification character string included in the first identification character string set. On the basis of the first identification character string contained in the first identification character string set, the second identification character string contained in the second identification character string set and not meeting the first preset requirement is determined, so that the accuracy of determining the second identification character string not meeting the first preset requirement from the second identification character string set is improved to a certain extent. That is, the accuracy of determining the second identification character string that does not meet the first preset requirement from the code file of the predefined web page is improved.
In some optional implementations, the code file of the predefined webpage contains a start extraction identifier and an end extraction identifier that identify the character string.
It is understood that the start extraction flag may be a start flag for extracting the identification character string. The termination extraction flag may be a termination flag that extracts the identification string. For example, the start extraction flag may be "intl (", the end extraction flag may be ")".
At this time, the execution subject of the information output method may execute step 102 in the manner shown in step L1 through step L3.
Step L1, extract information between the start extraction identifier and the corresponding end extraction identifier from the code file of the predefined web page.
In some scenarios, the execution subject of the information output method may identify, from a code file of a predefined web page, a start extraction identifier for extracting the identification character string and an end extraction identifier corresponding to the start extraction identifier. Further, the execution body of the information output method may extract information located between the start extraction flag and the end extraction flag.
It should be noted that the termination extraction flag corresponding to the start extraction flag may be the termination extraction flag recognized first according to the execution order of the code.
Step L2, based on the extracted extraction information, determines an identification character string.
In some scenarios, the executing body of the information output method may determine the extracted information as an identification string.
Step L3, based on the determined plurality of identification strings, obtains a second set of identification strings.
In some scenarios, the execution subject of the information output method may take a set of the determined plurality of identification strings as the second identification string set.
Therefore, the extraction of the identification character string from the code file of the predefined webpage is realized by setting the extraction identification in the code file of the predefined webpage.
In some optional implementations, the code file of the predefined web page includes a start reference identifier and an end reference identifier that reference the identification string.
The starting reference identifier may be a starting identifier of the reference identifier string. Accordingly, the termination reference identifier may be a termination identifier of the reference identifier string. For example, the start reference identifier may be a "'" the end reference identifier may be a "'".
Here, the identifier strings referred to by the start reference identifier and the end reference identifier are located between the start extraction identifier and the end extraction identifier. For example. The format of the reference identification string key in the code file of the predefined web page may be intl ('key').
Here, the execution subject of the information output method may execute step L2 in the following manner.
Specifically, in response to determining that the start reference identifier and the end reference identifier are included in the extraction information, a character string referenced by the start reference identifier and the end reference identifier included in the extraction information is taken as an identification character string.
Therefore, when the identification character string is referenced by the starting reference identifier and the ending reference identifier in the code file of the predefined webpage, after the extraction information between the starting extraction identifier and the ending extraction identifier is extracted, the integrity of the extracted identification character string can be ensured by determining that the starting reference identifier and the ending reference identifier are contained in the extraction information.
In some alternative implementations, the execution subject of the information output method may execute step 103 as follows.
The method comprises the first step of determining whether a second identification character string in a second identification character string set exists in a first identification character string set.
In some scenarios, the execution subject of the information output method may perform the following steps for each of the second identification strings in the second identification string set. Specifically, the execution subject of the information output method may determine whether the second identification string exists in the first identification string set.
And secondly, taking the second identification character string which does not exist in the first identification character string set as the second identification character string which does not meet the first preset requirement.
That is to say, the second identification character strings in the second identification character string set, which are not present in the first identification character string set, are the second identification character strings that do not meet the first preset requirement.
Therefore, the second identification character strings which do not meet the first preset requirement are determined by screening out the second identification character strings which are not in the second identification character string set in the first identification character string set.
Continuing to refer to fig. 3, a flow chart of still further embodiments of information output methods according to the present disclosure is shown. As shown in fig. 3, the information output method includes the following steps:
step 301, obtain a target spreadsheet file.
In the present embodiment, the execution subject of the information output method (e.g., terminal apparatuses 501, 502 shown in fig. 5) may acquire the target spreadsheet file.
The description information includes first language description information and second language description information. It is to be understood that the first language description information may be information described by the first language. The second language description information may be information described by a second language.
Here, the first language is different from the second language. For example, the first language may be Chinese and the second language may be English. In practice, the first language description information and the second language description information contained in the same description information have the same meaning. For example, the first language is chinese, and accordingly, the first language description information is "tail", the second language is english, and accordingly, the second language description information is "Bottom".
The target spreadsheet file may be a spreadsheet file containing one or more spreadsheets. Also, the spreadsheet contained in the target spreadsheet file may contain a plurality of rows and a plurality of columns. For example, the target spreadsheet file may be an electronic file in the ". xls" format, ". et" format, and the like. In some scenarios, the target spreadsheet file may be a spreadsheet file maintained by one or more users for a certain length of time. At this time, the first identification string extracted from the target spreadsheet file has higher reliability.
A plurality of presentation data is stored in the target spreadsheet file. Here, the presentation data may be data used to implement presentation of the description information in the web page. The display data includes an identification character string, first language description information, and second language description information. It is understood that the identification character string included in the presentation data is used to identify the first language description information and the second language description information included in the presentation data.
The identification string, the first language description information, and the second language description information are distributed in columns in the target spreadsheet file. It will be understood that the target spreadsheet file includes a column for storing the identification string, a column for storing the first language description information, and a column for storing the second language description information. In practice, the identification character string, the first language description information and the second language description information contained in the same display data are located in the same row in the target spreadsheet file.
In some scenarios, the executing agent of the information output method may determine a storage address of the target spreadsheet file in the server in advance, and then obtain the target spreadsheet file from the storage address.
Step 302, obtain a plurality of identification character strings from the column where the identification character string is located in the target electronic form file.
In this embodiment, after obtaining the target spreadsheet file, the executing entity of the information output method may obtain the stored identification strings from the column of the identification string in the target spreadsheet file.
Step 303, obtaining a first identification character string set based on the obtained plurality of identification character strings.
In this embodiment, after obtaining the plurality of identification character strings from the target spreadsheet file, the executing body of the information output method may obtain the first identification character string set based on the obtained plurality of identification character strings.
In some scenarios, the execution subject of the information output method may take a set formed by the acquired plurality of identification character strings as the first identification character string set.
Step 304, extracting a plurality of identification character strings from the code file of the predefined webpage to obtain a second identification character string set.
Step 305, determining a second identification character string which does not meet the first preset requirement in the second identification character string set based on the first identification character string in the first identification character string set.
Step 306, outputting the second identification character string which is included in the second identification character string set and does not meet the first preset requirement.
Step 304, step 305, and step 306 may be performed in a similar manner as step 102, step 103, and step 104 in the embodiment shown in fig. 1, and the above description for step 102, step 103, and step 104 also applies to step 304, step 305, and step 306, and is not repeated here.
In this embodiment, a plurality of identification character strings are obtained from the column where the identification character string is located in the target electronic form file, and the first identification character string is obtained, so that the first identification character string is extracted from the target electronic form file. In the target electronic form file, the identification character strings, the first language description information and the second language description information in the display data are distributed in columns, so that the maintainability of the display data can be improved.
In some alternative implementations, before obtaining the identification character string from the column in which the identification character string is located in the target spreadsheet file, the execution subject of the information output method may perform the following steps.
The method comprises the first step of determining whether display data which do not meet second preset requirements exist in a target electronic form file.
The second preset requirement may be a requirement preset for presentation data stored in the target spreadsheet file.
In some scenarios, the executing entity of the information output method may determine whether duplicate presentation data exists in the target spreadsheet file. In response to determining that there is duplicate presentation data in the target spreadsheet file, the executing entity of the information output method may determine that there is presentation data in the target spreadsheet file that does not meet the second preset requirement.
And secondly, in response to the fact that the display data which do not meet the second preset requirement exist in the target electronic form file, updating the display data which do not meet the second preset requirement in the target electronic form file.
In some scenarios, in response to determining that duplicate presentation data exists in the target spreadsheet file, the executing entity of the information output method may delete the duplicate presentation data in the target spreadsheet file. The presentation data in the target spreadsheet file is guaranteed to be stored only once.
Before the identification character string is obtained from the target electronic form, display data which are stored in the target electronic form and do not meet the second preset requirement are determined, and the display data which are stored in the target electronic form and do not meet the second preset requirement are updated. Therefore, the display data stored in the target electronic form are guaranteed to meet the second preset requirement. Further, the identification character string extracted from the target electronic form is guaranteed to meet a second preset requirement, or the description information identified by the identification character string extracted from the target electronic form is guaranteed to meet the second preset requirement.
In some alternative implementations, the execution subject of the information output method may determine whether presentation data that does not satisfy the second preset requirement exists in the target spreadsheet file in the following manner.
Specifically, in response to determining that the display data of the missing identification character string exists in the target electronic form file, it is determined that the display data which does not meet the second preset requirement exists in the target electronic form file.
Here, the execution subject of the information output method may update presentation data in the target spreadsheet file that does not satisfy the second preset requirement in the following manner.
Specifically, for display data lacking the identification character string, the identification character string of the display data is generated based on a preset generation strategy, and the generated identification character string is added as the identification character string of the display data in the target spreadsheet file.
The preset generating policy may be a preset policy for generating the identification character string. In some scenarios, the preset generation policy may specifically include the following steps. Firstly, generating a random number, and then, concatenating the name of the spreadsheet in which the display data is located and the generated random number to obtain the description information generated for the display data. For example, if the identification string is missing in the presentation data a, the name of the spreadsheet in which the presentation data a is located is "index", and if the generated random number is 2, the identification string generated for the presentation data a may be "index 2".
Therefore, when the display data stored in the target electronic form file lack the identification character string, the identification character string of the display data can be generated according to a preset generation strategy, and the identification character string of the display data is added in the target electronic form file. Thus, it is ensured that the display data stored in the target electronic form file does not lack the identification character string.
In some alternative implementations, the execution subject of the information output method may determine whether presentation data that does not satisfy the second preset requirement exists in the target spreadsheet file in the following manner.
Specifically, the execution subject of the information output method may perform the following steps S1 and S2 with respect to the presentation data stored in the target spreadsheet file. In step S1, the execution subject of the information output method may query the description information indicated by the identification character string included in the presentation data from the reference presentation data set. In response to determining that the queried description information does not match the description information included in the presentation data, the execution subject of the information output method may determine that presentation data that does not satisfy the second preset requirement exists in the target spreadsheet file, step S2.
The benchmark presentation data file may be a file prepared to unify usage standards of presentation data.
In some scenarios, the execution subject of the information output method may present data for each stored in the target spreadsheet file, perform step S1 and step S2.
Here, the execution subject of the information output method may update presentation data in the target spreadsheet file that does not satisfy the second preset requirement in the following manner.
Firstly, outputting prompt information.
The prompt information is used for prompting the user whether to update the description information which is not matched with the description information in the reference display data file. Here, the fact that the description information stored in the electronic form does not match the description information stored in the reference presentation data set means that the stored first language description information is not the same as the stored first language description information, or the stored second language description information is not the same as the stored second language description information. It will be appreciated that the user may determine from the prompt whether to update the descriptive information in the target spreadsheet file that does not match the descriptive information in the baseline presentation data file.
And a second step of updating, in response to detecting that the user performs an operation of updating the description information in the target electronic spreadsheet file, the description information that does not match the description information in the reference presentation data set in the target electronic spreadsheet file.
In some scenarios, when the description information identified by the same identification character string in the target electronic form file and the reference presentation data file do not match, the execution subject of the information output method may update the description information identified by the identification character string in the target electronic form file to the description information identified by the description information in the reference presentation data file.
Therefore, the data file is displayed through the benchmark, the description information which does not accord with the use standard in the electronic form is determined, and the description information which does not accord with the use standard in the electronic form is updated. Thus, the description information stored in the electronic form is guaranteed to conform to the use standard.
With further reference to fig. 4, as an implementation of the methods shown in the above figures, the present disclosure provides some embodiments of an information output apparatus, which correspond to the method embodiment shown in fig. 1, and which can be applied in various electronic devices.
As shown in fig. 4, the information output apparatus of the present embodiment includes: acquisition section 401, extraction section 402, determination section 403, and output section 404. Wherein, the obtaining unit 401 is configured to: and acquiring a first identification character string set, wherein the identification character string is used for identifying the description information to be displayed in the webpage. The extraction unit 402 is configured to: and extracting a plurality of identification character strings from the code file of the predefined webpage to obtain a second identification character string set. The determination unit 403 is configured to: and determining a second identification character string which does not meet the first preset requirement in the second identification character string set based on the first identification character string in the first identification character string set. The output unit 404 is configured to: and outputting the second identification character strings which are contained in the second identification character string set and do not meet the first preset requirement.
In this embodiment, specific processes of the obtaining unit 401, the extracting unit 402, the determining unit 403, and the outputting unit 404 of the information outputting apparatus and technical effects thereof may refer to the related descriptions of step 101, step 102, step 103, and step 104 in the corresponding embodiment of fig. 1, and are not described herein again.
In some optional implementations, the description information includes first language description information and second language description information. The obtaining unit 401 is further configured to: the method comprises the steps of obtaining a target electronic form file, wherein a plurality of display data are stored in the target electronic form file, the display data comprise identification character strings, first language description information and second language description information, and the identification character strings, the first language description information and the second language description information are distributed in the target electronic form file in columns; identifying a column where the character string is located in the target electronic form file to obtain a plurality of identification character strings; and obtaining a first identification character string set based on the obtained plurality of identification character strings.
In some alternative implementations, the information output device may further include an updating unit (not shown in the figure). The update unit is used for: determining whether display data which do not meet second preset requirements exist in the target electronic form file or not; and updating the display data which do not meet the second preset requirement in the target electronic form file in response to determining that the display data which do not meet the second preset requirement exist in the target electronic form file.
In some optional implementations, the update unit is further configured to: in response to determining that the display data of the missing identification character string exists in the target electronic form file, determining that the description information which does not meet the second preset requirement exists in the target electronic form file; generating the identification character string of the display data based on a preset generation strategy aiming at the display data of the missing identification character string in the target electronic form file, and adding the generated identification character string as the identification character string of the display data in the target electronic form file.
In some optional implementations, the update unit is further configured to: for the presentation data stored in the target spreadsheet file, the following steps are performed: inquiring description information identified by an identification character string contained in the presentation data from a benchmark presentation data file; in response to determining that the queried descriptive information does not match the descriptive information contained in the presentation data, determining that presentation data that does not meet a second preset requirement exists in the target spreadsheet file; outputting prompt information, wherein the prompt information is used for prompting a user whether to update description information which does not match with the description information in the reference display data set; in response to detecting that the user performs an operation of updating the description information in the target spreadsheet file, the description information that does not match the description information in the base presentation data set is updated in the target spreadsheet file.
In some optional implementations, the determining unit 403 is further configured to: for a second identification character string in the second identification character string set, determining whether the second identification character string exists in the first identification character string set; and taking the second identification character string which does not exist in the first identification character string set as the second identification character string which does not meet the first preset requirement.
In some optional implementations, the code file includes a start extraction identifier and an end extraction identifier that identify the character string. The extraction unit 402 is further configured to: extracting the extraction information between the initial extraction identifier and the corresponding termination extraction identifier from the code file of the predefined webpage; determining an identification character string according to the extracted extraction information; based on the determined plurality of identification strings, a second set of identification strings is obtained.
In some alternative implementations, the code file includes a start reference identifier and an end reference identifier that identify the character string. The extraction unit 402 is further configured to: and in response to determining that the start reference identifier and the end reference identifier are contained in the extraction information, using the character strings referenced by the start reference identifier and the end reference identifier contained in the extraction information as the identification character strings.
With further reference to fig. 5, fig. 5 illustrates an exemplary system architecture to which the information output methods of some embodiments of the present disclosure may be applied.
As shown in fig. 5, the system architecture may include terminal devices 501, 502, a network 503, and a server 504. The network 503 is the medium used to provide communication links between the terminal devices 501, 502 and the server 504. Network 503 may include various types of connections, such as wire, wireless communication links, or fiber optic cables, to name a few.
The terminal devices 501, 502 may interact with a server 504 via a network 503 to receive or send messages or the like. Various applications may be installed on the terminal devices 501, 502. For example, the terminal devices 501 and 502 may have an application program that supports data detection, an application program that supports shopping, and the like installed thereon. In some scenarios, the terminal device 501, 502 may extract a plurality of identification strings from a code file of a predefined web page and use a set of the extracted plurality of identification strings as the second set of identification strings. Further, the terminal device 501, 502 may determine, based on the first identification string set, an identification string in the second identification string set that does not meet the first preset requirement. Still further, the terminal devices 501 and 502 may output identification strings that do not meet the first preset requirement in the second identification string set.
The terminal devices 501 and 502 may be hardware or software. When the terminal devices 501, 502 are hardware, they may be various electronic devices having a display screen and supporting data detection, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and the like. When the terminal devices 501 and 502 are software, they can be installed in the electronic devices listed above. It may be implemented as multiple pieces of software or software modules, or as a single piece of software or software module. And is not particularly limited herein.
The server 504 may be a server that provides various services. In some scenarios, the server 504 may provide the first set of identification strings to the terminal devices 501, 502. In some scenarios, the terminal device 501, 502 may obtain the first set of identification strings locally. At this time, the server 504 may not be included in the system architecture shown in fig. 5.
The server 504 may be hardware or software. When the server 504 is hardware, it can be implemented as a distributed server cluster composed of a plurality of servers, or as a single server. When the server 504 is software, it may be implemented as multiple pieces of software or software modules (e.g., multiple pieces of software or software modules used to provide distributed services), or as a single piece of software or software module. And is not particularly limited herein.
It should be noted that the information output method provided by the embodiment of the present disclosure may be executed by the server 504, and accordingly, the information output apparatus may be provided in the server 504.
It should be understood that the number of terminal devices, networks, and servers in fig. 5 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
Referring now to fig. 6, shown is a schematic diagram of an electronic device (e.g., the terminal device of fig. 5) suitable for use in implementing some embodiments of the present disclosure. The terminal device in some embodiments of the present disclosure may include, but is not limited to, a mobile terminal such as a mobile phone, a notebook computer, a digital broadcast receiver, a PDA (personal digital assistant), a PAD (tablet computer), a PMP (portable multimedia player), a vehicle-mounted terminal (e.g., a car navigation terminal), and the like, and a fixed terminal such as a digital TV, a desktop computer, and the like. The electronic device shown in fig. 6 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present disclosure. The electronic device shown in fig. 6 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present disclosure.
As shown in fig. 6, the electronic device may include a processing means (e.g., a central processing unit, a graphics processor, etc.) 601, which may perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM)602 or a program loaded from a storage means 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data necessary for the operation of the electronic apparatus are also stored. The processing device 601, the ROM 602, and the RAM 603 are connected to each other via a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
Generally, the following devices may be connected to the I/O interface 605: input devices 606 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; output devices 607 including, for example, a Liquid Crystal Display (LCD), a speaker, a vibrator, and the like; storage 608 including, for example, tape, hard disk, etc.; and a communication device 609. The communication means 609 may allow the electronic device to communicate with other devices wirelessly or by wire to exchange data. While fig. 6 illustrates an electronic device having various means, it is to be understood that not all illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided. Each block shown in fig. 6 may represent one device or may represent multiple devices as desired.
In particular, according to some embodiments of the present disclosure, the processes described above with reference to the flow diagrams may be implemented as computer software programs. For example, some embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated by the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network via the communication means 609, or may be installed from the storage means 608, or may be installed from the ROM 602. The computer program, when executed by the processing device 601, performs the above-described functions defined in the methods of the embodiments of the present disclosure.
It should be noted that the computer readable medium described in some embodiments of the present disclosure may be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In some embodiments of the disclosure, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In some embodiments of the present disclosure, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: electrical wires, optical cables, RF (radio frequency), etc., or any suitable combination of the foregoing.
In some embodiments, the clients, servers may communicate using any currently known or future developed network protocol, such as HTTP (HyperText transfer protocol), and may be interconnected with any form or medium of digital data communication (e.g., a communications network). Examples of communication networks include a local area network ("LAN"), a wide area network ("WAN"), the Internet (e.g., the Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks), as well as any currently known or future developed network.
The computer readable medium may be included in the electronic device or may exist separately without being incorporated in the electronic device. The computer readable medium carries one or more programs which, when executed by the electronic device, cause the electronic device to: acquiring a first identification character string set, wherein the identification character string is used for identifying description information to be displayed in a webpage; extracting a plurality of identification character strings from a code file of a predefined webpage to obtain a second identification character string set; determining a second identification character string which does not meet a first preset requirement in a second identification character string set based on a first identification character string in the first identification character string set; and outputting the second identification character strings which are contained in the second identification character string set and do not meet the first preset requirement.
Computer program code for carrying out operations for embodiments of the present disclosure may be written in any combination of one or more programming languages, including but not limited to an object oriented programming language such as Java, Smalltalk, C + +, and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in some embodiments of the present disclosure may be implemented by software, and may also be implemented by hardware. The names of the units do not form a limitation on the units themselves in some cases, and for example, the output unit may also be described as a unit that outputs a second identification string included in the second identification string set that does not meet the first preset requirement.
The functions described herein above may be performed, at least in part, by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), systems on a chip (SOCs), Complex Programmable Logic Devices (CPLDs), and the like.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
The foregoing description is only exemplary of the preferred embodiments of the disclosure and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the disclosure in the embodiments of the present disclosure is not limited to the particular combination of the above-described features, but also encompasses other embodiments in which any combination of the above-described features or their equivalents is possible without departing from the scope of the present disclosure. For example, the above features may be interchanged with other features disclosed in this disclosure (but not limited to) those having similar functions.

Claims (11)

1. An information output method, comprising:
acquiring a first identification character string set, wherein the identification character string is used for identifying description information to be displayed in a webpage;
extracting a plurality of identification character strings from a code file of a predefined webpage to obtain a second identification character string set;
determining a second identification character string which does not meet a first preset requirement in the second identification character string set based on a first identification character string in the first identification character string set;
and outputting the second identification character strings which are contained in the second identification character string set and do not meet the first preset requirement.
2. The method according to claim 1, wherein the description information includes first language description information and second language description information; and
the acquiring of the first identification string set includes:
the method comprises the steps of obtaining a target electronic form file, wherein a plurality of display data are stored in the target electronic form file, the display data comprise identification character strings, first language description information and second language description information, and the identification character strings, the first language description information and the second language description information are distributed in the target electronic form file in a column mode;
acquiring a plurality of identification character strings from the columns where the identification character strings are located in the target electronic form file;
and obtaining the first identification character string set based on the obtained plurality of identification character strings.
3. The method of claim 2, wherein prior to obtaining the identification string from the column of the target spreadsheet file in which the identification string is located, the method further comprises:
determining whether display data which do not meet second preset requirements exist in the target electronic form file or not;
and in response to determining that the display data which does not meet the second preset requirement exists in the target electronic form file, updating the display data which does not meet the second preset requirement in the target electronic form file.
4. The method of claim 3, wherein said determining whether presentation data that does not meet a second predetermined requirement is present in the target spreadsheet file comprises:
in response to determining that the display data of the missing identification character string exists in the target electronic form file, determining that the description information which does not meet the second preset requirement exists in the target electronic form file; and
the updating of the display data in the target spreadsheet file that does not meet the second preset requirement includes:
and generating the identification character string of the display data based on a preset generation strategy aiming at the display data of the missing identification character string in the target electronic form file, and adding the generated identification character string as the identification character string of the display data in the target electronic form file.
5. The method of claim 3, wherein said determining whether presentation data that does not meet a second predetermined requirement is present in the target spreadsheet file comprises:
for the presentation data stored in the target spreadsheet file, the following steps are performed: inquiring description information identified by an identification character string contained in the presentation data from a benchmark presentation data file; in response to determining that the queried descriptive information does not match the descriptive information contained in the presentation data, determining that presentation data that does not meet the second preset requirement exists in the target spreadsheet file; and
the updating of the display data in the target spreadsheet file that does not meet the second preset requirement includes:
outputting prompt information, wherein the prompt information is used for prompting a user whether to update description information which does not match with the description information in the benchmark presentation data set;
in response to detecting that the user performs an operation to update description information in the target spreadsheet file, updating, in the target spreadsheet file, description information that does not match description information in the baseline presentation data set.
6. The method of claim 1, wherein determining, based on a first identification string in the first set of identification strings, a second identification string in the second set of identification strings that does not meet a first preset requirement comprises:
for a second identification character string in the second identification character string set, determining whether the second identification character string exists in the first identification character string set;
and taking a second identification character string which does not exist in the first identification character string set as a second identification character string which does not meet the first preset requirement.
7. The method according to any one of claims 1 to 6, wherein the code file contains a start extraction identifier and an end extraction identifier for identifying the character string; and
the extracting a plurality of identification character strings from the code file of the predefined webpage to obtain a second identification character string set includes:
extracting extraction information between the initial extraction identifier and the corresponding termination extraction identifier from the code file of the predefined webpage;
determining an identification character string according to the extracted extraction information;
obtaining the second identification character string set based on the plurality of identification character strings.
8. The method of claim 7, wherein the code file contains a start reference identifier and an end reference identifier for identifying the character string; and
determining an identification string according to the extracted extraction information includes:
and in response to determining that the start reference identifier and the end reference identifier are contained in the extraction information, using the character strings referenced by the start reference identifier and the end reference identifier contained in the extraction information as the identification character strings.
9. An information output apparatus, characterized by comprising:
the system comprises an acquisition unit, a display unit and a display unit, wherein the acquisition unit is used for acquiring a first identification character string set, and the identification character strings are used for identifying description information to be displayed in a webpage;
the extraction unit is used for extracting a plurality of identification character strings from a code file of a predefined webpage to obtain a second identification character string set;
the determining unit is used for determining a second identification character string which does not meet a first preset requirement in the second identification character string set based on a first identification character string in the first identification character string set;
and the output unit is used for outputting the second identification character strings which are contained in the second identification character string set and do not meet the first preset requirement.
10. An electronic device, comprising:
one or more processors;
a storage device for storing one or more programs,
when executed by the one or more processors, cause the one or more processors to implement the method of any one of claims 1-8.
11. A computer-readable medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1-8.
CN202010321640.8A 2020-04-22 2020-04-22 Information output method and device and electronic equipment Active CN111597107B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010321640.8A CN111597107B (en) 2020-04-22 2020-04-22 Information output method and device and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010321640.8A CN111597107B (en) 2020-04-22 2020-04-22 Information output method and device and electronic equipment

Publications (2)

Publication Number Publication Date
CN111597107A true CN111597107A (en) 2020-08-28
CN111597107B CN111597107B (en) 2023-04-28

Family

ID=72187644

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010321640.8A Active CN111597107B (en) 2020-04-22 2020-04-22 Information output method and device and electronic equipment

Country Status (1)

Country Link
CN (1) CN111597107B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112101239A (en) * 2020-09-17 2020-12-18 中国建设银行股份有限公司 Method, device and system for searching missing characters in PDF file
CN112597746A (en) * 2020-12-29 2021-04-02 维沃移动通信有限公司 Data display method and device and electronic equipment
CN113590467A (en) * 2021-06-30 2021-11-02 平安健康保险股份有限公司 Data comparison method, system, computer equipment and computer readable storage medium

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080010683A1 (en) * 2006-07-10 2008-01-10 Baddour Victor L System and method for analyzing web content
US20090106279A1 (en) * 2007-10-18 2009-04-23 Samsung Techwin Co., Ltd. Method of processing tag information and client-server system using the method
CN101599118A (en) * 2009-06-26 2009-12-09 华中师范大学 Html web page is distorted and is detected and localization method
CN104168293A (en) * 2014-09-05 2014-11-26 北京奇虎科技有限公司 Method and system for recognizing suspicious phishing web page in combination with local content rule base
CN105095466A (en) * 2015-07-31 2015-11-25 山东大学 Web text information extraction method
CN105183732A (en) * 2014-06-04 2015-12-23 广州市动景计算机科技有限公司 Method and device for processing webpage
WO2016178068A1 (en) * 2015-05-06 2016-11-10 Yandex Europe Ag System and method for testing web pages
CN106817297A (en) * 2017-01-19 2017-06-09 无锡云商通科技有限公司 A kind of method that spam is recognized by html tag
CN107145591A (en) * 2017-05-17 2017-09-08 广州瞬速信息科技有限公司 A kind of effective content metadata extracting method of webpage based on title
US20180069880A1 (en) * 2016-09-08 2018-03-08 AO Kaspersky Lab System and method for detecting anomalous elements of web pages
US20180260389A1 (en) * 2017-03-08 2018-09-13 Fujitsu Limited Electronic document segmentation and relation discovery between elements for natural language processing
CN108650250A (en) * 2018-04-27 2018-10-12 北京奇安信科技有限公司 Illegal page detection method, system, computer system and readable storage medium storing program for executing
US20180322215A1 (en) * 2016-02-05 2018-11-08 Tencent Technology (Shenzhen) Company Limited Web page access method and apparatus
CN109325202A (en) * 2018-08-23 2019-02-12 阿里巴巴集团控股有限公司 Web page element processing and Redundant Web Pages pattern lookup method and device
CN109558548A (en) * 2017-09-25 2019-04-02 北京国双科技有限公司 A kind of method and Related product for eliminating CSS style redundancy
WO2020044469A1 (en) * 2018-08-29 2020-03-05 Bbソフトサービス株式会社 Illicit webpage detection device, illicit webpage detection device control method, and control program

Patent Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080010683A1 (en) * 2006-07-10 2008-01-10 Baddour Victor L System and method for analyzing web content
US20090106279A1 (en) * 2007-10-18 2009-04-23 Samsung Techwin Co., Ltd. Method of processing tag information and client-server system using the method
CN101599118A (en) * 2009-06-26 2009-12-09 华中师范大学 Html web page is distorted and is detected and localization method
CN105183732A (en) * 2014-06-04 2015-12-23 广州市动景计算机科技有限公司 Method and device for processing webpage
CN104168293A (en) * 2014-09-05 2014-11-26 北京奇虎科技有限公司 Method and system for recognizing suspicious phishing web page in combination with local content rule base
WO2016178068A1 (en) * 2015-05-06 2016-11-10 Yandex Europe Ag System and method for testing web pages
CN105095466A (en) * 2015-07-31 2015-11-25 山东大学 Web text information extraction method
US20180322215A1 (en) * 2016-02-05 2018-11-08 Tencent Technology (Shenzhen) Company Limited Web page access method and apparatus
US20180069880A1 (en) * 2016-09-08 2018-03-08 AO Kaspersky Lab System and method for detecting anomalous elements of web pages
CN106817297A (en) * 2017-01-19 2017-06-09 无锡云商通科技有限公司 A kind of method that spam is recognized by html tag
US20180260389A1 (en) * 2017-03-08 2018-09-13 Fujitsu Limited Electronic document segmentation and relation discovery between elements for natural language processing
CN107145591A (en) * 2017-05-17 2017-09-08 广州瞬速信息科技有限公司 A kind of effective content metadata extracting method of webpage based on title
CN109558548A (en) * 2017-09-25 2019-04-02 北京国双科技有限公司 A kind of method and Related product for eliminating CSS style redundancy
CN108650250A (en) * 2018-04-27 2018-10-12 北京奇安信科技有限公司 Illegal page detection method, system, computer system and readable storage medium storing program for executing
CN109325202A (en) * 2018-08-23 2019-02-12 阿里巴巴集团控股有限公司 Web page element processing and Redundant Web Pages pattern lookup method and device
WO2020044469A1 (en) * 2018-08-29 2020-03-05 Bbソフトサービス株式会社 Illicit webpage detection device, illicit webpage detection device control method, and control program

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
印国成;孙茂圣;: "一种Web文本的自动提取方法" *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112101239A (en) * 2020-09-17 2020-12-18 中国建设银行股份有限公司 Method, device and system for searching missing characters in PDF file
CN112597746A (en) * 2020-12-29 2021-04-02 维沃移动通信有限公司 Data display method and device and electronic equipment
CN113590467A (en) * 2021-06-30 2021-11-02 平安健康保险股份有限公司 Data comparison method, system, computer equipment and computer readable storage medium
CN113590467B (en) * 2021-06-30 2023-07-21 平安健康保险股份有限公司 Data comparison method, system, computer device and computer readable storage medium

Also Published As

Publication number Publication date
CN111597107B (en) 2023-04-28

Similar Documents

Publication Publication Date Title
CN110969012B (en) Text error correction method and device, storage medium and electronic equipment
CN109656923B (en) Data processing method and device, electronic equipment and storage medium
CN111597107B (en) Information output method and device and electronic equipment
CN111459364B (en) Icon updating method and device and electronic equipment
CN115757400B (en) Data table processing method, device, electronic equipment and computer readable medium
CN112287206A (en) Information processing method and device and electronic equipment
CN112684968A (en) Page display method and device, electronic equipment and computer readable medium
CN110781437A (en) Method and device for acquiring webpage image loading duration and electronic equipment
CN111046252B (en) Information processing method, device, medium, electronic equipment and system
CN111596992B (en) Navigation bar display method and device and electronic equipment
CN111752644A (en) Interface simulation method, device, equipment and storage medium
CN111756953A (en) Video processing method, device, equipment and computer readable medium
CN111311358A (en) Information processing method and device and electronic equipment
CN111124541A (en) Configuration file generation method, device, equipment and medium
CN112084441A (en) Information retrieval method and device and electronic equipment
CN113807056B (en) Document name sequence error correction method, device and equipment
CN111460020B (en) Method, device, electronic equipment and medium for resolving message
CN112905090A (en) Spreadsheet processing method, device, terminal and storage medium
CN113393288A (en) Order processing information generation method, device, equipment and computer readable medium
CN113031950A (en) Picture generation method, device, equipment and medium
CN113360704A (en) Voice playing method and device and electronic equipment
CN111708680A (en) Error reporting information analysis method and device, electronic equipment and storage medium
CN111753238A (en) Data mapping method and device and electronic equipment
CN115374320B (en) Text matching method and device, electronic equipment and computer medium
CN111027281B (en) Word segmentation method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant