CN116382830A - Method and device for determining popup content and electronic equipment - Google Patents

Method and device for determining popup content and electronic equipment Download PDF

Info

Publication number
CN116382830A
CN116382830A CN202310369244.6A CN202310369244A CN116382830A CN 116382830 A CN116382830 A CN 116382830A CN 202310369244 A CN202310369244 A CN 202310369244A CN 116382830 A CN116382830 A CN 116382830A
Authority
CN
China
Prior art keywords
page
corpus
displayed
word
preset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310369244.6A
Other languages
Chinese (zh)
Inventor
张雪涵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202310369244.6A priority Critical patent/CN116382830A/en
Publication of CN116382830A publication Critical patent/CN116382830A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Machine Translation (AREA)

Abstract

The disclosure provides a determination method of popup window content, and relates to the technical fields of natural language processing, big data and the like. The specific scheme is as follows: under the condition that the page to be displayed contains a preset type of component, determining the page type of the page to be displayed based on the layout of the page to be displayed, screening page information in the page to be displayed according to the page type to obtain a reference text, and then analyzing the reference text to generate content to be displayed in a popup window corresponding to the component. Therefore, according to the page type of the page to be displayed, screening the page information in the page to be displayed, determining a reference text, analyzing the reference text, and generating the content of the popup window. The accuracy of the reference text is improved, and therefore the accuracy of the content for generating the popup window is improved.

Description

Method and device for determining popup content and electronic equipment
Technical Field
The disclosure relates to the technical field of artificial intelligence, in particular to the technical fields of natural language processing, big data and the like, and specifically relates to a method and a device for determining popup content and electronic equipment.
Background
In web pages, interactive popups are typically configured to increase the amount of user attention. The content presented in the interactive popup directly affects the conversion rate of the user, and thus, a method for accurately generating the interactive popup content is needed.
Disclosure of Invention
The disclosure provides a method and device for determining popup content and electronic equipment.
According to an aspect of the present disclosure, there is provided a method for determining pop-up content, including:
under the condition that the page to be displayed contains a preset type component, determining the page type of the page to be displayed based on the layout of the page to be displayed;
screening page information in a page to be displayed according to the page type to obtain a reference text;
and analyzing the text to be referred to, and generating the content to be displayed in the popup window corresponding to the component.
According to another aspect of the present disclosure, there is provided a determination apparatus of popup content, including:
the determining module is used for determining the page type of the page to be displayed based on the layout of the page to be displayed under the condition that the page to be displayed contains the components of the preset type;
the screening module is used for screening page information in the page to be displayed according to the page type so as to acquire a reference text;
the generation module is used for analyzing the text to be referred and generating the content to be displayed in the popup window corresponding to the component.
According to another aspect of the present disclosure, there is provided an electronic device including:
at least one processor; and
A memory communicatively coupled to the at least one processor; wherein,,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the methods of the embodiments described above.
According to another aspect of the present disclosure, there is provided a non-transitory computer-readable storage medium storing computer instructions for causing a computer to perform a method according to the above-described embodiments.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the disclosure, nor is it intended to be used to limit the scope of the disclosure. Other features of the present disclosure will become apparent from the following specification.
Drawings
The drawings are for a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:
fig. 1 is a flow chart of a method for determining pop-up content according to an embodiment of the disclosure;
FIG. 2 is a schematic diagram of a page provided by an embodiment of the present disclosure;
FIG. 3 is a flowchart illustrating another method for determining pop-up content according to an embodiment of the present disclosure;
FIG. 4 is a flowchart illustrating another method for determining pop-up content according to an embodiment of the present disclosure;
FIG. 5 is a schematic diagram of another page provided by an embodiment of the present disclosure;
fig. 6 is a schematic structural diagram of another device for determining pop-up content according to an embodiment of the present disclosure;
fig. 7 is a block diagram of an electronic device used to implement determination of popup content in accordance with an embodiment of the present disclosure.
Detailed Description
Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and should be considered as merely exemplary. Accordingly, one of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Artificial intelligence is the discipline of studying certain mental processes and intelligent behaviors (e.g., learning, reasoning, thinking, planning, etc.) of a person using a computer, and has the technical field of both hardware and software aspects. Artificial intelligence hardware technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing, and the like; the artificial intelligence software technology comprises a computer vision technology, a voice recognition technology, a natural language processing technology, a deep learning technology, a big data processing technology, a knowledge graph technology and the like.
NLP (Natural Language Processing ) is an important direction in the fields of computer science and artificial intelligence, and the content of NLP research includes, but is not limited to, the following branch fields: text classification, information extraction, automatic abstracting, intelligent question and answer, topic recommendation, machine translation, topic word recognition, knowledge base construction, deep text representation, named entity recognition, text generation, text analysis (lexical, syntactic, grammatical, etc.), speech recognition and synthesis, and the like.
Big data, or massive data, refers to data that is so large in size that it is impossible to access, manage, process, and sort data in a reasonable time through the current mainstream software tools, thereby helping business operations decision.
In the disclosure, according to the page type of a page to be displayed, screening page information in the page to be displayed, determining a reference text, analyzing the reference text, and generating the content of a popup window. The accuracy of the reference text is improved, and therefore the accuracy of the content for generating the popup window is improved.
The following describes a method, an apparatus, an electronic device, and a storage medium for determining a pop-up content according to embodiments of the present disclosure in detail with reference to the accompanying drawings.
It should be noted that, the method for determining the popup content according to the present disclosure is configured in a determining device for popup content (hereinafter, simply referred to as determining device) for illustration, and the determining device may be applied to any electronic device, so that the electronic device may perform the determining function of the popup content.
The electronic device may be any device with computing capability, for example, may be a personal computer (Personal Computer, abbreviated as PC), a mobile terminal, and the mobile terminal may be a hardware device with various operating systems, touch screens, and/or display screens, for example, a mobile phone, a tablet computer, a personal digital assistant, a wearable device, and the like.
Fig. 1 is a flow chart of a method for determining pop-up content according to an embodiment of the present disclosure.
As shown in fig. 1, the method includes:
step 101, determining the page type of the page to be displayed based on the layout of the page to be displayed under the condition that the page to be displayed contains the components of the preset type.
The preset type of component can be an entry component for user conversion, such as WeChat. The page types may be of a first type and a second type, which is not limited by the present disclosure.
In addition, the user can establish contact with the business party or pay attention to the business party by clicking a component of a preset type. The first type corresponds to pages in a non-novel domain and the second type corresponds to pages in a novel domain.
In the present disclosure, in the case that a page to be displayed includes a preset type of component, it is illustrated that the page includes a user conversion function. At this time, a popup window containing guiding content can be generated for the preset type of component, so as to attract attention of users and improve the conversion rate of the users.
In the present disclosure, pages in different domains have different page layouts. Therefore, the layout of the page to be displayed can be analyzed, and the page type of the page to be displayed can be determined. Based on the page type of the page to be displayed, service information, selling point information and drainage mode information are extracted from different positions of the page to be displayed in a targeted mode, so that accuracy of determining the content of the popup window is improved.
For example, as shown in FIG. 2 (a), the layout of a novel domain page may be such that it contains large pieces of long text, and the text occupies a larger area in the page. As shown in fig. 2 (b), the layout of the non-novel domain page may be one that contains more components and does not contain long text. The layout of the page to be displayed can be analyzed, and the page type of the page to be displayed can be determined.
Step 102, screening page information in the page to be displayed according to the page type to obtain a reference text.
In the disclosure, the content of the popup window should accurately include service information, selling point information and drainage mode information included in a page to be displayed. In the page of different fields, the business information, the selling point information and the drainage mode information are contained in different positions in the page. Therefore, the page information in the page to be displayed can be screened according to the page type, so that the reference text containing the service information, the selling point information and the drainage mode information can be obtained.
Optionally, when the page type is the first type, the page title field of the page to be displayed, the first page presentation text of the page to be displayed, the title field of the component of the preset type, and the description field of the component of the preset type may be determined as the reference text.
Alternatively, when the page type is the second type, the title field of the component determining the first page presentation text and the preset type in the page to be displayed may be determined as the reference text.
The first page display text is a text which is displayed in the page to be displayed and can be perceived by a user and displayed in the display interface, and comprises text information in pictures in the interface to be displayed. The page title field, the title field of the preset type of component, and the description field of the preset type of component may be obtained in a program file (such as json file) of the interface to be displayed.
And 103, analyzing the reference text to generate contents to be displayed in the popup window corresponding to the component.
In the disclosure, the reference text may be parsed by a preset parsing algorithm to extract corpus for generating popup content corresponding to the component from the reference text. And then, combining the extracted corpus by a preset rule to generate the content of the popup window.
In the disclosure, under the condition that a to-be-displayed page contains a preset type of component, determining the page type of the to-be-displayed page based on the layout of the to-be-displayed page, screening page information in the to-be-displayed page according to the page type to obtain a reference text, and then analyzing the reference text to generate contents to be displayed in a popup window corresponding to the component. Therefore, according to the page type of the page to be displayed, screening the page information in the page to be displayed, determining a reference text, analyzing the reference text, and generating the content of the popup window. The accuracy of the reference text is improved, and therefore the accuracy of the content for generating the popup window is improved.
Fig. 3 is a flowchart of a method for determining pop-up content according to an embodiment of the present disclosure.
As shown in fig. 3, the method includes:
Step 301, determining a page type of the page to be displayed based on the layout of the page to be displayed in the case that the page to be displayed contains the preset type of component.
In the present disclosure, the specific implementation process of step 301 may refer to the detailed description of any embodiment of the present disclosure, which is not repeated herein.
In step 302, in the case that the page type is the first type, the page title field of the page to be displayed, the first page presentation text of the page to be displayed, the title field of the component, and the description field of the component are determined as the reference text.
Wherein the first type corresponds to a page of a non-novel domain.
In the disclosure, the page header field of the page to be displayed includes service information corresponding to the page, so that the page header field of the page to be displayed can be determined as a reference text of the service information corresponding to the page. The first page display text of the page to be displayed contains the selling point information, so that a user perceives the selling point and is attracted to the attention of the user. Therefore, the first page presentation text of the page to be displayed can be determined as the reference text of the selling point information corresponding to the page. The preset type component is used for drainage, and the description field and the title field of the preset type component contain drainage mode information for attracting users. Therefore, the description field and the title field of the preset type component can be determined as the reference text of the drainage mode information corresponding to the page. Thereby improving the accuracy of the reference text.
Step 303, deleting any first word in the page header field to obtain a first corpus when the page header field contains any first word in the first preset word set.
In this disclosure, the page header field typically includes a page tag vocabulary outside the service information, such as a number, "add", "copy", and the like. The number can be screened out by using a preset matching rule, each first word in the first preset word set is matched with the page title field, and the first words are contained in the page title field. And deleting a certain first word in the page title field under the condition that the first word is packaged in the page title field, and acquiring a first corpus (namely business information). Thereby improving the accuracy of the service information.
Step 304, screening the second corpus from the first sentences according to the matching degree between each first sentence in the first page display text and each second word in the second preset word set.
The second word is a word for displaying the selling point, for example, "ultra-low", "hot selling", "immediately subtracting", "going up", "free", "preferential", etc. The historical selling point text can be counted in advance, and the use frequency of each word in the historical selling point text can be determined. And then, removing some common connection words or language words (such as, for example, the like), determining the preset number of words with highest frequency as second words, generating a second preset word set, and storing the second preset word set in a system. Or determining sentences of the preset number of words with highest frequency in the historical selling point text as reference selling point corpus of corresponding words, and storing the preset number of words and the corresponding reference selling point corpus in a system in a correlated manner.
Optionally, the second word set further includes a preset weight of each second word, and the weight of each second word may be determined according to a frequency of occurrence of each second word in the historical selling point text. The higher the frequency of occurrence of the second word in the historical selling point text, the greater the weight corresponding to the second word.
In the disclosure, each second word in the second preset word set is matched with each first sentence in the first page display text, and the matching degree between each first sentence and each second word is determined. Then, the first sentence corresponding to the maximum matching degree may be determined as the second corpus (i.e., sales point information). Thereby improving the accuracy of the selling point information.
Optionally, when the matching degree between a certain first sentence and any second word is greater than a first threshold, the first sentence is determined as the reference corpus. And under the condition that the number of the reference corpora is multiple, determining the reference corpora corresponding to the second word with the largest weight as the second corpus. Thereby improving the accuracy of the second corpus.
Optionally, when the second corpus only includes the second word, the reference selling point corpus corresponding to the second word in the second corpus may be further determined as the second corpus, so as to ensure smoothness of the generated popup window content.
Step 305, screening the third corpus from the second sentences according to the matching degree between each second sentence in the header field and the description field and each third word in the third preset word set.
The third word is a word corresponding to the drainage mode, for example, "try", "send", "get" and the like.
In the disclosure, each third word in a third preset word set is matched with each second sentence in a title field and a description field, and the matching degree between each second sentence and each third word is determined. And then, under the condition that the matching degree of a certain second sentence and any third word is larger than a preset threshold value, determining the second sentence as a third corpus (namely drainage mode information). Or determining the second sentence corresponding to the maximum matching degree as the third corpus. Thereby improving the accuracy of the drainage mode information.
Optionally, under the condition that the matching degree between each second sentence and any third word is smaller than a preset threshold, a preset drainage operation corresponding to the first type page can be determined to be the third corpus. For example, drainage techniques such as "micro consultation", "detailed knowledge of added micro letter", and the like are determined as the third corpus.
It can be understood that by means of matching, the first corpus, the second corpus and the third corpus are determined, so that the complexity of determining the popup content is reduced, and the efficiency of determining the popup content is improved.
Step 306, based on the preset format, the first corpus, the second corpus and the third corpus are combined to generate the content to be displayed in the popup window corresponding to the component.
In the disclosure, the first corpus, the second corpus and the third corpus may be spliced and combined based on a preset format to generate the content to be displayed in the popup window corresponding to the component.
Therefore, the first corpus, the second corpus and the third corpus are extracted from different parts of information of the page to be displayed in a targeted manner, and accuracy of the first corpus, the second corpus and the third corpus is improved. The accuracy of generating the content of the popup window is further improved.
In the disclosure, when a page to be displayed includes a preset type of component, determining a page type of the page to be displayed based on a layout of the page to be displayed, determining a page title field of the page to be displayed, a first page display text of the page to be displayed, a title field of the component and a description field of the component as reference texts when the page type is a first type, deleting any first word in the page title field when the page title field includes any first word in a first preset word set, acquiring a first corpus, screening second corpora from each first sentence according to a matching degree between each first sentence in the first page display text and each second word in a second preset word set, screening third corpora from each second sentence according to a matching degree between each second sentence in the title field and the description field and each third word in a third preset word set, and then combining the first corpora, the second corpora and the third corpora based on a and the preset format, and generating content to be displayed in a window corresponding to the component. Therefore, according to the page type, the first corpus, the second corpus and the third corpus are extracted from different parts of information of the page to be displayed in a targeted manner, and accuracy and efficiency of the first corpus, the second corpus and the third corpus are improved. The accuracy and efficiency of generating the content of the popup window are further improved.
Fig. 4 is a flowchart of a method for determining pop-up content according to an embodiment of the present disclosure.
As shown in fig. 4, the method includes:
in step 401, in the case that the page to be displayed includes a preset type of component, the page type of the page to be displayed is determined based on the layout of the page to be displayed.
In the present disclosure, the specific implementation process of step 401 may refer to the detailed description of any embodiment of the present disclosure, which is not repeated herein.
Step 402, in the case that the page type is the second type, determining the number of characters of each paragraph of the first page presentation text in the page to be displayed.
Wherein the second type corresponds to a page of the novel domain.
In the disclosure, the long text in the page of the novel field is generally novel text content, has no generalization, is not suitable for serving as business information, and does not contain selling point information and drainage mode information. Thus, the number of characters for each paragraph of the first page presentation text in the page to be displayed can be determined. To filter the first page presentation text according to the number of characters in each paragraph.
And step 403, deleting the paragraphs with the number of the corresponding characters larger than a second threshold value in the first page display text, and obtaining a second page display text.
In the method, the paragraphs with the number of the corresponding characters larger than the second threshold value in the first page display text can be deleted, so that the influence of redundant information on the generation of popup window contents is reduced, and the accuracy of determining the popup window contents is improved.
Step 404, determining the second page presentation text and the title field of the component as the reference text.
In the disclosure, the second page presentation text may be determined as a reference text of business information and sales point information corresponding to the page. The preset type component in the novel field page only comprises a title field, so that the title field of the preset type component can be determined to be the reference text of the selling point information or the drainage mode information corresponding to the page. Thereby improving the accuracy of the reference text.
And step 405, extracting a first corpus from the second page display text based on a preset matching rule.
In the disclosure, a novel name, a chapter name, a novel type, or the like can be screened out from the second page display text by using a preset regular matching rule, and the novel type is used as the first corpus. Therefore, the correlation degree between the service information and the novel specific information of the page to be displayed can be improved.
For example, as in the page in fig. 5, a preset regular matching rule may be used to screen out the novel name "×", or the novel type "talk novel" as the first corpus.
Alternatively, in the case that the extraction of the first corpus from the second page display text fails based on the preset matching rule, the preset universal phone may be determined as the first corpus. For example, "hot novels" are determined as the first corpus.
Step 406, screening the second corpus from the third sentences according to the matching degree between each third sentence in the second page display text and each fourth word in the fourth preset word set.
The fourth word is a word for displaying the selling point, for example, "free" and the like. The fourth word may be the same as or different from the second word in the above embodiment. The present disclosure is not limited in this regard.
In the disclosure, each fourth word in a fourth preset word set is matched with each second sentence in the second page display text, and the matching degree between each second sentence and each fourth word is determined. And then, under the condition that the matching degree of a certain second sentence and any fourth word is larger than a preset threshold value, determining the second sentence as a second corpus (namely sales point information). Or determining the second sentence corresponding to the maximum matching degree as the second corpus. Thereby improving the accuracy of the selling point information.
Optionally, a preset selling point phone corresponding to the fourth word in the second corpus may be determined as the second corpus. For example, "new chapter free", "0-element reading", "free acquisition", "free no popup window", "good book free reading", etc. are determined as the second corpus.
Step 407, determining that the title field is a third corpus when the matching degree between the title field and any fifth word in the fifth preset word set is greater than a third threshold.
The fifth word is a word corresponding to the drainage mode.
Optionally, under the condition that the matching degree between the title field and any fifth word in the fifth preset word set is smaller than a third threshold, a preset drainage speech operation (i.e. policy speech operation) corresponding to the second type page may be determined as a third corpus. For example, the "follow-up more wonderful-! Drainage techniques such as "," Add WeChat read full "are determined as the third corpus.
Optionally, the first corpus may be extracted from the second page display text based on a preset matching rule, and if the matching degree between the title field and any fourth word in the fourth preset word set is greater than a fourth threshold, the preset corpus corresponding to any fourth word is determined to be the second corpus, the preset tactical operation is determined to be the third corpus, and based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate the content to be displayed in the popup window corresponding to the component. The accuracy and fluency of popup window content are improved.
Step 408, based on the preset format, the first corpus, the second corpus and the third corpus are combined to generate the content to be displayed in the popup window corresponding to the component.
In this disclosure, the specific implementation process of step 408 may refer to the detailed description of any embodiment of this disclosure, which is not repeated here.
According to the page type, the first corpus, the second corpus and the third corpus are extracted from different parts of information of the page to be displayed in a targeted mode, and accuracy and efficiency of the first corpus, the second corpus and the third corpus are improved. The accuracy and efficiency of generating the content of the popup window are further improved.
In order to achieve the above embodiments, the embodiments of the present disclosure further provide a device for determining pop-up content.
Fig. 6 is a schematic structural diagram of a device for determining pop-up content according to an embodiment of the present disclosure.
As shown in fig. 6, the pop-up content determining apparatus 600 includes: a determining module 610, a screening module 620, a generating module 630.
A determining module 610, configured to determine, based on a layout of the page to be displayed, a page type of the page to be displayed, where the page to be displayed includes a preset type of component;
the screening module 620 is configured to screen page information in a page to be displayed according to a page type, so as to obtain a reference text;
The generating module 630 is configured to parse the text to be referred, and generate content to be displayed in the popup window corresponding to the component.
In one possible implementation manner of the embodiment of the present disclosure, the screening module 620 is configured to:
and under the condition that the page type is the first type, determining a page title field of the page to be displayed, a first page display text of the page to be displayed, a title field of the component and a description field of the component as reference texts.
In one possible implementation manner of the embodiment of the present disclosure, the generating module 630 is configured to:
deleting any first word in the page title field under the condition that the page title field contains any first word in a first preset word set, and acquiring a first corpus;
screening a second corpus from each first sentence according to the matching degree between each first sentence in the first page display text and each second word in a second preset word set;
screening a third corpus from the second sentences according to the matching degree between each second sentence in the title field and the description field and each third word in a third preset word set;
based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
In one possible implementation manner of the embodiment of the present disclosure, the generating module 630 is configured to:
the second word set further comprises preset weight of each second word, and if the matching degree between any first sentence and any second word is larger than a first threshold value, any first sentence is determined to be a reference corpus;
and under the condition that the number of the reference corpora is multiple, determining the reference corpora corresponding to the second word with the largest weight as the second corpus.
In one possible implementation manner of the embodiment of the present disclosure, the screening module 620 is configured to:
determining the number of characters of each paragraph of the first page display text in the page to be displayed under the condition that the page type is the second type;
deleting paragraphs with the number of corresponding characters larger than a second threshold value in the first page display text to obtain a second page display text;
and determining the second page display text and the title field of the component as reference text.
In one possible implementation manner of the embodiment of the present disclosure, the generating module 630 is configured to:
extracting a first corpus from the second page display text based on a preset matching rule;
screening a second corpus from each third sentence according to the matching degree between each third sentence in the second page display text and each fourth word in a fourth preset word set;
Under the condition that the matching degree between the title field and any fifth word in the fifth preset word set is larger than a third threshold value, determining that the title field is a third corpus;
based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
In one possible implementation manner of the embodiment of the present disclosure, the generating module 630 is configured to:
extracting a first corpus from the second page display text based on a preset matching rule;
under the condition that the matching degree between the title field and any fourth word in the fourth preset word set is larger than a fourth threshold value, determining the preset corpus corresponding to any fourth word as a second corpus;
determining a preset strategy speech as a third corpus;
based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
It should be noted that, the explanation of the embodiment of the method for determining the pop-up window content is also applicable to the device of this embodiment, so that the description is omitted here.
In the disclosure, under the condition that a to-be-displayed page contains a preset type of component, determining the page type of the to-be-displayed page based on the layout of the to-be-displayed page, screening page information in the to-be-displayed page according to the page type to obtain a reference text, and then analyzing the reference text to generate contents to be displayed in a popup window corresponding to the component. Therefore, according to the page type of the page to be displayed, screening the page information in the page to be displayed, determining a reference text, analyzing the reference text, and generating the content of the popup window. The accuracy of the reference text is improved, and therefore the accuracy of the content for generating the popup window is improved.
According to an embodiment of the disclosure, the disclosure further provides an electronic device, a readable storage medium.
Fig. 7 illustrates a schematic block diagram of an example electronic device 700 that may be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the disclosure described and/or claimed herein.
As shown in fig. 7, the apparatus 700 includes a computing unit 701 that can perform various appropriate actions and processes according to a computer program stored in a ROM (Read-Only Memory) 702 or a computer program loaded from a storage unit 708 into a RAM (Random Access Memory ) 703. In the RAM 703, various programs and data required for the operation of the device 700 may also be stored. The computing unit 701, the ROM 702, and the RAM 703 are connected to each other through a bus 704. An I/O (Input/Output) interface 705 is also connected to bus 704.
Various components in device 700 are connected to I/O interface 705, including: an input unit 706 such as a keyboard, a mouse, etc.; an output unit 707 such as various types of displays, speakers, and the like; a storage unit 708 such as a magnetic disk, an optical disk, or the like; and a communication unit 709 such as a network card, modem, wireless communication transceiver, etc. The communication unit 709 allows the device 700 to exchange information/data with other devices via a computer network, such as the internet, and/or various telecommunication networks.
The computing unit 701 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of computing unit 701 include, but are not limited to, a CPU (Central Processing Unit ), a GPU (Graphic Processing Units, graphics processing unit), various dedicated AI (Artificial Intelligence ) computing chips, various computing units running machine learning model algorithms, a DSP (Digital Signal Processor ), and any suitable processor, controller, microcontroller, etc. The computing unit 701 performs the respective methods and processes described above, for example, a determination method of the popup content. For example, in some embodiments, the method of determining pop-up content may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as storage unit 708. In some embodiments, part or all of the computer program may be loaded and/or installed onto device 700 via ROM 702 and/or communication unit 709. When the computer program is loaded into RAM 703 and executed by the computing unit 701, one or more steps of the above-described method of determining the content of a pop may be performed. Alternatively, in other embodiments, the computing unit 701 may be configured to perform the method of determining the pop content in any other suitable way (e.g. by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuit System, FPGA (Field Programmable Gate Array ), ASIC (Application-Specific Integrated Circuit, application-specific integrated circuit), ASSP (Application Specific Standard Product, special-purpose standard product), SOC (System On Chip ), CPLD (Complex Programmable Logic Device, complex programmable logic device), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
Program code for carrying out methods of the present disclosure may be written in any combination of one or more programming languages. These program code may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus such that the program code, when executed by the processor or controller, causes the functions/operations specified in the flowchart and/or block diagram to be implemented. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. The machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, RAM, ROM, EPROM (Electrically Programmable Read-Only-Memory, erasable programmable read-Only Memory) or flash Memory, an optical fiber, a CD-ROM (Compact Disc Read-Only Memory), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., CRT (Cathode-Ray Tube) or LCD (Liquid Crystal Display ) monitor) for displaying information to a user; and a keyboard and pointing device (e.g., a mouse or trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: LAN (Local Area Network ), WAN (Wide Area Network, wide area network), internet and blockchain networks.
The computer system may include a client and a server. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also called a cloud computing server or a cloud host, and is a host product in a cloud computing service system, so that the defects of high management difficulty and weak service expansibility in the traditional physical hosts and VPS service (Virtual Private Server, virtual special servers) are overcome. The server may also be a server of a distributed system or a server that incorporates a blockchain.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps recited in the present disclosure may be performed in parallel, sequentially, or in a different order, provided that the desired results of the disclosed aspects are achieved, and are not limited herein.
The above detailed description should not be taken as limiting the scope of the present disclosure. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present disclosure are intended to be included within the scope of the present disclosure.

Claims (16)

1. A method of determining pop-up content, comprising:
under the condition that a to-be-displayed page comprises a preset type component, determining the page type of the to-be-displayed page based on the layout of the to-be-displayed page;
screening the page information in the page to be displayed according to the page type to obtain a reference text;
and analyzing the text to be referred to, and generating the content to be displayed in the popup window corresponding to the component.
2. The method of claim 1, wherein the filtering the page information in the page to be displayed according to the page type to obtain the reference text includes:
and under the condition that the page type is the first type, determining the page title field of the page to be displayed, the first page display text of the page to be displayed, the title field of the component and the description field of the component as the reference text.
3. The method of claim 2, wherein the parsing the text to be referred to generates the content to be displayed in the popup corresponding to the component, including:
deleting any first word in the page title field under the condition that the page title field contains any first word in a first preset word set, and acquiring a first corpus;
screening a second corpus from each first sentence according to the matching degree between each first sentence in the first page display text and each second word in a second preset word set;
screening a third corpus from each second sentence according to the matching degree between each second sentence in the title field and the description field and each third word in a third preset word set;
Based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
4. The method of claim 3, wherein the screening the second corpus from the first sentences according to the matching degree between each first sentence in the first page display text and each second word in the second preset word set comprises:
the second word set further comprises preset weights of each second word, and if the matching degree between any first sentence and any second word is greater than a first threshold value, determining that any first sentence is a reference corpus;
and under the condition that the number of the reference corpora is multiple, determining the reference corpora corresponding to the second word with the largest weight as the second corpus.
5. The method of claim 1, wherein the filtering the page information in the page to be displayed according to the page type to obtain the reference text includes:
determining the number of characters of each paragraph of the first page display text in the page to be displayed under the condition that the page type is the second type;
Deleting paragraphs with the number of corresponding characters larger than a second threshold value in the first page display text to obtain a second page display text;
and determining the second page display text and the title field of the component as the reference text.
6. The method of claim 5, wherein the parsing the text to be referred to generates the content to be displayed in the popup corresponding to the component, comprising:
extracting a first corpus from the second page display text based on a preset matching rule;
screening a second corpus from each third sentence according to the matching degree between each third sentence in the second page display text and each fourth word in a fourth preset word set;
determining that the title field is a third corpus under the condition that the matching degree between the title field and any fifth word in a fifth preset word set is greater than a third threshold;
based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
7. The method of claim 5, wherein the parsing the text to be referred to generates the content to be displayed in the popup corresponding to the component, comprising:
Extracting a first corpus from the second page display text based on a preset matching rule;
under the condition that the matching degree between the title field and any fourth word in a fourth preset word set is larger than a fourth threshold value, determining a preset corpus corresponding to any fourth word as a second corpus;
determining a preset strategy speech as a third corpus;
based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
8. A device for determining popup content, comprising:
the determining module is used for determining the page type of the page to be displayed based on the layout of the page to be displayed under the condition that the page to be displayed contains components of a preset type;
the screening module is used for screening the page information in the page to be displayed according to the page type so as to acquire a reference text;
and the generating module is used for analyzing the text to be referred to and generating the content to be displayed in the popup window corresponding to the component.
9. The apparatus of claim 8, wherein the screening module is to:
and under the condition that the page type is the first type, determining the page title field of the page to be displayed, the first page display text of the page to be displayed, the title field of the component and the description field of the component as the reference text.
10. The apparatus of claim 9, wherein the means for generating is configured to:
deleting any first word in the page title field under the condition that the page title field contains any first word in a first preset word set, and acquiring a first corpus;
screening a second corpus from each first sentence according to the matching degree between each first sentence in the first page display text and each second word in a second preset word set;
screening a third corpus from each second sentence according to the matching degree between each second sentence in the title field and the description field and each third word in a third preset word set;
based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
11. The apparatus of claim 10, wherein the means for generating is configured to:
the second word set further comprises preset weights of each second word, and if the matching degree between any first sentence and any second word is greater than a first threshold value, determining that any first sentence is a reference corpus;
And under the condition that the number of the reference corpora is multiple, determining the reference corpora corresponding to the second word with the largest weight as the second corpus.
12. The apparatus of claim 8, wherein the screening module is to:
determining the number of characters of each paragraph of the first page display text in the page to be displayed under the condition that the page type is the second type;
deleting paragraphs with the number of corresponding characters larger than a second threshold value in the first page display text to obtain a second page display text;
and determining the second page display text and the title field of the component as the reference text.
13. The apparatus of claim 12, wherein the means for generating is configured to:
extracting a first corpus from the second page display text based on a preset matching rule;
screening a second corpus from each third sentence according to the matching degree between each third sentence in the second page display text and each fourth word in a fourth preset word set;
determining that the title field is a third corpus under the condition that the matching degree between the title field and any fifth word in a fifth preset word set is greater than a third threshold;
Based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
14. The apparatus of claim 12, wherein the means for generating is configured to:
extracting a first corpus from the second page display text based on a preset matching rule;
under the condition that the matching degree between the title field and any fourth word in a fourth preset word set is larger than a fourth threshold value, determining a preset corpus corresponding to any fourth word as a second corpus;
determining a preset strategy speech as a third corpus;
based on a preset format, the first corpus, the second corpus and the third corpus are combined to generate content to be displayed in the popup window corresponding to the component.
15. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-7.
16. A non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the method of any one of claims 1-7.
CN202310369244.6A 2023-04-07 2023-04-07 Method and device for determining popup content and electronic equipment Pending CN116382830A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310369244.6A CN116382830A (en) 2023-04-07 2023-04-07 Method and device for determining popup content and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310369244.6A CN116382830A (en) 2023-04-07 2023-04-07 Method and device for determining popup content and electronic equipment

Publications (1)

Publication Number Publication Date
CN116382830A true CN116382830A (en) 2023-07-04

Family

ID=86967165

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310369244.6A Pending CN116382830A (en) 2023-04-07 2023-04-07 Method and device for determining popup content and electronic equipment

Country Status (1)

Country Link
CN (1) CN116382830A (en)

Similar Documents

Publication Publication Date Title
CN113705187B (en) Method and device for generating pre-training language model, electronic equipment and storage medium
CN113807098B (en) Model training method and device, electronic equipment and storage medium
CN115982376B (en) Method and device for training model based on text, multimode data and knowledge
CN112784589B (en) Training sample generation method and device and electronic equipment
CN112580339B (en) Model training method and device, electronic equipment and storage medium
CN113590776A (en) Text processing method and device based on knowledge graph, electronic equipment and medium
US20230005283A1 (en) Information extraction method and apparatus, electronic device and readable storage medium
US20220121668A1 (en) Method for recommending document, electronic device and storage medium
CN114021548A (en) Sensitive information detection method, training method, device, equipment and storage medium
CN112926308A (en) Method, apparatus, device, storage medium and program product for matching text
CN114398943B (en) Sample enhancement method and device thereof
CN114970540A (en) Method and device for training text audit model
CN114880520B (en) Video title generation method, device, electronic equipment and medium
WO2023016163A1 (en) Method for training text recognition model, method for recognizing text, and apparatus
CN113051396B (en) Classification recognition method and device for documents and electronic equipment
CN113641724B (en) Knowledge tag mining method and device, electronic equipment and storage medium
CN115048523A (en) Text classification method, device, equipment and storage medium
CN114780821A (en) Text processing method, device, equipment, storage medium and program product
CN116382830A (en) Method and device for determining popup content and electronic equipment
CN114676699A (en) Entity emotion analysis method and device, computer equipment and storage medium
CN113221566A (en) Entity relationship extraction method and device, electronic equipment and storage medium
CN113342179A (en) Input text processing method and device, electronic equipment and storage medium
CN111767728A (en) Short text classification method, device, equipment and storage medium
CN114490976B (en) Method, device, equipment and storage medium for generating dialogue abstract training data
CN113705206B (en) Emotion prediction model training method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination