CN105373570B - Management method and terminal for browser history records - Google Patents

Management method and terminal for browser history records Download PDF

Info

Publication number
CN105373570B
CN105373570B CN201410444334.8A CN201410444334A CN105373570B CN 105373570 B CN105373570 B CN 105373570B CN 201410444334 A CN201410444334 A CN 201410444334A CN 105373570 B CN105373570 B CN 105373570B
Authority
CN
China
Prior art keywords
plain text
ith
browsing
pages
attention
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410444334.8A
Other languages
Chinese (zh)
Other versions
CN105373570A (en
Inventor
丁跞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CN201410444334.8A priority Critical patent/CN105373570B/en
Publication of CN105373570A publication Critical patent/CN105373570A/en
Application granted granted Critical
Publication of CN105373570B publication Critical patent/CN105373570B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The embodiment of the invention discloses a management method of a browser history record, which is applied to a terminal and comprises the following steps: determining N plain text pages in all historical webpages, wherein N is an integer greater than or equal to 2; obtaining an attention parameter of an ith plain text page in the N plain text pages, wherein the attention parameter is used for representing the attention degree of a user to the ith plain text page, and i is an integer greater than 1 and less than or equal to N; and sequencing the webpage identifications corresponding to the N plain text pages according to the attention parameter. The embodiment of the invention also discloses a terminal.

Description

Management method and terminal for browser history records
Technical Field
The invention relates to the field of information management, in particular to a management method and a terminal for a browser history record.
Background
With the rapid development of wireless communication technology and internet technology, more and more users can surf the internet on the terminal by using the browser, and the browser has a history recording function and can record websites visited by the users, so that the users can directly search in the history record when needing to review the websites visited by the users, and the use of the users is facilitated.
At present, the inquiry of the historical webpage record of the browser can be inquired according to the access time, the terminal can record the time of each time the user accesses the website, the historical webpages accessed by the user are sorted according to the time sequence, and when the user needs to inquire the historical webpages, the user can check the historical webpages which the user wants to search according to the sorted sequence. However, as the time of using the browser by the user is longer and longer, the data recorded by the browser is more and more, when the user needs to search for a webpage viewed before a long time, the user hardly memorizes the browsing time, and even if an approximate time is recalled, it takes a lot of time to search in the time period, that is, when the user wants to query the webpage viewed before, the user cannot accurately and quickly find the historical webpage he wants.
Therefore, the prior art has the technical problem of low efficiency of querying the history of the browser.
Disclosure of Invention
In view of this, embodiments of the present invention are intended to provide a method and a terminal for managing a browser history record, so as to improve the efficiency of querying the history record of a browser, so that a user can quickly and accurately query a history webpage to be queried.
In order to achieve the purpose, the technical scheme of the invention is realized as follows:
in a first aspect, an embodiment of the present invention provides a method for managing a browser history, where the method includes: determining N plain text pages in all historical webpages, wherein N is an integer greater than or equal to 2; obtaining an attention parameter of an ith plain text page in the N plain text pages, wherein the attention parameter is used for representing the attention degree of a user to the ith plain text page, and i is an integer greater than 1 and less than or equal to N; and sequencing the webpage identifications corresponding to the N plain text pages according to the attention parameter.
Further, the determining N plain text pages in all the historical web pages includes: obtaining the ratio of the length of the text with the link attribute to the length of the text with the non-link attribute in each historical webpage; and determining the N historical webpages of which the ratios meet preset conditions as the N plain text pages.
Further, before determining N plain text pages in all the historical web pages, the method further includes: obtaining the ratio of the length of the text with the link attribute to the length of the text with the non-link attribute in the current browsed webpage; when the ratio meets the preset condition, adding a plain text page identifier to the currently browsed webpage; determining N plain text pages in all historical webpages, wherein the N plain text pages comprise: and determining the N historical webpages with the plain text page identifications as the N plain text pages.
Further, the obtaining of the attention parameter of the ith plain text page in the N plain text pages includes: determining an effective browsing operation group based on the time length between two adjacent browsing operations, wherein the effective browsing operation group is the two adjacent browsing operations performed on the ith plain text page within a preset operation time length; based on the effective browsing operation group, obtaining a total effective browsing duration of the ith plain text page and obtaining a total effective browsing height of the ith plain text page, wherein the total effective browsing duration is used for representing a sum of effective attention durations of the user to the ith plain text page, and the total effective browsing height is used for representing a sum of effective attention areas of the user to the ith plain text page; and obtaining the attention parameter of the ith plain text page according to the total effective browsing duration and the total effective browsing height.
Further, the obtaining of the total effective browsing height of the ith plain text page includes: obtaining the offset of the heights of the two adjacent effective browsing operation groups, wherein the height of the effective browsing operation group is the distance between the initial position of the effective browsing operation group on the ith plain text page and the initial position of the browsing window of the ith plain text page; determining the effective height of the effective browsing operation group according to the offset and the height value of the browsing window, wherein the effective height is an effective attention area when the user performs the effective browsing operation group on the ith plain text page; and obtaining the total effective browsing height of the ith plain text page according to the effective height.
Further, the obtaining of the attention parameter of the ith plain text page includes: obtaining average attention values of the first i plain text pages in the N plain text pages, wherein the average attention values are used for representing the average attention degree of the user to the first i plain text pages; and obtaining the attention parameter according to a configuration value of a time attenuation factor and the average attention value of the first i plain text pages, wherein the time attenuation factor is used for representing the priority of the ith plain text page, and the time attenuation factor is greater than or equal to 0.
Further, the obtaining an average attention value of the first i plain text pages in the N plain text pages includes: and obtaining the average attention value of the first i plain text pages based on the average attention value of the first i-1 plain text pages in the N plain text pages.
Further, the average attention value of the first i plain text pages is specifically obtained by the following formula:
Figure BDA0000564405950000031
wherein E isiThe average attention value of the first i plain text pages is obtained; ei-1The average attention value of the first i-1 plain text pages is obtained; a. theiIs the ratio of the total effective browsing duration to the total effective browsing height.
Further, the attention parameter is specifically obtained by the following formula:
Vi=Ai×(1-τ)+(T-TS)/(TN-TS)×Ei×τ
wherein, ViThe attention degree parameter of the ith plain text page is obtained; a. theiThe ratio of the total effective browsing duration to the total effective browsing height of the ith plain text page is obtained; τ is the time decay factor; t is the time when the user browses the webpage; t isSA valid time set for the user; t isNTime to initiate a history lookup for the user; eiIs the average attention value of the ith plain text page.
In a second aspect, an embodiment of the present invention provides a terminal, where the terminal includes: the device comprises a determining unit, an obtaining unit and a sorting unit; the determining unit is used for determining N plain text pages in all historical webpages, wherein N is an integer greater than or equal to 2; the obtaining unit is configured to obtain an attention parameter of an ith plain text page in the N plain text pages, where the attention parameter is used to represent an attention degree of a user to the ith plain text page, and i is an integer greater than 0 and less than or equal to N; and the sequencing unit is used for sequencing the webpage identifications corresponding to the N plain text pages according to the attention parameter.
Further, the determining unit is specifically configured to obtain a ratio of a length of a text with a link attribute to a length of a text with a non-link attribute in each history webpage; and determining the N historical webpages of which the ratios meet preset conditions as the N plain text pages.
Further, the determining unit is specifically configured to obtain, before determining N plain text pages in all the historical webpages, a ratio between a length of a text having a link attribute and a length of a text having a non-link attribute in a currently browsed webpage; when the ratio meets the preset condition, adding a plain text page identifier to the currently browsed webpage; and the server is further used for determining the N historical webpages with the plain text page identifications as the N plain text pages.
Further, the obtaining unit includes: determining a subunit, a first obtaining subunit and a second obtaining subunit; the determining subunit is configured to determine an effective browsing operation group based on a duration between two adjacent browsing operations, where the effective browsing operation group is the two adjacent browsing operations performed on the ith plain text page within a preset operation duration; the first obtaining subunit is configured to obtain, based on the effective browsing operation group, a total effective browsing duration of the ith plain text page and a total effective browsing height of the ith plain text page, where the total effective browsing duration is used to represent a sum of effective attention durations of the user to the ith plain text page, and the total effective browsing height is used to represent a sum of effective attention areas of the user to the ith plain text page; and the second obtaining subunit is configured to obtain the attention parameter of the ith plain text page according to the total effective browsing duration and the total effective browsing height.
Further, the first obtaining subunit is specifically configured to obtain an offset of the height of the effective browsing operation group, and determine the effective height of the effective browsing operation group according to the offset and the height value of the browsing window; obtaining the total effective browsing height of the ith plain text page according to the effective height; and the height of the effective browsing operation group is the distance between the effective browsing operation group on the ith plain text page and the initial position of the browsing window of the ith plain text page.
Further, the second obtaining subunit is specifically configured to obtain average attention values of the first i plain text pages in the N plain text pages; obtaining the attention degree parameter according to a configuration value of a time attenuation factor and an average attention degree value of the first i plain text pages, wherein the average attention degree value is used for representing the average attention degree of the user to the first i plain text pages, the time attenuation factor is used for representing the priority of the ith plain text page, and the time attenuation factor is greater than or equal to 0.
Further, the second obtaining subunit is specifically configured to obtain average attention values of the top i plain text pages based on the average attention values of the top i-1 plain text pages in the N plain text pages.
The method and the terminal for managing the browser history record provided by the embodiment of the invention have the advantages that the terminal determines N plain text pages in all history webpages browsed by a user, then obtains the attention parameter of the user to the ith plain text page, and sorts the webpage identifiers corresponding to the N plain text pages according to the attention parameter, the method can be known, the webpage identifiers are sorted according to the attention degree of the user to the history webpages, generally speaking, the history webpages the user wants to search are the webpages with higher attention degree, when the history webpages are more, the webpage identifiers with higher attention degree after being sorted are generally concentrated at one end of a webpage identifier sequence, and the history webpages corresponding to the webpage identifiers with higher attention degree are often wanted by the user, so the user can quickly and accurately find the history webpages the user wants to search in the webpages with higher attention degree, therefore, the technical problem that the historical record query efficiency of the browser is low in the prior art is effectively solved, the historical record query efficiency of the browser is further improved, a user can quickly and accurately find a historical webpage to be queried, and user experience is greatly improved.
Drawings
FIG. 1 is a flowchart illustrating a method for managing browser history according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of a method for obtaining an attention parameter of an ith plain text page in N plain text pages according to an embodiment of the present invention;
FIG. 3 is a diagram illustrating an exemplary height offset corresponding to an effective browsing operation set according to an embodiment of the present invention;
fig. 4 is a schematic flowchart of a method for obtaining a focus parameter of an ith plain text page in the embodiment of the present invention;
fig. 5 is a flowchart illustrating a method for managing ten history webpages according to an embodiment of the present invention.
Detailed Description
The technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention.
The embodiment of the invention provides a management method of a browser history record, which is applied to a terminal, wherein the terminal can be a smart phone, a tablet computer, a super notebook and the like, the browser is installed on the terminal, and a user can browse a webpage displayed in a browser window, such as clicking a mouse, sliding the mouse and the like, without specific limitation.
Fig. 1 is a flowchart illustrating a method for managing a browser history according to an embodiment of the present invention, and referring to fig. 1, the method includes:
s101: determining N plain text pages in all historical webpages, wherein N is an integer greater than or equal to 2;
in the implementation, the timing of executing S101 may be, but is not limited to, the following two cases.
Firstly, when a user finishes browsing web pages and needs to query historical web pages, a terminal determines N plain text pages in all the historical web pages browsed by the user;
secondly, in the process of browsing the webpage by the user, the terminal firstly determines whether the webpage currently browsed by the user is a plain text page, and then determines N plain text pages from all historical webpages when the user inquires the historical webpages.
Specifically, for the first case, S101 may specifically be: obtaining the ratio of the length of the text with the link attribute to the length of the text with the non-link attribute in each historical webpage; and determining the N historical webpages of which the ratios meet preset conditions as the N plain text pages.
In practical application, all the history webpages may be all browsed webpages recorded by the browser from the beginning of using the browser by the user for the first time, or all browsed webpages within an effective time set by the user, where the effective time refers to a time range in which the user wants to view the history webpages, for example, the effective time set by the user is 15 days, and then the user can only view the history webpages within the 15 days, and accordingly, the terminal can only manage the history webpages within the 15 days, so as to improve the efficiency of managing the history webpages by the terminal.
Specifically, when the user queries the history webpages, the terminal may retrieve text nodes in a Document Object Model (DOM) tree of each history webpage by traversing the DOM tree, calculate a text length L1 having a link attribute and a text length L2 having a non-link attribute in each history webpage, obtain a ratio of L1 to L2, find N history webpages in which the ratio satisfies a preset condition among all history webpages, and determine the N history webpages as plain text pages, among all history webpages browsed by the user.
In practical applications, the preset condition may be set according to different application needs, for example, the preset condition is 1/8 < L1/L2 < 1/3, and when L1/L2 of a certain historical web page is 1/4, since the ratio satisfies the preset condition, the historical web page is determined as a plain text page, for example, four historical web pages among ten historical web pages browsed by a user satisfy the preset condition, and the four historical web pages are determined as plain text pages.
For the second case, before S101, the method further includes: obtaining the ratio of the length of the text with the link attribute to the length of the text with the non-link attribute in the current browsed webpage; when the ratio meets the preset condition, adding a plain text page identification to the currently browsed webpage;
specifically, the terminal obtains a text node in a DOM tree that can be retrieved by traversing the DOM tree of a web page currently browsed by a user, calculates a text length L1 having a link attribute and a text length L2 having a non-link attribute in the web page, respectively, obtains a ratio of L1 to L2, determines the web page as a plain text page when the ratio satisfies a preset condition, and adds a plain text page identifier to the web page.
At this time, correspondingly, S101 specifically includes: and determining the N historical webpages with the plain text page identifications as the N plain text pages.
Specifically, the terminal marks the webpages of the plain text pages in the process of browsing the webpages by the user, so that when the user inquires the historical webpages, the terminal can determine the N historical webpages with the plain text page identifications as the N plain text pages.
S102: obtaining an attention parameter of an ith plain text page in the N plain text pages, wherein i is an integer which is greater than 1 and less than or equal to N;
fig. 2 is a schematic flow chart of a method for obtaining a focus parameter of an ith plain text page in N plain text pages in the embodiment of the present invention, and as shown in fig. 2, S102 includes:
s201: determining an effective browsing operation group based on the time length between two adjacent browsing operations, wherein the effective browsing operation group refers to two adjacent browsing operations performed on the ith plain text page in the N plain text pages within a preset operation time length;
for example, the terminal determines four plain text pages in ten historical web pages, when the user opens the third plain text page, the terminal records the time for browsing the plain text page each time, assuming that the user browses the plain text page four times, the time is t1 ═ 10:05:00, t2 ═ 10:07:00, t3 ═ 10:07:10, t4 ═ 10:08:10, the range of the preset operation duration d is [30s, 300s ], respectively calculates the duration Δ t1 between t2 and t1 ═ 120s, the duration Δ t2 between t3 and t2 is 10s, the duration Δ t3 between t4 and t3 is 59660 s, since Δ t1 and Δ t3 are within the range of the preset operation duration, the value of t2 is not within the range of the preset operation, and the effective browsing time is determined as the browsing time from t 581 to t 5961, and determining two browsing operations within the time length from t3 to t4 as an effective browsing operation group 2, determining the time length delta t1 corresponding to the effective browsing operation group 1 as an effective browsing time length 1, and determining the time length delta t3 corresponding to the effective browsing operation group 2 as an effective time length 2.
S202: based on the effective browsing operation group, obtaining the total effective browsing duration of the ith plain text page, wherein the total effective browsing duration is used for representing the sum of effective attention durations of users to the ith plain text page;
specifically, when the user finishes browsing the third plain text page, the terminal accumulates the effective browsing durations determined in the above steps to obtain a total effective browsing duration for browsing the plain text page, for example, the effective duration is △ T1-120 s and △ T3-60 s, so that the total effective browsing duration for browsing the plain text page is △ T3General assembly=△t1+△t3=180s。
S203: based on the effective browsing operation group, obtaining the total effective browsing height of the ith plain text page; the total effective browsing height is used for representing the sum of effective attention areas of the user to the ith plain text page;
the method comprises the steps that firstly, based on the effective browsing operation, the offset of the heights of all effective browsing operation groups is obtained, wherein the heights of the effective browsing operation groups are the distance between the initial positions of the effective browsing operation groups on the ith plain text page and the browsing window of the ith plain text page;
for example, fig. 3 is a schematic diagram of the height offset amount corresponding to the effective browsing operation group in the embodiment of the present invention, and as shown in fig. 3, the height offset amount corresponding to the effective browsing operation group 1 is Δ h1 ═ 2cm, and the height offset amount corresponding to the effective browsing operation group 2 is Δ h2 ═ 4 cm.
Secondly, respectively comparing the offset of the height of each effective browsing operation group with the height value of the browsing window, and taking the minimum value of the two as the effective height of each effective browsing operation group to obtain the effective heights of all the effective browsing operation groups;
for example, the height value H of the browsing window is 4cm, the offset of the heights of the two effective browsing operation groups is Δ H1-2 cm, Δ H2-4 cm, the minimum value is taken in Δ H1 and H, that is, Δ H1, and the minimum value is taken in Δ H2 and H, that is, Δ H2, so that the effective height of the effective browsing operation group 1 is H1- Δ H1-2 cm, and the effective height of the effective browsing operation group 2 is H2- Δ H2-4 cm;
thirdly, accumulating the effective heights of all the effective browsing operation groups to obtain the total effective browsing height of the ith plain text page;
for example, the terminal adds the effective heights of the effective browsing operation group 1 and the effective browsing operation group 2 to obtain a total effective browsing height H3 of the third plain text page of the 4 plain text pagesGeneral assembly=H1+H2=6cm。
S204: and obtaining the attention parameter of the ith plain text page according to the total effective browsing duration and the total effective browsing height.
Specifically, firstly, the terminal calculates the ratio of the total effective browsing duration to the total effective browsing height according to the total effective browsing duration and the total effective browsing height;
for example, the total effective browsing duration △ T3 of the third plain text page by the userGeneral assemblyTotal effective browsing height H3 of 180sGeneral assembly6cm, then the ratio a of the total effective browsing duration to the total effective browsing height of the user for the third plain text page3=△T3General assembly/H3General assembly=30s/cm。
At this time, the terminal determines the ratio of the total effective browsing duration to the total effective browsing height of the ith plain text page as the attention parameter of the plain text page.
In practical application, in order to improve the priority of the recently browsed historical webpages, a user can set a time attenuation factor, and the terminal can also obtain the average attention value of the first i plain text pages based on the average attention value of the first i-1 plain text pages in the N plain text pages.
In the present embodiment, the average attention value of the first i plain text pages is obtained by formula (1).
Figure BDA0000564405950000101
Wherein E isiThe average attention value of the first i plain text pages is obtained; ei-1The average attention value of the first i-1 plain text pages; a. theiThe ratio of the total effective browsing duration to the total effective browsing height of the ith plain text page is obtained; when i is an integer of 1 or more and N or less, E is 22=(E1+A1) /2 wherein E1=A1
For example, the terminal obtains the average attention value E of the first two plain text pages in the four plain text pages2E33 s/cm, mixing2And A3Substituting equation (1) can calculate the average attention value of the first 3 plain text pages, i.e. E3=31s/cm。
And then, the terminal obtains the attention parameter of the ith plain text page based on the configuration value of the time attenuation factor and the average attention value of the first i plain text pages, wherein the time attenuation factor is used for representing the priority of the ith plain text page, and the time attenuation factor is greater than or equal to 0.
In the present embodiment, the attention parameter of the ith plain text page is obtained by formula (2).
Vi=Ai×(1-τ)+(T-TS)/(TN-TS)×Ei×τ (2)
Wherein, ViThe attention degree parameter is the attention degree parameter of the ith plain text page; a. theiThe ratio of the total effective browsing duration to the total effective browsing height of the ith plain text page is obtained; τ is a time decay factor; t is the time when the user browses the webpage; t isSAn effective time set for a user; t isNTime to initiate a history lookup for the user; eiIs the average attention value of the ith plain text page.
For example, the time decay factor τ set by the user is 0.5, the time when the user browses the web page is 8 months, that is, T is 8, the time when the user browses the web page for the first time is 1 month, TS1, current time 10 months, TNThen, the above parameters are substituted into the formula (2), and the attention parameter V of the third plain text page is calculated3=27.06s/cm。
In a specific implementation process, after obtaining the attention parameter of the ith plain text page through the S102, the attention parameter of the ith plain text page and the corresponding web page identifier may be stored in the history database, and when a user needs to query a history web page, the attention parameter of the user to the ith plain text history page may be directly obtained from the history database, so that the user may quickly find the history web page to be viewed.
S103: sequencing the webpage identifications corresponding to the N plain text pages according to the attention parameter;
in practical application, the terminal may sort the corresponding web page identifiers according to the sequence of the attention parameter from large to small, may also sort the corresponding web page identifiers according to the sequence of the attention parameter from small to large, and of course, may have other sorting manners, and the present invention is not limited specifically.
Further, after the webpage identifiers corresponding to the plain text pages are sorted in the step S103, the terminal can output the sorted webpage identifier sequences to the user, so that the user can conveniently check the attention degree of the historical webpages, and when the historical webpages to be checked are inquired, the search range can be narrowed according to the sorting result displayed by the terminal, so that the historical webpages to be searched can be quickly and accurately found.
The following describes a method for managing the browser history record by using a specific example.
Fig. 4 is a flowchart illustrating a method for managing ten historical webpages in an embodiment of the present invention, and referring to fig. 4, the method includes:
s401: the terminal respectively calculates L1 and L2 of each recorded historical webpage and calculates L1/L2;
the history web page 1 has L1/L2 of 1/5, the history web page 2 has L1/L2 of 1/4, the history web page has L1/L2 of 1/9, the history web page 4 has L1/L2 of 1/10, the history web page 5 has L1/L2 of 1/6, the history web page 6 has L1/L2 of 2/3, the history web page 7 has L1/L2 of 3/4, the history web page 8 has L1/L2 of 1/2, the history web page 9 has L1/L2 of 1/7, and the history web page 10 has L1/L2 of 4/5.
S402: the terminal determines the history webpages 1, 2, 5 and 9 which are larger than 1/8 and smaller than 1/3 in the L1/L2 as plain text pages;
s403: the terminal determines an effective browsing operation group and effective browsing duration thereof according to the duration of two adjacent browsing operations of a user on each plain text page aiming at the historical webpages 1, 2, 5 and 9;
firstly, for the history page 1, the terminal records the time of 4 browsing operations performed by the user, t1 being 10:05:00, t2 being 10:07:00, t3 being 10:07:10, t4 being 10:08:10, the value range of the preset time length d being [30s, 300s ], respectively calculating the time length Δ t1 between t2 and t1 being 120s, the time length Δ t2 between t3 and t2 being 10s, the time length Δ t3 between t4 and t3 being 60s, since Δ t1 and Δ t3 are within the range of the preset operation time period, Δ t2 is not within the range of the preset operation time period, therefore, the two browsing operations within the time length of t1 to t2 are determined as the effective browsing operation group 1, the two browsing operations within the time length of t3 to t4 are determined as the effective browsing operation group 2, and determining the time length delta t1 corresponding to the effective browsing operation group 1 as the effective browsing time length 1, and determining the time length delta t3 corresponding to the effective browsing operation group 2 as the effective time length 2.
By analogy, the effective browsing time lengths of the historical webpage 2 are respectively 110s and 90s for Δ t1 and Δ t 4; the effective browsing time lengths of the historical web pages 5 are respectively 50s, 60s and 40s, namely Δ t2, 4 and 5; the effective browsing time lengths of the history web pages 9 are respectively 80s and 137s for Δ t1 and Δ t 4.
S404, the effective browsing time lengths of the four historical webpages are respectively accumulated by the terminal, and the total effective browsing time length △ t1 of each historical webpage browsed by the user is obtainedGeneral assembly、△t2General assembly、△t5General assembly、△t9General assembly
In S403, the effective browsing durations of the history web page 1 obtained by the user are △ t 1-60S and △ t 3-120S, respectively, and the effective browsing durations are accumulated, that is, △ t1+ △ t 3-180S, so that the total effective browsing duration △ t1 of the history web page 1 browsed by the userGeneral assembly180s, and so on, the total effective browsing duration △ t2 of the user browsing the history web page 2General assembly200s, the total effective browsing time length △ t5 of the user browsing the historical webpage 5General assembly150s, the total effective browsing time △ t9 of the user browsing the historical webpage 9General assembly=217s。
S405: the terminal calculates the effective heights of the effective browsing operation groups based on the effective browsing operation groups, accumulates the effective heights, and obtains the total effective browsing height H1 of the user browsing the 4 historical webpagesGeneral assembly、H2General assembly、H5General assembly、H9General assembly
Among them, in the above history web page 1, the height offset of the browsing operation group 1 is △ H1-2 cm, the height offset of the effective browsing operation group 2 is △ H3-4 cm, the height value of the browsing window is H-4 cm, the minimum value is taken from △ H1 and H, i.e. △ H1, and the minimum value is taken from △ H2 and H, i.e. △ H2, so that the effective height of the effective browsing operation group 1 is H1- △ H1-2 cm, the effective height of the effective browsing operation group 2 is H2- △ H2-4 cm, and therefore, the total effective browsing height H1 of the user browsing history web page 1 is H1General assembly=6cm。
By analogy, the total effective height of the historical webpage 2 is H2General assemblyThe total effective height of the historical webpage 5 is H5 when the length of the webpage is 8cmGeneral assemblyThe total effective height of the historical webpage 9 is H9 at 5cmGeneral assembly=7cm。
S406, the terminal respectively calculates △ t of the four historical webpages according to the obtained total effective browsing duration and the total effective browsing heightGeneral assemblyAnd HGeneral assemblyRatio A between1、A2、A3、A4
Wherein, the total effective browsing time length △ t1 of the historical webpage 1General assemblyTotal effective browsing height H1 of 180sGeneral assembly6cm, then A130 s/cm; by analogy to this, historyA of Web Page 2225s/cm, A of historical webpage 5330s/cm, a of the history web page 94=31s/cm。
S407: the terminal respectively calculates the average attention value E of the first i plain text pages in the four historical webpagesiWherein the value of i is 1, 2, 3 and 4;
wherein E is1=A1Obtaining the average attention value E of the previous 2 plain text pages, namely the historical web pages 1 and 2 according to the formula (1) after 30s/cm227.5cm/s, and so on, E3=28.3cm/s,E4=29cm/s。
S408: the terminal is based on the time attenuation factor tau and the average attention value E1、E2、E3、E4Obtaining the attention parameter V of the four historical webpages1、V2、V3、V4
The time when the user browses the webpage is 8 months, namely T is 8, the time when the user browses the webpage for the first time is 1 month, and TS1, current time 10 months, TNWhen the time attenuation factor is set to τ 0.5 at 10, V is calculated according to equation (2)126.67s/cm, namely, the attention parameter of the user to the historical webpage 1 is 26.67 s/cm. By analogy, the user focuses on the parameter V of the historical webpage 2223.19s/cm, the attention parameter V of the user to the historical webpage 5326.1s/cm, the user's attention parameter V to the history web page 94=26.78s/cm。
S409: and the terminal sorts the webpage identifications corresponding to the four historical webpages according to the attention parameter.
Wherein, the terminal is according to the attention degree parameter of the above-mentioned four historical webpages, namely V1、V2、V3、V4Arranged in descending order, i.e. V4、V1、V3、V2And sequencing the corresponding webpage identifiers according to the sequenced attention degree parameters to obtain webpage identifier sequences, namely a historical webpage 9, a historical webpage 1, a historical webpage 5 and a historical webpage 2.
Thus, the management process of the ten historical webpages is completed, wherein the terminal may not display the corresponding webpage identifiers except for the six historical webpages 1, 2, 5 and 9, and may also display the corresponding webpage identifiers after the webpage identifier sequence according to the time sequence. When the user needs to inquire the historical webpages, the historical webpages with higher attention degree can be quickly found out from the ten historical webpages.
Therefore, when a user inquires historical webpages, if the historical webpages are more, after the terminal sorts the historical webpages according to the attention parameter method, the webpage identifiers with higher attention are generally concentrated at one end of the webpage identifier sequence, so that the user can quickly and accurately find the historical webpages which the user wants to inquire in the historical webpages with higher attention, the inquiry efficiency of the historical records of the browser is improved, and the user experience is greatly improved.
Based on the same inventive concept, embodiments of the present invention provide a terminal, which is consistent with the terminal described in one or more embodiments above.
Fig. 5 is a schematic structural diagram of a terminal in an embodiment of the present invention, and referring to fig. 5, the terminal includes: a determination unit 51, an obtaining unit 52, and a sorting unit 53;
the determining unit 51 is configured to determine N plain text pages in all history webpages, where N is an integer greater than or equal to 2; an obtaining unit 52, configured to obtain, by the terminal, an attention parameter of an ith plain text page in the N plain text pages, where the attention parameter is used to represent an attention degree of a user to the ith plain text page, and i is an integer greater than 1 and less than or equal to N; and the sorting unit 53 is configured to sort, by the terminal, the web page identifiers corresponding to the N plain text pages according to the attention parameter.
Further, the determining unit 51 is specifically configured to obtain a ratio of a length of the text with the link attribute to a length of the text with the non-link attribute in each history webpage; and determining the N historical webpages of which the ratios meet the preset conditions as N plain text pages.
Further, the determining unit 51 is specifically configured to, before determining N plain text pages in all historical webpages, obtain a ratio between a length of a text having a link attribute and a length of a text having a non-link attribute in a currently browsed webpage; when the ratio meets a preset condition, adding a plain text page identification to the currently browsed webpage; and the method is also used for determining the N historical webpages with the plain text page identifications as N plain text pages.
Further, the obtaining unit 52 includes: determining a subunit, a first obtaining subunit and a second obtaining subunit; the determining subunit is configured to determine an effective browsing operation group based on a time length between two adjacent browsing operations, where the effective browsing operation group is two adjacent browsing operations performed on the ith plain text page within a preset operation time length; the first obtaining subunit is configured to obtain, based on the effective browsing operation group, a total effective browsing duration of the ith plain text page and a total effective browsing height of the ith plain text page, where the total effective browsing duration is used to represent a sum of effective attention durations of users to the ith plain text page, and the total effective browsing height is used to represent a sum of effective attention areas of users to the ith plain text page; and the second obtaining subunit is used for obtaining the attention parameter of the ith plain text page according to the total effective browsing duration and the total effective browsing height.
Further, the first obtaining subunit is specifically configured to obtain an offset of the height of the effective browsing operation group, and determine the effective height of the effective browsing operation group according to the offset and the height value of the browsing window; obtaining the total effective browsing height of the ith plain text page according to the effective height; the height of the effective browsing operation group is the distance between the initial position of the effective browsing operation group on the ith plain text page and the initial position of the browsing window of the ith plain text page.
Further, the second obtaining subunit is specifically configured to obtain average attention values of the first i plain text pages in the N plain text pages; and obtaining an attention parameter according to the configuration value of the time attenuation factor and the average attention value of the previous i plain text pages, wherein the average attention value is used for representing the average attention degree of the user to the previous i plain text pages, the time attenuation factor is used for representing the priority of the ith plain text page, and the time attenuation factor is more than or equal to 0.
Further, the second obtaining subunit is specifically configured to obtain average attention values of the first i plain text pages based on the average attention values of the first i-1 plain text pages in the N plain text pages.
The determining unit 51, the obtaining unit 52 and the sorting unit 53 may be disposed in a terminal, such as a CPU, an ARM, or a processor, such as an embedded controller or a system-on-chip, and the present invention is not limited in detail.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of a hardware embodiment, a software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention.

Claims (14)

1. A management method of browser history records is applied to a terminal, and is characterized by comprising the following steps:
determining N plain text pages in all historical webpages, wherein N is an integer greater than or equal to 2;
determining an effective browsing operation group based on the time length between two adjacent browsing operations, wherein the effective browsing operation group is the two adjacent browsing operations performed on the ith plain text page within a preset operation time length;
obtaining a total effective browsing duration of the ith plain text page based on the effective browsing operation group and obtaining a total effective browsing height of the ith plain text page based on a height of the effective browsing operation group, wherein the total effective browsing duration is used for representing a sum of effective attention durations of the user to the ith plain text page, and the total effective browsing height is used for representing a sum of effective attention areas of the user to the ith plain text page;
obtaining an attention parameter of the ith plain text page according to the total effective browsing duration and the total effective browsing height, wherein the attention parameter is used for representing the attention degree of a user to the ith plain text page, and i is an integer which is greater than 1 and less than or equal to N;
and sequencing the webpage identifications corresponding to the N plain text pages according to the attention parameter.
2. The method of claim 1, wherein determining N plain text pages among all historical web pages comprises:
obtaining the ratio of the length of the text with the link attribute to the length of the text with the non-link attribute in each historical webpage;
and determining the N historical webpages of which the ratios meet preset conditions as the N plain text pages.
3. The method of claim 1, wherein before said determining N plain text pages among all historical web pages, the method further comprises:
obtaining the ratio of the length of the text with the link attribute to the length of the text with the non-link attribute in the current browsed webpage;
when the ratio meets a preset condition, adding a plain text page identifier to the currently browsed webpage;
determining N plain text pages in all historical webpages, wherein the N plain text pages comprise:
and determining the N historical webpages with the plain text page identifications as the N plain text pages.
4. The method of claim 1, wherein obtaining the total effective browsing height of the ith plain text page comprises:
obtaining an offset of a height of the effective browsing operation group, wherein the height of the effective browsing operation group is a distance between the ith plain text page and an initial position of a browsing window of the ith plain text page;
determining the effective height of the effective browsing operation group according to the offset and the height value of the browsing window, wherein the effective height is an effective attention area when the user performs the effective browsing operation group on the ith plain text page;
and obtaining the total effective browsing height of the ith plain text page according to the effective height.
5. The method according to claim 1, wherein the obtaining the attention parameter of the ith plain text page comprises:
obtaining average attention values of the first i plain text pages in the N plain text pages, wherein the average attention values are used for representing the average attention degree of the user to the first i plain text pages;
and obtaining the attention parameter according to a configuration value of a time attenuation factor and the average attention value of the first i plain text pages, wherein the time attenuation factor is used for representing the priority of the ith plain text page, and the time attenuation factor is greater than or equal to 0.
6. The method of claim 5, wherein obtaining the average attention value of the first i plain text pages of the N plain text pages comprises:
and obtaining the average attention value of the first i plain text pages based on the average attention value of the first i-1 plain text pages in the N plain text pages.
7. The method according to claim 6, characterized in that the average attention value of the first i plain text pages is obtained by the following formula:
Figure FDA0002542826560000031
wherein E isiThe average attention value of the first i plain text pages is obtained; ei-1The average attention value of the first i-1 plain text pages is obtained; a. theiFor the total effective browsing time length and theThe ratio of the total effective browsing height.
8. The method according to claim 5, characterized in that the attention parameter is obtained in particular by the following formula:
Vi=Ai×(1-τ)+((T-TS)/(TN-TS))×Ei×τ
wherein, ViThe attention degree parameter of the ith plain text page is obtained; a. theiThe ratio of the total effective browsing duration to the total effective browsing height of the ith plain text page is obtained; τ is the time decay factor; t is the time when the user browses the webpage; t isSA valid time set for the user; t isNTime to initiate a history lookup for the user; eiIs the average attention value of the ith plain text page.
9. A terminal, characterized in that the terminal comprises: the device comprises a determining unit, an obtaining unit and a sorting unit;
the determining unit is used for determining N plain text pages in all historical webpages, wherein N is an integer greater than or equal to 2;
the obtaining unit is configured to determine an effective browsing operation group based on a time length between two adjacent browsing operations, where the effective browsing operation group is the two adjacent browsing operations performed on the ith plain text page within a preset operation time length; obtaining a total effective browsing duration of the ith plain text page based on the effective browsing operation group and obtaining a total effective browsing height of the ith plain text page based on a height of the effective browsing operation group, wherein the total effective browsing duration is used for representing a sum of effective attention durations of the user to the ith plain text page, and the total effective browsing height is used for representing a sum of effective attention areas of the user to the ith plain text page; obtaining an attention parameter of the ith plain text page according to the total effective browsing duration and the total effective browsing height, wherein the attention parameter is used for representing the attention degree of a user to the ith plain text page, and i is an integer which is greater than 0 and less than or equal to N;
and the sequencing unit is used for sequencing the webpage identifications corresponding to the N plain text pages according to the attention parameter.
10. The terminal according to claim 9, wherein the determining unit is specifically configured to obtain a ratio of a length of a text having a link attribute to a length of a text having a non-link attribute in each history webpage; and determining the N historical webpages of which the ratios meet preset conditions as the N plain text pages.
11. The terminal according to claim 9, wherein the determining unit is specifically configured to, before determining N plain text pages in all the historical web pages, obtain a ratio between a length of a text having a link attribute and a length of a text having a non-link attribute in a currently browsed web page; when the ratio meets a preset condition, adding a plain text page identifier to the currently browsed webpage; and the server is further used for determining the N historical webpages with the plain text page identifications as the N plain text pages.
12. The terminal according to claim 9, wherein the first obtaining subunit is specifically configured to obtain an offset of a height of the effective browsing operation group, and determine an effective height of the effective browsing operation group according to the offset and a height value of the browsing window; obtaining the total effective browsing height of the ith plain text page according to the effective height; and the height of the effective browsing operation group is the distance between the effective browsing operation group on the ith plain text page and the initial position of the browsing window of the ith plain text page.
13. The terminal according to claim 9, wherein the second obtaining subunit is specifically configured to obtain an average attention value of the first i plain text pages in the N plain text pages; obtaining the attention degree parameter according to a configuration value of a time attenuation factor and an average attention degree value of the first i plain text pages, wherein the average attention degree value is used for representing the average attention degree of the user to the first i plain text pages, the time attenuation factor is used for representing the priority of the ith plain text page, and the time attenuation factor is greater than or equal to 0.
14. The terminal according to claim 13, wherein the second obtaining subunit is configured to obtain the average attention value of the first i plain text pages based on the average attention value of the first i-1 plain text pages in the N plain text pages.
CN201410444334.8A 2014-09-02 2014-09-02 Management method and terminal for browser history records Active CN105373570B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410444334.8A CN105373570B (en) 2014-09-02 2014-09-02 Management method and terminal for browser history records

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410444334.8A CN105373570B (en) 2014-09-02 2014-09-02 Management method and terminal for browser history records

Publications (2)

Publication Number Publication Date
CN105373570A CN105373570A (en) 2016-03-02
CN105373570B true CN105373570B (en) 2020-09-15

Family

ID=55375777

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410444334.8A Active CN105373570B (en) 2014-09-02 2014-09-02 Management method and terminal for browser history records

Country Status (1)

Country Link
CN (1) CN105373570B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105843929A (en) * 2016-03-29 2016-08-10 乐视控股(北京)有限公司 Browsing history ordering method and apparatus
CN105930513A (en) * 2016-05-16 2016-09-07 北京京东尚科信息技术有限公司 Browser history record sorting method and apparatus
CN108345601B (en) * 2017-01-23 2020-11-20 腾讯科技(深圳)有限公司 Search result ordering method and device
CN114816179A (en) * 2021-01-18 2022-07-29 腾讯科技(深圳)有限公司 Historical browsing content display method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101373485A (en) * 2008-09-25 2009-02-25 北京搜狗科技发展有限公司 Method and apparatus for providing web page access entrance
CN102609474A (en) * 2012-01-18 2012-07-25 北京搜狗信息服务有限公司 Access information providing method and system
CN102622445A (en) * 2012-03-15 2012-08-01 华南理工大学 User interest perception based webpage push system and webpage push method
CN103092839A (en) * 2011-10-28 2013-05-08 腾讯科技(深圳)有限公司 Management method and device for recording historical information
CN103309862A (en) * 2012-03-07 2013-09-18 腾讯科技(深圳)有限公司 Webpage type recognition method and system
CN103793426A (en) * 2012-11-01 2014-05-14 腾讯科技(深圳)有限公司 Method and device for keeping web page access records

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120143927A1 (en) * 2010-12-05 2012-06-07 Unisys Corp. Efficient storage of information from markup language documents

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101373485A (en) * 2008-09-25 2009-02-25 北京搜狗科技发展有限公司 Method and apparatus for providing web page access entrance
CN103092839A (en) * 2011-10-28 2013-05-08 腾讯科技(深圳)有限公司 Management method and device for recording historical information
CN102609474A (en) * 2012-01-18 2012-07-25 北京搜狗信息服务有限公司 Access information providing method and system
CN103309862A (en) * 2012-03-07 2013-09-18 腾讯科技(深圳)有限公司 Webpage type recognition method and system
CN102622445A (en) * 2012-03-15 2012-08-01 华南理工大学 User interest perception based webpage push system and webpage push method
CN103793426A (en) * 2012-11-01 2014-05-14 腾讯科技(深圳)有限公司 Method and device for keeping web page access records

Also Published As

Publication number Publication date
CN105373570A (en) 2016-03-02

Similar Documents

Publication Publication Date Title
CN102930059B (en) Method for designing focused crawler
TWI522942B (en) User favorites data processing method and device, user favorite data searching method and device, and user favorite system
US9405746B2 (en) User behavior models based on source domain
CN105373570B (en) Management method and terminal for browser history records
CN101329687B (en) Method for positioning news web page
US20170075513A1 (en) Surf Software
CN107169010A (en) A kind of determination method and device of recommendation search keyword
CN106850750B (en) Method and device for pushing information in real time
CN102761627A (en) Cloud website recommending method and system based on terminal access statistics as well as related equipment
CN104252348B (en) A kind of web page access statistical method and device based on browser
CN104361042A (en) Information retrieval method and device
CN103577490A (en) Method and device of showing web browsing history
CN102184199A (en) Network information recommending method and system
CN106126544B (en) Internet content delivery method and device
CN103729439B (en) A kind of webpage preloads method and apparatus
CN107277115A (en) A kind of content delivery method and device
CN102411617A (en) Method for storing and inquiring a large quantity of URLs
CN104050183A (en) Content matching result prompting method and device for browser input frame
CN102508884A (en) Method and device for acquiring hotpot events and real-time comments
CN110008393B (en) Method and equipment for acquiring website information
KR20180017182A (en) Automated Information Retrieval
CN111259274A (en) Information processing method, device, equipment and information display device
CN109948034B (en) Method and device for extracting page information based on filtering session
CN107463581B (en) Application download amount acquisition method and device and terminal equipment
KR20150068256A (en) Method and apparatus for managing web browser history

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant