CN111898015A - Book heat value acquisition method and device, terminal device and storage medium - Google Patents

Book heat value acquisition method and device, terminal device and storage medium Download PDF

Info

Publication number
CN111898015A
CN111898015A CN202010886961.2A CN202010886961A CN111898015A CN 111898015 A CN111898015 A CN 111898015A CN 202010886961 A CN202010886961 A CN 202010886961A CN 111898015 A CN111898015 A CN 111898015A
Authority
CN
China
Prior art keywords
target
book
name
author
author name
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202010886961.2A
Other languages
Chinese (zh)
Inventor
刘晓聪
高心怡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Shenzhen Huantai Technology Co Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Shenzhen Huantai Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd, Shenzhen Huantai Technology Co Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN202010886961.2A priority Critical patent/CN111898015A/en
Publication of CN111898015A publication Critical patent/CN111898015A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application is applicable to the technical field of data search, and provides a book heat value acquisition method, a device, a terminal device and a storage medium, and the method comprises the following steps: acquiring a target book word list, wherein the target book word list comprises book names and author names of M books, and M is an integer greater than zero; acquiring historical search sentences of a target book in the M books, wherein the target book is selected from the M books; obtaining the access amount of historical search sentences of a target book; and acquiring a target heat value of the target book according to the access amount of the historical search statement of the target book. Through the method and the device, the influence of the book resource platform on the book heat value can be reduced when the book heat value is acquired.

Description

Book heat value acquisition method and device, terminal device and storage medium
Technical Field
The application belongs to the technical field of data search, and particularly relates to a book popularity value acquisition method and device, a terminal device and a storage medium.
Background
In a mobile search scene, the search of various books such as textbooks, tool books, novels and the like occupies a large part of the search requirements of users, and in order to make the search experience of the users better, the resources of a plurality of book resource platforms are generally fused to provide the search experience of the book resources with rich contents for the users.
The book popularity value reflects the popularity of the books, and in a mobile search scene, the book popularity value is important and directly influences the book search result. In the prior art, book heat values are calculated in a weighting manner by using characteristics of the number of comments, the number of reads and the like of books provided by a book resource platform, and the method needs to align resources of a plurality of book resource platforms and is greatly influenced by the book resource platforms.
Disclosure of Invention
The application provides a book heat value acquisition method and device, terminal equipment and a storage medium, so that when the book heat value is acquired, the influence of a book resource platform on the book heat value is reduced.
In a first aspect, an embodiment of the present application provides a book heat value obtaining method, where the book heat value obtaining method includes:
acquiring a target book word list, wherein the target book word list comprises book names and author names of M books, and M is an integer greater than zero;
acquiring historical search sentences of a target book in the M books, wherein the target book is selected from the M books;
obtaining the access amount of the historical search statement of the target book;
and acquiring a target heat value of the target book according to the access amount of the historical search statement of the target book.
In a second aspect, an embodiment of the present application provides a book heat value obtaining apparatus, including:
the target word list obtaining module is used for obtaining a target book word list, wherein the target book word list comprises book names and author names of M books, and M is an integer larger than zero;
a search sentence acquisition module, configured to acquire a historical search sentence of a target book in the M books, where the target book is selected from the M books;
the access amount acquisition module is used for acquiring the access amount of the historical search statement of the target book;
and the heat value acquisition module is used for acquiring the target heat value of the target book according to the access amount of the historical search statement of the target book.
In a third aspect, an embodiment of the present application provides a terminal device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps of the book popularity value obtaining method according to the first aspect when executing the computer program.
In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, where a computer program is stored, and the computer program, when executed by a processor, implements the steps of the book popularity value obtaining method according to the first aspect.
In a fifth aspect, an embodiment of the present application provides a computer program product, which, when running on a terminal device, causes the terminal device to execute the steps of the book hotness value obtaining method according to the first aspect.
Therefore, the target heat value of the target book can be obtained according to the access amount of the historical search statement of the target book by acquiring the access amount of the historical search statement of the target book. Namely, according to the access amount of the historical search sentences of the books, the book heat value can be obtained, the characteristics of comment data, reading quantity and the like of the user on the books do not need to be acquired from the book resource platform, and the influence of the book resource platform on the book heat value is reduced.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a schematic flowchart illustrating an implementation process of a book heat value obtaining method according to an embodiment of the present application;
FIG. 2 is an exemplary diagram of a target book vocabulary;
fig. 3 is a schematic flow chart illustrating an implementation of a book heat value obtaining method according to a second embodiment of the present application;
fig. 4 is a schematic flow chart illustrating an implementation of a book heat value obtaining method according to a third embodiment of the present application;
fig. 5 is a schematic structural diagram of a book heat value acquisition device according to a fourth embodiment of the present application;
fig. 6 is a schematic structural diagram of a terminal device provided in the fifth embodiment of the present application;
fig. 7 is a schematic structural diagram of a terminal device according to a sixth embodiment of the present application.
Detailed Description
In the following description, for purposes of explanation and not limitation, specific details are set forth, such as particular system structures, techniques, etc. in order to provide a thorough understanding of the embodiments of the present application. It will be apparent, however, to one skilled in the art that the present application may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.
It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
In particular implementations, the terminal devices described in embodiments of the present application include, but are not limited to, other portable devices such as mobile phones, laptop computers, or tablet computers having touch sensitive surfaces (e.g., touch screen displays and/or touch pads). It should also be understood that in some embodiments, the device is not a portable communication device, but is a desktop computer having a touch-sensitive surface (e.g., a touch screen display and/or touchpad).
In the discussion that follows, a terminal device that includes a display and a touch-sensitive surface is described. However, it should be understood that the terminal device may include one or more other physical user interface devices such as a physical keyboard, mouse, and/or joystick.
The terminal device supports various applications, such as one or more of the following: a drawing application, a presentation application, a word processing application, a website creation application, a disc burning application, a spreadsheet application, a gaming application, a telephone application, a video conferencing application, an email application, an instant messaging application, an exercise support application, a photo management application, a digital camera application, a web browsing application, a digital music player application, and/or a digital video player application.
Various applications that may be executed on the terminal device may use at least one common physical user interface device, such as a touch-sensitive surface. One or more functions of the touch-sensitive surface and corresponding information displayed on the terminal can be adjusted and/or changed between applications and/or within respective applications. In this way, a common physical architecture (e.g., touch-sensitive surface) of the terminal can support various applications with user interfaces that are intuitive and transparent to the user.
It should be understood that, the sequence numbers of the steps in this embodiment do not mean the execution sequence, and the execution sequence of each process should be determined by the function and the inherent logic of the process, and should not constitute any limitation to the implementation process of the embodiment of the present application.
In order to explain the technical solution described in the present application, the following description will be given by way of specific examples.
Referring to fig. 1, which is a schematic diagram of an implementation flow of a book heat value obtaining method provided in an embodiment of the present application, where the book heat value obtaining method is applied to a terminal device, as shown in the figure, the book heat value obtaining method may include the following steps:
step 101, obtaining a target book vocabulary.
The target book word list comprises book names and author names of M books, and M is an integer larger than zero. It should be noted that when M is an integer greater than 1, M books refer to M different books, at least two books with the same book name and different book contents are different books, and the author names of the at least two books with the same book name and different book contents are usually different, that is, at least two books with the same book name and different book contents can also be understood as books with the same book name and different book contents written by at least two different authors. The support of the M books includes, but is not limited to, paper, devices using optical, electrical, magnetic media, etc., and is not limited herein; the types of the M books include, but are not limited to, textbooks, tool books, novels, etc., and are not limited thereto.
Fig. 2 is an exemplary diagram of a target book vocabulary including six books with book names a, b, and c, and two books with book name a with author names a1 and a 2; the name of an author of the book is B; the author names of three books with the book name C are respectively C1, C2 and C3; namely, the target book vocabulary contains six pieces of data, and each piece of data includes a book name and an author name.
And 102, acquiring historical search sentences of a target book in the M books.
The target book is selected from M books, and may be any one of the M books, which is not limited herein.
For convenience of explanation, the present application takes a target book as an example to describe a manner of acquiring a heat value of each book, that is, in order to calibrate the heat value of each book in the M books, each book in the M books may be taken as a target book, so that the heat value of each book in the M books may be obtained through the present application.
The historical search statement may refer to a search statement input by a user before the current time, and the search statement may refer to a search statement input by the user in a search box, and is used to obtain a corresponding search result, for example, a "big lead" is input in a browser of a mobile phone, data related to the "big lead" may be obtained, the data related to the "big lead" is a search result of the "big lead", and the "big lead" is a search statement. It should be noted that the search term may also be referred to as query in the search field.
And 103, acquiring the access amount of the historical search sentences of the target book.
The access amount of the history search term may refer to the search frequency of the history search term, or may refer to the number of users who input the history search term, and is not limited herein. The search frequency of the historical search statement can be understood as Page View (PV), that is, if the same historical search statement is input for multiple times through the same device, the search frequency is accumulated; the number of users who input the history search sentence may be understood as a Unique Viewer (UV), that is, if the same history search sentence is input through the same device a plurality of times, the number of users is not accumulated, and one device is one user.
And 104, acquiring a target heat value of the target book according to the access amount of the historical search statement of the target book.
The target heat value of the target book is the acquired heat value of the target book according to the access amount of the historical search statement of the target book, and represents the popularity of the target book. Since the user usually searches for the target book by the book name, the target hot value of the target book may be understood as the hot value of the target book name, and the target book name may be the book name of the target book.
In this embodiment, according to the access amount of the historical search statement, the heat value of the book (i.e., the target heat value of the book) is obtained, the heat value can be calibrated for the book whose heat value has not been calibrated before, and the method is suitable for the cold start stage. For example, a Logistic Regression (LR) model is used to model important features such as the number of searches for books, click rate, exposure rate, and the like, a real-time engine is built at the bottom, and the heat value of the book calculated in real time is built into the real-time engine to complete the update of the heat value of the book that has been online.
In this embodiment, after the target popularity value of the target book is obtained, the target popularity value of the target book may be used as a popularity tag of the target book, and when a subsequent user searches the target book, the popularity value of the target book may be displayed in a search result page of the target book (i.e., a page on which relevant information of the target book is displayed), so that the user can know the popularity of the target book conveniently. In addition, for at least two books with the same book name and different book contents, when the book name is used as a search statement, the search result pages of the at least two books can be displayed according to the sequence from high to low of the heat value, so that a user can preferentially view the books with higher popularity.
According to the method and the device, the book heat value can be obtained according to the access amount of the historical search sentences of the book, the characteristics of comment data, reading quantity and the like of the user on the book do not need to be acquired from the book resource platform, and the influence of the book resource platform on the book heat value is reduced.
Referring to fig. 3, which is a schematic diagram of an implementation flow of a book heat value obtaining method provided in the second embodiment of the present application, where the book heat value obtaining method is applied to a terminal device, as shown in the figure, the book heat value obtaining method may include the following steps:
step 301, obtaining a target book vocabulary.
The step is the same as step 101, and reference may be made to the related description of step 101, which is not described herein again.
Step 302, obtaining a search statement of a target book name.
The search term of the target book name may refer to a search term using the target book name as a keyword, and the search term using the target book name as a keyword may refer to that the search term of the target book name includes the target book name and does not include other content, where the other content is content other than the target book name, and the target book name is a book name of the target book.
Step 303, a first search statement of at least one author name corresponding to the target book name is obtained.
The first search statement of an author name corresponding to the target book name is a search statement using the target book name and the author name as keywords, and the first search statement of the author name which uses the target book name and the author name as keywords may refer to the target book and the author name in the first search statement of the author name of the target book, and does not include other content, where the other content refers to content other than the target book name and the author name. Alternatively, the writing order of the target book name and the author name in the corresponding first search sentence is not limited, and may be, for example, the target book name, the author name (i.e., the target book name is written before and the author name is written after), or the author name, the target book name (i.e., the author name is written before and the target book name is written after).
In this embodiment, the search term of the target book name and the first search term of at least one author name corresponding to the target book name may be acquired from all search terms input by the user and acquired within the third preset time period. The at least one author name corresponding to the target book name may be understood as author names of all books with the book names of the target book names, for example, two authors with author names a1 and a2 write different books with the book names a, and then the author names a1 and a2 corresponding to the book name a. Alternatively, the third preset time period may refer to any preset time period, for example, 50 days.
Step 304, acquiring book resource platform data and first network search result data.
The book resource platform data includes a book name and an author name of K books, K is an integer greater than zero, the first network search result data refers to network search result data within a first preset time period, the network search result data refers to a search result including the book name and the author name, and the first preset time period may be any preset time period, for example, 45 days.
The book resource platform data may be data provided by at least one book resource platform, and the book resource platform may refer to a platform capable of providing book resources, such as a reading group, a starting point, a bean, and the like. The web search result data may refer to a search result including a book name and an author name obtained by searching a website (e.g., hundredths, horse, dog, etc.), one of the search results includes the book name and the author name corresponding to the book name, for example, the book name "leading" is input in a search box of a search engine, a plurality of search results are usually displayed, a search result arranged in the first L bits may be selected from the plurality of search results as the web search result data, and a search result arranged in the first L bits may be selected as the web search result data, instead of using all the search results as the web search result data, because the search result arranged in the first L bits usually can more accurately express the main intention of a search sentence, L is an integer greater than zero, for example, L is 10.
And 305, acquiring a second search statement of the name of the target author according to the book resource platform data and the first network search result data.
Wherein, the target author name refers to the author name of the target book.
In this embodiment, since the book resource platform data is data provided by at least one book resource platform, and usually includes book names and author names of a large number of books, the book resource platform data can supplement a large number of second search sentences for the books on the basis of the second search sentences obtained from the first web search result data.
Optionally, the obtaining a second search statement of the name of the target author according to the book resource platform data and the first network search result data includes:
if the book resource platform data and the first network search result data both comprise the target book name and the target author name, determining that a search statement corresponding to the target search result is a second search statement of the target author name, wherein the target search result is a search result of the first network search result data comprising the target book name and the target author name;
and if the book resource platform data comprises the target book name and the target author name and the first network search result data does not have the target search result, determining a second search statement taking the target book name and the target author name as keywords as the search statement of the target author name.
The book resource platform data includes a target book name and a target author name, which means that there is a piece of data in the book resource platform data, where the book name is the target book name and the author name corresponding to the book name is the target author name, for example, the target book name is a and the target author name is a1, and if data is found in the book resource platform data, where the book name is a and the author name corresponding to the book name is also a1, it is determined that the book resource platform data includes the target book name and the target author name. The first network search result data includes a target book name and a target author name, which means that there is a search result in the first network search result data in which a book name is a target book name and an author name corresponding to the book name is a target author name, for example, the target book name is a and the target author name is a1, and if a search result in which a book name is a and an author name corresponding to the book name is also a1 is found in the first network search result data, it is determined that the first network search result data includes the target book name and the target author name.
Step 306, obtaining the access amount of the search statement of the target book name, the access amount of the first search statement of at least one author name corresponding to the target book name, and the access amount of the second search statement of the target author name.
The access amount of the search statement of the target book name, the access amount of the first search statement of at least one author name corresponding to the target book name, and the access amount of the second search statement of the target author name may all be search frequency, or may all be user number, which is not limited herein.
Step 307, obtaining a first initial heat value of the target book according to the access amount of the search statement of the target book name and the access amount of the first search statement of at least one author name corresponding to the target book name.
In this embodiment, since the access amount of the search term of the target book name is the access amount of the search term using the target book name as a keyword, and the author name corresponding to the target book name is usually at least one, according to the access amount of the search term of the target book name and the access amount of the first search term of at least one author name of the target book, the ratio of the access amount of the search term of the target book name corresponding to each author name in the at least one author name in the access amounts of the search terms of the target book names corresponding to all authors can be obtained, and then according to the access amount of the search term of the target book name, the ratio can be converted into the access amount, so as to obtain the access amount of the book corresponding to each author name, since the at least one author name includes the target author name, therefore, the access amount of the target book corresponding to the target author name can be obtained. For example, if the target book name a, the target author name a1, the author with author name a1 writes a book L1 with target book name a (i.e., the target book), the author with author name a2 writes a book L2 with target book name a, and the contents of book L1 and book L2 are different, the target book name a corresponds to two different authors with author names a1 and a2, respectively, and the access amount of book L1 and the access amount of book L2 can be calculated according to the access amount of the search term of the target book name a, the access amount of the first search term of a1, and the access amount of the first search term of a 2.
Optionally, the obtaining the first initial hot value of the target book according to the access amount of the search statement of the target book name and the access amount of the first search statement of the at least one author name corresponding to the target book name includes:
calculating the access amount of the target book corresponding to the target author name according to the access amount of the search statement of the target book name and the access amount of the first search statement of at least one author name corresponding to the target book name
Figure BDA0002655863970000101
Wherein, V2The access amount, V, of the first search statement representing the name of the target author1Access amount, V, of search term representing name of target book2nRepresenting the access amount of a first search statement of an nth author name in at least one author name corresponding to the target book name, wherein N represents the total amount of the at least one author name corresponding to the target book name, and is an integer greater than zero;
detecting whether a first book with an author name as a target author name exists in a target book word list, wherein the first book is a book with the author name as the target author name except the target book;
if a first book with the author name as the target author name exists, acquiring the access amount of the first book corresponding to the target author name;
accumulating the access amount of the first book corresponding to the target author name and the access amount of the target book corresponding to the target author name to obtain a second accumulated value, and determining that the second accumulated value is the sum of the access amounts of all books corresponding to the target author name;
if the target book name corresponds to other author names, acquiring the sum of the access volumes of all books corresponding to the other author names, wherein the other author names are author names except the target author name;
acquiring the maximum value of the sum of the access amounts of all books corresponding to other author names and the sum of the access amounts of all books corresponding to the target author name;
calculating an initial heat value of the target author name according to the sum of the access amounts of all books corresponding to the target author name and the maximum value of the sum of the access amounts;
if the target book name does not correspond to other author names, determining that the initial heat value of the target author name is a preset numerical value;
respectively carrying out first preprocessing on the access quantity of the first search statement of the target author name and the access quantity of the target book corresponding to the target author name, unifying the access quantity of the first search statement of the target author name and the dimension of the access quantity of the target book corresponding to the target author name through the first preprocessing, and obtaining the access quantity of the first search statement of the target author name after unifying the dimension and the access quantity of the target book corresponding to the target author name after unifying the dimension;
and acquiring a first initial heat value of the target book according to the initial heat value of the target author name, the access quantity of the first search statement of the target author name after dimension unification and the access quantity of the target book corresponding to the target author name after dimension unification.
Wherein the above formula
Figure BDA0002655863970000111
In (1)
Figure BDA0002655863970000112
The sum of the access amounts of the first search sentences of all the author names corresponding to the target book name can be represented, and the access amount of the first search sentence of each author name corresponding to the target book name can be corrected through the formula, so that the situation that the hot value of a book corresponding to an incorrect author name is higher due to the fact that the hot value of the book is too high is avoided.
For all books corresponding to the target author name, including the first book and the target book with the author name as the target author name, taking each book corresponding to the target author name as the target book, the above formula can be used
Figure BDA0002655863970000121
Calculating to obtain respective access amount of all books corresponding to the target author name, and calculating all the books corresponding to the target author nameThe respective access amounts of the books are accumulated, and the accumulated value is
Figure BDA0002655863970000122
I.e. the sum of the access volumes, V, of all books corresponding to the target author name3kRepresenting the access amount of the kth book corresponding to the target author name, Q representing the total number of all books corresponding to the target author name, and taking each author name in all author names corresponding to the target book name as the target author name, then the formula can be used according to
Figure BDA0002655863970000123
Calculating to obtain the total access amount of all books corresponding to all author names corresponding to the target book name, and selecting the maximum access amount total value from the total access amount of all books corresponding to all author names, wherein max (V) can be used4n) Indicates the maximum value of the access amount sum, V4nThe sum of the access amounts of all books corresponding to the nth author name is represented, the value range of N is 1 to N, and the initial heat value of the target author name can be represented as
Figure BDA0002655863970000124
P (i.e. a preset value) is a positive number, and a user can set a specific value of P by himself according to actual needs, for example, P is 5.
In this embodiment, by unifying the dimensions of the access amount of the first search statement of the target author name and the access amount of the target book corresponding to the target author name through the first preprocessing, the access amount of the first search statement of the target author name and the access amount of the target book corresponding to the target author name can be enabled to have the same dimensions, and the contribution of the access amount of the first search statement and the access amount of the target book in the first initial heat value calculation process is balanced.
The first preprocessing of the access amount of the first search statement of the target author name may include: acquiring the respective access quantity of first search sentences of all author names of M books; according to the sequence of the visit quantity of the first search statement from high to low, the places of the M books are searchedOrdering the access amount of the first search statement with the name of the author; the access amount of the first search statement arranged at the first bit may be taken as a first target access amount; since there may be an abnormal first search term access amount in the access amounts of the first search terms of all author names of the M books, in order to reduce the influence of the abnormal first search term access amount on the uniform dimension, a median value of the access amounts of all first search terms arranged at the top H position may also be used as the first target access amount, and H may be an integer greater than 1 and generally less than or equal to 10; calculating the ratio of the visit quantity of the first search statement of the target author name to the first target visit quantity; calculating a product of the ratio and a preset dimension, wherein the product is an access quantity of the first search statement of the target author name after the dimension is unified, and the access quantity of the first search statement of the target author name after the dimension is unified can be expressed as
Figure BDA0002655863970000131
Figure BDA0002655863970000132
The first target access amount is represented, F represents a preset dimension, and the preset dimension may refer to any preset dimension, such as 1000, 5000, and the like, and is not limited herein.
The first preprocessing of the access amount of the target book corresponding to the target author name may include: acquiring the access amount of books corresponding to all author names of the M books respectively; sequencing the access volumes of books corresponding to all author names of the M books according to the sequence of the access volumes of the books from high to low; the access amount of the book arranged at the first place may be taken as a second target access amount; since there may be an access amount of an abnormal book among access amounts of books corresponding to all author names of the M books, in order to reduce the access amount of the abnormal book, a median value of the access amounts of all books arranged at the top H may be used as the second target access amount; calculating the ratio of the access quantity of the target book corresponding to the target author name to the second target access quantity; calculating the product of the ratio and a predetermined dimension, the productThat is, the access amount of the target book corresponding to the target author name after the unification of the dimensions may be expressed as the access amount of the target book corresponding to the target author name after the unification of the dimensions
Figure BDA0002655863970000133
Figure BDA0002655863970000134
Representing a second target access quantity.
When the first initial heat value of the target book is obtained, the access amount of the first search statement of the target author name and the access amount of the target book corresponding to the target author name may be added, whether the added value is greater than zero or not is judged, and if the added value is greater than zero, the first initial heat value of the target book is obtained
Figure BDA0002655863970000135
If the value obtained after the addition is less than or equal to zero, the second initial heat value of the target book
Figure BDA0002655863970000141
Wherein x is1And x2Are all adjustment coefficients, x1And x2All of which are greater than zero, e.g. x1Is 0.8, x2Is 0.2.
Optionally, after calculating the access amount of the target book corresponding to the target author name, the embodiment further includes:
judging whether the access amount of the target book corresponding to the target author name is abnormal or not according to the access amount of the target book corresponding to the target author name or according to the access amount of the first search statement of the target author name and the access amount of the target book corresponding to the target author name;
and if the access volume of the target book corresponding to the target author name is abnormal, updating the access volume of the target book corresponding to the target author name according to the access volume of the first search statement of the target author name.
Taking the novel as an example, since many movies and television shows are recomposed from the novel, hot-play of the movies and television shows may raise the heat value of a part of the novel, that is, a search sentence which is not intended by the novel, such as a hot-play movie and television show, is determined as a search sentence with the intention of the novel, and thus the heat value of the novel is raised. In order to reduce the influence of the hot-broadcast movie and television play on the hot value of the target novel and improve the accuracy of the hot value of the target novel, whether the access quantity of the target novel corresponding to the name of the target author is abnormal or not can be judged according to the access quantity of the target novel corresponding to the name of the target author; or judging whether the access quantity of the target novel corresponding to the target author name is abnormal or not according to the access quantity of the first search statement of the target author name and the access quantity of the target novel corresponding to the target author name; if the access volume of the target novel corresponding to the target author name is abnormal, the access volume of the target novel corresponding to the target author name can be reduced by updating the access volume of the target novel corresponding to the target author name.
The determining whether the access volume of the target book corresponding to the target author name is abnormal according to the access volume of the target book corresponding to the target author name may include: comparing the access quantity of the target book corresponding to the target author name with an access quantity threshold, and if the access quantity of the target book corresponding to the target author name is larger than the access quantity threshold, determining that the access quantity of the target book corresponding to the target author name is abnormal; and if the access quantity of the target book corresponding to the target author name is smaller than or equal to the first threshold value, determining that no abnormity exists in the access quantity of the target book corresponding to the target author name.
The determining whether the access amount of the target book corresponding to the target author name is abnormal or not according to the access amount of the first search statement of the target author name and the access amount of the target book corresponding to the target author name may include: calculating the ratio of the access quantity of the target book corresponding to the target author name to the access quantity of the first search statement of the target author name; taking the preset quantile of the ratio as a second threshold value; judging whether the ratio is larger than a second threshold value; if the ratio is larger than a second threshold value, determining that the access quantity of the target book corresponding to the target author name is abnormal; and if the ratio is smaller than or equal to the second threshold, determining that the access quantity of the target book corresponding to the target author name is not abnormal. The preset quantile may refer to any preset quantile, for example, a 90% quantile, and is not limited herein.
Optionally, updating the access amount of the target book corresponding to the target author name according to the access amount of the first search statement of the target author name includes:
determining an access amount range according to the access amount of the first search statement of the target author name;
determining a second book according to the access amount range, wherein the second book is a book, except the target book, of which the access amount of the first search statement is within the access amount range;
acquiring the access amount of a second book;
and updating the access amount of the target book corresponding to the target author name according to the access amount of the second book.
When determining that the access amount of the target book corresponding to the target author name is abnormal according to the access amount of the target book corresponding to the target author name, the first adjustment parameter may be subtracted from the access amount of the first search term of the target author name, the obtained difference is determined to be a starting value of the access amount range, the access amount of the first search term of the target author name is added to the first adjustment parameter, and the added value is determined to be an end value of the access amount range, so that the access amount range is obtained. Where the first adjustment parameter is a positive number, the user may preset the first adjustment parameter, for example, the first adjustment parameter is 500, and then the access amount range may be represented as V2-500,V2+500]。
When it is determined that there is an abnormality in the access amount of the target book corresponding to the target author name based on the access amount of the first search sentence of the target author name and the access amount of the target book corresponding to the target author name, a product of the second adjustment coefficient and the access amount of the first search sentence of the target author name may be used as a third adjustment coefficient, the third adjustment parameter may be subtracted from the access amount of the first search sentence of the target author name, and a difference obtained by determining the difference as the start of the access amount rangeAnd adding the access amount of the first search statement of the target author name and the third adjustment parameter, and determining that the added value is the end value of the access amount range, thereby obtaining the access amount range. Where the second adjustment parameter is a positive number smaller than 1, the user may preset the second adjustment parameter, for example, the second adjustment parameter is 0.2, and then the access amount range may be represented as V2-0.2*V,V2+0.2*V]。
In this embodiment, the number of the second books may be one or at least two, and when the number of the second books is at least two, the access amount of the second book refers to the access amount of each of the at least two second books.
According to the access amount of the second book, updating the access amount of the target book corresponding to the target author name may include: if the number of the second books is one, updating the access amount of the second book corresponding to the target author name to the access amount of the second book, namely, taking the access amount of the second book as the access amount of the target book corresponding to the target author name; if the number of the second books is at least two, acquiring a median of the access amounts of the at least two second books, and updating the access amount of the target book corresponding to the target author name to the median, namely, taking the median as the access amount of the target book corresponding to the target author name.
Step 308, determining the access amount of the second search statement of the target author name as a second initial heat value of the target book.
Step 309, obtaining a target heat value of the target book according to the first initial heat value of the target book and the second initial heat value of the target book.
In this embodiment, the first initial heat value of the target book and the second initial heat value of the target book may be weighted and averaged, and a value obtained by weighted and averaged may be determined as the target heat value of the target book. Alternatively, the user may set the weight of the first initial heating value and the weight of the second initial heating value by himself or herself according to actual needs, for example, two weights are set to be 1.
Optionally, the obtaining the target heat value of the target book according to the first initial heat value of the target book and the second initial heat value of the target book includes:
performing first pretreatment on the second initial heat value of the target book to obtain a second initial heat value of the pretreated target book, wherein the dimension of the second initial heat value of the pretreated target book is the same as that of the first initial heat value of the target book;
and acquiring a target heat value of the target book according to the second initial heat value of the preprocessed target book and the first initial heat value of the target book.
In this embodiment, since the first pre-processing is performed in the process of calculating the first initial heat value of the target book, in order to unify the dimensions of the first initial heat value and the second initial heat value of the target book, the first pre-processing may be performed on the second initial heat value of the target book, so that the first initial heat value and the second initial heat value of the target book have the same dimensions, and the contribution of the two initial heat values in the process of calculating the target heat value is balanced.
In step 310, it is detected whether a first book with the author name as the target author name exists in the target book vocabulary.
The first book title is a book with the name of the author being the name of the target author except the target book.
Step 311, if there is a first book with the author name being the target author name, acquiring a target heat value of the first book, accumulating the target heat value of the first book and the target heat value of the target book to obtain a first accumulated value, and determining that the first accumulated value is the target heat value of the target author.
The target author refers to an author of the target book, and the name of the target author is the name of the target author.
In this embodiment, each of all books corresponding to the target author name may be taken as a target book, and according to steps 301 to 309, respective target heat values of all books corresponding to the target author name are obtained, where all books corresponding to the target author name may refer to names of all books written by the target author, including the first book and the target book, and the target author refers to an author whose name is the target author name.
The target author may write a plurality of different books, and in order to more accurately obtain the heat value of the target author, the target heat values of all the books written by the target author may be accumulated, and the accumulated value (i.e., the first accumulated value) may be used as the heat value of the target author.
In step 312, if there is no first book with the author name as the target author name, the target heat value of the target book is determined to be the target heat value of the target author.
In this embodiment, if there is no first book with the author name being the target author name in the target book vocabulary, it is determined that the target author writes one book, i.e., the target book, but does not write other books, so that the target heat value of the target book can be directly used as the target heat value of the target author.
According to the method and the device, the popularity of the target book and the popularity of the target author can be obtained by calculating the popularity of the target book and the popularity of the target author through the access amount of various different historical search sentences, and the accuracy of the popularity of the target book and the popularity of the target author is improved.
Referring to fig. 4, which is a schematic diagram of an implementation flow of a book heat value obtaining method provided in the third embodiment of the present application, where the book heat value obtaining method is applied to a terminal device, as shown in the figure, the book heat value obtaining method may include the following steps:
step 401, obtaining second network search result data.
The second network search result data refers to network search result data within a second preset time period, the network search result data refers to search results including book names and author names, and the second preset time period may refer to any preset time period, such as a day.
In this embodiment, the second network search result data is combined with book resource platform data provided by a plurality of book resource platforms, so that the book name and the author name can be extracted offline, and the book name and the author name can be understood as a book entity. It should be noted that, when extracting book entities, many book entities can be extracted by combining the book resource platform data provided by the plurality of book resource platforms, and the search requirements of the user are greatly met.
In this embodiment, the second network search result data may be obtained by building a crawler system, where the input of the crawler system is a search statement of a user in a mobile search scene, that is, a historical search statement of the user is a random seed of the crawler system, and a website of the crawler may be at least one of mainstream search websites such as Baidu, Shenma, and Saogue.
Step 402, extracting the book name and the author name from the second network search result data.
For a search statement, the crawler system usually has a plurality of search results obtained from the mainstream search website for the search statement, and may select the search result arranged at the first L bits from the plurality of search results as the second web search result data, and select the search result arranged at the first L bits as the second web search result data, instead of using all the search results as the second web search result data, because the search result arranged at the first L bits usually can more accurately express the main intention of the search statement, L is an integer greater than zero, for example, L is 10.
Step 403, generating a standby book vocabulary according to the extracted book name and author name.
In this embodiment, the second search result data generally includes a plurality of search results, a search result including a specific mark may be selected from the plurality of search results, an author name may be extracted from the search result including the specific mark, a book name may be extracted from a title or an abstract of the search result including the specific mark, the extracted book name may correspond to the extracted author name, and the extracted book name and the author name may be regularized, the regularization including but not limited to case transition, full half-angle transition, simplified form transition, punctuation mark and space removal, etc. by which the book name, the author name, etc. included in different search sentences may be unified into the same format, for example, letters in the book name are capital letters, words are simplified, etc., after the extracted book name and the author name are regularized, the establishment of the corresponding relationship between the search statement of the search result containing the specific mark and the extracted book name and author name facilitates the subsequent return and search of the search statement corresponding to the book name and/or author name with abnormality when the book name and/or author name extracted based on the second search result data is abnormal. The specific mark may refer to a mark containing an author name, for example, "start reading author: ".
Step 404, obtain an initial book vocabulary.
The initial book vocabulary may refer to a target book vocabulary generated before the current time, for example, the target book vocabulary is updated every day, and then the initial book vocabulary may be understood as the target book vocabulary generated the previous day.
And 405, combining the standby book vocabulary, the initial book vocabulary and the book resource platform data to obtain a combined book vocabulary.
And 406, performing second preset processing on the combined book vocabulary to generate a target book vocabulary.
Wherein the second preset processing includes deduplication processing and/or deletion of data including the specific word. The deduplication processing may be that at least two pieces of identical data exist in the merged book vocabulary, and one piece of data of the at least two pieces of identical data is retained in the target book vocabulary, and one piece of data in the vocabulary may understand one book name and one author name corresponding to the book name. The specific word is used for deleting data of book names, non-book names and/or author names, non-author names in the merged book vocabulary. Alternatively, the user may set a specific word (e.g., "chapter") according to actual needs, and the present invention is not limited thereto.
Step 407, obtaining historical search sentences of the target books in the M books.
The step is the same as step 102, and reference may be made to the related description of step 102, which is not repeated herein.
And step 408, acquiring the access amount of the historical search sentences of the target book.
The step is the same as step 103, and reference may be made to the related description of step 103, which is not described herein again.
And 409, acquiring a target heat value of the target book according to the access amount of the historical search statement of the target book.
The step is the same as step 104, and reference may be made to the related description of step 104, which is not described herein again.
According to the method and the device, the initial book vocabulary can be updated in real time according to the network search result data, the initial book vocabulary and the book resource platform data, and the target book vocabulary with abundant data is obtained.
Fig. 5 is a schematic structural diagram of a book calorific value acquisition apparatus according to a fourth embodiment of the present application, and for convenience of description, only a part related to the embodiment of the present application is shown.
The book hotness value acquisition device comprises:
a target vocabulary obtaining module 51, configured to obtain a target book vocabulary, where the target book vocabulary includes book names and author names of M books, and M is an integer greater than zero;
a search sentence acquisition module 52, configured to acquire a historical search sentence of a target book in the M books, where the target book is selected from the M books;
an access amount obtaining module 53, configured to obtain an access amount of a history search statement of a target book;
and a hot value obtaining module 54, configured to obtain a target hot value of the target book according to the access amount of the history search statement of the target book.
Optionally, the book hotness value obtaining device further includes:
the book detection module is used for detecting whether a first book with the author name as the target author name exists in the target book vocabulary, wherein the first book is a book with the author name as the target author name except the target book;
the target value acquisition module is used for acquiring a target heat value of a first book with an author name as a target author name if the first book exists;
the popularity accumulation module is used for accumulating the target popularity value of the first book and the target popularity value of the target book to obtain a first accumulated value and determining that the first accumulated value is the popularity value of the name of a target author, wherein the target author refers to the author of the target book;
and the heat determining module is used for determining that the target heat value of the target book is the target heat value of the target author if the first book with the author name as the target author name does not exist.
Optionally, the search sentence acquisition module 52 includes:
the system comprises a first acquisition unit, a second acquisition unit and a third acquisition unit, wherein the first acquisition unit is used for acquiring a search sentence of a target book name, the search sentence of the target book name is a search sentence taking the target book name as a keyword, and the target book name is the book name of the target book;
the second acquisition unit is used for acquiring a first search statement of at least one author name corresponding to a target book name, wherein the first search statement of one author name corresponding to the target book name is a search statement taking the target book name and the author name as keywords;
the third acquiring unit is used for acquiring book resource platform data and first network search result data, wherein the book resource platform data comprises a book name and an author name of K books, K is an integer greater than zero, the first network search result data is network search result data in a first preset time period, and the network search result data is a search result comprising the book name and the author name;
a fourth obtaining unit, configured to obtain a second search statement of a target author name according to the book resource platform data and the first network search result data, where the target author name is an author name of the target book;
the sentence determining unit is used for determining that a search sentence of a target book name, a first search sentence of at least one author name corresponding to the target book name and a second search sentence of the target author name are historical search sentences of the target book;
correspondingly, the access amount obtaining module 53 is specifically configured to:
and acquiring the access quantity of the search statement of the target book name, the access quantity of the first search statement of at least one author name corresponding to the target book name and the access quantity of the second search statement of the target author name.
Optionally, the fourth obtaining unit is specifically configured to:
if the book resource platform data and the first network search result data both comprise a target book name and a target author name, determining that a search statement corresponding to a target search result is a second search statement of the target author name, wherein the target search result is a search result of the first network search result data comprising the target book name and the target author name;
and if the book resource platform data comprises the target book name and the target author name and the first network search result data does not have the target search result, determining a second search statement taking the target novel name and the target author name as the search statement of the keyword as the target author name.
Optionally, the heat value obtaining module 54 includes:
a fifth obtaining unit, configured to obtain a first initial heat value of the target book according to an access amount of the search term of the target book name and an access amount of a first search term of at least one author name corresponding to the target book name;
the popularity determining unit is used for determining the access amount of a second search statement of the target author name as a second initial popularity value of the target book;
and the sixth acquisition unit is used for acquiring the target heat value of the target book according to the first initial heat value of the target book and the second initial heat value of the target book.
Optionally, the fifth obtaining unit is specifically configured to:
a first calculating subunit for calculating the access amount of the search sentence according to the target book nameThe access amount of the first search statement of at least one author name corresponding to the target book name is calculated, and the access amount of the target book corresponding to the target author name is calculated
Figure BDA0002655863970000221
Wherein, V2The access amount, V, of the first search statement representing the name of the target author1Access amount, V, of search term representing name of target book2nRepresenting the access amount of a first search statement of an nth author name in at least one author name corresponding to the target book name, wherein N represents the total amount of the at least one author name corresponding to the target book name, and is an integer greater than zero;
a name detection subunit, configured to detect whether a first book with an author name as a target author name exists in a target book vocabulary, where the first book is a book with an author name as a target author name except the target book;
the first obtaining subunit is configured to obtain, if a first book with an author name being a target author name exists, an access amount of the first book corresponding to the target author name;
the accumulation subunit is configured to accumulate the access volume of the first book corresponding to the target author name and the access volume of the target book corresponding to the target author name to obtain a second accumulated value, determine that the second accumulated value is the access volume sum of all books corresponding to the target author name, and obtain the access volume sum of all books corresponding to all author names of the target book;
a sum obtaining subunit, configured to obtain, if the target book name corresponds to another author name, a sum of access amounts of all books corresponding to the another author name, where the another author name is an author name other than the target author name;
a second obtaining subunit, configured to obtain a maximum access amount sum from the access amount sum of all books corresponding to other author names and the access amount sum of all books corresponding to the target author name;
the second calculating subunit is used for calculating the initial heat value of the target author name according to the sum of the access amounts of all books corresponding to the target author name and the maximum value of the sum of the access amounts;
the hot value determining subunit is used for determining the initial hot value of the target author name as a preset numerical value if the target book name does not correspond to other author names;
the preprocessing subunit is used for respectively performing first preprocessing on the access quantity of the first search statement of the target author name and the access quantity of the target book corresponding to the target author name, unifying the access quantity of the first search statement of the target author name and the dimension of the access quantity of the target book corresponding to the target author name through the first preprocessing, and obtaining the access quantity of the first search statement of the target author name after unifying the dimension and the access quantity of the target book corresponding to the target author name after unifying the dimension;
and the third acquiring subunit is used for acquiring the first initial heat value of the target book according to the initial heat value of the target author name, the access quantity of the first search statement of the target author name after dimension unification and the access quantity of the target book corresponding to the target author name after dimension unification.
Optionally, the fifth obtaining unit is further configured to:
the abnormality judgment subunit is configured to judge whether the access amount of the target book corresponding to the target author name is abnormal or not according to the access amount of the target book corresponding to the target author name or according to the access amount of the first search statement of the target author name and the access amount of the target book corresponding to the target author name;
and the updating subunit is used for updating the access amount of the target book corresponding to the target author name according to the access amount of the first search statement of the target author name if the access amount of the target book corresponding to the target author name is abnormal.
Optionally, the update subunit is specifically configured to:
determining an access amount range according to the access amount of the first search statement of the target author name;
determining a second book according to the access amount range, wherein the second book is a book, except the target book, of which the access amount of the first search statement is within the access amount range;
acquiring the access amount of a second book;
and updating the access amount of the target book corresponding to the target author name according to the access amount of the second book.
Optionally, the sixth obtaining unit is specifically configured to:
performing first pretreatment on the second initial heat value of the target book to obtain a second initial heat value of the pretreated target book, wherein the dimension of the second initial heat value of the pretreated target book is the same as that of the first initial heat value of the target book;
and acquiring a target heat value of the target book according to the second initial heat value of the preprocessed target book and the first initial heat value of the target book.
Optionally, the target vocabulary acquiring module 51 is specifically configured to:
acquiring second network search result data, wherein the second network search result data refers to network search result data in a second preset time period, and the network search result data refers to a search result comprising a book name and an author name;
extracting a book name and an author name from the second network search result data;
generating a standby book word list according to the extracted book name and the author name;
acquiring an initial book word list;
merging the standby book word list, the initial book word list and the book resource platform data to obtain a merged book word list;
and performing second preset processing on the combined book word list to generate a target book word list, wherein the second preset processing comprises duplication elimination processing and/or deletion of data comprising specific words.
The book popularity value obtaining apparatus provided in the embodiment of the present application can be applied to the foregoing method embodiments, and for details, reference is made to the description of the foregoing method embodiments, which is not described herein again.
Fig. 6 is a schematic structural diagram of a terminal device according to a fifth embodiment of the present application. The terminal device as shown in the figure may include: one or more processors 601 (only one shown); one or more input devices 602 (only one shown), one or more output devices 603 (only one shown), and memory 604. The processor 601, the input device 602, the output device 603, and the memory 604 are connected by a bus 605. The memory 604 is used for storing instructions, and the processor 601 is used for implementing the steps in each of the book heat value obtaining method embodiments described above when the instructions stored in the memory 604 are executed.
It should be understood that, in the embodiment of the present Application, the Processor 601 may be a Central Processing Unit (CPU), and the Processor may also be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components, and the like. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The input device 602 may include a touch pad, a fingerprint sensor (for collecting fingerprint information of a user and direction information of the fingerprint), a microphone, a data receiving interface, and the like. The output device 603 may include a display (LCD, etc.), speakers, a data transmission interface, and the like.
The memory 604 may include both read-only memory and random access memory, and provides instructions and data to the processor 601. A portion of the memory 604 may also include non-volatile random access memory. For example, the memory 604 may also store device type information.
In a specific implementation, the processor 601, the input device 602, the output device 603, and the memory 604 described in this embodiment of the present application may execute the implementation described in the embodiment of the book hotness value obtaining method provided in this embodiment of the present application, or may execute the implementation described in the fourth book hotness value obtaining apparatus in this embodiment, which is not described herein again.
Fig. 7 is a schematic diagram of a terminal device provided in the fifth embodiment of the present application. As shown in fig. 7, the terminal device 7 of this embodiment includes: one or more processors 70 (only one of which is shown), a memory 71, and a computer program 72 stored in the memory 71 and executable on the at least one processor 70. The processor 70 implements the steps in the various book heat value acquisition method embodiments described above when executing the computer program 72.
The terminal device 7 may be a desktop computer, a notebook, a palm computer, a cloud server, or other computing devices. The terminal device may include, but is not limited to, a processor 70, a memory 71. It will be appreciated by those skilled in the art that fig. 7 is merely an example of a terminal device 7 and does not constitute a limitation of the terminal device 7 and may include more or less components than those shown, or some components may be combined, or different components, e.g. the terminal device may also include input output devices, network access devices, buses, etc.
The processor 70 may be a central processing unit CPU, but may also be other general purpose processors, digital signal processors DSP, application specific integrated circuits ASIC, off-the-shelf programmable gate arrays FPGA or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The storage 71 may be an internal storage unit of the terminal device 7, such as a hard disk or a memory of the terminal device 7. The memory 71 may also be an external storage device of the terminal device 7, such as a plug-in hard disk provided on the terminal device 7, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like. Further, the memory 71 may also include both an internal storage unit of the terminal device 7 and an external storage device. The memory 71 is used for storing computer programs and other programs and data required by the terminal device. The memory 71 may also be used to temporarily store data that has been output or is to be output.
It will be apparent to those skilled in the art that, for convenience and brevity of description, only the above-mentioned division of the functional units and modules is illustrated, and in practical applications, the above-mentioned function distribution may be performed by different functional units and modules according to needs, that is, the internal structure of the apparatus is divided into different functional units or modules, so as to perform all or part of the functions described above. Each functional unit and module in the embodiments may be integrated in one processing unit, or each unit may exist alone physically, or two or more units are integrated in one unit, and the integrated unit may be implemented in a form of hardware, or in a form of software functional unit. In addition, specific names of the functional units and modules are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present application. The specific working processes of the units and modules in the system may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and reference may be made to the related descriptions of other embodiments for parts that are not described or illustrated in a certain embodiment.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus/terminal device and method may be implemented in other ways. For example, the above-described embodiments of the apparatus/terminal device are merely illustrative, and for example, a module or a unit may be divided into only one logical function, and may be implemented in other ways, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
Units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, all or part of the flow in the method according to the embodiments described above may be implemented by a computer program, which is stored in a computer readable storage medium and used by a processor to implement the steps of the embodiments of the methods described above. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer readable medium may include: any entity or device capable of carrying computer program code, recording medium, U.S. disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution media, and the like. It should be noted that the computer readable medium may contain other components which may be suitably increased or decreased as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, in accordance with legislation and patent practice, the computer readable medium does not include electrical carrier signals and telecommunications signals.
When the computer program product runs on the terminal device, the steps in the method embodiments can be realized when the terminal device executes the computer program product.
The above embodiments are only used to illustrate the technical solutions of the present application, and not to limit the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present application and are intended to be included within the scope of the present application.

Claims (13)

1. A book popularity value acquisition method is characterized by comprising the following steps:
acquiring a target book word list, wherein the target book word list comprises book names and author names of M books, and M is an integer greater than zero;
acquiring historical search sentences of a target book in the M books, wherein the target book is selected from the M books;
obtaining the access amount of the historical search statement of the target book;
and acquiring a target heat value of the target book according to the access amount of the historical search statement of the target book.
2. The book calorific value acquisition method of claim 1, further comprising:
detecting whether a first book with an author name as a target author name exists in the target book word list, wherein the target author name refers to the author name of the target book, and the first book with the author name as the target author name is a book except the target book;
if a first book with the author name as the target author name exists, acquiring a target heat value of the first book;
accumulating the target heat value of the first book and the target heat value of the target book to obtain a first accumulated value, and determining that the first accumulated value is the target heat value of a target author, wherein the target author refers to an author of the target book;
and if the first book with the author name as the target author name does not exist, determining the target heat value of the target book as the target heat value of the target author.
3. The book calorific value acquisition method according to claim 1, wherein said acquiring the historical search sentence of the target book among the M books comprises:
acquiring a search sentence of a target book name, wherein the search sentence of the target book name is a search sentence taking the target book name as a keyword, and the target book name is a book name of the target book;
acquiring a first search statement of at least one author name corresponding to the target book name, wherein the first search statement of the author name corresponding to the target book name is a search statement taking the target book name and the author name as keywords;
acquiring book resource platform data and first network search result data, wherein the book resource platform data comprise a book name and an author name of K books, K is an integer greater than zero, the first network search result data are network search result data in a first preset time period, and the network search result data are search results comprising the book name and the author name;
acquiring a second search statement of a target author name according to the book resource platform data and the first network search result data, wherein the target author name refers to an author name of the target book;
determining a search statement of the target book name, a first search statement of at least one author name corresponding to the target book name, and a second search statement of the target author name as a historical search statement of the target book;
correspondingly, the obtaining of the access amount of the historical search statement of the target book comprises:
and acquiring the access amount of the search statement of the target book name, the access amount of the first search statement of at least one author name corresponding to the target book name and the access amount of the second search statement of the target author name.
4. The book popularity value acquisition method of claim 3, wherein acquiring the second search statement of the target author name based on the book resource platform data and the first web search result data comprises:
if the book resource platform data and the first network search result data both include the target book name and the target author name, determining that a search statement corresponding to a target search result is a second search statement of the target author name, where the target search result refers to a search result of the first network search result data that includes the target book name and the target author name;
and if the book resource platform data comprises the target book name and the target author name and the first network search result data does not have the target search result, determining a search statement using the target novel name and the target author name as keywords as a second search statement of the target author name.
5. The book popularity value acquisition method according to claim 3, wherein the acquiring a target popularity value of the target book according to an access amount of a history search sentence of the target book includes:
acquiring a first initial heat value of the target book according to the access amount of the search statement of the name of the target book and the access amount of a first search statement of at least one author name corresponding to the name of the target book;
determining the visit quantity of a second search statement of the target author name as a second initial heat value of the target book;
and acquiring a target heat value of the target book according to the first initial heat value of the target book and the second initial heat value of the target book.
6. The book popularity value acquisition method of claim 5, wherein the acquiring a first initial popularity value of the target book according to the access amount of the search term of the target book name and the access amount of the first search term of the at least one author name corresponding to the target book name comprises:
calculating the access amount of the target book corresponding to the target author name according to the access amount of the search statement of the target book name and the access amount of the first search statement of at least one author name corresponding to the target book name
Figure FDA0002655863960000031
Wherein, V2An amount of access, V, of a first search statement representing the name of the target author1Access volume, V, of search sentence representing name of target book2nRepresenting the access amount of a first search statement of an nth author name in at least one author name corresponding to the target book name, wherein N represents the total number of the at least one author name corresponding to the target book name, and is an integer greater than zero;
detecting whether a first book with an author name being the target author name exists in the target book vocabulary, wherein the first book is a book with an author name being the target author name except the target book;
if a first book with the author name as the target author name exists, acquiring the access amount of the first book corresponding to the target author name;
accumulating the access amount of the first book corresponding to the target author name and the access amount of the target book corresponding to the target author name to obtain a second accumulated value, and determining that the second accumulated value is the sum of the access amounts of all books corresponding to the target author name;
if the target book name corresponds to other author names, acquiring the sum of the access amounts of all books corresponding to the other author names, wherein the other author names are author names except the target author name;
acquiring the maximum value of the sum of the access amounts of all books corresponding to the other author names and the sum of the access amounts of all books corresponding to the target author name;
calculating an initial heat value of the target author name according to the sum of the access amounts of all books corresponding to the target author name and the maximum value of the sum of the access amounts;
if the target book name does not correspond to other author names, determining that the initial heat value of the target author name is a preset numerical value;
respectively carrying out first preprocessing on the access quantity of the first search statement of the target author name and the access quantity of the target book corresponding to the target author name, unifying the access quantity of the first search statement of the target author name and the dimension of the access quantity of the target book corresponding to the target author name through the first preprocessing, and obtaining the access quantity of the first search statement of the target author name after unifying the dimension and the access quantity of the target book corresponding to the target author name after unifying the dimension;
and acquiring a first initial heat value of the target book according to the initial heat value of the target author name, the access quantity of the first search statement of the target author name after the unified dimension and the access quantity of the target book corresponding to the target author name after the unified dimension.
7. The book calorific value acquisition method according to claim 6, further comprising, after calculating an access amount of the target book corresponding to the target author name:
judging whether the access amount of the target book corresponding to the target author name is abnormal or not according to the access amount of the target book corresponding to the target author name or according to the access amount of a first search statement of the target author name and the access amount of the target book corresponding to the target author name;
and if the access volume of the target book corresponding to the target author name is abnormal, updating the access volume of the target book corresponding to the target author name according to the access volume of the first search statement of the target author name.
8. The book popularity value acquisition method of claim 7, wherein the updating of the access volume of the target book corresponding to the target author name according to the access volume of the first search statement of the target author name comprises:
determining an access amount range according to the access amount of the first search statement of the target author name;
determining a second book according to the access amount range, wherein the second book is a book, except the target book, of which the access amount of the first search statement is within the access amount range;
acquiring the access amount of the second book;
and updating the access amount of the target book corresponding to the target author name according to the access amount of the second book.
9. The book calorific value acquisition method of claim 6, wherein said acquiring a target calorific value of the target book based on a first initial calorific value of the target book and a second initial calorific value of the target book comprises:
performing first pretreatment on the second initial heat value of the target book to obtain a pretreated second initial heat value of the target book, wherein the dimension of the pretreated second initial heat value of the target book is the same as that of the first initial heat value of the target book;
and acquiring a target heat value of the target book according to the second initial heat value of the preprocessed target book and the first initial heat value of the target book.
10. The book calorific value acquisition method of any one of claims 1 to 9, wherein said acquiring a target book vocabulary comprises:
acquiring second network search result data, wherein the second network search result data refers to network search result data in a second preset time period, and the network search result data refers to a search result comprising a book name and an author name;
extracting a book name and an author name from the second network search result data;
generating a standby novel word list according to the extracted book name and the author name;
acquiring an initial novel word list;
merging the standby novel vocabulary, the initial novel vocabulary and the book resource platform data to obtain a merged book vocabulary;
and performing second preset processing on the combined book word list to generate the target book word list, wherein the second preset processing comprises duplication elimination processing and/or deletion of data comprising specific words.
11. A book hotness value acquisition device, comprising:
the target word list obtaining module is used for obtaining a target book word list, wherein the target book word list comprises book names and author names of M books, and M is an integer larger than zero;
a search sentence acquisition module, configured to acquire a historical search sentence of a target book in the M books, where the target book is selected from the M books;
the access amount acquisition module is used for acquiring the access amount of the historical search statement of the target book;
and the heat value acquisition module is used for acquiring the target heat value of the target book according to the access amount of the historical search statement of the target book.
12. A terminal device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor implements the steps of the book heat value acquisition method according to any one of claims 1 to 10 when executing the computer program.
13. A computer-readable storage medium storing a computer program, wherein the computer program is executed by a processor to implement the steps of the book heat value acquisition method according to any one of claims 1 to 10.
CN202010886961.2A 2020-08-28 2020-08-28 Book heat value acquisition method and device, terminal device and storage medium Withdrawn CN111898015A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010886961.2A CN111898015A (en) 2020-08-28 2020-08-28 Book heat value acquisition method and device, terminal device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010886961.2A CN111898015A (en) 2020-08-28 2020-08-28 Book heat value acquisition method and device, terminal device and storage medium

Publications (1)

Publication Number Publication Date
CN111898015A true CN111898015A (en) 2020-11-06

Family

ID=73225513

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010886961.2A Withdrawn CN111898015A (en) 2020-08-28 2020-08-28 Book heat value acquisition method and device, terminal device and storage medium

Country Status (1)

Country Link
CN (1) CN111898015A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103530487A (en) * 2012-07-02 2014-01-22 纽海信息技术(上海)有限公司 Data statistical method capable of reflecting book popularity variation
CN107885873A (en) * 2017-11-28 2018-04-06 百度在线网络技术(北京)有限公司 Method and apparatus for output information
CN109657144A (en) * 2018-12-17 2019-04-19 北京百度网讯科技有限公司 Methods of marking, device, storage medium and the terminal device of works

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103530487A (en) * 2012-07-02 2014-01-22 纽海信息技术(上海)有限公司 Data statistical method capable of reflecting book popularity variation
CN107885873A (en) * 2017-11-28 2018-04-06 百度在线网络技术(北京)有限公司 Method and apparatus for output information
CN109657144A (en) * 2018-12-17 2019-04-19 北京百度网讯科技有限公司 Methods of marking, device, storage medium and the terminal device of works

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
高并发: "如何使用redis实现微博热搜榜", pages 1 - 3, Retrieved from the Internet <URL:https://blog.csdn.net/weixin_39025362/article/details/105753064> *

Similar Documents

Publication Publication Date Title
US10796094B1 (en) Extracting keywords from a document
US9342233B1 (en) Dynamic dictionary based on context
AU2019366858B2 (en) Method and system for decoding user intent from natural language queries
CN108241741B (en) Text classification method, server and computer readable storage medium
CN111708934B (en) Knowledge content evaluation method, device, electronic equipment and storage medium
CN112668320B (en) Model training method and device based on word embedding, electronic equipment and storage medium
CN107526846B (en) Method, device, server and medium for generating and sorting channel sorting model
US20130173655A1 (en) Selective fetching of search results
GB2570751A (en) Predicting style breaches within textual content
JP2015505101A (en) Server, display device, scrolling method, and method of generating heat map
CN113688310B (en) Content recommendation method, device, equipment and storage medium
CN111814481B (en) Shopping intention recognition method, device, terminal equipment and storage medium
CN104516635A (en) Content display management
CN112181386B (en) Code construction method, device and terminal based on software continuous integration
CN113806660A (en) Data evaluation method, training method, device, electronic device and storage medium
JP7204903B2 (en) INFORMATION PUSH METHOD, DEVICE, DEVICE AND STORAGE MEDIUM
US10241988B2 (en) Prioritizing smart tag creation
CN113626576A (en) Method and device for extracting relational characteristics in remote supervision, terminal and storage medium
US9705972B2 (en) Managing a set of data
CN111275683A (en) Image quality grading processing method, system, device and medium
US9946762B2 (en) Building a domain knowledge and term identity using crowd sourcing
CN113657116B (en) Social media popularity prediction method and device based on visual semantic relationship
US20150269177A1 (en) Method and system for determining user interest in a file
CN111898015A (en) Book heat value acquisition method and device, terminal device and storage medium
CN114020867A (en) Method, device, equipment and medium for expanding search terms

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication

Application publication date: 20201106

WW01 Invention patent application withdrawn after publication