WO2023151426A1 - 直播间人数统计方法及装置 - Google Patents

直播间人数统计方法及装置 Download PDF

Info

Publication number
WO2023151426A1
WO2023151426A1 PCT/CN2022/144350 CN2022144350W WO2023151426A1 WO 2023151426 A1 WO2023151426 A1 WO 2023151426A1 CN 2022144350 W CN2022144350 W CN 2022144350W WO 2023151426 A1 WO2023151426 A1 WO 2023151426A1
Authority
WO
WIPO (PCT)
Prior art keywords
room
users
abnormal
user
rooms
Prior art date
Application number
PCT/CN2022/144350
Other languages
English (en)
French (fr)
Inventor
孙袁袁
Original Assignee
上海哔哩哔哩科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海哔哩哔哩科技有限公司 filed Critical 上海哔哩哔哩科技有限公司
Publication of WO2023151426A1 publication Critical patent/WO2023151426A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25866Management of end-user data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/4508Management of client data or end-user data
    • H04N21/4532Management of client data or end-user data involving end-user characteristics, e.g. viewer profile, preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4784Supplemental services, e.g. displaying phone caller identification, shopping application receiving rewards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting

Definitions

  • This application relates to the technical field of webcasting, in particular to a method for counting the number of people in a livestreaming room.
  • the present application also relates to a device for counting people in a live broadcast room, a computing device, and a computer-readable storage medium.
  • the number of visitors in a normal room is the real number of people, but the number of visitors is not the real number of people in a room with a brush amount.
  • the number of visitors can be determined through the anti-splash strategy, but after the room with a brush amount is determined, the number of visitors cannot be obtained The real number of people in the room. Therefore, how to ensure the correctness of the popularity data in the live broadcast room is an urgent problem to be solved.
  • an embodiment of the present application provides a method for counting the number of people in a live broadcast room.
  • the present application also relates to a device for counting the number of people in a live broadcast room, a computing device, and a computer-readable storage medium, so as to solve the problem in the prior art that the number of viewers in a live broadcast room cannot be accurately obtained.
  • a method for counting the number of people in a live broadcast room including:
  • the number of real users corresponding to each browsing room is counted.
  • a device for counting the number of people in a live broadcast room including:
  • the obtaining module is configured to obtain the room attribute information corresponding to each scanning room in the multiple scanning rooms;
  • the first determining module is configured to determine the set of visiting users corresponding to each browsing room according to the room attribute information
  • the second determination module is configured to determine an abnormal user set according to the plurality of access user sets
  • the statistical module is configured to count the number of real users corresponding to each browsing room according to the visiting user set corresponding to each browsing room and the abnormal user set.
  • a computing device including a memory, a processor, and computer instructions stored in the memory and operable on the processor.
  • the processor executes the computer instructions, the computer instructions are implemented. Describe the steps of the method for counting the number of people in the live broadcast room.
  • a computer-readable storage medium which stores computer instructions, and when the computer instructions are executed by a processor, the steps of the method for counting people in a live broadcast room are implemented.
  • the method for counting the number of people in a live broadcast room includes: obtaining room attribute information corresponding to each of the multiple browsing rooms; and determining a set of visiting users corresponding to each browsing room according to the room attribute information; Determining an abnormal user set according to the plurality of visiting user sets; counting the number of real users corresponding to each browsing room according to the visiting user set corresponding to each visiting user set and the abnormal user set.
  • An embodiment of the present application realizes that by obtaining the room attribute information of at least two scrolling rooms, determining the set of visiting users corresponding to each scrolling room according to the room attribute information of each scrolling room, and according to the access user set of each scrolling room The user set determines the abnormal user set. After the abnormal user set is determined, the number of real users corresponding to each browsing room can be counted according to the visiting user set and abnormal user set corresponding to each browsing room, and the number of people can be counted in the future. Among them, the number of heartbeats in the abnormal user collection is automatically discarded, which realizes the real-time removal of the number of brushes and ensures the correctness of the popularity data in the live broadcast room.
  • Fig. 1 is a flow chart of a method for counting the number of people in a live broadcast room provided by an embodiment of the present application
  • Fig. 2 is a processing flow chart of a method for counting the number of people in a live broadcast room applied to counting the real number of people online provided by an embodiment of the present application;
  • Fig. 3 is a schematic structural diagram of a device for counting the number of people in a live broadcast room provided by an embodiment of the present application;
  • Fig. 4 is a structural block diagram of a computing device provided by an embodiment of the present application.
  • first, second, etc. may be used to describe various information in one or more embodiments of the present application, the information should not be limited to these terms. These terms are only used to distinguish information of the same type from one another. For example, first may also be referred to as second, and similarly, second may also be referred to as first, without departing from the scope of one or more embodiments of the present application. Depending on the context, the word “if” as used herein may be interpreted as “at” or “when” or “in response to a determination.”
  • Live Popularity The number of viewers, the number of barrage connections, the number of gifts, etc. are calculated according to a certain ratio, and are used to rank the live broadcast platform according to the popularity.
  • Number of Live Broadcasters The real number of people watching the live broadcast room in real time.
  • Swipe volume By simulating normal user visits, a large number of false viewings are generated, that is, simulating watching live broadcast rooms through illegal means.
  • Swiping room A live broadcast room that generates a large amount of false popularity through illegal means. Many illegal users in the swiping room use illegal means to generate a large number of viewers, barrage and other false data for the swiping room.
  • Abnormal users Users who use illegal means to collect a large amount of false popularity information and barrage information for the live broadcast room.
  • Anti-swiping Through technical means, identify the request for illegal access, that is, identify the access request of the blacklist address of the amount of brushing, and reject the illegal request.
  • Heartbeat summary service Real-time summary of the heartbeat information of each live room.
  • the heartbeat information can be the number of viewers in the live room, the number of bullet screens in the live room, and other information. Since the heartbeat summary service only does one thing, which is to summarize the heartbeat information of each room, the heartbeat summary service can achieve real-time summary.
  • the number of people in the live broadcast room is one of the important parameters to measure the popularity of the live broadcast room, and the popularity of the live broadcast room can be used to rank the rooms on the live broadcast platform.
  • the general process for a user to watch a live broadcast room is as follows: the user clicks on the live room he wants to watch, chooses to enter the live room, the client requests the broadcast address from the scheduling service of the live broadcast platform, and starts to play the live broadcast room for the user after getting the broadcast address. Live content. After that, the client regularly reports the playback log to the database of the live broadcast platform in the background.
  • the specific information of the playback log includes: live room identification information + user identification information + other information, and the client will also regularly report to the heartbeat summary service of the live broadcast platform
  • Heartbeat information includes: room identification information + user identification information. Based on the heartbeat information reported by each viewing user client, the heartbeat summary service can summarize the number of viewers in each live room in real time.
  • the playback log is not reported in real time, because the playback log is only used to analyze data and does not require high real-time performance, so the client is allowed to delay reporting.
  • the database that receives playback logs receives a large number of playback logs or information reports, which are generally in the form of n+1, that is, the report is reported on the nth day, and after the database classifies the logs, the relevant information can be queried on the n+1st day Play log for business.
  • a method for counting the number of people in a live broadcast room is provided, which is used to solve the problem of the above-mentioned anti-brushing strategy.
  • the present application also relates to a device for counting people in a live broadcast room, a computing device, and a computer-readable storage medium, which will be described in detail in the following embodiments.
  • Figure 1 shows a flow chart of a method for counting the number of people in a live broadcast room according to an embodiment of the present application, which specifically includes the following steps:
  • Step 102 Obtain the room attribute information corresponding to each brushed room in the multiple brushed rooms.
  • the room attribute information can be understood as the attribute information corresponding to each live broadcast room.
  • the room attribute information can be the room identifier of the live broadcast room, such as the room number of the live broadcast room, the host ID of the live broadcast room, etc.
  • the room attribute information can also include the live room Information such as the current number of viewers, the number of bullet screens, etc.
  • the preset anti-spam rules can be understood as the rules for judging whether a live broadcast room is a spam room. According to the preset anti-splash rules, it can be accurately determined whether a live room is spam, and the spam rooms can be understood as generated by illegal means. Live room with a lot of fake viewership.
  • each live broadcast room on the live broadcast platform can be judged by preset anti-spoofing rules to screen out the spam rooms.
  • the preset anti-scrap rules can be used to judge whether the live broadcast room is a spam room from multiple dimensions. For example, it can be judged based on the parameters of the target live broadcast room, such as the number of barrage connections and the number of followers.
  • There are many methods for judging whether a live broadcast room is a spamming room and this application does not make specific limitations here, and the corresponding anti-brushing rules can be selected according to the actual situation.
  • the number of clicks corresponding to each live broadcast room is calculated through parameters such as the number of barrage connections and the number of followers of each live broadcast room. If If the swiping score of the current live room exceeds the preset score, it is judged that the current live room is a swiping room; correspondingly, if the swiping score of the current live room does not exceed the preset score, it is judged that the current live room is normal Room. Through the preset anti-scrap rules, it is finally determined that among the 1,000 live broadcast rooms on the current live broadcast platform, 400 live broadcast rooms are spam rooms.
  • Step 104 Determine the set of visiting users corresponding to each swiping room according to the room attribute information.
  • the set of visiting users can be understood as the set of users who watch the live broadcast room.
  • the users in the set of visiting users may include normal users or users who use the amount of users (abnormal users).
  • the set of visiting users may include user IDs, usernames, User IP, etc.
  • the set of accessing users may be a set of online viewing users of a live broadcast room, or a collection of historical viewing users within a preset historical time period of the current live broadcast room.
  • the historical access user set of the current live broadcast room can be obtained from the database of the live broadcast platform. It should be noted that, in order to ensure the accuracy of subsequent judgments, when the visiting user collection is a historical visiting user collection, the historical visiting user collection of each browsing room should be uniformly obtained; when the visiting user collection is an online visiting user collection In this case, the set of online access users for each swiping room should be obtained uniformly.
  • the current online viewer ID of each scrolling room that is being broadcast is obtained, and according to the current online viewing user ID of each scrolling room Watch the user ID online, and generate a collection of visiting users corresponding to each browsing room.
  • the set of visiting users corresponding to each swiping room is determined, including:
  • the target scanning room can be understood as a scanning room selected from all scanning rooms.
  • Accessing the user database can be understood as a database storing user data in the live broadcast platform, and the database can store relevant information of each live room according to the room attribute information of each live room. For example, according to the room number of each live broadcast room, the number of viewers of the live broadcast, the number of live barrage, the number of gifts, etc. are stored sequentially.
  • determining the set of visiting users since there are many browsing rooms, it needs to be determined one by one. Firstly, determine the target browsing room among many browsing rooms, and then determine the set of visiting users corresponding to the target browsing room according to the room attribute information of the target browsing room. Then select the next scanning room as the target scanning room, and perform the same processing operations in turn until the set of visiting users corresponding to each scanning room is determined.
  • a target reading room A is determined among 400 reading rooms, and the target reading number is obtained in the access user database of the live broadcast platform according to the room number of the target reading room A
  • the set of visiting users corresponding to room A is the set of visiting users of the target brushing room A in the past 3 days.
  • the collection of visiting users for a room can be processed in parallel to improve processing efficiency.
  • a target browsing room A is determined among 400 browsing rooms, the current online viewing user information of the target browsing room A is obtained, and the target browsing room A is generated Corresponding access user set.
  • Step 106 Determine an abnormal user set according to the plurality of access user sets.
  • the set of abnormal users can be understood as a set of users who scan the amount of users, and the set of abnormal users can include the user names, user IDs, and user IPs of the abnormal users.
  • the abnormal user set can be determined according to the visiting user sets corresponding to at least two browsing rooms. Specifically, before determining the set of abnormal users, it is necessary to determine whether each visiting user in the set of visiting users is an abnormal user, which can be determined according to the visit frequency of the abnormal user. When the user's access frequency exceeds the preset access frequency, it is determined that the current access user is an abnormal user, otherwise it is a normal user.
  • the number of swiped rooms is 100
  • the number of visits by user A is 25, and the visit frequency of user A is 25%
  • the preset visit frequency does not exceed 30%
  • user A is a normal user
  • the access frequency of user B is 40%, and if it exceeds the preset access frequency by 30%, then user B is an abnormal user.
  • determining an abnormal user set according to multiple sets of visiting users includes:
  • An intersection of at least two access user sets to be processed is determined, and an abnormal user set is determined according to the intersection, wherein the abnormal user set includes users with abnormal operations.
  • determining at least two sets of visiting users to be processed in a plurality of sets of visiting users can be understood as determining at least two sets of target visiting users (sets of visiting users to be processed) for browsing rooms in all sets of visiting users, For example, if 20 user sets to be processed are determined among 100 sets of visiting users in a room that scan the volume, it should be noted that the number of user sets to be processed can be equal to the total number of user sets that visit the room, that is, 100 volumes A set of 100 pending access users can be identified in the room. Determining the intersection of each pending access user set can be understood as judging which users have viewed the swiping rooms corresponding to these pending access user sets.
  • the purpose of determining the intersection of at least two sets of accessing users to be processed is to determine the set of abnormal users based on the intersection.
  • the reason for finding the intersection is that each abnormal user will not only receive One anchor or one live broadcast room’s brush order, one abnormal user will brush multiple live rooms. Since the server of each abnormal user is fixed and the server itself will not be changed at will, the exit IP of each abnormal user is also fixed, and it is easy for the same IP to access multiple rooms.
  • 100 pending visiting user sets are determined from the 400 visiting user sets, and the visiting users corresponding to the 100 browsing rooms are found according to the 100 pending visiting user sets
  • the preset times can be set according to the number of swiping rooms. For example, if there are 100 browsing rooms corresponding to the user set to be processed, the preset number of times can be set to 10% of the number of browsing rooms, that is, the preset number of times is 10. Since the collection of abnormal users is screened in the collection of pending access users, compared with the overall statistics of abnormal users, the probability of accidental injury is low and the pertinence is strong.
  • each user in each pending visiting user set query each user The number of visits to the live broadcast rooms in the counting rooms corresponding to the 100 pending access user sets.
  • the number of times user A visits the live broadcasting rooms in the 100 counting rooms is 20 times. If the preset number of times exceeds 10 times, user A For abnormal users, complete the detection of each user in turn to filter out abnormal users, and generate a collection of abnormal users.
  • the access user set is a user name and a user ID
  • An abnormal user set is determined according to the abnormal user attribute information set.
  • the set of user attribute information can be understood as the set of attribute information corresponding to each user.
  • the set of user attribute information can include information such as user IP, the number of barrage messages sent by the user, and the number of gifts sent by the user.
  • the visiting users of each room can be obtained in the visiting user database according to the attribute information of each room, that is, when the user name of each visiting user is obtained, it is necessary to obtain the user name of each visiting user name to obtain the user IP corresponding to each access user.
  • the user attribute information of the browsing room corresponding to each set of visiting users to be processed is obtained according to the 100 sets of visiting users to be processed Set, the user attribute information set includes the access IP of each visiting user, and the abnormal user attribute information set containing the abnormal user IP is determined according to the user attribute information set corresponding to each browsing room, and the abnormal user attribute information set is determined according to the abnormal user attribute information set An exception user collection containing user IDs and/or usernames.
  • Step 108 Count the number of real users corresponding to each browsing room according to the visiting user set corresponding to each browsing room and the abnormal user collection.
  • the real number of users can be understood as the real number of people watching the live broadcast room, and the real number of users can be the number of users in the visiting user set minus the number of abnormal users that exist.
  • the real number of users can be the real number of viewers in the past n days of browsing the room, or the real number of viewers currently watching in the browsing room.
  • the authenticity of the number of viewers in a room can be calculated by counting the actual number of viewers corresponding to each browsing room through the visiting user set and abnormal user set corresponding to each browsing room. For example, if there are 100 users in the visiting user collection and 20 users in the abnormal user collection, and these 20 abnormal users are all in the visiting user collection, then the actual number of viewers is 80.
  • the above example is followed, and the number of real users corresponding to the 100 visiting rooms is counted according to the visiting user set and abnormal user set corresponding to the 100 visiting rooms.
  • the method before counting the number of real users corresponding to each room that visits the room according to the set of visiting users corresponding to each browsing room and the set of abnormal users, the method further includes: obtaining the number of abnormal users according to the set of abnormal users.
  • the number of abnormal users may be understood as the number of abnormal users counted according to the abnormal user set.
  • the abnormal user set may include user ID, user name, user IP, etc., and the number of abnormal users can be counted according to the abnormal user set.
  • the number of abnormal users in 100 brushing rooms is 20.
  • the number of real users corresponding to the target browsing room is counted.
  • the target traffic room can be understood as a target room determined among multiple traffic rooms, and further processing is performed on the target traffic room.
  • the number of visiting users can be understood as the total number of people who visit the target scrolling room, and the total number includes the number of viewers of normal users and the number of surfers of abnormal users.
  • the visiting user set of each browsing room includes user information of normal users and user information of abnormal users.
  • the number of visiting users obtained according to the visiting user set is normal users plus the number of abnormal users.
  • the number of visiting users and the number of abnormal users count the real number of users in each swiping room.
  • the target traffic room R1 is determined among 100 traffic volume rooms, and the number of visits corresponding to the target traffic volume room R1 is obtained according to the set of visiting users corresponding to the target traffic volume room R1. According to the number of visiting users and the number of abnormal users, the number of real users 80 corresponding to the target swiping volume room is counted.
  • counting the number of real users corresponding to the target swiping room including:
  • the number of real users corresponding to the target browsing room is counted.
  • the number of target abnormal users can be understood as the number of abnormal users who watch the target browsing volume room.
  • each abnormal user in the abnormal user set may not have visited each browsing volume room. Therefore, when counting the target browsing volume Before counting the real number of users in the room, it is necessary to determine how many abnormal users there are in the target swiping room. For example, if there are 100 users in the visiting user set and 20 users in the abnormal user set, but only 10 abnormal users are in the visiting user set, then the real number of viewers is 90.
  • the number of users visiting the target reading room R1 is 100, and it is determined that only 5 of the 20 abnormal users are among the number of visiting users. Therefore, the number of visiting users in the target reading room R1 The number of real users is 95.
  • the number of abnormal users and the collection of abnormal users can be sent to the heartbeat summary service of the live broadcast platform.
  • the heartbeats reported by abnormal users in the abnormal user collection can be directly ignored, so as to accurately count the number of people in other live broadcast rooms, and play a role in preventing brushing, and there is no need to judge whether the live broadcast room is swiping again.
  • the number of rooms is determined according to the preset anti-blog rules. Since the preset anti-blog rules are artificially set, there will be some rooms that are missed.
  • the access records of each abnormal user can be queried in the access user database of the live broadcast platform according to the collection of abnormal users, and the access records of each abnormal user can be obtained according to the access records. Generate a list of which live rooms have been passed, and judge whether the live rooms in the list are the rooms that we have determined to be used. If so, do not process;
  • the method also includes:
  • the set of rooms to be confirmed to be checked can be understood as a set of suspected rooms to be checked, that is, whether these rooms to be confirmed are rooms to be checked.
  • the preset anti-spam rule is a rule for judging whether the current live broadcast room is a sparring room. Adjusting the preset anti-spam rule can be understood as adjusting the preset standard value in the preset anti-spam rule.
  • the suspected value in the current live broadcast room is greater than When the standard value is preset, it is determined that the current live room is a brush room; when the suspected value of the current live room is less than or equal to the preset standard value, it is determined that the current live room is a normal room.
  • the preset anti-spam rules for judging whether the live room is a spam room can be adjusted accordingly, so that the spam rooms can be more accurately judged according to the preset anti-spam rules.
  • the access records of these 20 abnormal users are obtained to determine the set of rooms to be confirmed.
  • the set of rooms to be confirmed there are There are 1,000 rooms to be confirmed, and the preset anti-scanning rules are adjusted according to the collection of rooms to be confirmed.
  • the set of rooms to be confirmed is acquired, including:
  • the target room to be reviewed is a room for brushing.
  • the set of rooms to be reviewed can be understood as the set of rooms visited by all abnormal users.
  • the live broadcast rooms in the set of rooms to be reviewed there may be rooms that have been confirmed to be used and unknown rooms to be determined.
  • the access room set of all abnormal users is obtained, which is the set of rooms to be reviewed, and it is determined whether there is a missed judgment according to the live room in the set of rooms to be reviewed.
  • the set of rooms to be reviewed is obtained according to the set of abnormal users, the target room R2 to be reviewed is determined in the set of rooms to be reviewed, and it is judged whether the target room R2 to be reviewed is in the previous brushing room In the set, if the judgment result is Yes, it means that the target room R2 to be reviewed is a confirmed brushing room.
  • the collection of rooms to be reviewed is obtained according to the collection of abnormal users
  • the target room to be reviewed R2 is determined in the collection of rooms to be reviewed, and it is judged whether the target room to be reviewed R2 is within the previous swiping volume In the room set, if the judgment result is absent, it means that the target room R2 to be reviewed may be an undetected brushed room or a normal room.
  • obtaining the room set to be reviewed according to the abnormal user set includes:
  • the target abnormal user can be understood as an abnormal user selected from the abnormal user set, and the target abnormal user is the next processing object
  • the target access information can be understood as the access record of the target abnormal user, and the access information can include access Time, visiting address (visiting room number) and other information.
  • each abnormal user in the abnormal user set is processed, the target abnormal user in the abnormal user set is selected, and the access information of each abnormal user is obtained according to the access information of the target abnormal user.
  • the review room set may be a set of access rooms determined by all abnormal users according to a preset time. For example, if the preset time is 3 days, the collection of rooms visited by all abnormal users in the past 3 days is obtained, and the collection of rooms to be reviewed is determined.
  • the sub-sets of rooms to be reviewed can be obtained according to the access information of each abnormal user, and finally the set of rooms to be reviewed can be generated by summarizing, or the access information of each abnormal user can be summarized first, and the set of rooms to be reviewed can be determined according to the total access information.
  • the target abnormal user A is determined in the abnormal user set, and the target access information of the target abnormal user A is obtained. According to the target access information, the sub-set of rooms to be reviewed by the target abnormal user can be obtained, and each A sub-set of pending rooms for abnormal users, and a set of pending rooms is generated according to each sub-set of pending rooms.
  • determine the target abnormal user A in the abnormal user set obtain the target access information of the target abnormal user A, determine the next target abnormal user, and obtain the corresponding target access information, Summarize the total visit information according to the target visit information corresponding to each abnormal user, and determine the room information to be reviewed according to the total visit information.
  • a method for counting the number of people in a live broadcast room comprising: obtaining room attribute information corresponding to each of the multiple browsing rooms; and determining the visiting user corresponding to each browsing room according to the room attribute information set; determine the abnormal user set according to the plurality of visiting user sets; count the number of real users corresponding to each browsing room according to the visiting user set corresponding to each visiting user set and the abnormal user set.
  • the room attribute information of at least two counting rooms determine the set of visiting users corresponding to each counting room according to the room attribute information of each counting room, and determine the set of abnormal users according to the set of visiting users of each counting room , after the abnormal user set is determined, the number of real users corresponding to each browsing room can be counted according to the visiting user set and abnormal user set corresponding to each scanning room, and the abnormal user set can be automatically discarded in the subsequent population statistics
  • the number of heartbeats in the real-time removal of the number of users to ensure the correctness of the popularity data in the live broadcast room.
  • it is also possible to dynamically adjust the preset anti-scraping rules increase the number of rooms that are judged to be swiped, and reduce missed judgments on rooms that are swiped.
  • FIG. 2 shows a processing flow chart of a method for counting the number of people in a live broadcast room applied to count the real number of people online provided by an embodiment of the present application, which specifically includes the following steps:
  • Step 202 Determine 500 brushing rooms according to the preset anti-splashing rules.
  • Step 204 Obtain room attribute information of 500 brushed rooms.
  • the room IDs of 500 brushed rooms are acquired.
  • Step 206 Determine the set of visiting users corresponding to each swiping room according to the attribute information of each room.
  • the user set corresponding to each browsing room is determined according to the room ID of each browsing room.
  • Step 208 Determine at least two sets of visiting users to be processed in the 500 brushing rooms, determine the intersection of each set of visiting users to be processed, and determine the set of abnormal users according to the intersection.
  • Step 210 Obtain the number of abnormal users according to the set of abnormal users, determine the target room, obtain the number of users visiting the target room according to the set of visiting users corresponding to the target room, and calculate the target according to the number of visiting users and the number of abnormal users The number of real users corresponding to the swiping room.
  • the number of abnormal users is determined to be 100 according to the set of abnormal users, and the target browsing room m1 is determined among 500 reading rooms, and the number of online viewers in the target reading room m1 is obtained as 1500, according to the number of visiting users and the number of abnormal users, the number of real online viewers corresponding to the target brushing volume room is 1400.
  • Step 212 Obtain a set of rooms to be confirmed with the amount of browsing according to the set of abnormal users, and adjust the preset anti-swiping rules according to the set of rooms to be confirmed with the amount of usage.
  • the set of rooms to be confirmed is obtained, and it is judged whether each live room in the set of rooms to be confirmed is a room that has been confirmed. If the collection of rooms with data collection is all rooms with data collection, the preset anti-spoofing rules will not be adjusted; if there are unconfirmed rooms with suspected data collection in the rooms to be confirmed, the preset anti-spoofing rules will be lowered according to the set of rooms with suspected data collection The preset standard score in .
  • This application provides a method for counting the number of people in a live broadcast room, which is applied to counting the real number of people online, including: determining 500 rooms with swipes according to the preset anti-blogging rules; obtaining the room attribute information of the 500 swipes rooms; Determine at least two pending access user sets in the room, determine the intersection of each pending access user set, and determine the abnormal user set based on the intersection; obtain the number of abnormal users according to the abnormal user set, determine the target browsing volume room, and according to the target browsing volume
  • the visiting user set corresponding to the room obtains the number of visiting users of the target reading room, and counts the real number of users corresponding to the target reading room according to the number of visiting users and the abnormal user number; obtains the room set to be confirmed according to the abnormal user collection, and according to Adjust the preset anti-spam rules for the set of rooms to be confirmed.
  • Use the preset anti-scraping rules to determine the spam room reversely query the user sets of all the spam rooms according to the spam room, find the intersection of at least two access user sets, obtain the abnormal user set, and remove the spam in real time according to the abnormal user set Measure the number of users, obtain the real number of visitors to each live room, and dynamically adjust the standard of the preset anti-scrap rules, so as to more accurately judge whether the live room is a sparring room.
  • FIG. 3 shows a schematic structural diagram of a device for counting people in a live broadcast room provided by an embodiment of the present application.
  • the device includes:
  • the acquiring module 302 is configured to acquire the room attribute information corresponding to each brushing room in the multiple brushing rooms;
  • the first determining module 304 is configured to determine, according to the room attribute information, the set of visiting users corresponding to each browsing room;
  • the second determination module 306 is configured to determine an abnormal user set according to the plurality of access user sets
  • the statistical module 308 is configured to count the number of real users corresponding to each browsing room according to the visiting user set corresponding to each browsing room and the abnormal user set.
  • the first determination module 304 is further configured to:
  • the second determining module 306 is further configured to:
  • An intersection of at least two access user sets to be processed is determined, and an abnormal user set is determined according to the intersection, wherein the abnormal user set includes users with abnormal operations.
  • the second determining module 306 is further configured to:
  • An abnormal user set is determined according to the abnormal user attribute information set.
  • the statistics module 308 is also configured to:
  • the statistical module 308 is further configured to:
  • the number of real users corresponding to the target browsing room is counted.
  • the acquiring module 302 is further configured to:
  • the device also includes:
  • the adjustment module is configured to obtain a set of rooms to be confirmed for brushing according to the set of abnormal users;
  • the adjustment module is further configured to:
  • the target room to be reviewed is a room for brushing.
  • the adjustment module is further configured to:
  • a device for counting the number of people in a live broadcast room includes: an acquisition module configured to acquire room attribute information corresponding to each room in a plurality of rooms that have been checked; a first determination module configured to The attribute information determines the visiting user set corresponding to each browsing room; the second determination module is configured to determine the abnormal user set according to the plurality of visiting user sets; the statistical module is configured to correspond to each browsing room The access user set and the abnormal user set count the number of real users corresponding to each swiping room.
  • the room attribute information of at least two counting rooms determine the set of visiting users corresponding to each counting room according to the room attribute information of each counting room, and determine the set of abnormal users according to the set of visiting users of each counting room , after the abnormal user set is determined, the number of real users corresponding to each browsing room can be counted according to the visiting user set and abnormal user set corresponding to each scanning room, and the abnormal user set can be automatically discarded in the subsequent population statistics
  • the number of heartbeats in the real-time removal of the number of users to ensure the correctness of the popularity data in the live broadcast room.
  • Fig. 4 shows a structural block diagram of a computing device 400 provided according to an embodiment of the present application.
  • Components of the computing device 400 include, but are not limited to, a memory 410 and a processor 420 .
  • the processor 420 is connected to the memory 410 through the bus 430, and the database 450 is used for storing data.
  • Computing device 400 also includes an access device 440 that enables computing device 400 to communicate via one or more networks 460 .
  • networks include the Public Switched Telephone Network (PSTN), Local Area Network (LAN), Wide Area Network (WAN), Personal Area Network (PAN), or a combination of communication networks such as the Internet.
  • Access device 440 may include one or more of any type of network interface (e.g., a network interface card (NIC)), wired or wireless, such as an IEEE 802.11 wireless local area network (WLAN) wireless interface, Worldwide Interoperability for Microwave Access ( Wi-MAX) interface, Ethernet interface, Universal Serial Bus (USB) interface, cellular network interface, Bluetooth interface, Near Field Communication (NFC) interface, etc.
  • NIC network interface card
  • the above-mentioned components of the computing device 400 and other components not shown in FIG. 4 may also be connected to each other, such as through a bus. It should be understood that the structural block diagram of the computing device shown in FIG. 4 is only for the purpose of illustration, rather than limiting the scope of the application. Those skilled in the art can add or replace other components as needed.
  • Computing device 400 may be any type of stationary or mobile computing device, including mobile computers or mobile computing devices (e.g., tablet computers, personal digital assistants, laptop computers, notebook computers, netbooks, etc.), mobile telephones (e.g., smartphones), ), wearable computing devices (eg, smart watches, smart glasses, etc.), or other types of mobile devices, or stationary computing devices such as desktop computers or PCs.
  • Computing device 400 may also be a mobile or stationary server. Wherein, when the processor 420 executes the computer instructions, the steps of the method for counting the number of people in the live broadcast room are realized.
  • An embodiment of the present application also provides a computer-readable storage medium, which stores computer instructions, and when the computer instructions are executed by a processor, the steps of the aforementioned method for counting people in a live broadcast room are implemented.
  • the computer instructions include computer program code, which may be in source code form, object code form, executable file or some intermediate form or the like.
  • the computer-readable medium may include: any entity or device capable of carrying the computer program code, a recording medium, a USB flash drive, a removable hard disk, a magnetic disk, an optical disk, a computer memory, and a read-only memory (ROM, Read-Only Memory) , Random Access Memory (RAM, Random Access Memory), electrical carrier signal, telecommunication signal and software distribution medium, etc.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • electrical carrier signal telecommunication signal and software distribution medium, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Social Psychology (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Graphics (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本申请提供直播间人数统计方法及装置,其中直播间人数统计方法包括:获取多个刷量房间中每个刷量房间对应的房间属性信息(102);根据房间属性信息确定每个刷量房间对应的访问用户集合(104);根据多个访问用户集合确定异常用户集合(106);根据每个刷量房间对应的访问用户集合和异常用户集合统计每个刷量房间对应的真实用户数(108)。通过反向查询刷量房间的访问用户集合,根据访问用户集合确定异常用户集合,根据异常用户集合去除刷量用户人数,从而获得直播间的真实访问人数,并且可以在以后的人数统计中,自动丢弃异常用户集合中的心跳人数,实现了实时去除刷量人数,保证了每个直播间人气数据的正确性。

Description

直播间人数统计方法及装置
本申请要求于2022年02月09日提交中国专利局、申请号为202210122950.6、发明名称为“直播间人数统计方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及网络直播技术领域,特别涉及一种直播间人数统计方法。本申请同时涉及一种直播间人数统计装置,一种计算设备,以及一种计算机可读存储介质。
背景技术
随着网络通信技术的进步和宽带网络的提速,直播得到了越来越多的发展和应用。在现有直播体系中,人气是用于直播平台各个房间排名的重要指标,一般而言人气越高,排名越靠前,主播越有可能被用户观看。人气计算中直播间实时观看人数是关键一环,而一些主播为了提高人气,会通过非法手段模拟观看直播间,伪造直播间的在线观看人数,即通过刷量提高人气排名。
在正常房间中的访问人数即为真实人数,但在刷量房间中,访问人数不是真实人数,目前可以通过防刷策略确定出刷量房间,但是在确定出刷量房间之后,无法获得刷量房间的真实人数。因此,如何确保直播间人气数据的正确性是目前亟需解决的问题。
发明内容
有鉴于此,本申请实施例提供了一种直播间人数统计方法。本申请同时涉及一种直播间人数统计装置,一种计算设备,以及一种计算机可读存储介质,以解决现有技术中存在的无法准确的获取直播间观看人数的问题。
根据本申请实施例的第一方面,提供了一种直播间人数统计方法,包括:
获取多个刷量房间中每个刷量房间对应的房间属性信息;
根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合;
根据多个所述访问用户集合确定异常用户集合;
根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。
根据本申请实施例的第二方面,提供了一种直播间人数统计装置,包括:
获取模块,被配置为获取多个刷量房间中每个刷量房间对应的房间属性信息;
第一确定模块,被配置为根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合;
第二确定模块,被配置为根据多个所述访问用户集合确定异常用户集合;
统计模块,被配置为根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。
根据本申请实施例的第三方面,提供了一种计算设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机指令,所述处理器执行所述计算机指令时实现所述直播间人数统计方法的步骤。
根据本申请实施例的第四方面,提供了一种计算机可读存储介质,其存储有计算机指令,该计算机指令被处理器执行时实现所述直播间人数统计方法的步骤。
本申请提供的直播间人数统计方法,包括:获取多个刷量房间中每个刷量房间对应的房间属性信息;根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合;根据多个所述访问用户集合确定异常用户集合;根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。
本申请一实施例实现了通过获取至少两个刷量房间的房间属性信息,根据每个刷量房间的房间属性信息确定每个刷量房间对应的访问用户集合,根据每个刷量房间的访问用户集合确定出异常用户集合,确定出异常用户集合之后,可以根据每个刷量房间对应的访问用户集合和异常用户集合统计每个刷量房间对应的真实用户数,并且可以在以后的人数统计中,自动丢弃异常用户集合中的心跳人数,实现了实时去除刷量人数,保证了直播间人气数据的正确性。
附图说明
图1是本申请一实施例提供的一种直播间人数统计方法的流程图;
图2是本申请一实施例提供的一种应用于统计在线真实人数的直播间人数统计方法的处理流程图;
图3是本申请一实施例提供的一种直播间人数统计装置的结构示意图;
图4是本申请一实施例提供的一种计算设备的结构框图。
具体实施方式
在下面的描述中阐述了很多具体细节以便于充分理解本申请。但是本申请能够以很多不同于在此描述的其它方式来实施,本领域技术人员可以在不违背本申请内涵的情况下做类似推广,因此本申请不受下面公开的具体实施的限制。
在本申请一个或多个实施例中使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本申请一个或多个实施例。在本申请一个或多个实施例和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。还应当理解,本申请一个或多个实施例中使用的术语“和/或”是指并包含一个或多个相关联的列出项目的任何或所有可能组合。
应当理解,尽管在本申请一个或多个实施例中可能采用术语第一、第二等来描述各种信息,但这些信息不应限于这些术语。这些术语仅用来将同一类型的信息彼此区分开。例如,在不脱离本申请一个或多个实施例范围的情况下,第一也可以被称为第二,类似地,第二也可以被称为第一。取决于语境,如在此所使用的词语“如果”可以被解释成为“在……时”或“当……时”或“响应于确定”。
首先,对本申请一个或多个实施例涉及的名词术语进行解释。
直播人气:综合观看人数、弹幕连接数、礼物数等按照一定比例算出的数值,用于在直播平台按照人气的高低进行排名。
直播人数:实时观看直播间的真实人数。
刷量:通过模拟正常用户访问,产生大量虚假观看的情况,即通过非法手段模拟观看直播间。
刷量房间:通过非法手段产生大量虚假人气的直播间,刷量房间中有许多非法用户通过非法手段为刷量房间产生大量观看人数、弹幕等虚假数据。
异常用户:通过非法手段为直播间刷取大量虚假人气信息、弹幕信息的用户。
防刷:通过技术手段,识别非法访问的请求,即识别刷量的黑名单地址的访问请求,并拒绝该非法请求。
心跳汇总服务:实时汇总每个直播间的心跳信息,心跳信息可以是直播间的观看人数、直播间的弹幕数等信息。由于心跳汇总服务只做一件事情就是汇总每个房间的心跳信息,所以心跳汇总服务可以做到实时汇总。
在现有直播体系中,直播间人数是衡量直播间人气的重要参数之一,直播间人气可用于对直播平台 房间排名。一般而言,人气越高排名越靠前,主播越有可能被用户观看,因此,直播间的观看人数对于该直播间尤为重要。
一般的用户观看直播间的流程为:用户点击自己想要观看的直播间,选择进入直播间,客户端向直播平台的调度服务请求播放地址,拿到播放地址后开始为用户播放该直播间的直播内容。此后,客户端在后台定时的向直播平台的数据库上报播放日志,播放日志的具体信息包括:直播间标识信息+用户标识信息+其他信息,同时客户端也会向直播平台的心跳汇总服务定时上报心跳信息,心跳信息包括:房间标识信息+用户标识信息。心跳汇总服务基于每个观看用户客户端上报的心跳信息,可以实时汇总每个直播间的观看人数信息。
由于目前有部分非法人员通过模拟正常用户访问,为刷量房间产生大量虚假观看的情况,以此获取非法利益。因此,现在很多直播平台都有各自的防刷策略来统计直播间真实人数,例如,一般都会基于IP的防刷,利用播放器的播放日志,全盘统计所有IP出现的次数,在同一IP出现的次数超过预设频率的情况下,即将该IP设置为非法IP,该IP的用户为异常用户,在下一次该IP上报人数心跳信息时,心跳汇总服务默认选择丢弃该人数,不进行人数统计以起到防刷的作用。但是,上述防刷策略具有以下几点弊端:
误伤概率大:很多小区/学校/大型办公楼的出口IP是固定的一个或者多个,因此很容易出现IP访问过多,即IP超过预设频率的情况。
针对性低:由于上述防刷策略是从全盘的角度统计IP的出现次数,在非法人员拥有多个IP且访问频率低的情况下,这些非法IP很容易出现的次数达不到很高的频率,会出现漏判的情况,如果降低预设频率又容易出现误伤的情况。
延迟性高:播放日志并不是实时上报的,因为播放日志只是用来分析数据,对实时性要求不高,所以允许客户端可以延迟上报。接收播放日志的数据库由于接收的播放日志或者信息上报非常多,一般都是n+1的形式,即在第n天上报,数据库进行日志的分类处理后,在第n+1天才可以查询到相关业务的播放日志。
基于此在本申请中,提供了一种直播间人数统计方法,用于解决上述防刷策略的问题。本申请同时涉及一种直播间人数统计装置,一种计算设备,以及一种计算机可读存储介质,在下面的实施例中逐一进行详细说明。
图1示出了根据本申请一实施例提供的一种直播间人数统计方法的流程图,具体包括以下步骤:
步骤102:获取多个刷量房间中每个刷量房间对应的房间属性信息。
其中,房间属性信息可以理解为每个直播间对应的属性信息,房间属性信息可以是直播间的房间标识,例如直播间的房间号、直播间的主播ID等,房间属性信息还可以包括直播间的当前观看人数、弹幕数等信息。
在本申请一具体实施例中,确定出刷量房间有100个,获取这100个刷量房间的房间号。
为了确定出异常用户,需要先确定出当前线上的所有刷量房间,具体地,获取至少两个刷量房间的房间属性信息之前,还包括:
根据预设防刷规则确定至少两个刷量房间。
其中,预设防刷规则可以理解为用于判断一个直播间是否为刷量房间的规则,根据预设防刷规则可以精准地确定直播间是否刷量,刷量房间可以理解为通过非法手段产生大量虚假观看人数的直播房间。
在实际应用中,一个直播平台中可能有许多直播间通过非法技术手段产生大量虚假观看人数,可以通过预设防刷规则对直播平台中的每个直播间都进行判断,筛选出刷量房间。预设防刷规则可以是从多维度判断直播间是否为刷量房间,例如,可以综合目标直播间的弹幕连接数、关注数等维度的参数来判断。判断直播间是否为刷量房间的方法有很多种,本申请在此不作具体限制,可根据实际情况选择相应的防刷规则。
在本申请一具体实施例中,当前直播平台中有1000个直播间正在直播,通过每个直播间的弹幕连 接数、关注数等参数计算出每个直播间对应的刷量分值,若当前直播间的刷量分值超过预设分值,则判断当前直播间为刷量房间;相应地,若当前直播间的刷量分值未超过预设分值,则判断当前直播间为正常房间。通过预设防刷规则最终判断出当前直播平台的1000个直播间中,有400个直播间为刷量房间。
步骤104:根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合。
其中,访问用户集合可以理解为观看直播间的用户集合,访问用户集合中的用户可能有正常用户也可能有刷量用户(异常用户),访问用户集合中可以包括有用户的ID、用户名、用户IP等。访问用户集合可以是正在直播的直播间的在线观看用户的集合,也可以是获取当前直播间的预设历史时间段内的历史观看用户的集合。
在实际应用中,根据当前直播间的房间属性信息可以从直播平台的数据库中获取到当前直播间的历史访问用户集合,若当前直播间正在直播也可以获取到当前正在观看的访问用户集合。需要注意的是,为了保证后续判断的准确性,在访问用户集合为历史访问用户集合的情况下,应该统一获取每个刷量房间的历史访问用户集合;在访问用户集合为在线访问用户集合的情况下,应该统一获取每个刷量房间的在线访问用户集合。
在本申请一具体实施例中,沿用上例,根据400个刷量房间各自的房间号,在直播平台中的数据库中查找每个刷量房间过去7天内的所有访问用户ID,根据每个刷量房间的所有的访问用户ID,生成每个刷量房间各自对应的访问用户集合。
在本申请另一具体实施例中,沿用上例,根据400个刷量房间各自的房间号,获取到每个正在直播的刷量房间的当前在线观看用户ID,根据每个刷量房间的当前在线观看用户ID,生成每个刷量房间各自对应的访问用户集合。
具体地,根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合,包括:
确定目标刷量房间;
根据所述目标刷量房间的房间属性信息在访问用户数据库中获取所述目标刷量房间对应的访问用户集合。
其中,目标刷量房间可以理解为在所有刷量房间中选择出的一个刷量房间。访问用户数据库可以理解为直播平台中存储用户数据的数据库,数据库中可以按照每个直播间的房间属性信息分别将每个直播间的相关信息进行存储。例如,按照每个直播间的房间号,依次存储各自的直播观看人数、直播弹幕数、礼物数等。
在实际应用中,在确定访问用户集合中,由于存在许多刷量房间,因而需要进行逐个处理确定。首先在众多刷量房间中确定出目标刷量房间,在根据目标刷量房间的房间属性信息确定出目标刷量房间对应的访问用户集合。再选择下一个刷量房间作为目标刷量房间,依次进行相同的处理操作,直到确定出每个刷量房间各自对应的访问用户集合。
在本申请一具体实施例中,沿用上例,在400个刷量房间中确定出一个目标刷量房间A,根据目标刷量房间A的房间号在直播平台的访问用户数据库中获取目标刷量房间A对应的访问用户集合,访问用户集合中为目标刷量房间A过去3天的访问用户集合,相应地,获取每个刷量房间各自对应的访问用户集合,其中,获取每个目标刷量房间的访问用户集合可以并行处理以提高处理效率。
在本申请另一具体实施例中,沿用上例,在400个刷量房间中确定出一个目标刷量房间A,获取目标刷量房间A的当前在线观看用户信息,并生成目标刷量房间A对应的访问用户集合。
步骤106:根据多个所述访问用户集合确定异常用户集合。
其中,异常用户集合可以理解为刷量用户的集合,异常用户集合中可以包括有异常用户的用户名、用户ID、用户IP等。
在实际应用中,在多个刷量房间的访问用户集合中必然存在异常用户,根据至少两个刷量房间对应的访问用户集合可以确定出异常用户集合。具体地,确定异常用户集合之前,需要判断访问用户集合中的每个访问用户是否为异常用户,可以根据异常用户的访问频率确定,访问频率可以是用户访问次数除 以刷量房间数,在访问用户的访问频率超过预设访问频率时,则确定当前访问用户为异常用户,反之则为正常用户。例如,刷量房间数为100,访问用户A的访问次数为25,访问用户A的访问频率为25%,没有超过预设访问频率30%,则访问用户A为正常用户;访问用户B的访问次数40,访问用户B的访问频率为40%,超过预设访问频率30%,则访问用户B为异常用户。
在本申请一具体实施例中,沿用上例,目前有400个刷量房间以及每个刷量房间对应的访问用户集合,确定每个访问用户集合中的访问用户的访问频率,判断每个访问用户的访问频率是否超过预设访问频率20%,将超过预设访问频率的访问用户添加至异常用户集合中。
具体地,根据多个所述访问用户集合确定异常用户集合,包括:
在多个所述访问用户集合中确定至少两个待处理访问用户集合;
确定至少两个待处理访问用户集合的交集,并根据所述交集确定异常用户集合,其中异常用户集合中包括有异常操作的用户。
其中,在多个所述访问用户集合中确定至少两个待处理访问用户集合可以理解为在所有访问用户集合中确定出至少两个刷量房间的目标访问用户集合(待处理访问用户集合),例如在100个刷量房间的访问用户集合中确定出20个待处理访问用户集合,需要注意的是待处理访问用户集合的数量可以与刷量房间的访问用户集合总数相等,即100个刷量房间中可以确定出100个待处理访问用户集合。确定每个待处理访问用户集合的交集可以理解为判断哪些用户均看过这些待处理访问用户集合对应的刷量房间。
在实际应用中,确定至少两个待处理访问用户集合的交集的目的是根据交集确定出异常用户集合,求交集的原因是:每个异常用户在一个直播平台进行刷量时必然不会只接收一个主播或一个直播间的刷量订单,一个异常用户会对多个直播间进行刷量。由于每个异常用户的服务器均是固定不变且服务器本身不会随意改变,因此每个异常用户的出口IP也是固定不变的,很容易出现同一个IP访问多个房间的情况。
在本申请一具体实施例中,沿用上例,在400个访问用户集合中确定出100个待处理访问用户集合,根据100个待处理访问用户集合找出这个100个刷量房间对应的访问用户集合的交集,交集中有10个用户IP,这10个IP对应的用户在过去3天内均访问过这100个刷量房间,则将这10个用户IP作为异常IP,生成一个异常用户集合。
在实际应用中,也可以在多个刷量房间的访问用户集合中确定出一定数量的待处理访问用户集合,在这些待处理访问用户集合中,查找每个用户访问这些待处理访问用户集合对应的刷量房间的次数超过预设次数。预设次数可以根据刷量房间的数量设定。例如,待处理访问用户集合对应的刷量房间为100个,则预设次数可以设置为刷量房间数量的10%,即预设次数为10次。由于是在待处理访问用户集合中筛选异常用户集合,因此相较于全盘统计异常用户误伤概率低、针对性强。
在本申请一具体实施例中,在400个刷量房间对应的访问用户集合中确定出100个待处理访问用户集合之后,针对每个待处理访问用户集合里的每个用户,查询每个用户访问这100个待处理访问用户集合对应的刷量房间中的直播间的次数,用户A访问这100个刷量房间中的直播间的次数为20次,超过预设次数10次,则用户A为异常用户,依次完成对每个用户的检测以筛选出异常用户,并生成异常用户集合。
在访问用户集合中为用户名、用户ID的情况下,需要根据用户名和/或用户ID获取到每个用户对应的用户属性信息集合。具体地,确定至少两个待处理访问用户集合的交集,并根据所述交集确定异常用户集合,包括:
根据每个待处理访问用户集合获取每个所述待处理访问用户集合对应的用户属性信息集合;
确定每个待处理访问用户集合的用户属性信息集合的交集,确定异常用户属性信息集合;
根据所述异常用户属性信息集合确定异常用户集合。
其中,用户属性信息集合可以理解为包含每个用户对应的属性信息的集合,用户属性信息集合里可 以包括有用户IP、用户发送弹幕数、用户发送礼物数等信息。
在实际应用中,根据每个房间属性信息可以在访问用户数据库中获取到每个房间的访问用户,即在获取到每个房间访问用户的用户名的情况下,需要根据每个访问用户的用户名,获取到每个访问用户对应的用户IP。
在本申请一具体实施例中,沿用上例,确定出100个待处理访问用户集合之后,根据100个待处理访问用户集合获取到每个待处理访问用户集合对应的刷量房间的用户属性信息集合,用户属性信息集合中包括有每个访问用户的访问IP,根据每个刷量房间对应的用户属性信息集合确定出包含有异常用户IP的异常用户属性信息集合,根据异常用户属性信息集合确定包含有用户ID和/或用户名的异常用户集合。
步骤108:根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。
其中,真实用户数可以理解为观看直播间的真实人数,真实用户数可以是访问用户集合中的用户数量减去存在的异常用户数量。真实用户数可以是刷量房间过去n天的真实观看人数,也可以是刷量房间当前正在观看的真实观看人数。
在实际应用中,每个刷量房间的访问用户集合中必然有异常用户,这些异常用户通过非法技术手段模拟正常用户观看,为刷量房间提供大量虚假的观看人数,因此为了保证每个刷量房间的观看人数的真实性,可以通过每个刷量房间对应的访问用户集合和异常用户集合统计每个刷量房间对应的真实观看人数。例如,访问用户集合中有100个用户,异常用户集合中有20个用户,这20个异常用户都在访问用户集合中,则真实观看人数为80。
在本申请一具体实施例中,沿用上例,根据100个刷量房间对应的访问用户集合和异常用户集合统计该100个刷量房间对应的真实用户数。
具体地,在根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数之前,还包括:根据所述异常用户集合获取异常用户数。
其中,异常用户数可以理解为根据异常用户集合统计出的异常用户数量。
在实际应用中,异常用户集合中可能包括有用户ID、用户名、用户IP等,则可以根据异常用户集合统计出异常用户的数量。
在本申请一具体实施例中,沿用上例,在根据每个刷量房间对应的访问用户集合和异常用户集合统计每个刷量房间对应的真实用户数之前,需要根据异常用户集合获取到这100个刷量房间的异常用户数量,异常用户数量为20。
在根据异常用户集合获取异常用户数之后,则需要根据每个刷量房间对应的访问用户集合获取每个刷量房间对应的访问用户数,具体地,根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数,包括:
确定目标刷量房间,根据所述目标刷量房间对应的访问用户集合获取所述目标刷量房间的访问用户数;
根据所述访问用户数和所述异常用户数统计目标刷量房间对应的真实用户数。
其中,目标刷量房间可以理解为在多个刷量房间中确定出的一个目标房间,针对该目标刷量房间进行进一步的处理。访问用户数可以理解为访问目标刷量房间的总人数,总人数中包括有正常用户的观看人数和异常用户的刷量人数。
在实际应用中,每个刷量房间的访问用户集合中包括有正常用户的用户信息和异常用户的用户信息,根据访问用户集合获取的访问用户数为正常用户加上异常用户数,因而可以根据访问用户数和异常用户数统计出每个刷量房间的真实用户数。
在本申请一具体实施例中,沿用上例,在100个刷量房间中确定出目标刷量房间R1,根据目标刷量房间R1对应的访问用户集合获取目标刷量房间R1对应的访问数100,根据访问用户数和异常用户数统计出目标刷量房间对应的真实用户数80。
具体地,根据所述访问用户数和所述异常用户数统计目标刷量房间对应的真实用户数,包括:
根据所述访问用户数和所述异常用户数统计目标刷量房间对应的目标异常用户数;
根据所述访问用户数和所述目标异常用户数统计目标刷量房间对应的真实用户数。
其中,目标异常用户数可以理解为观看目标刷量房间的异常用户数量,在实际应用中,异常用户集合中的每个异常用户可能没有访问过每个刷量房间,因此,在统计目标刷量房间的真实用户数之前,需要确定目标刷量房间中有多少异常用户。例如,访问用户集合中有100个用户,异常用户集合中有20个用户,但是只有10个异常用户在访问用户集合中,则真实观看人数为90。
在本申请一具体实施例中,沿用上例,确定目标刷量房间R1的访问用户数为100,确定异常用户数20人中只有5人在访问用户数中,因此,目标刷量房间R1的真实用户数为95。
在实际应用中,获取到异常用户数之后可以将异常用户数以及异常用户集合发送至直播平台的心跳汇总服务,在直播平台汇总出本次确定的刷量房间之外的直播间的人数时,可以直接忽略异常用户集合中异常用户上报的心跳,从而准确地统计其他直播间的人数,起到防刷的作用,并且无需再次进行直播间是否刷量的判断。
在上述直播间人数统计方法中,刷量房间是根据预设防刷规则确定的,由于预设防刷规则本身是人为设定的,因此会有一些漏判的刷量房间。基于刷量用户会访问多个直播间进行刷量的现象,可以根据异常用户集合在直播平台的访问用户数据库中查询到每个异常用户的访问记录,根据访问记录可以获取到每个异常用户访问过哪些直播间,生成一个列表,判断列表中的直播间是否为我们已经确定的刷量房间,若是,则不做处理;若否,则将其添加至疑似刷量房间列表中。
具体地,所述方法还包括:
根据所述异常用户集合获取待确认刷量房间集合;
根据所述待确认刷量房间集合调整所述预设防刷规则。
其中,待确认刷量房间集合可以理解为疑似刷量房间集合,即待确认这些房间是否为刷量房间。预设防刷规则是用于判断当前直播间是否为刷量房间的规则,调整预设防刷规则可以理解为调整预设防刷规则中的预设标准值,在当前直播间的疑似值大于预设标准值时,确定当前直播间为刷量房间;在当前直播间的疑似值小于或等于预设标准值时,确定当前直播间为正常房间。
在实际应用中,在获取异常用户集合之后,还可以相应的调整判断直播间是否为刷量房间的预设防刷规则,使得根据预设防刷规则能够更准确的判断出刷量房间。
在本申请一具体实施例中,沿用上例,根据异常用户集合中的20名异常用户,获取这20名异常用户的访问记录从而确定待确认刷量房间集合,待确认刷量房间集合中有1000个待确认刷量房间,根据待确认刷量房间集合调整预设防刷规则。
具体地,根据所述异常用户集合获取待确认刷量房间集合,包括:
根据所述异常用户集合获取待审核房间集合;
在所述待审核房间集合中确定目标待审核房间;
判断所述至少两个刷量房间中是否包括所述目标待审核房间;
若否,则将所述目标待审核房间添加至待确认刷量房间集合;
若是,则所述目标待审核房间为刷量房间。
其中,待审核房间集合可以理解为所有异常用户访问的房间集合,待审核房间集合中的直播间中可能有已确定为刷量的刷量房间和待确定的未知房间。
在实际应用中,根据异常用户集合获取所有异常用户的访问房间集合,该集合即为待审核房间集合,根据待审核房间集合中的直播间确定是否有漏判的情况发生。
在本申请一具体实施例中,沿用上例,根据异常用户集合获取待审核房间集合,在待审核房间集合中确定出目标待审核房间R2,判断目标待审核房间R2是否在之前的刷量房间集合中,判断结果为在,则说明目标待审核房间R2为已经确认的刷量房间。
在本申请另一具体实施例中,沿用上例,根据异常用户集合获取待审核房间集合,在待审核房间集合中确定出目标待审核房间R2,判断目标待审核房间R2是否在之前的刷量房间集合中,判断结果为不在,则说明目标待审核房间R2有可能是未检测到的刷量房间也有可能是正常房间。
具体地,根据所述异常用户集合获取待审核房间集合,包括:
在所述异常用户集合中确定目标异常用户;
获取所述目标异常用户的目标访问信息;
根据每个异常用户对应的目标访问信息获取待审核房间集合。
其中,目标异常用户可以理解为在异常用户集合中选择出的一个异常用户,目标异常用户为接下来的处理对象,目标访问信息可以理解为目标异常用户的访问记录,访问信息中可以包括有访问时间、访问地址(访问房间号)等信息。
在实际应用中,针对异常用户集合中的每一个异常用户进行处理,选择出异常用户集合中的目标异常用户,根据目标异常用户的访问信息,以此类推获取每个异常用户的访问信息,待审核房间集合可以是所有异常用户根据预设时间确定出的访问房间集合。例如,预设时间为3天,则获取所有异常用户过去3天的访问房间集合,确定待审核房间集合。可以根据每个异常用户的访问信息获得各自的待审核房间子集合,最后汇总生成待审核房间集合,也可以先汇总每个异常用户的访问信息,根据总的访问信息确定待审核房间集合。
在本申请一具体实施例中,在异常用户集合中确定目标异常用户A,获取目标异常用户A的目标访问信息,根据目标访问信息可以获取到目标异常用户的待审核房间子集合,获取到每个异常用户的待审核房间子集合,根据每个待审核房间子集合生成待审核房间集合。
在本申请另一具体实施例中,沿用上例,在异常用户集合中确定目标异常用户A,获取目标异常用户A的目标访问信息,并确定下一个目标异常用户,获取对应的目标访问信息,根据每个异常用户对应的目标访问信息汇总成总访问信息,根据总访问信息确定待审核房间信息。
本申请提供的一种直播间人数统计方法,包括:获取多个刷量房间中每个刷量房间对应的房间属性信息;根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合;根据多个所述访问用户集合确定异常用户集合;根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。通过获取至少两个刷量房间的房间属性信息,根据每个刷量房间的房间属性信息确定每个刷量房间对应的访问用户集合,根据每个刷量房间的访问用户集合确定出异常用户集合,确定出异常用户集合之后,可以根据每个刷量房间对应的访问用户集合和异常用户集合统计每个刷量房间对应的真实用户数,并且可以在以后的人数统计中,自动丢弃异常用户集合中的心跳人数,实现了实时去除刷量人数,保证了直播间人气数据的正确性。并且,还可以动态调整预设防刷规则,提高判断刷量房间数量,减少对刷量房间的漏判。
下述结合附图2,以本申请提供的直播间人数统计方法在统计在线真实人数的应用为例,对所述直播间人数统计方法进行进一步说明。其中,图2示出了本申请一实施例提供的一种应用于统计在线真实人数的直播间人数统计方法的处理流程图,具体包括以下步骤:
步骤202:根据预设防刷规则确定500个刷量房间。
在本申请一具体实施例中,根据预设的防刷规则确定出直播平台中当前正在直播的直播间中有500个刷量房间。
步骤204:获取500个刷量房间的房间属性信息。
在本申请一具体实施例中,沿用上例,获取500个刷量房间的房间ID。
步骤206:根据每个房间属性信息确定每个刷量房间对应的访问用户集合。
在本申请一具体实施例中,沿用上例,根据每个刷量房间的房间ID确定出每个刷量房间对应的访问用户集合。
步骤208:在500个刷量房间中确定至少两个待处理访问用户集合,确定每个待处理访问用户集合 的交集,并根据交集确定异常用户集合。
在本申请一具体实施例中,沿用上例,在500个刷量房间中确定出500个待处理访问用户集合,确定这500个刷量房间的待处理访问用户集合的交集,求交集的结果为异常用户集合。
步骤210:根据异常用户集合获取异常用户数,确定目标刷量房间,根据目标刷量房间对应的访问用户集合获取目标刷量房间的访问用户数,根据访问用户数和所述异常用户数统计目标刷量房间对应的真实用户数。
在本申请一具体实施例中,沿用上例,根据异常用户集合确定出异常用户数为100,在500个刷量房间中确定目标刷量房间m1,获取目标刷量房间m1的在线观看人数为1500,根据访问用户数和异常用户数统计出目标刷量房间对应的在线真实观看人数为1400。
步骤212:根据异常用户集合获取待确认刷量房间集合,根据待确认刷量房间集合调整所述预设防刷规则。
在本申请一具体实施例中,沿用上例,根据异常用户集合获取待确认刷量房间集合,判断待确认刷量房间集合中的每个直播间是否为已经确定的刷量房间,若待确认刷量房间集合中都是刷量房间,则不调整预设防刷规则;若待确认刷量房间中有未确认的疑似刷量房间,则根据疑似刷量房间集合调低预设防刷规则中的预设标准分。
本申请提供的一种应用于统计在线真实人数的直播间人数统计方法,包括:根据预设防刷规则确定500个刷量房间;获取500个刷量房间的房间属性信息;在500个刷量房间中确定至少两个待处理访问用户集合,确定每个待处理访问用户集合的交集,并根据交集确定异常用户集合;根据异常用户集合获取异常用户数,确定目标刷量房间,根据目标刷量房间对应的访问用户集合获取目标刷量房间的访问用户数,根据访问用户数和所述异常用户数统计目标刷量房间对应的真实用户数;根据异常用户集合获取待确认刷量房间集合,根据待确认刷量房间集合调整所述预设防刷规则。通过预设防刷规则确定刷量房间,根据刷量房间反向查询所有刷量房间的访问用户集合,求取至少两个访问用户集合的交集,获取异常用户集合,根据异常用户集合实时去除刷量用户人数,获得每个直播间的真实访问人数,并且动态调整预设防刷规则的标准,从而能够更加准确的判断直播间是否为刷量房间。
与上述方法实施例相对应,本申请还提供了直播间人数统计装置实施例,图3示出了本申请一实施例提供的一种直播间人数统计装置的结构示意图。如图3所示,该装置包括:
获取模块302,被配置为获取多个刷量房间中每个刷量房间对应的房间属性信息;
第一确定模块304,被配置为根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合;
第二确定模块306,被配置为根据多个所述访问用户集合确定异常用户集合;
统计模块308,被配置为根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。
可选地,所述第一确定模块304,进一步被配置为:
确定目标刷量房间;
根据所述目标刷量房间的房间属性信息在访问用户数据库中获取所述目标刷量房间对应的访问用户集合。
可选地,所述第二确定模块306,进一步被配置为:
在多个所述访问用户集合中确定至少两个待处理访问用户集合;
确定至少两个待处理访问用户集合的交集,并根据所述交集确定异常用户集合,其中异常用户集合中包括有异常操作的用户。
可选地,所述第二确定模块306,进一步被配置为:
根据每个待处理访问用户集合获取每个所述待处理访问用户集合对应的用户属性信息集合;
确定每个待处理访问用户集合的用户属性信息集合的交集,确定异常用户属性信息集合;
根据所述异常用户属性信息集合确定异常用户集合。
可选地,所述统计模块308,还被配置为:
根据所述异常用户集合获取异常用户数。
可选地,所述统计模块308,进一步被配置为:
确定目标刷量房间,根据所述目标刷量房间对应的访问用户集合获取所述目标刷量房间的访问用户数;
根据所述访问用户数和所述异常用户数统计目标刷量房间对应的真实用户数。
可选地,所述获取模块302,还被配置为:
根据预设防刷规则确定至少两个刷量房间。
可选地,所述装置还包括:
调整模块,被配置为根据所述异常用户集合获取待确认刷量房间集合;
根据所述待确认刷量房间集合调整所述预设防刷规则。
可选地,所述调整模块,还被配置为:
根据所述异常用户集合获取待审核房间集合;
在所述待审核房间集合中确定目标待审核房间;
判断所述至少两个刷量房间中是否包括所述目标待审核房间;
若否,则将所述目标待审核房间添加至待确认刷量房间集合;
若是,则所述目标待审核房间为刷量房间。
可选地,所述调整模块,还被配置为:
在所述异常用户集合中确定目标异常用户;
获取所述目标异常用户的目标访问信息;
根据每个异常用户对应的目标访问信息获取待审核房间集合。
本申请提供的一种直播间人数统计装置,包括:获取模块,被配置为获取多个刷量房间中每个刷量房间对应的房间属性信息;第一确定模块,被配置为根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合;第二确定模块,被配置为根据多个所述访问用户集合确定异常用户集合;统计模块,被配置为根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。通过获取至少两个刷量房间的房间属性信息,根据每个刷量房间的房间属性信息确定每个刷量房间对应的访问用户集合,根据每个刷量房间的访问用户集合确定出异常用户集合,确定出异常用户集合之后,可以根据每个刷量房间对应的访问用户集合和异常用户集合统计每个刷量房间对应的真实用户数,并且可以在以后的人数统计中,自动丢弃异常用户集合中的心跳人数,实现了实时去除刷量人数,保证了直播间人气数据的正确性。
上述为本实施例的一种直播间人数统计装置的示意性方案。需要说明的是,该直播间人数统计装置的技术方案与上述的直播间人数统计方法的技术方案属于同一构思,直播间人数统计装置的技术方案未详细描述的细节内容,均可以参见上述直播间人数统计方法的技术方案的描述。
图4示出了根据本申请一实施例提供的一种计算设备400的结构框图。该计算设备400的部件包括但不限于存储器410和处理器420。处理器420与存储器410通过总线430相连接,数据库450用于保存数据。
计算设备400还包括接入设备440,接入设备440使得计算设备400能够经由一个或多个网络460通信。这些网络的示例包括公用交换电话网(PSTN)、局域网(LAN)、广域网(WAN)、个域网(PAN)或诸如因特网的通信网络的组合。接入设备440可以包括有线或无线的任何类型的网络接口(例如,网络接口卡(NIC))中的一个或多个,诸如IEEE802.11无线局域网(WLAN)无线接口、全球微波互联接入(Wi-MAX)接口、以太网接口、通用串行总线(USB)接口、蜂窝网络接口、蓝牙接口、近场通信(NFC)接口,等等。
在本申请的一个实施例中,计算设备400的上述部件以及图4中未示出的其他部件也可以彼此相连 接,例如通过总线。应当理解,图4所示的计算设备结构框图仅仅是出于示例的目的,而不是对本申请范围的限制。本领域技术人员可以根据需要,增添或替换其他部件。
计算设备400可以是任何类型的静止或移动计算设备,包括移动计算机或移动计算设备(例如,平板计算机、个人数字助理、膝上型计算机、笔记本计算机、上网本等)、移动电话(例如,智能手机)、可佩戴的计算设备(例如,智能手表、智能眼镜等)或其他类型的移动设备,或者诸如台式计算机或PC的静止计算设备。计算设备400还可以是移动式或静止式的服务器。其中,处理器420执行所述计算机指令时实现所述的直播间人数统计方法的步骤。
上述为本实施例的一种计算设备的示意性方案。需要说明的是,该计算设备的技术方案与上述的直播间人数统计方法的技术方案属于同一构思,计算设备的技术方案未详细描述的细节内容,均可以参见上述直播间人数统计方法的技术方案的描述。
本申请一实施例还提供一种计算机可读存储介质,其存储有计算机指令,该计算机指令被处理器执行时实现如前所述直播间人数统计方法的步骤。
上述为本实施例的一种计算机可读存储介质的示意性方案。需要说明的是,该存储介质的技术方案与上述的直播间人数统计方法的技术方案属于同一构思,存储介质的技术方案未详细描述的细节内容,均可以参见上述直播间人数统计方法的技术方案的描述。
上述对本申请特定实施例进行了描述。其它实施例在所附权利要求书的范围内。在一些情况下,在权利要求书中记载的动作或步骤可以按照不同于实施例中的顺序来执行并且仍然可以实现期望的结果。另外,在附图中描绘的过程不一定要求示出的特定顺序或者连续顺序才能实现期望的结果。在某些实施方式中,多任务处理和并行处理也是可以的或者可能是有利的。
所述计算机指令包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。
需要说明的是,对于前述的各方法实施例,为了简便描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请并不受所描述的动作顺序的限制,因为依据本申请,某些步骤可以采用其它顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定都是本申请所必须的。
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其它实施例的相关描述。
以上公开的本申请优选实施例只是用于帮助阐述本申请。可选实施例并没有详尽叙述所有的细节,也不限制该发明仅为所述的具体实施方式。显然,根据本申请的内容,可作很多的修改和变化。本申请选取并具体描述这些实施例,是为了更好地解释本申请的原理和实际应用,从而使所属技术领域技术人员能很好地理解和利用本申请。本申请仅受权利要求书及其全部范围和等效物的限制。

Claims (14)

  1. 一种直播间人数统计方法,其特征在于,包括:
    获取多个刷量房间中每个刷量房间对应的房间属性信息;
    根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合;
    根据多个所述访问用户集合确定异常用户集合;
    根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。
  2. 如权利要求1所述的直播间人数统计方法,其特征在于,根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合,包括:
    确定目标刷量房间;
    根据所述目标刷量房间的房间属性信息在访问用户数据库中获取所述目标刷量房间对应的访问用户集合。
  3. 如权利要求1所述的直播间人数统计方法,其特征在于,根据多个所述访问用户集合确定异常用户集合,包括:
    在多个所述访问用户集合中确定至少两个待处理访问用户集合;
    确定至少两个待处理访问用户集合的交集,并根据所述交集确定异常用户集合,其中,异常用户集合中包括有异常操作的用户。
  4. 如权利要求3所述的直播间人数统计方法,其特征在于,确定至少两个待处理访问用户集合的交集,并根据所述交集确定异常用户集合,包括:
    根据每个待处理访问用户集合获取每个所述待处理访问用户集合对应的用户属性信息集合;
    确定每个待处理访问用户集合的用户属性信息集合的交集,确定异常用户属性信息集合;
    根据所述异常用户属性信息集合确定异常用户集合。
  5. 如权利要求1所述的直播间人数统计方法,其特征在于,在根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数之前,还包括:根据所述异常用户集合获取异常用户数。
  6. 如权利要求5所述的直播间人数统计方法,其特征在于,根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数,包括:
    确定目标刷量房间,根据所述目标刷量房间对应的访问用户集合获取所述目标刷量房间的访问用户数;
    根据所述访问用户数和所述异常用户数统计目标刷量房间对应的真实用户数。
  7. 如权利要求1所述的直播间人数统计方法,其特征在于,获取至少两个刷量房间的房间属性信息之前,还包括:
    根据预设防刷规则确定至少两个刷量房间。
  8. 如权利要求7所述的直播间人数统计方法,其特征在于,所述方法还包括:
    根据所述异常用户集合获取待确认刷量房间集合;
    根据所述待确认刷量房间集合调整所述预设防刷规则。
  9. 如权利要求8所述的直播间人数统计方法,其特征在于,根据所述异常用户集合获取待确认刷量房间集合,包括:
    根据所述异常用户集合获取待审核房间集合;
    在所述待审核房间集合中确定目标待审核房间;
    判断所述至少两个刷量房间中是否包括所述目标待审核房间;
    若否,则将所述目标待审核房间添加至待确认刷量房间集合;
    若是,则所述目标待审核房间为刷量房间。
  10. 如权利要求9所述的直播间人数统计方法,其特征在于,根据所述异常用户集合获取待审核房间集合,包括:
    在所述异常用户集合中确定目标异常用户;
    获取所述目标异常用户的目标访问信息;
    根据每个异常用户对应的目标访问信息获取待审核房间集合。
  11. 如权利要求1所述的直播间人数统计方法,其特征在于,所述房间属性信息包括房间标识信息、房间互动信息。
  12. 一种直播间人数统计装置,其特征在于,包括:
    获取模块,被配置为获取多个刷量房间中每个刷量房间对应的房间属性信息;
    第一确定模块,被配置为根据所述房间属性信息确定所述每个刷量房间对应的访问用户集合;
    第二确定模块,被配置为根据多个所述访问用户集合确定异常用户集合;
    统计模块,被配置为根据每个刷量房间对应的访问用户集合和所述异常用户集合统计每个刷量房间对应的真实用户数。
  13. 一种计算设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机指令,其特征在于,所述处理器执行所述计算机指令时实现权利要求1-11任意一项所述方法的步骤。
  14. 一种计算机可读存储介质,其存储有计算机指令,其特征在于,该计算机指令被处理器执行时实现权利要求1-11任意一项所述方法的步骤。
PCT/CN2022/144350 2022-02-09 2022-12-30 直播间人数统计方法及装置 WO2023151426A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210122950.6 2022-02-09
CN202210122950.6A CN114466214B (zh) 2022-02-09 2022-02-09 直播间人数统计方法及装置

Publications (1)

Publication Number Publication Date
WO2023151426A1 true WO2023151426A1 (zh) 2023-08-17

Family

ID=81413174

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/144350 WO2023151426A1 (zh) 2022-02-09 2022-12-30 直播间人数统计方法及装置

Country Status (2)

Country Link
CN (1) CN114466214B (zh)
WO (1) WO2023151426A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114466214B (zh) * 2022-02-09 2023-05-02 上海哔哩哔哩科技有限公司 直播间人数统计方法及装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016061898A1 (zh) * 2014-10-20 2016-04-28 广州华多网络科技有限公司 直播间的频道访问方法和***
CN111339495A (zh) * 2020-02-18 2020-06-26 北京字节跳动网络技术有限公司 直播间在线人数的统计方法、装置、电子设备及存储介质
CN112995689A (zh) * 2021-02-24 2021-06-18 上海哔哩哔哩科技有限公司 确定直播间刷量的方法及装置
WO2021254080A1 (zh) * 2020-06-19 2021-12-23 百果园技术(新加坡)有限公司 直播补位的方法、装置、服务器和存储介质
CN114020996A (zh) * 2021-09-26 2022-02-08 武汉斗鱼网络科技有限公司 一种识别异常观看用户的方法、装置、介质及计算机设备
CN114466214A (zh) * 2022-02-09 2022-05-10 上海哔哩哔哩科技有限公司 直播间人数统计方法及装置

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109429082B (zh) * 2017-08-31 2020-10-16 武汉斗鱼网络科技有限公司 直播人气检测方法、存储介质、电子设备及***
CN113383362B (zh) * 2019-06-24 2022-05-13 深圳市欢太科技有限公司 用户识别方法及相关产品
CN110519263B (zh) * 2019-08-26 2022-05-17 北京百度网讯科技有限公司 防刷量方法、装置、设备及计算机可读存储介质
CN113947139A (zh) * 2021-10-13 2022-01-18 咪咕视讯科技有限公司 一种用户的识别方法、装置及设备

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016061898A1 (zh) * 2014-10-20 2016-04-28 广州华多网络科技有限公司 直播间的频道访问方法和***
CN111339495A (zh) * 2020-02-18 2020-06-26 北京字节跳动网络技术有限公司 直播间在线人数的统计方法、装置、电子设备及存储介质
WO2021254080A1 (zh) * 2020-06-19 2021-12-23 百果园技术(新加坡)有限公司 直播补位的方法、装置、服务器和存储介质
CN112995689A (zh) * 2021-02-24 2021-06-18 上海哔哩哔哩科技有限公司 确定直播间刷量的方法及装置
CN114020996A (zh) * 2021-09-26 2022-02-08 武汉斗鱼网络科技有限公司 一种识别异常观看用户的方法、装置、介质及计算机设备
CN114466214A (zh) * 2022-02-09 2022-05-10 上海哔哩哔哩科技有限公司 直播间人数统计方法及装置

Also Published As

Publication number Publication date
CN114466214B (zh) 2023-05-02
CN114466214A (zh) 2022-05-10

Similar Documents

Publication Publication Date Title
EP2506522B1 (en) Method and device for pushing data
JP4837954B2 (ja) インテリジェントデータブロードキャスト
CN106033419B (zh) 一种消息实时推送方法、装置及***
US9355138B2 (en) Methods and apparatus to obtain anonymous audience measurement data from network server data for particular demographic and usage profiles
CN111371672B (zh) 消息推送方法及装置
US8396874B2 (en) System and method for using topic messages to understand media relating to an event
CN103716338B (zh) 一种信息推送方法和装置
WO2019169978A1 (zh) 资源推荐方法及装置
AU2013204865A1 (en) Methods and apparatus to share online media impressions data
WO2022193835A1 (zh) 数据处理方法、直播方法、鉴权服务器及直播数据服务器
JP6694962B2 (ja) メディア情報提示方法、サーバ及び記憶媒体
CN107733786B (zh) 一种好友推荐的方法及设备
US10331677B1 (en) Contextual search using database indexes
CN113055701B (zh) 数据处理方法及装置
CN108600083B (zh) 消息提醒方法及设备
TWI629899B (zh) Method and device for evaluating quality of multimedia resources
WO2023151426A1 (zh) 直播间人数统计方法及装置
WO2023077813A1 (zh) 确定直播间刷量的方法及装置
CN109688217B (zh) 一种消息推送方法、装置及电子设备
US20220198050A1 (en) Privacy protected database querying
CN111078651A (zh) 统计对象存储的使用量的方法及装置
WO2021104513A1 (zh) 对象展示方法、装置、电子设备及存储介质
WO2023179162A1 (zh) 数据处理方法及装置
WO2013097622A1 (zh) 在移动搜索***中展示移动媒介信息的方法、装置以及机器可读介质
CN113840157B (zh) 访问检测方法、***及装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22925775

Country of ref document: EP

Kind code of ref document: A1