CN109361564A - Internet data acquisition method and device based on the passive data fusion of master - Google Patents

Internet data acquisition method and device based on the passive data fusion of master Download PDF

Info

Publication number
CN109361564A
CN109361564A CN201811294367.3A CN201811294367A CN109361564A CN 109361564 A CN109361564 A CN 109361564A CN 201811294367 A CN201811294367 A CN 201811294367A CN 109361564 A CN109361564 A CN 109361564A
Authority
CN
China
Prior art keywords
data
layer data
target user
behavior
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811294367.3A
Other languages
Chinese (zh)
Inventor
袁振龙
王嘉正
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Duoxing Technology Co Ltd
Tsinghua University
Original Assignee
Beijing Duoxing Technology Co Ltd
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Duoxing Technology Co Ltd, Tsinghua University filed Critical Beijing Duoxing Technology Co Ltd
Priority to CN201811294367.3A priority Critical patent/CN109361564A/en
Publication of CN109361564A publication Critical patent/CN109361564A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/06Generation of reports
    • H04L43/062Generation of reports related to network traffic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/06Generation of reports
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/06Generation of reports
    • H04L43/067Generation of reports using time frame reporting

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The embodiment of the present invention provides a kind of internet data acquisition method and device based on the passive data fusion of master, the described method includes: obtaining the behavior layer data of target user, and obtain the content layer data of the target user, the behavior layer data include that behavior executes the time, and the content layer data include data generation time;Time and the data generation time are executed according to the identity of the target user, the behavior, by the behavior layer data and the content layer data fusion at complete data.Internet data acquisition method and device provided in an embodiment of the present invention based on the passive data fusion of master, the content layer data effective integration that the behavior layer data of passive data acquisition and active data are obtained, under the premise of not being related to user content privacy, realize the complete acquisition to internet behavior layer data and content layer data, the data value of collected user data is improved, more valuable data source is provided for big data analysis.

Description

Internet data acquisition method and device based on the passive data fusion of master
Technical field
The present embodiments relate to network securitys and big data technical field, more particularly to a kind of passive data of master that are based on to melt The internet data acquisition method and device of conjunction.
Background technique
It is further perfect with entire big data industry, internet big data be increasingly becoming influence current digital society with The keystone resources of economic development.The key of big data analysis is to acquire valuable data from the data of magnanimity, then The data of acquisition can be analyzed.In view of each Internet company is to the closure of oneself possessed mass data, third party Data acquisition becomes the key element for influencing internet big data Industry Quick Development.The depth according to representated by data and meaning Justice, third party's data inside are mainly divided into behavior layer data and content layer data, wherein behavior layer data refers to internet Behavior (Action) data of user, for example click some button or browse some page;Content layer data is referred to mutually The data such as contents attribute, the field generated in on-line customer's behavior generating process, such as title, the note of content, article commented on The content etc. of son.Behavior layer data only relates to behavior of user itself, is not related to the content layer data of user.
In the prior art, there are two types of the modes for obtaining third party's internet data, first is that the side obtained by active data Formula crawls the public data of different Internet companies using web crawlers as technical way;Second is that passing through passive data acquisition Mode parse the communication data of different Internet companies using network packet detection as technical way.For example, Active data obtains or passive data acquisition, can acquire the internet data money of Sina weibo, today's tops etc. Source, difference mainly have two o'clock: (1) active data acquisition can only collect the content-data published on the internet, nothing The passive data acquisition of the image of Buddha is the same, collects the behavioral datas such as the background user click transmitted in communication network or browsing, because In the level of behavioral data acquisition, passive data acquisition occupies advantage for this;(2) it is passed in recent years in the data based on network communication During defeated, the encrypted transmissions such as https are more and more common, and thus the mode of caused passive data acquisition, fundamentally can not Continue the data resource for being resolved to internet content level in such a way that depth data detective is surveyed, Internet user's row can only be obtained For the behavioral data resource of level, for example any operation is performed in wechat, beat voip phone and still browse circle of friends, that is, It can only acquire whether behavior act occurs, VoIP voice content and circle of friends message can not be parsed, executed in microblogging for another example What operation, posts or uploading pictures, that is, can only acquire whether behavior act occurs, can not parse model and picture Content.Therefore, in the level of content-data acquisition, active data obtains often more dominant.
For to sum up, active data obtains the data acquisition for being often suitble to internet content data plane, can not obtain row For the data of data plane;Passive data acquisition is often suitble to the data acquisition of internet behavioral data level, can not obtain interior Hold the data of data plane.The mode of active and passive data acquisition respectively has superiority and inferiority, a kind of simple data acquisition completeness of mode compared with Difference.
Summary of the invention
A kind of overcome the above problem the purpose of the embodiment of the present invention is that providing or at least be partially solved the above problem Internet data acquisition method and device based on the passive data fusion of master.
In order to solve the above-mentioned technical problem, on the one hand, the embodiment of the present invention provides a kind of based on the passive data fusion of master Internet data acquisition method, comprising:
The behavior layer data of target user is obtained, and obtains the content layer data of the target user, the behavior number of plies The time is executed according to comprising behavior, the content layer data include data generation time;
Time and the data generation time are executed according to the identity of the target user, the behavior, it will be described Behavior layer data and the content layer data fusion are at complete data.
On the other hand, the embodiment of the present invention provides a kind of internet data acquisition device based on the passive data fusion of master, Include:
Module is obtained, for obtaining the behavior layer data of target user, and obtains the content layer data of the target user, The behavior layer data include that behavior executes the time, and the content layer data include data generation time;
Fusion Module executes the time for identity, the behavior according to the target user and the data produces The raw time, by the behavior layer data and the content layer data fusion at complete data.
In another aspect, the embodiment of the present invention provides a kind of electronic equipment, comprising:
Memory and processor, the processor and the memory complete mutual communication by bus;It is described to deposit Reservoir is stored with the program instruction that can be executed by the processor, and it is above-mentioned that the processor calls described program instruction to be able to carry out Method.
Another aspect, the embodiment of the present invention provide a kind of non-transient computer readable storage medium, are stored thereon with calculating Machine program realizes above-mentioned method when the computer program is executed by processor.
Internet data acquisition method and device provided in an embodiment of the present invention based on the passive data fusion of master, will be passive The content layer data effective integration that the behavior layer data and active data of data acquisition obtain, be not related to user content privacy Under the premise of, the complete acquisition to internet behavior layer data and content layer data is realized, collected user data is improved Data value, more valuable data source is provided for big data analysis.
Detailed description of the invention
Fig. 1 is the internet data acquisition method schematic diagram provided in an embodiment of the present invention based on the passive data fusion of master;
Fig. 2 is the internet data acquisition device schematic diagram provided in an embodiment of the present invention based on the passive data fusion of master;
Fig. 3 is the structural schematic diagram of electronic equipment provided in an embodiment of the present invention.
Specific embodiment
In order to keep the purposes, technical schemes and advantages of the embodiment of the present invention clearer, implement below in conjunction with the present invention Attached drawing in example, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment It is a part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiment of the present invention, those of ordinary skill in the art Every other embodiment obtained without making creative work, shall fall within the protection scope of the present invention.
Fig. 1 is the internet data acquisition method schematic diagram provided in an embodiment of the present invention based on the passive data fusion of master, As shown in Figure 1, the embodiment of the present invention provides a kind of internet data acquisition method based on the passive data fusion of master, master is executed Body be the internet data acquisition device based on the passive data fusion of master, abbreviation data acquisition device, this method comprises:
Step S101, the behavior layer data of target user is obtained, and obtains the content layer data of the target user, it is described Behavior layer data include that behavior executes the time, and the content layer data include data generation time;
Step S102, when executing time and data generation according to the identity of the target user, the behavior Between, by the behavior layer data and the content layer data fusion at complete data.
Specifically, firstly, according to the identity of target user, the behavior layer data of target user is obtained, and is obtained The content layer data of the target user executes time etc. comprising user behavior and behavior in behavior layer data, in content layer data Include data content and data generation time.The behavior execution time refers to that the target user executes certain on targeted internet platform The time of one behavior, for example, click some button, post or browse some page etc. the time of behavior.Data generate Time refers to the time of the content of the target user publication of targeted internet platform record, for example, the content of the comment of record, The time of the content of the title of article, model etc. content.
Then, time and data generation time are executed according to the identity of target user, behavior, by behavior layer data and Content layer data correlation, and save.By same user, behavior executes time and identical two levels of data generation time Data fusion collect complete user data to realize the fusion of behavior layer data and content layer data.
For example, Sina weibo user Shanghai Ah sweet (ID:987654321) 38 divides at the morning 9 on the 10th in October Beijing time Sina weibo App is logged in;And 40 divides when 9 and perform the user behavior of one " posting ", but due to network communication encryption Reason can not find out this model content information posted.
After detecting above-mentioned " posting " behavior by passive network data acquisition mode, acquired by Active Networks data Mode, on the website of Sina weibo, tracking open Sina weibo user@Shanghai Ah sweet (ID:987654321) individual Homepage obtains the account and 40 divides the corresponding model issued at the morning 9 on the 10th in October Beijing time, and parses, acquires model Content-data, since the content-data of the model belongs to the full disclosure content of Sina weibo platform, it is not related to user's Private data.
After the completion of above-mentioned passive, active data acquisition step, the pet name (Shanghai@of the Sina weibo account can be passed through A Gan) or ID (987654321) closes the action behavior posted (include two behaviors: logging in and post) with data such as contents UNPROFOR is deposited, and realizes that the data based on the passive data fusion of master acquire purpose.Above-mentioned Overall Steps can be achieved automated data and adopt Collection, association and completion.
Internet data acquisition method provided in an embodiment of the present invention based on the passive data fusion of master, passive data are obtained The content layer data effective integration that the behavior layer data and active data taken obtains, be not related to the premise of user content privacy Under, the complete acquisition to internet behavior layer data and content layer data is realized, the number of collected user data is improved According to value, more valuable data source is provided for big data analysis.
On the basis of the above embodiments, further, the behavior layer data for obtaining target user, specifically includes:
By being detected and being parsed to network packet, the behavior layer data of the target user is obtained.
Specifically, passive network data acquisition mode obtains the behavior layer data of target user, mainly passes through feature The technological means of identification, detects network packet and is parsed, and acquires the complete of user's access target internet platform in real time Portion's network communication data is obtained comprising using the different user pet name/ID as the fine-grained user behavior of the different user of identity Data.
Internet data acquisition method provided in an embodiment of the present invention based on the passive data fusion of master, passive data are obtained The content layer data effective integration that the behavior layer data and active data taken obtains, be not related to the premise of user content privacy Under, the complete acquisition to internet behavior layer data and content layer data is realized, the number of collected user data is improved According to value, more valuable data source is provided for big data analysis.
It is further, described by being detected and being parsed to network packet on the basis of the above various embodiments, it obtains The behavior layer data for taking the target user, specifically includes:
Obtain overall network data on flows;
The total data of targeted internet platform, the targeted internet are filtered out from the overall network data on flows Platform is that the target user issues the internet platform used when the content layer data;
The identity of the target user is extracted from the total data of the targeted internet platform, and described in acquisition The behavior layer data of target user.
Specifically, by being detected and being parsed to network packet, the tool of the behavior layer data of target user is obtained Body step includes:
When needing to obtain the behavior layer data of user, firstly, capture user's overall network data on flows.
Then, the total data of targeted internet platform, target interconnection are filtered out from the overall network data on flows Net platform is that target user issues the internet platform used when content layer data.
Finally, extracting the identity of the target user from the total data of targeted internet platform, and obtain the mesh The behavior layer data for marking user, identifies the behavior of target user.
Internet data acquisition method provided in an embodiment of the present invention based on the passive data fusion of master, passive data are obtained The content layer data effective integration that the behavior layer data and active data taken obtains, be not related to the premise of user content privacy Under, the complete acquisition to internet behavior layer data and content layer data is realized, the number of collected user data is improved According to value, more valuable data source is provided for big data analysis.
On the basis of the above various embodiments, further, the content layer data for obtaining the target user, specifically Include:
The content layer data of the target user is obtained by web crawlers.
Specifically, Active Networks data acquisition modes obtain the content layer data of target user, mainly pass through network Crawler technology means crawl the content layer of target user when target user issues content layer data on the internet platform that uses Data.
Internet data acquisition method provided in an embodiment of the present invention based on the passive data fusion of master, passive data are obtained The content layer data effective integration that the behavior layer data and active data taken obtains, be not related to the premise of user content privacy Under, the complete acquisition to internet behavior layer data and content layer data is realized, the number of collected user data is improved According to value, more valuable data source is provided for big data analysis.
It is further, described that the interior of the target user is obtained by web crawlers on the basis of the above various embodiments Hold layer data, specifically include:
According to the identity of the target user, the target user disclosed number on targeted internet platform is locked According to the page, the targeted internet platform is that the target user issues the internet platform used when the content layer data;
The content layer data of the target user is parsed from the page of data.
Specifically, include: by the specific steps of the content layer data of web crawlers acquisition target user
Firstly, it is disclosed on targeted internet platform to lock the target user according to the identity of the target user Page of data, the targeted internet platform are that target user issues the internet platform used when content layer data.
Then, it in disclosed page of data, crawls on targeted internet platform from the target user, parse the target The content layer data of user.
Internet data acquisition method provided in an embodiment of the present invention based on the passive data fusion of master, passive data are obtained The content layer data effective integration that the behavior layer data and active data taken obtains, be not related to the premise of user content privacy Under, the complete acquisition to internet behavior layer data and content layer data is realized, the number of collected user data is improved According to value, more valuable data source is provided for big data analysis.
Fig. 2 is the internet data acquisition device schematic diagram provided in an embodiment of the present invention based on the passive data fusion of master, As shown in Fig. 2, the embodiment of the present invention provides a kind of internet data acquisition device based on the passive data fusion of master, for executing Any of the above-described method as described in the examples specifically includes and obtains module 201 and Fusion Module 202, in which:
The behavior layer data that module 201 is used to obtain target user is obtained, and obtains the content number of plies of the target user According to the behavior layer data include that behavior executes the time, and the content layer data include data generation time;Fusion Module 202 For executing time and the data generation time according to identity, the behavior of the target user, by the behavior Layer data and the content layer data fusion are at complete data.
Specifically, firstly, obtaining the row of target user by obtaining module 201 according to the identity of target user For layer data, and the content layer data of the target user is obtained, executes the time comprising user behavior and behavior in behavior layer data Deng comprising data content and data generation time in content layer data.The behavior execution time refers to that the target user is mutual in target The time of a certain behavior is executed in networked platforms, for example, click some button, post or browse some page etc. behavior Time.Data generation time refers to the time of the content of the target user publication of targeted internet platform record, for example, note The time of the content of the comment of record, the content of the title of article, model etc. content.
Then, when executing time and data generation according to identity, the behavior of target user by Fusion Module 202 Between, behavior layer data and content layer data is associated with, and saves.By same user, behavior executes the time and data generate The data fusion of time identical two levels collects complete to realize the fusion of behavior layer data and content layer data User data.
For example, Sina weibo user Shanghai Ah sweet (ID:987654321) 38 divides at the morning 9 on the 10th in October Beijing time Sina weibo App is logged in;And 40 divides when 9 and perform the user behavior of one " posting ", but due to network communication encryption Reason can not find out this model content information posted.
After detecting above-mentioned " posting " behavior by passive network data acquisition mode, acquired by Active Networks data Mode, on the website of Sina weibo, tracking open Sina weibo user@Shanghai Ah sweet (ID:987654321) individual Homepage obtains the account and 40 divides the corresponding model issued at the morning 9 on the 10th in October Beijing time, and parses, acquires model Content-data, since the content-data of the model belongs to the full disclosure content of Sina weibo platform, it is not related to user's Private data.
After the completion of above-mentioned passive, active data acquisition step, the pet name (Shanghai@of the Sina weibo account can be passed through A Gan) or ID (987654321) closes the action behavior posted (include two behaviors: logging in and post) with data such as contents UNPROFOR is deposited, and realizes that the data based on the passive data fusion of master acquire purpose.Above-mentioned Overall Steps can be achieved automated data and adopt Collection, association and completion.
The embodiment of the present invention provides a kind of internet data acquisition device based on the passive data fusion of master, for executing Method described in any embodiment is stated, the device provided through this embodiment executes above-mentioned a certain method as described in the examples Specific steps it is identical as above-mentioned corresponding embodiment, details are not described herein again.
Internet data acquisition device provided in an embodiment of the present invention based on the passive data fusion of master, passive data are obtained The content layer data effective integration that the behavior layer data and active data taken obtains, be not related to the premise of user content privacy Under, the complete acquisition to internet behavior layer data and content layer data is realized, the number of collected user data is improved According to value, more valuable data source is provided for big data analysis.
Fig. 3 is the structural schematic diagram of electronic equipment provided in an embodiment of the present invention, as shown in figure 3, the equipment includes: place Manage device 301, memory 302 and bus 303;
Wherein, processor 301 and memory 302 complete mutual communication by the bus 303;
Processor 301 is used to call the program instruction in memory 302, to execute provided by above-mentioned each method embodiment Method, for example,
The behavior layer data of target user is obtained, and obtains the content layer data of the target user, the behavior number of plies The time is executed according to comprising behavior, the content layer data include data generation time;
Time and the data generation time are executed according to the identity of the target user, the behavior, it will be described Behavior layer data and the content layer data fusion are at complete data.
The embodiment of the present invention provides a kind of computer program product, and the computer program product is non-transient including being stored in Computer program on computer readable storage medium, the computer program include program instruction, when described program instructs quilt When computer executes, computer is able to carry out method provided by above-mentioned each method embodiment, for example,
The behavior layer data of target user is obtained, and obtains the content layer data of the target user, the behavior number of plies The time is executed according to comprising behavior, the content layer data include data generation time;
Time and the data generation time are executed according to the identity of the target user, the behavior, it will be described Behavior layer data and the content layer data fusion are at complete data.
The embodiment of the present invention provides a kind of non-transient computer readable storage medium, the non-transient computer readable storage Medium storing computer instruction, the computer instruction make the computer execute side provided by above-mentioned each method embodiment Method, for example,
The behavior layer data of target user is obtained, and obtains the content layer data of the target user, the behavior number of plies The time is executed according to comprising behavior, the content layer data include data generation time;
Time and the data generation time are executed according to the identity of the target user, the behavior, it will be described Behavior layer data and the content layer data fusion are at complete data.
Those of ordinary skill in the art will appreciate that: realize that all or part of the steps of above method embodiment can pass through The relevant hardware of program instruction is completed, and program above-mentioned can be stored in a computer readable storage medium, the program When being executed, step including the steps of the foregoing method embodiments is executed;And storage medium above-mentioned includes: ROM, RAM, magnetic disk or light The various media that can store program code such as disk.
The embodiments such as device and equipment described above are only schematical, wherein described be used as separate part description Unit may or may not be physically separated, component shown as a unit may or may not be Physical unit, it can it is in one place, or may be distributed over multiple network units.It can be according to the actual needs Some or all of the modules therein is selected to achieve the purpose of the solution of this embodiment.Those of ordinary skill in the art are not paying In the case where creative labor, it can understand and implement.
Through the above description of the embodiments, those skilled in the art can be understood that each embodiment can It realizes by means of software and necessary general hardware platform, naturally it is also possible to pass through hardware.Based on this understanding, on Stating technical solution, substantially the part that contributes to existing technology can be embodied in the form of software products in other words, should Computer software product may be stored in a computer readable storage medium, such as ROM/RAM, magnetic disk, CD, including several fingers It enables and using so that a computer equipment (can be personal computer, server or the network equipment etc.) executes each implementation Method described in certain parts of example or embodiment.
Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although Present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: it still may be used To modify the technical solutions described in the foregoing embodiments or equivalent replacement of some of the technical features; And these are modified or replaceed, technical solution of various embodiments of the present invention that it does not separate the essence of the corresponding technical solution spirit and Range.

Claims (10)

1. a kind of internet data acquisition method based on the passive data fusion of master characterized by comprising
The behavior layer data of target user is obtained, and obtains the content layer data of the target user, the behavior layer data packet The time is executed containing behavior, the content layer data include data generation time;
Time and the data generation time are executed according to the identity of the target user, the behavior, by the behavior Layer data and the content layer data fusion are at complete data.
2. the method according to claim 1, wherein the behavior layer data for obtaining target user, specific to wrap It includes:
By being detected and being parsed to network packet, the behavior layer data of the target user is obtained.
3. according to the method described in claim 2, it is characterized in that, described by being detected and being parsed to network packet, The behavior layer data for obtaining the target user, specifically includes:
Obtain overall network data on flows;
The total data of targeted internet platform, the targeted internet platform are filtered out from the overall network data on flows It is that the target user issues the internet platform used when the content layer data;
The identity of the target user is extracted from the total data of the targeted internet platform, and obtains the target The behavior layer data of user.
4. the method according to claim 1, wherein the content layer data for obtaining the target user, tool Body includes:
The content layer data of the target user is obtained by web crawlers.
5. according to the method described in claim 4, it is characterized in that, described obtain the interior of the target user by web crawlers Hold layer data, specifically include:
According to the identity of the target user, the target user disclosed data page on targeted internet platform is locked Face, the targeted internet platform are that the target user issues the internet platform used when the content layer data;
The content layer data of the target user is parsed from the page of data.
6. a kind of internet data acquisition device based on the passive data fusion of master characterized by comprising
Module is obtained, for obtaining the behavior layer data of target user, and obtains the content layer data of the target user, it is described Behavior layer data include that behavior executes the time, and the content layer data include data generation time;
Fusion Module, when executing time and data generation for identity, the behavior according to the target user Between, by the behavior layer data and the content layer data fusion at complete data.
7. device according to claim 6, which is characterized in that the behavior layer data for obtaining target user is specific to wrap It includes:
By being detected and being parsed to network packet, the behavior layer data of the target user is obtained.
8. device according to claim 6, which is characterized in that the content layer data for obtaining the target user, tool Body includes:
The content layer data of the target user is obtained by web crawlers.
9. a kind of electronic equipment characterized by comprising
Memory and processor, the processor and the memory complete mutual communication by bus;The memory It is stored with the program instruction that can be executed by the processor, the processor calls described program instruction to be able to carry out right such as and wants Seek 1 to 5 any method.
10. a kind of non-transient computer readable storage medium, is stored thereon with computer program, which is characterized in that when the meter When calculation machine program is executed by processor, method as claimed in claim 1 to 5 is realized.
CN201811294367.3A 2018-11-01 2018-11-01 Internet data acquisition method and device based on the passive data fusion of master Pending CN109361564A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811294367.3A CN109361564A (en) 2018-11-01 2018-11-01 Internet data acquisition method and device based on the passive data fusion of master

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811294367.3A CN109361564A (en) 2018-11-01 2018-11-01 Internet data acquisition method and device based on the passive data fusion of master

Publications (1)

Publication Number Publication Date
CN109361564A true CN109361564A (en) 2019-02-19

Family

ID=65343825

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811294367.3A Pending CN109361564A (en) 2018-11-01 2018-11-01 Internet data acquisition method and device based on the passive data fusion of master

Country Status (1)

Country Link
CN (1) CN109361564A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111555988A (en) * 2020-04-26 2020-08-18 深圳供电局有限公司 Big data-based network asset mapping and discovering method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103136253A (en) * 2011-11-30 2013-06-05 腾讯科技(深圳)有限公司 Method and device of acquiring information
CN103389999A (en) * 2012-05-11 2013-11-13 中国人民大学 Method for incrementally grabbing microblog information
CN106844588A (en) * 2017-01-11 2017-06-13 上海斐讯数据通信技术有限公司 A kind of analysis method and system of the user behavior data based on web crawlers
CN108173692A (en) * 2017-12-28 2018-06-15 山东华软金盾软件股份有限公司 It is a kind of based on the whole network equipment sensory perceptual system being actively and passively combined and cognitive method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103136253A (en) * 2011-11-30 2013-06-05 腾讯科技(深圳)有限公司 Method and device of acquiring information
CN103389999A (en) * 2012-05-11 2013-11-13 中国人民大学 Method for incrementally grabbing microblog information
CN106844588A (en) * 2017-01-11 2017-06-13 上海斐讯数据通信技术有限公司 A kind of analysis method and system of the user behavior data based on web crawlers
CN108173692A (en) * 2017-12-28 2018-06-15 山东华软金盾软件股份有限公司 It is a kind of based on the whole network equipment sensory perceptual system being actively and passively combined and cognitive method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
耿乐群: "基于主动搜索的论坛内容监管技术研究", 《中国优秀硕士学位论文全文数据库》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111555988A (en) * 2020-04-26 2020-08-18 深圳供电局有限公司 Big data-based network asset mapping and discovering method and device
CN111555988B (en) * 2020-04-26 2023-11-03 深圳供电局有限公司 Network asset mapping discovery method and device based on big data

Similar Documents

Publication Publication Date Title
Javed et al. A comprehensive survey on computer forensics: State-of-the-art, tools, techniques, challenges, and future directions
US10795992B2 (en) Self-adaptive application programming interface level security monitoring
Reedy Interpol review of digital evidence 2016-2019
van Baar et al. Digital forensics as a service: A game changer
Gupta et al. PHP-sensor: a prototype method to discover workflow violation and XSS vulnerabilities in PHP web applications
Inel et al. Crowdtruth: Machine-human computation framework for harnessing disagreement in gathering annotated data
CN104717185B (en) Displaying response method, device, server and the system of short uniform resource locator
CN104144142B (en) A kind of Web bug excavation methods and system
CN110119469A (en) A kind of data collection and transmission and method towards darknet
CN101345751B (en) Identifying application user as source of database activity
Khanafseh et al. A survey of various frameworks and solutions in all branches of digital forensics with a focus on cloud forensics
CN105095207B (en) Retrieval, the method and apparatus for obtaining application software content
CN112632135A (en) Big data platform
CN109587125A (en) Network security big data analysis method, system and related device
CN109710440A (en) Abnormality eliminating method, device, storage medium and the terminal device of webpage front-end
CN109756467A (en) A kind of recognition methods of fishing website and device
US20180316702A1 (en) Detecting and mitigating leaked cloud authorization keys
CN107784113A (en) Html web page collecting method, device and computer-readable recording medium
Faiella et al. Enriching Threat Intelligence Platforms Capabilities.
CN109710667A (en) A kind of shared realization method and system of the multisource data fusion based on big data platform
CN109361564A (en) Internet data acquisition method and device based on the passive data fusion of master
Jimenez et al. A framework for SDN forensic readiness and cybersecurity incident response
CN108540471B (en) Mobile application network traffic clustering method, computer readable storage medium and terminal
CN104462392A (en) Statistical method and statistical device for sharing return traffic
CN105763530A (en) Web-based threat information acquisition system and method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190219