CN111460253A - Internet data capture method suitable for big data analysis - Google Patents
Internet data capture method suitable for big data analysis Download PDFInfo
- Publication number
- CN111460253A CN111460253A CN202010212831.0A CN202010212831A CN111460253A CN 111460253 A CN111460253 A CN 111460253A CN 202010212831 A CN202010212831 A CN 202010212831A CN 111460253 A CN111460253 A CN 111460253A
- Authority
- CN
- China
- Prior art keywords
- data
- information
- internet
- screening
- method suitable
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 26
- 238000007405 data analysis Methods 0.000 title claims abstract description 20
- 238000013481 data capture Methods 0.000 title claims description 16
- 238000012216 screening Methods 0.000 claims abstract description 20
- 238000004364 calculation method Methods 0.000 claims description 4
- 230000009193 crawling Effects 0.000 claims description 4
- 238000011161 development Methods 0.000 abstract description 4
- 230000009286 beneficial effect Effects 0.000 abstract 1
- 238000005516 engineering process Methods 0.000 description 26
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000013523 data management Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010212831.0A CN111460253A (en) | 2020-03-24 | 2020-03-24 | Internet data capture method suitable for big data analysis |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010212831.0A CN111460253A (en) | 2020-03-24 | 2020-03-24 | Internet data capture method suitable for big data analysis |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111460253A true CN111460253A (en) | 2020-07-28 |
Family
ID=71685700
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010212831.0A Pending CN111460253A (en) | 2020-03-24 | 2020-03-24 | Internet data capture method suitable for big data analysis |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111460253A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113064947A (en) * | 2021-04-08 | 2021-07-02 | 深圳石方数链科技有限公司 | Customer data protection system based on customer management system |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102208992A (en) * | 2010-06-13 | 2011-10-05 | 天津海量信息技术有限公司 | Internet-facing filtration system of unhealthy information and method thereof |
CN104063448A (en) * | 2014-06-18 | 2014-09-24 | 华东师范大学 | Distributed type microblog data capturing system related to field of videos |
GB201507530D0 (en) * | 2015-05-01 | 2015-06-17 | Salesoptimize Ltd | Computer-implemented methods of website analysis |
CN105117484A (en) * | 2015-09-17 | 2015-12-02 | 广州银讯信息科技有限公司 | Internet public opinion monitoring method and system |
CN105893368A (en) * | 2014-11-19 | 2016-08-24 | 北京航天长峰科技工业集团有限公司 | Multilingual online public opinion analysis method |
CN106960063A (en) * | 2017-04-20 | 2017-07-18 | 广州优亚信息技术有限公司 | A kind of internet information crawl and commending system for field of inviting outside investment |
CN109063054A (en) * | 2018-07-19 | 2018-12-21 | 天津迈基生物科技有限公司 | A kind of machine learning and big data processing system |
CN109255063A (en) * | 2018-08-01 | 2019-01-22 | 宜人恒业科技发展(北京)有限公司 | A kind of method and apparatus crawling web page contents |
CN110321471A (en) * | 2019-04-19 | 2019-10-11 | 四川政资汇智能科技有限公司 | A kind of internet techno-financial intelligent Matching method based on the convergence of policy resource |
-
2020
- 2020-03-24 CN CN202010212831.0A patent/CN111460253A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102208992A (en) * | 2010-06-13 | 2011-10-05 | 天津海量信息技术有限公司 | Internet-facing filtration system of unhealthy information and method thereof |
CN104063448A (en) * | 2014-06-18 | 2014-09-24 | 华东师范大学 | Distributed type microblog data capturing system related to field of videos |
CN105893368A (en) * | 2014-11-19 | 2016-08-24 | 北京航天长峰科技工业集团有限公司 | Multilingual online public opinion analysis method |
GB201507530D0 (en) * | 2015-05-01 | 2015-06-17 | Salesoptimize Ltd | Computer-implemented methods of website analysis |
CN105117484A (en) * | 2015-09-17 | 2015-12-02 | 广州银讯信息科技有限公司 | Internet public opinion monitoring method and system |
CN106960063A (en) * | 2017-04-20 | 2017-07-18 | 广州优亚信息技术有限公司 | A kind of internet information crawl and commending system for field of inviting outside investment |
CN109063054A (en) * | 2018-07-19 | 2018-12-21 | 天津迈基生物科技有限公司 | A kind of machine learning and big data processing system |
CN109255063A (en) * | 2018-08-01 | 2019-01-22 | 宜人恒业科技发展(北京)有限公司 | A kind of method and apparatus crawling web page contents |
CN110321471A (en) * | 2019-04-19 | 2019-10-11 | 四川政资汇智能科技有限公司 | A kind of internet techno-financial intelligent Matching method based on the convergence of policy resource |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113064947A (en) * | 2021-04-08 | 2021-07-02 | 深圳石方数链科技有限公司 | Customer data protection system based on customer management system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107888574B (en) | Method, server and storage medium for detecting database risk | |
CN111245793A (en) | Method and device for analyzing abnormity of network data | |
CN113098870A (en) | Phishing detection method and device, electronic equipment and storage medium | |
CN108023868B (en) | Malicious resource address detection method and device | |
CN109347808B (en) | Safety analysis method based on user group behavior activity | |
CN115134099B (en) | Network attack behavior analysis method and device based on full flow | |
CN109756467B (en) | Phishing website identification method and device | |
CN108229170B (en) | Software analysis method and apparatus using big data and neural network | |
KR101692982B1 (en) | Automatic access control system of detecting threat using log analysis and automatic feature learning | |
CN113409555B (en) | Real-time alarm linkage method and system based on Internet of things | |
CN105516128A (en) | Detecting method and device of Web attack | |
CN108337269A (en) | A kind of WebShell detection methods | |
CN113572757B (en) | Server access risk monitoring method and device | |
CN109657119A (en) | A kind of web crawlers detection method based on access log IP analysis | |
CN112839014A (en) | Method, system, device and medium for establishing model for identifying abnormal visitor | |
CN115982762A (en) | Big data based data security leakage-proof management method, system and medium | |
CN113918938A (en) | User entity behavior analysis method and system of continuous immune safety system | |
CN111460253A (en) | Internet data capture method suitable for big data analysis | |
CN117609992A (en) | Data disclosure detection method, device and storage medium | |
CN112528325B (en) | Data information security processing method and system | |
CN113923037B (en) | Anomaly detection optimization device, method and system based on trusted computing | |
CN113688346A (en) | Illegal website identification method, device, equipment and storage medium | |
CN105205134B (en) | Identify that user clicks the method and device of access website behavior | |
CN114389875A (en) | Man-machine behavior detection method, system, equipment and medium | |
CN113132340B (en) | Phishing website identification method based on vision and host characteristics and electronic device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Xiang Hui Inventor after: Zhang Yongli Inventor after: Su Ruiqing Inventor after: Zhang Hongyuan Inventor after: Cai Pengfei Inventor after: Zhang Jing Inventor after: Lu Yan Inventor after: Yang Qingzhuo Inventor after: Li Haolan Inventor before: Xiang Hui Inventor before: Zhang Yongli Inventor before: Su Ruiqing Inventor before: Zhang Hongyuan Inventor before: Cai Pengfei Inventor before: Zhang Jing Inventor before: Lu Yan Inventor before: Yang Qingzhuo Inventor before: Li Haolan |
|
CB03 | Change of inventor or designer information | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200728 |
|
RJ01 | Rejection of invention patent application after publication |