CN111090797B - 数据获取方法、装置、计算机设备和存储介质 - Google Patents
数据获取方法、装置、计算机设备和存储介质 Download PDFInfo
- Publication number
- CN111090797B CN111090797B CN201911198993.7A CN201911198993A CN111090797B CN 111090797 B CN111090797 B CN 111090797B CN 201911198993 A CN201911198993 A CN 201911198993A CN 111090797 B CN111090797 B CN 111090797B
- Authority
- CN
- China
- Prior art keywords
- webpage
- target
- path information
- elements
- acquiring
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 51
- 230000001960 triggered effect Effects 0.000 claims abstract description 90
- 238000004590 computer program Methods 0.000 claims description 28
- 238000010586 diagram Methods 0.000 description 10
- 230000009193 crawling Effects 0.000 description 4
- 239000000284 extract Substances 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000004801 process automation Methods 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000009191 jumping Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911198993.7A CN111090797B (zh) | 2019-11-29 | 2019-11-29 | 数据获取方法、装置、计算机设备和存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911198993.7A CN111090797B (zh) | 2019-11-29 | 2019-11-29 | 数据获取方法、装置、计算机设备和存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111090797A CN111090797A (zh) | 2020-05-01 |
CN111090797B true CN111090797B (zh) | 2023-07-25 |
Family
ID=70393709
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911198993.7A Active CN111090797B (zh) | 2019-11-29 | 2019-11-29 | 数据获取方法、装置、计算机设备和存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111090797B (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111638879B (zh) * | 2020-05-15 | 2023-10-31 | 民生科技有限责任公司 | 克服像素点定位限制的***、方法、装置及可读存储介质 |
CN112882625B (zh) * | 2021-02-10 | 2022-05-17 | 南京苏宁软件技术有限公司 | 元素拾取方法、装置、计算机设备和存储介质 |
CN113918460A (zh) * | 2021-10-15 | 2022-01-11 | 京东科技信息技术有限公司 | 页面测试方法、装置、设备和介质 |
CN114528005B (zh) * | 2021-11-29 | 2023-06-23 | 深圳市千源互联网科技服务有限公司 | 抓取标签更新方法、装置、设备及存储介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101464905A (zh) * | 2009-01-08 | 2009-06-24 | 中国科学院计算技术研究所 | 一种网页信息抽取的***及方法 |
CN102117289A (zh) * | 2009-12-30 | 2011-07-06 | 北京大学 | 一种从网页中抽取评论内容的方法和装置 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102831121B (zh) * | 2011-06-15 | 2015-07-08 | 阿里巴巴集团控股有限公司 | 一种网页信息抽取的方法和*** |
-
2019
- 2019-11-29 CN CN201911198993.7A patent/CN111090797B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101464905A (zh) * | 2009-01-08 | 2009-06-24 | 中国科学院计算技术研究所 | 一种网页信息抽取的***及方法 |
CN102117289A (zh) * | 2009-12-30 | 2011-07-06 | 北京大学 | 一种从网页中抽取评论内容的方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
CN111090797A (zh) | 2020-05-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111090797B (zh) | 数据获取方法、装置、计算机设备和存储介质 | |
US9529780B2 (en) | Displaying content on a mobile device | |
US9330179B2 (en) | Configuring web crawler to extract web page information | |
US7496847B2 (en) | Displaying a computer resource through a preferred browser | |
CN107729475B (zh) | 网页元素采集方法、装置、终端与计算机可读存储介质 | |
CN110069683B (zh) | 一种基于浏览器爬取数据的方法及装置 | |
US9547717B2 (en) | Administration of search results | |
CN104536973B (zh) | 图片识别的方法和浏览器客户端 | |
CN110209966B (zh) | 一种网页刷新方法、网页***及电子设备 | |
CN105868096B (zh) | 用于在浏览器中显示web页面测试结果的方法、装置及设备 | |
CN107644100B (zh) | 信息处理方法、装置以及***和计算机可读存储介质 | |
CN107679214B (zh) | 链接定位方法、装置、终端与计算机可读存储介质 | |
CN114417197A (zh) | 一种访问记录处理方法、装置及存储介质 | |
CN108595697B (zh) | 网页集成方法、装置及*** | |
CN104866594A (zh) | 信息推送方法和装置 | |
CN104239298A (zh) | 文本信息推荐方法、服务器、浏览器及*** | |
CN103577595A (zh) | 基于当前浏览页面的关键词推送方法及装置 | |
CN110222251B (zh) | 一种基于网页分割和搜索算法的服务包装方法 | |
CN103678511A (zh) | 根据可视化模板进行网页内容抽取的方法及装置 | |
CN103544272A (zh) | 一种在浏览器中显示图片的方法和装置 | |
KR20170073693A (ko) | 유사 그룹 요소 추출 | |
CN104809173A (zh) | 一种搜索结果的处理方法和装置 | |
CN106649350B (zh) | 链接元素位置信息的获取方法及装置 | |
CN104281629A (zh) | 从网页中提取图片的方法、装置及客户端设备 | |
US20170024472A1 (en) | Information retrieval method utilizing webpage visual and language features and system using thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: No.1-1 Suning Avenue, Xuzhuang Software Park, Xuanwu District, Nanjing, Jiangsu Province, 210000 Patentee after: Jiangsu Suning cloud computing Co.,Ltd. Country or region after: China Address before: No.1-1 Suning Avenue, Xuzhuang Software Park, Xuanwu District, Nanjing, Jiangsu Province, 210000 Patentee before: Suning Cloud Computing Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240603 Address after: Room 3104, Building A5, No. 3 Gutan Avenue, Economic Development Zone, Gaochun District, Nanjing City, Jiangsu Province, 210000 Patentee after: Jiangsu Biying Technology Co.,Ltd. Country or region after: China Address before: No.1-1 Suning Avenue, Xuzhuang Software Park, Xuanwu District, Nanjing, Jiangsu Province, 210000 Patentee before: Jiangsu Suning cloud computing Co.,Ltd. Country or region before: China |