CN106599022A - 基于用户访问数据的用户画像形成方法 - Google Patents
基于用户访问数据的用户画像形成方法 Download PDFInfo
- Publication number
- CN106599022A CN106599022A CN201610935388.3A CN201610935388A CN106599022A CN 106599022 A CN106599022 A CN 106599022A CN 201610935388 A CN201610935388 A CN 201610935388A CN 106599022 A CN106599022 A CN 106599022A
- Authority
- CN
- China
- Prior art keywords
- user
- label
- webpage
- forming method
- method based
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 29
- 230000011218 segmentation Effects 0.000 claims abstract description 10
- 238000000605 extraction Methods 0.000 claims abstract description 9
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 8
- 239000013598 vector Substances 0.000 claims description 19
- 238000012549 training Methods 0.000 claims description 10
- 238000013527 convolutional neural network Methods 0.000 claims description 9
- 238000012545 processing Methods 0.000 claims description 8
- 241000270322 Lepidosauria Species 0.000 claims description 3
- 239000000284 extract Substances 0.000 claims description 3
- 238000013515 script Methods 0.000 claims description 2
- 230000006399 behavior Effects 0.000 abstract description 6
- 238000010801 machine learning Methods 0.000 abstract description 3
- 230000007547 defect Effects 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 5
- 238000001914 filtration Methods 0.000 description 4
- 230000003796 beauty Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 235000019640 taste Nutrition 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610935388.3A CN106599022B (zh) | 2016-11-01 | 2016-11-01 | 基于用户访问数据的用户画像形成方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610935388.3A CN106599022B (zh) | 2016-11-01 | 2016-11-01 | 基于用户访问数据的用户画像形成方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106599022A true CN106599022A (zh) | 2017-04-26 |
CN106599022B CN106599022B (zh) | 2019-12-10 |
Family
ID=58589465
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610935388.3A Active CN106599022B (zh) | 2016-11-01 | 2016-11-01 | 基于用户访问数据的用户画像形成方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106599022B (zh) |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107633036A (zh) * | 2017-09-08 | 2018-01-26 | 广州汪汪信息技术有限公司 | 一种微博用户画像方法、电子设备、存储介质、*** |
CN107818334A (zh) * | 2017-09-29 | 2018-03-20 | 北京邮电大学 | 一种移动互联网用户访问模式表征和聚类方法 |
CN107895024A (zh) * | 2017-09-13 | 2018-04-10 | 同济大学 | 用于网页新闻分类推荐的用户模型构建方法及推荐方法 |
CN108521435A (zh) * | 2018-07-06 | 2018-09-11 | 武汉思普崚技术有限公司 | 一种用户网络行为画像的方法及*** |
CN108769440A (zh) * | 2018-06-06 | 2018-11-06 | 北京京东尚科信息技术有限公司 | 前置分流方法和装置 |
CN108874941A (zh) * | 2018-06-04 | 2018-11-23 | 成都知道创宇信息技术有限公司 | 基于卷积特征和多重哈希映射的大数据url去重方法 |
CN108920717A (zh) * | 2018-07-27 | 2018-11-30 | 百度在线网络技术(北京)有限公司 | 用于显示信息的方法及装置 |
CN109002459A (zh) * | 2018-05-30 | 2018-12-14 | 珠海市君天电子科技有限公司 | 一种用户喜好的商品类型识别方法以及装置 |
CN109168044A (zh) * | 2018-10-11 | 2019-01-08 | 北京奇艺世纪科技有限公司 | 一种视频特征的确定方法及装置 |
CN109710836A (zh) * | 2018-11-29 | 2019-05-03 | 国政通科技有限公司 | 一种基于追星族公会的大数据智能推荐***及方法 |
CN109710890A (zh) * | 2018-12-20 | 2019-05-03 | 四川新网银行股份有限公司 | 基于构建的行为画像模型实时识别虚假材料的方法和*** |
CN109934629A (zh) * | 2019-03-12 | 2019-06-25 | 重庆金窝窝网络科技有限公司 | 一种信息推送方法及装置 |
CN110020113A (zh) * | 2017-09-28 | 2019-07-16 | 南京无界家居科技有限公司 | 一种基于特征匹配的家居产品预测方法及装置 |
CN110598016A (zh) * | 2019-09-11 | 2019-12-20 | 腾讯科技(深圳)有限公司 | 一种多媒体信息推荐的方法、装置、设备和介质 |
CN110717116A (zh) * | 2018-06-27 | 2020-01-21 | 北京京东尚科信息技术有限公司 | 关系网络的链接预测方法及***、设备、存储介质 |
CN111915366A (zh) * | 2020-07-20 | 2020-11-10 | 上海燕汐软件信息科技有限公司 | 一种用户画像构建方法、装置、计算机设备及存储介质 |
CN112380418A (zh) * | 2020-12-31 | 2021-02-19 | 广州智云尚大数据科技有限公司 | 一种基于网络爬虫的数据处理方法、***及云平台 |
CN112383545A (zh) * | 2020-11-13 | 2021-02-19 | 西安热工研究院有限公司 | 适用电力scada***的反爬虫***、装置及部署方法 |
CN112825076A (zh) * | 2019-11-20 | 2021-05-21 | 北京搜狗科技发展有限公司 | 一种信息推荐方法、装置和电子设备 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2945113A1 (en) * | 2014-05-14 | 2015-11-18 | Cisco Technology, Inc. | Audience segmentation using machine-learning |
CN105550269A (zh) * | 2015-12-10 | 2016-05-04 | 复旦大学 | 一种有监督学习的产品评论分析方法及*** |
CN105718579A (zh) * | 2016-01-22 | 2016-06-29 | 浙江大学 | 一种基于上网日志挖掘和用户活动识别的信息推送方法 |
-
2016
- 2016-11-01 CN CN201610935388.3A patent/CN106599022B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2945113A1 (en) * | 2014-05-14 | 2015-11-18 | Cisco Technology, Inc. | Audience segmentation using machine-learning |
CN105550269A (zh) * | 2015-12-10 | 2016-05-04 | 复旦大学 | 一种有监督学习的产品评论分析方法及*** |
CN105718579A (zh) * | 2016-01-22 | 2016-06-29 | 浙江大学 | 一种基于上网日志挖掘和用户活动识别的信息推送方法 |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107633036A (zh) * | 2017-09-08 | 2018-01-26 | 广州汪汪信息技术有限公司 | 一种微博用户画像方法、电子设备、存储介质、*** |
CN107895024A (zh) * | 2017-09-13 | 2018-04-10 | 同济大学 | 用于网页新闻分类推荐的用户模型构建方法及推荐方法 |
CN107895024B (zh) * | 2017-09-13 | 2021-10-08 | 同济大学 | 用于网页新闻分类推荐的用户模型构建方法及推荐方法 |
CN110020113A (zh) * | 2017-09-28 | 2019-07-16 | 南京无界家居科技有限公司 | 一种基于特征匹配的家居产品预测方法及装置 |
CN107818334A (zh) * | 2017-09-29 | 2018-03-20 | 北京邮电大学 | 一种移动互联网用户访问模式表征和聚类方法 |
CN109002459A (zh) * | 2018-05-30 | 2018-12-14 | 珠海市君天电子科技有限公司 | 一种用户喜好的商品类型识别方法以及装置 |
CN108874941A (zh) * | 2018-06-04 | 2018-11-23 | 成都知道创宇信息技术有限公司 | 基于卷积特征和多重哈希映射的大数据url去重方法 |
CN108874941B (zh) * | 2018-06-04 | 2021-09-21 | 成都知道创宇信息技术有限公司 | 基于卷积特征和多重哈希映射的大数据url去重方法 |
CN108769440A (zh) * | 2018-06-06 | 2018-11-06 | 北京京东尚科信息技术有限公司 | 前置分流方法和装置 |
CN110717116B (zh) * | 2018-06-27 | 2023-12-05 | 北京京东尚科信息技术有限公司 | 关系网络的链接预测方法及***、设备、存储介质 |
CN110717116A (zh) * | 2018-06-27 | 2020-01-21 | 北京京东尚科信息技术有限公司 | 关系网络的链接预测方法及***、设备、存储介质 |
CN108521435A (zh) * | 2018-07-06 | 2018-09-11 | 武汉思普崚技术有限公司 | 一种用户网络行为画像的方法及*** |
CN108920717A (zh) * | 2018-07-27 | 2018-11-30 | 百度在线网络技术(北京)有限公司 | 用于显示信息的方法及装置 |
CN109168044A (zh) * | 2018-10-11 | 2019-01-08 | 北京奇艺世纪科技有限公司 | 一种视频特征的确定方法及装置 |
CN109710836A (zh) * | 2018-11-29 | 2019-05-03 | 国政通科技有限公司 | 一种基于追星族公会的大数据智能推荐***及方法 |
CN109710890A (zh) * | 2018-12-20 | 2019-05-03 | 四川新网银行股份有限公司 | 基于构建的行为画像模型实时识别虚假材料的方法和*** |
CN109710890B (zh) * | 2018-12-20 | 2023-06-09 | 四川新网银行股份有限公司 | 基于构建的行为画像模型实时识别虚假材料的方法和*** |
CN109934629A (zh) * | 2019-03-12 | 2019-06-25 | 重庆金窝窝网络科技有限公司 | 一种信息推送方法及装置 |
CN110598016A (zh) * | 2019-09-11 | 2019-12-20 | 腾讯科技(深圳)有限公司 | 一种多媒体信息推荐的方法、装置、设备和介质 |
CN112825076A (zh) * | 2019-11-20 | 2021-05-21 | 北京搜狗科技发展有限公司 | 一种信息推荐方法、装置和电子设备 |
CN112825076B (zh) * | 2019-11-20 | 2024-03-01 | 北京搜狗科技发展有限公司 | 一种信息推荐方法、装置和电子设备 |
CN111915366A (zh) * | 2020-07-20 | 2020-11-10 | 上海燕汐软件信息科技有限公司 | 一种用户画像构建方法、装置、计算机设备及存储介质 |
CN111915366B (zh) * | 2020-07-20 | 2024-01-12 | 上海燕汐软件信息科技有限公司 | 一种用户画像构建方法、装置、计算机设备及存储介质 |
CN112383545A (zh) * | 2020-11-13 | 2021-02-19 | 西安热工研究院有限公司 | 适用电力scada***的反爬虫***、装置及部署方法 |
CN112380418A (zh) * | 2020-12-31 | 2021-02-19 | 广州智云尚大数据科技有限公司 | 一种基于网络爬虫的数据处理方法、***及云平台 |
Also Published As
Publication number | Publication date |
---|---|
CN106599022B (zh) | 2019-12-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106599022A (zh) | 基于用户访问数据的用户画像形成方法 | |
CN104077377B (zh) | 基于网络文章属性的网络舆情热点发现方法和装置 | |
CN111797898B (zh) | 一种基于深度语义匹配的在线评论自动回复方法 | |
CN110633373A (zh) | 一种基于知识图谱和深度学习的汽车舆情分析方法 | |
CN112307351A (zh) | 用户行为的模型训练、推荐方法、装置和设备 | |
CN109815386B (zh) | 一种基于用户画像的构建方法、装置及存储介质 | |
US20140229486A1 (en) | Method and apparatus for unsupervised learning of multi-resolution user profile from text analysis | |
Wu et al. | News filtering and summarization on the web | |
CN115329085A (zh) | 一种社交机器人分类方法及*** | |
Fiol-Roig et al. | Data mining techniques for web page classification | |
CN111680505B (zh) | 一种Markdown特征感知的无监督关键词提取方法 | |
CN116776889A (zh) | 一种基于图卷积网络和外部知识嵌入的粤语谣言检测方法 | |
Wasim et al. | Extracting and modeling user interests based on social media | |
Zhu | A book recommendation algorithm based on collaborative filtering | |
Patil et al. | Detecting and categorization of click baits | |
Liebeskind et al. | Text categorization from category name in an industry-motivated scenario | |
CN108205532A (zh) | 生成网页的方法和装置 | |
CN113761125A (zh) | 动态摘要确定方法和装置、计算设备以及计算机存储介质 | |
Tran et al. | User interest analysis with hidden topic in news recommendation system | |
CN112417858A (zh) | 一种实体权重评分方法、***、电子设备及存储介质 | |
JP2020113267A (ja) | リーディングリストを生成するシステム及び方法 | |
John et al. | Methods for removing noise from web pages: a review | |
Pan et al. | Automatically infer human traits and behavior from social media data | |
Panawong et al. | Tourism web filtering and analysis using Naïve bay with boundary values and text mining | |
Bhatia et al. | Opinion score mining system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Nie Lin Inventor after: Lin Jing Inventor after: Wang Qing Inventor after: Luo Siwei Inventor before: Luo Siwei Inventor before: Lin Jing Inventor before: Wang Qing Inventor before: Nie Lin |
|
CB03 | Change of inventor or designer information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20170426 Assignee: GUANGDONG TECSUN TECHNOLOGY Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2023980054810 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240102 Application publication date: 20170426 Assignee: Guangzhou Quying Information Technology Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2023980054796 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240102 Application publication date: 20170426 Assignee: SHENDAYUN NETWORK (SHENZHEN) Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2023980054646 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20231229 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20170426 Assignee: Guangzhou Ainuo Technology Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2024980001983 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240205 Application publication date: 20170426 Assignee: Guangzhou Ruijinyuan Food Technology Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2024980001982 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240205 Application publication date: 20170426 Assignee: Guangzhou Liren Digital Technology Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2024980001991 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240205 Application publication date: 20170426 Assignee: Spectrum Blue Cloud (Guangzhou) Digital Technology Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2024980001990 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240205 Application publication date: 20170426 Assignee: Lingjing Information Technology (Guangzhou) Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2024980001986 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240205 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20170426 Assignee: Guangzhou Love Time Information Technology Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2024980002610 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240307 Application publication date: 20170426 Assignee: Zhongyuan Technology (Guangdong) Co.,Ltd. Assignor: SUN YAT-SEN University Contract record no.: X2024980002582 Denomination of invention: A User Profile Formation Method Based on User Access Data Granted publication date: 20191210 License type: Common License Record date: 20240307 |
|
EE01 | Entry into force of recordation of patent licensing contract |