CN106202294B - 基于关键词和主题模型融合的相关新闻计算方法及装置 - Google Patents
基于关键词和主题模型融合的相关新闻计算方法及装置 Download PDFInfo
- Publication number
- CN106202294B CN106202294B CN201610509723.3A CN201610509723A CN106202294B CN 106202294 B CN106202294 B CN 106202294B CN 201610509723 A CN201610509723 A CN 201610509723A CN 106202294 B CN106202294 B CN 106202294B
- Authority
- CN
- China
- Prior art keywords
- news
- keywords
- keyword
- candidate set
- topic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000004364 calculation method Methods 0.000 title claims abstract description 26
- 230000004927 fusion Effects 0.000 title claims abstract description 11
- 238000000034 method Methods 0.000 claims abstract description 39
- 230000011218 segmentation Effects 0.000 claims abstract description 12
- 238000001914 filtration Methods 0.000 claims description 9
- 230000008569 process Effects 0.000 claims description 7
- 238000000605 extraction Methods 0.000 claims description 5
- 239000013598 vector Substances 0.000 description 18
- 239000011159 matrix material Substances 0.000 description 6
- 238000012545 processing Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 101100261000 Caenorhabditis elegans top-3 gene Proteins 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000008030 elimination Effects 0.000 description 3
- 238000003379 elimination reaction Methods 0.000 description 3
- 238000012821 model calculation Methods 0.000 description 3
- 101100261006 Salmonella typhi topB gene Proteins 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000006399 behavior Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000009499 grossing Methods 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000002085 persistent effect Effects 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 101150032437 top-3 gene Proteins 0.000 description 2
- 102100040401 DNA topoisomerase 3-alpha Human genes 0.000 description 1
- 241000224466 Giardia Species 0.000 description 1
- 101000611068 Homo sapiens DNA topoisomerase 3-alpha Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (17)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610509723.3A CN106202294B (zh) | 2016-07-01 | 2016-07-01 | 基于关键词和主题模型融合的相关新闻计算方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610509723.3A CN106202294B (zh) | 2016-07-01 | 2016-07-01 | 基于关键词和主题模型融合的相关新闻计算方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106202294A CN106202294A (zh) | 2016-12-07 |
CN106202294B true CN106202294B (zh) | 2020-09-11 |
Family
ID=57464512
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610509723.3A Active CN106202294B (zh) | 2016-07-01 | 2016-07-01 | 基于关键词和主题模型融合的相关新闻计算方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106202294B (zh) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106919682A (zh) * | 2017-03-01 | 2017-07-04 | 北京再塑宝科技有限公司 | 一种基于redis技术的搜索联想词实现方法 |
CN107423430B (zh) * | 2017-08-03 | 2020-03-03 | 北京京东尚科信息技术有限公司 | 数据处理方法、装置和计算机可读存储介质 |
CN108052520A (zh) * | 2017-11-01 | 2018-05-18 | 平安科技(深圳)有限公司 | 基于主题模型的关联词分析方法、电子装置及存储介质 |
CN108256096B (zh) * | 2018-01-30 | 2021-01-22 | 北京搜狐新媒体信息技术有限公司 | 一种数据处理方法及装置 |
CN108509630A (zh) * | 2018-04-09 | 2018-09-07 | 北京搜狐新媒体信息技术有限公司 | 一种新闻推荐方法及装置 |
CN110737820B (zh) * | 2018-07-03 | 2022-05-31 | 百度在线网络技术(北京)有限公司 | 用于生成事件信息的方法和装置 |
CN109408706B (zh) * | 2018-09-20 | 2022-05-03 | 上海掌门科技有限公司 | 一种图像过滤方法 |
CN109508394A (zh) * | 2018-10-18 | 2019-03-22 | 青岛聚看云科技有限公司 | 一种多媒体文件搜索排序模型的训练方法及装置 |
CN112100500A (zh) * | 2020-09-23 | 2020-12-18 | 高小翎 | 范例学习驱动的内容关联网站发掘方法 |
CN112202889B (zh) * | 2020-09-30 | 2023-05-23 | 深圳前海微众银行股份有限公司 | 信息的推送方法、装置和存储介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103235824A (zh) * | 2013-05-06 | 2013-08-07 | 上海河广信息科技有限公司 | 根据浏览网页确定用户感兴趣的网页文本的方法和*** |
CN103389975A (zh) * | 2012-05-07 | 2013-11-13 | 腾讯科技(深圳)有限公司 | 一种新闻推荐方法及*** |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6845374B1 (en) * | 2000-11-27 | 2005-01-18 | Mailfrontier, Inc | System and method for adaptive text recommendation |
US7689559B2 (en) * | 2006-02-08 | 2010-03-30 | Telenor Asa | Document similarity scoring and ranking method, device and computer program product |
US8095540B2 (en) * | 2008-04-16 | 2012-01-10 | Yahoo! Inc. | Identifying superphrases of text strings |
CN105095162A (zh) * | 2014-05-19 | 2015-11-25 | 腾讯科技(深圳)有限公司 | 文本相似度确定方法、装置、电子设备及*** |
CN104965889B (zh) * | 2015-06-17 | 2017-06-13 | 腾讯科技(深圳)有限公司 | 内容推荐方法及装置 |
CN105183833B (zh) * | 2015-08-31 | 2020-05-19 | 天津大学 | 一种基于用户模型的微博文本推荐方法及其推荐装置 |
-
2016
- 2016-07-01 CN CN201610509723.3A patent/CN106202294B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103389975A (zh) * | 2012-05-07 | 2013-11-13 | 腾讯科技(深圳)有限公司 | 一种新闻推荐方法及*** |
CN103235824A (zh) * | 2013-05-06 | 2013-08-07 | 上海河广信息科技有限公司 | 根据浏览网页确定用户感兴趣的网页文本的方法和*** |
Also Published As
Publication number | Publication date |
---|---|
CN106202294A (zh) | 2016-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106202294B (zh) | 基于关键词和主题模型融合的相关新闻计算方法及装置 | |
CN107133213B (zh) | 一种基于算法的文本摘要自动提取方法与*** | |
CN109815308B (zh) | 意图识别模型的确定及检索意图识别方法、装置 | |
CN104765769B (zh) | 一种基于词矢量的短文本查询扩展及检索方法 | |
CN105302810B (zh) | 一种信息搜索方法和装置 | |
CN108280114B (zh) | 一种基于深度学习的用户文献阅读兴趣分析方法 | |
CN105045875B (zh) | 个性化信息检索方法及装置 | |
CN103455487B (zh) | 一种搜索词的提取方法及装置 | |
CN105243087B (zh) | It资讯聚合阅读个性化推荐方法 | |
US20110145348A1 (en) | Systems and methods for identifying terms relevant to web pages using social network messages | |
CN109388743B (zh) | 语言模型的确定方法和装置 | |
CN108154395A (zh) | 一种基于大数据的客户网络行为画像方法 | |
CN106250513A (zh) | 一种基于事件建模的事件个性化分类方法及*** | |
Patil et al. | Automatic text categorization: Marathi documents | |
CN104598607A (zh) | 推荐搜索短语的方法及*** | |
Wu et al. | News filtering and summarization on the web | |
CN107291895B (zh) | 一种快速的层次化文档查询方法 | |
CN110717038B (zh) | 对象分类方法及装置 | |
CN111291177A (zh) | 一种信息处理方法、装置和计算机存储介质 | |
CN104915399A (zh) | 基于新闻标题的推荐数据处理方法及*** | |
CN106649605B (zh) | 一种推广关键词的触发方法及装置 | |
Shawon et al. | Website classification using word based multiple n-gram models and random search oriented feature parameters | |
CN106294358A (zh) | 一种信息的检索方法及*** | |
CN113722478A (zh) | 多维度特征融合相似事件计算方法、***及电子设备 | |
Zhu et al. | Real-time personalized twitter search based on semantic expansion and quality model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP03 | Change of name, title or address |
Address after: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park) Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Patentee after: Beijing Qizhi Business Consulting Co.,Ltd. Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park) Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Patentee before: Qizhi software (Beijing) Co.,Ltd. |
|
CP03 | Change of name, title or address | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240119 Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015 Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park) Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Patentee before: Beijing Qizhi Business Consulting Co.,Ltd. |
|
TR01 | Transfer of patent right |