CN107908749B - 一种基于搜索引擎的人物检索***及方法 - Google Patents
一种基于搜索引擎的人物检索***及方法 Download PDFInfo
- Publication number
- CN107908749B CN107908749B CN201711147336.0A CN201711147336A CN107908749B CN 107908749 B CN107908749 B CN 107908749B CN 201711147336 A CN201711147336 A CN 201711147336A CN 107908749 B CN107908749 B CN 107908749B
- Authority
- CN
- China
- Prior art keywords
- webpage
- name
- visual
- character
- visual block
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/355—Class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711147336.0A CN107908749B (zh) | 2017-11-17 | 2017-11-17 | 一种基于搜索引擎的人物检索***及方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711147336.0A CN107908749B (zh) | 2017-11-17 | 2017-11-17 | 一种基于搜索引擎的人物检索***及方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107908749A CN107908749A (zh) | 2018-04-13 |
CN107908749B true CN107908749B (zh) | 2020-04-10 |
Family
ID=61846123
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711147336.0A Active CN107908749B (zh) | 2017-11-17 | 2017-11-17 | 一种基于搜索引擎的人物检索***及方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107908749B (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109359301A (zh) * | 2018-10-19 | 2019-02-19 | 国家计算机网络与信息安全管理中心 | 一种网页内容的多维度标注方法及装置 |
CN109948154B (zh) * | 2019-03-12 | 2023-05-05 | 南京邮电大学 | 一种基于邮箱名的人物获取及关系推荐***和方法 |
CN111241283B (zh) * | 2020-01-15 | 2023-04-07 | 电子科技大学 | 一种科研学者画像的快速表征方法 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104182420A (zh) * | 2013-05-27 | 2014-12-03 | 华东师范大学 | 一种基于本体的中文人名消歧方法 |
CN106484675A (zh) * | 2016-09-29 | 2017-03-08 | 北京理工大学 | 融合分布式语义和句义特征的人物关系抽取方法 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1687924A (zh) * | 2005-04-28 | 2005-10-26 | 中国科学院计算技术研究所 | 互联网人物信息搜索引擎的生成方法 |
CN102054029A (zh) * | 2010-12-17 | 2011-05-11 | 哈尔滨工业大学 | 一种基于社会网络和人名上下文的人物信息消歧处理方法 |
CN102831128B (zh) * | 2011-06-15 | 2015-03-25 | 富士通株式会社 | 一种对互联网上的同名人物信息进行分类的方法及装置 |
CN102880623B (zh) * | 2011-07-13 | 2015-09-09 | 富士通株式会社 | 同名人物搜索方法及*** |
CN104376116A (zh) * | 2014-12-01 | 2015-02-25 | 国家电网公司 | 一种人物信息的搜索方法及装置 |
US20160314130A1 (en) * | 2015-04-24 | 2016-10-27 | Tribune Broadcasting Company, Llc | Computing device with spell-check feature |
-
2017
- 2017-11-17 CN CN201711147336.0A patent/CN107908749B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104182420A (zh) * | 2013-05-27 | 2014-12-03 | 华东师范大学 | 一种基于本体的中文人名消歧方法 |
CN106484675A (zh) * | 2016-09-29 | 2017-03-08 | 北京理工大学 | 融合分布式语义和句义特征的人物关系抽取方法 |
Non-Patent Citations (1)
Title |
---|
"面向网络人物搜索的中文人名消歧";沈剑平;《中国优秀硕士学位论文全文数据库•信息科技辑》;20120215;I138-2615 * |
Also Published As
Publication number | Publication date |
---|---|
CN107908749A (zh) | 2018-04-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9514216B2 (en) | Automatic classification of segmented portions of web pages | |
CN107180045B (zh) | 一种互联网文本蕴含地理实体关系的抽取方法 | |
CN103870973B (zh) | 基于电子信息的关键词提取的信息推送、搜索方法及装置 | |
CN107784092A (zh) | 一种推荐热词的方法、服务器及计算机可读介质 | |
CN103455487B (zh) | 一种搜索词的提取方法及装置 | |
CN106339502A (zh) | 一种基于用户行为数据分片聚类的建模推荐方法 | |
CN107885793A (zh) | 一种微博热点话题分析预测方法及*** | |
CN106250513A (zh) | 一种基于事件建模的事件个性化分类方法及*** | |
Smith et al. | Evaluating visual representations for topic understanding and their effects on manually generated topic labels | |
CN105843796A (zh) | 一种微博情感倾向分析方法及装置 | |
TW202001620A (zh) | 自動化網站資料蒐集方法 | |
CN102955848A (zh) | 一种基于语义的三维模型检索***和方法 | |
CN112559684A (zh) | 一种关键词提取及信息检索方法 | |
CN108021715B (zh) | 基于语义结构特征分析的异构标签融合*** | |
CN108363748B (zh) | 基于知乎的话题画像***及话题画像方法 | |
Raghuvanshi et al. | A brief review on sentiment analysis | |
CN111680131B (zh) | 基于语义的文档聚类方法、***及计算机设备 | |
CN107908749B (zh) | 一种基于搜索引擎的人物检索***及方法 | |
Nualart et al. | How we draw texts: a review of approaches to text visualization and exploration | |
Nandi et al. | Bangla news recommendation using doc2vec | |
JP6621514B1 (ja) | 要約作成装置、要約作成方法、及びプログラム | |
CN116882414B (zh) | 基于大规模语言模型的评语自动生成方法及相关装置 | |
CN110309355A (zh) | 内容标签的生成方法、装置、设备及存储介质 | |
CN116484079A (zh) | 属性词挖掘方法及相关产品 | |
CN113934910A (zh) | 一种自动优化、更新的主题库构建方法,及热点事件实时更新方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Liu Yang Inventor after: Wang Bailing Inventor after: Zhou Qi Inventor after: Xin Guodong Inventor after: Sun Yunxiao Inventor after: Wang Wei Inventor before: Zhou Qi Inventor before: Liu Yang Inventor before: Wang Bailing Inventor before: Xin Guodong Inventor before: Sun Yunxiao Inventor before: Wang Wei |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Sun Yunxiao Inventor after: Liu Yang Inventor after: Wang Bailing Inventor after: Zhou Qi Inventor after: Xin Guodong Inventor after: Wang Wei Inventor before: Liu Yang Inventor before: Wang Bailing Inventor before: Zhou Qi Inventor before: Xin Guodong Inventor before: Sun Yunxiao Inventor before: Wang Wei |