CN101802776A - 应用语义向量和关键字分析关联数据集的方法和装置 - Google Patents

应用语义向量和关键字分析关联数据集的方法和装置 Download PDF

Info

Publication number
CN101802776A
CN101802776A CN200880001312A CN200880001312A CN101802776A CN 101802776 A CN101802776 A CN 101802776A CN 200880001312 A CN200880001312 A CN 200880001312A CN 200880001312 A CN200880001312 A CN 200880001312A CN 101802776 A CN101802776 A CN 101802776A
Authority
CN
China
Prior art keywords
data set
data collection
key word
group
subject data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN200880001312A
Other languages
English (en)
Chinese (zh)
Inventor
文圆
克里特普瑞特斯·马
杰拉德弗朗斯·荷利三世
安德鲁劳伦斯·法瑞斯
咖贝尔·斯汀伯格
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TEXTWISE LLC
Original Assignee
TEXTWISE LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TEXTWISE LLC filed Critical TEXTWISE LLC
Publication of CN101802776A publication Critical patent/CN101802776A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN200880001312A 2008-07-29 2008-07-29 应用语义向量和关键字分析关联数据集的方法和装置 Pending CN101802776A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2008/071505 WO2010014082A1 (en) 2008-07-29 2008-07-29 Method and apparatus for relating datasets by using semantic vectors and keyword analyses

Publications (1)

Publication Number Publication Date
CN101802776A true CN101802776A (zh) 2010-08-11

Family

ID=41610613

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200880001312A Pending CN101802776A (zh) 2008-07-29 2008-07-29 应用语义向量和关键字分析关联数据集的方法和装置

Country Status (4)

Country Link
EP (1) EP2307951A4 (ja)
JP (1) JP2011529600A (ja)
CN (1) CN101802776A (ja)
WO (1) WO2010014082A1 (ja)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103649905A (zh) * 2011-03-10 2014-03-19 特克斯特怀茨有限责任公司 用于统一信息表示的方法和***及其应用
CN106257440A (zh) * 2015-06-17 2016-12-28 松下知识产权经营株式会社 语义信息生成方法和语义信息生成装置
CN109558586A (zh) * 2018-11-02 2019-04-02 中国科学院自动化研究所 一种资讯的言据自证评分方法、设备和存储介质
CN110060255A (zh) * 2017-12-28 2019-07-26 达索***公司 利用逐像素分类器来对2d平面图进行语义分割
CN111199259A (zh) * 2018-11-19 2020-05-26 中国电信股份有限公司 标识转换方法、装置和计算机可读存储介质

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5388988B2 (ja) * 2010-10-26 2014-01-15 ヤフー株式会社 広告選択装置、方法及びプログラム
US9558185B2 (en) * 2012-01-10 2017-01-31 Ut-Battelle Llc Method and system to discover and recommend interesting documents
JP5701324B2 (ja) * 2013-01-15 2015-04-15 ヤフー株式会社 情報配信装置、及び、情報配信方法
US9195470B2 (en) 2013-07-22 2015-11-24 Globalfoundries Inc. Dynamic data dimensioning by partial reconfiguration of single or multiple field-programmable gate arrays using bootstraps
JP6228425B2 (ja) * 2013-10-25 2017-11-08 株式会社Nttドコモ 広告生成装置および広告生成方法
CN105022754B (zh) 2014-04-29 2020-05-12 腾讯科技(深圳)有限公司 基于社交网络的对象分类方法及装置
US10360520B2 (en) 2015-01-06 2019-07-23 International Business Machines Corporation Operational data rationalization
US10643031B2 (en) 2016-03-11 2020-05-05 Ut-Battelle, Llc System and method of content based recommendation using hypernym expansion
US20230122031A1 (en) * 2019-06-26 2023-04-20 Google Llc Systems and methods for providing content candidates
CN113609264B (zh) * 2021-06-28 2022-09-02 国网北京市电力公司 电力***节点的数据查询方法、装置
CN113449111B (zh) * 2021-08-31 2021-12-07 苏州工业园区测绘地理信息有限公司 基于时空语义知识迁移的社会治理热点话题自动识别方法
CN114187605B (zh) * 2021-12-13 2023-02-28 苏州方兴信息技术有限公司 一种数据集成方法、装置和可读存储介质
WO2024074760A1 (en) * 2022-10-04 2024-04-11 Thirdpresence Oy Content management arrangement

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6175828B1 (en) * 1997-02-28 2001-01-16 Sharp Kabushiki Kaisha Retrieval apparatus
US20050216516A1 (en) * 2000-05-02 2005-09-29 Textwise Llc Advertisement placement method and system using semantic analysis
CN101059806A (zh) * 2007-06-06 2007-10-24 华东师范大学 一种基于语义的本地文档检索方法

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6134532A (en) * 1997-11-14 2000-10-17 Aptex Software, Inc. System and method for optimal adaptive matching of users to most relevant entity and information in real-time
US7089194B1 (en) * 1999-06-17 2006-08-08 International Business Machines Corporation Method and apparatus for providing reduced cost online service and adaptive targeting of advertisements
JP2005173795A (ja) * 2003-12-09 2005-06-30 Canon Inc 文書検索装置、およびその検索方法、並びに記憶媒体
JP2005326970A (ja) * 2004-05-12 2005-11-24 Mitsubishi Electric Corp 構造化文書曖昧検索装置及びそのプログラム
JP4728125B2 (ja) * 2006-01-11 2011-07-20 ヤフー株式会社 索引ファイルを用いた文書検索の方法、索引ファイルを用いた文書検索サーバ、及び索引ファイルを用いた文書検索プログラム

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6175828B1 (en) * 1997-02-28 2001-01-16 Sharp Kabushiki Kaisha Retrieval apparatus
US20050216516A1 (en) * 2000-05-02 2005-09-29 Textwise Llc Advertisement placement method and system using semantic analysis
CN101059806A (zh) * 2007-06-06 2007-10-24 华东师范大学 一种基于语义的本地文档检索方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
KURT D. BOLLACKER等: "CiteSeer: An Autonomous Web Agent for Automatic Retrieval and Identification of Interesting Publications", 《2ND INTERNATIONAL ACM CONFERENCE ON AUTONOMOUS AGENTS》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103649905A (zh) * 2011-03-10 2014-03-19 特克斯特怀茨有限责任公司 用于统一信息表示的方法和***及其应用
CN103649905B (zh) * 2011-03-10 2015-08-05 特克斯特怀茨有限责任公司 用于统一信息表示的方法和***及其应用
CN106257440A (zh) * 2015-06-17 2016-12-28 松下知识产权经营株式会社 语义信息生成方法和语义信息生成装置
CN106257440B (zh) * 2015-06-17 2021-03-09 松下知识产权经营株式会社 语义信息生成方法和语义信息生成装置
CN110060255A (zh) * 2017-12-28 2019-07-26 达索***公司 利用逐像素分类器来对2d平面图进行语义分割
CN109558586A (zh) * 2018-11-02 2019-04-02 中国科学院自动化研究所 一种资讯的言据自证评分方法、设备和存储介质
CN109558586B (zh) * 2018-11-02 2023-04-18 中国科学院自动化研究所 一种资讯的言据自证评分方法、设备和存储介质
CN111199259A (zh) * 2018-11-19 2020-05-26 中国电信股份有限公司 标识转换方法、装置和计算机可读存储介质
CN111199259B (zh) * 2018-11-19 2023-06-20 中国电信股份有限公司 标识转换方法、装置和计算机可读存储介质

Also Published As

Publication number Publication date
EP2307951A4 (en) 2012-12-19
WO2010014082A1 (en) 2010-02-04
JP2011529600A (ja) 2011-12-08
EP2307951A1 (en) 2011-04-13

Similar Documents

Publication Publication Date Title
CN101802776A (zh) 应用语义向量和关键字分析关联数据集的方法和装置
CN101593200B (zh) 基于关键词频度分析的中文网页分类方法
AU2008307247B2 (en) System and method of inclusion of interactive elements on a search results page
US8312022B2 (en) Search engine optimization
US20080228720A1 (en) Implicit name searching
US7516397B2 (en) Methods, apparatus and computer programs for characterizing web resources
US20060129843A1 (en) Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents
CN103443786A (zh) 识别网络浏览器中的并行布局的独立任务的机器学习方法
CN101546341A (zh) 信息推荐装置和信息推荐方法
CN102184262A (zh) 基于web的文本分类挖掘***及方法
CN104885081A (zh) 搜索***和相应方法
US20090119283A1 (en) System and Method of Improving and Enhancing Electronic File Searching
JP5313295B2 (ja) 文書探索サービス提供方法及びシステム
KR20080037413A (ko) 온라인 문맥기반 광고 장치 및 방법
CN104503988A (zh) 搜索方法及装置
Sivakumar Effectual web content mining using noise removal from web pages
CN116975340A (zh) 信息检索方法、装置、设备、程序产品及存储介质
US20100082594A1 (en) Building a topic based webpage based on algorithmic and community interactions
CN102567016A (zh) 应用程序编程接口使用示例提取方法及装置
Ahamed et al. Deduce user search progression with feedback session
Tsapatsoulis Web image indexing using WICE and a learning-free language model
Vagliano et al. Training researchers with the moving platform
JP7438272B2 (ja) 検索インテント単位のブロックを生成する方法、コンピュータ装置、およびコンピュータプログラム
KR101132393B1 (ko) 폭소노미와 링크 기반 랭킹 기법을 이용한 집단지성 기반 웹 페이지 검색 방법 및 이를 수행하기 위한 시스템
KR20240081523A (ko) 실감형 뉴스 콘텐츠를 위한 빅데이터 플랫폼 구축 방법

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1147326

Country of ref document: HK

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20100811

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1147326

Country of ref document: HK