HK1159815A1 - Method and apparatus for data categorizing - Google Patents
Method and apparatus for data categorizingInfo
- Publication number
- HK1159815A1 HK1159815A1 HK12100209.3A HK12100209A HK1159815A1 HK 1159815 A1 HK1159815 A1 HK 1159815A1 HK 12100209 A HK12100209 A HK 12100209A HK 1159815 A1 HK1159815 A1 HK 1159815A1
- Authority
- HK
- Hong Kong
- Prior art keywords
- data categorizing
- categorizing
- data
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/355—Class or cluster creation or modification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010101221412A CN102193936B (zh) | 2010-03-09 | 2010-03-09 | 一种数据分类的方法及装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
HK1159815A1 true HK1159815A1 (en) | 2012-08-03 |
Family
ID=44560907
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK12100209.3A HK1159815A1 (en) | 2010-03-09 | 2012-01-09 | Method and apparatus for data categorizing |
Country Status (5)
Country | Link |
---|---|
US (1) | US20110225161A1 (zh) |
EP (1) | EP2545511A4 (zh) |
CN (1) | CN102193936B (zh) |
HK (1) | HK1159815A1 (zh) |
WO (1) | WO2011112236A1 (zh) |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102332137A (zh) * | 2011-09-23 | 2012-01-25 | 纽海信息技术(上海)有限公司 | 商品匹配方法及*** |
US20130268328A1 (en) * | 2012-04-09 | 2013-10-10 | Yahoo! Inc. | Generating a deal score to indicate a relative value of an offer |
CN103377216A (zh) * | 2012-04-24 | 2013-10-30 | 苏州引角信息科技有限公司 | 产品信息库的构建方法及*** |
CN103577989B (zh) * | 2012-07-30 | 2017-11-14 | 阿里巴巴集团控股有限公司 | 一种基于产品识别的信息分类方法及信息分类*** |
US9110983B2 (en) * | 2012-08-17 | 2015-08-18 | Intel Corporation | Traversing data utilizing data relationships |
CN103678335B (zh) * | 2012-09-05 | 2017-12-08 | 阿里巴巴集团控股有限公司 | 商品标识标签的方法、装置及商品导航的方法 |
CN103729365A (zh) * | 2012-10-12 | 2014-04-16 | 阿里巴巴集团控股有限公司 | 一种搜索方法和*** |
CN104008101B (zh) * | 2013-02-21 | 2019-02-12 | 北京京东尚科信息技术有限公司 | 货物分类检验方法及检验装置 |
US9483741B2 (en) | 2013-03-28 | 2016-11-01 | Wal-Mart Stores, Inc. | Rule-based item classification |
US9436919B2 (en) | 2013-03-28 | 2016-09-06 | Wal-Mart Stores, Inc. | System and method of tuning item classification |
CN103235822B (zh) * | 2013-05-03 | 2016-05-25 | 富景天策(北京)气象科技有限公司 | 数据库的生成及查询方法 |
US10678878B2 (en) | 2013-05-20 | 2020-06-09 | Tencent Technology (Shenzhen) Company Limited | Method, device and storing medium for searching |
CN104077337B (zh) * | 2013-05-20 | 2015-11-25 | 腾讯科技(深圳)有限公司 | 搜索方法及装置 |
CN103294798B (zh) * | 2013-05-27 | 2016-08-31 | 北京尚友通达信息技术有限公司 | 基于二元切词和支持向量机的商品自动分类方法 |
US10489842B2 (en) * | 2013-09-30 | 2019-11-26 | Ebay Inc. | Large-scale recommendations for a dynamic inventory |
CN103544264A (zh) * | 2013-10-17 | 2014-01-29 | 常熟市华安电子工程有限公司 | 一种商品标题优化工具 |
CN103605815B (zh) * | 2013-12-11 | 2016-08-31 | 焦点科技股份有限公司 | 一种适用于b2b电子商务平台的商品信息自动分类推荐方法 |
US20150331936A1 (en) * | 2014-05-14 | 2015-11-19 | Faris ALQADAH | Method and system for extracting a product and classifying text-based electronic documents |
US9607098B2 (en) | 2014-06-02 | 2017-03-28 | Wal-Mart Stores, Inc. | Determination of product attributes and values using a product entity graph |
CN104408635A (zh) * | 2014-12-01 | 2015-03-11 | 银联智惠信息服务(上海)有限公司 | 商户类别信息识别方法和装置 |
CN106570573B (zh) * | 2015-10-13 | 2022-05-27 | 菜鸟智能物流控股有限公司 | 预测包裹属性信息的方法及装置 |
CN105589847B (zh) * | 2015-12-22 | 2019-02-15 | 北京奇虎科技有限公司 | 带权重的文章标识方法和装置 |
CN106919543A (zh) * | 2015-12-24 | 2017-07-04 | 阿里巴巴集团控股有限公司 | 确定商品对象标题文本的方法及装置 |
CN107203542A (zh) * | 2016-03-17 | 2017-09-26 | 阿里巴巴集团控股有限公司 | 词组提取方法及装置 |
CN107203507B (zh) * | 2016-03-17 | 2019-08-13 | 阿里巴巴集团控股有限公司 | 特征词汇提取方法及装置 |
CN107766394B (zh) * | 2016-08-23 | 2021-12-21 | 阿里巴巴集团控股有限公司 | 业务数据处理方法及其*** |
US10200759B1 (en) * | 2017-07-28 | 2019-02-05 | Rovi Guides, Inc. | Systems and methods for identifying and correlating an advertised object from a media asset with a demanded object from a group of interconnected computing devices embedded in a living environment of a user |
CN110147483B (zh) * | 2017-09-12 | 2023-09-29 | 阿里巴巴集团控股有限公司 | 一种标题重建方法及装置 |
CN108171586A (zh) * | 2018-01-23 | 2018-06-15 | 北京值得买科技股份有限公司 | 一种商品聚类方法及装置 |
CN108388555A (zh) * | 2018-02-01 | 2018-08-10 | 口碑(上海)信息技术有限公司 | 基于行业类别的商品去重方法及装置 |
CN108491873B (zh) * | 2018-03-19 | 2019-05-14 | 广州蓝深科技有限公司 | 一种基于数据分析的商品分类方法 |
CN109543940B (zh) * | 2018-10-12 | 2024-04-09 | 中国平安人寿保险股份有限公司 | 活动评估方法、装置、电子设备及存储介质 |
CN111625620A (zh) * | 2019-02-28 | 2020-09-04 | 北京京东尚科信息技术有限公司 | 信息处理方法和装置 |
CN111723566B (zh) * | 2019-03-21 | 2024-01-23 | 阿里巴巴集团控股有限公司 | 产品信息的重构方法和装置 |
CN110647630A (zh) * | 2019-09-30 | 2020-01-03 | 浙江执御信息技术有限公司 | 检测同款商品的方法及装置 |
US20210304121A1 (en) * | 2020-03-30 | 2021-09-30 | Coupang, Corp. | Computerized systems and methods for product integration and deduplication using artificial intelligence |
CN112181968A (zh) * | 2020-09-29 | 2021-01-05 | 京东数字科技控股股份有限公司 | 统一商品信息的方法、装置、***及存储介质 |
US11829396B1 (en) * | 2022-01-25 | 2023-11-28 | Wizsoft Ltd. | Method and system for retrieval based on an inexact full-text search |
Family Cites Families (47)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2943447B2 (ja) * | 1991-01-30 | 1999-08-30 | 三菱電機株式会社 | テキスト情報抽出装置とテキスト類似照合装置とテキスト検索システムとテキスト情報抽出方法とテキスト類似照合方法、及び、質問解析装置 |
US5371807A (en) * | 1992-03-20 | 1994-12-06 | Digital Equipment Corporation | Method and apparatus for text classification |
US5331554A (en) * | 1992-12-10 | 1994-07-19 | Ricoh Corporation | Method and apparatus for semantic pattern matching for text retrieval |
US5438628A (en) * | 1993-04-19 | 1995-08-01 | Xerox Corporation | Method for matching text images and documents using character shape codes |
US6714933B2 (en) * | 2000-05-09 | 2004-03-30 | Cnet Networks, Inc. | Content aggregation method and apparatus for on-line purchasing system |
US7082426B2 (en) * | 1993-06-18 | 2006-07-25 | Cnet Networks, Inc. | Content aggregation method and apparatus for an on-line product catalog |
CN1158460A (zh) * | 1996-12-31 | 1997-09-03 | 复旦大学 | 一种跨语种语料自动分类与检索方法 |
US6742003B2 (en) * | 2001-04-30 | 2004-05-25 | Microsoft Corporation | Apparatus and accompanying methods for visualizing clusters of data and hierarchical cluster classifications |
US6751600B1 (en) * | 2000-05-30 | 2004-06-15 | Commerce One Operations, Inc. | Method for automatic categorization of items |
US7076485B2 (en) * | 2001-03-07 | 2006-07-11 | The Mitre Corporation | Method and system for finding similar records in mixed free-text and structured data |
US7716161B2 (en) * | 2002-09-24 | 2010-05-11 | Google, Inc, | Methods and apparatus for serving relevant advertisements |
US20040093200A1 (en) * | 2002-11-07 | 2004-05-13 | Island Data Corporation | Method of and system for recognizing concepts |
EP1588283A2 (en) * | 2002-11-22 | 2005-10-26 | Transclick, Inc. | System and method for language translation via remote devices |
CA2516941A1 (en) * | 2003-02-19 | 2004-09-02 | Custom Speech Usa, Inc. | A method for form completion using speech recognition and text comparison |
JP4466564B2 (ja) * | 2003-09-08 | 2010-05-26 | 日本電気株式会社 | 文書作成閲覧装置、文書作成閲覧ロボットおよび文書作成閲覧プログラム |
WO2005071665A1 (en) * | 2004-01-20 | 2005-08-04 | Koninklijke Philips Electronics, N.V. | Method and system for determining the topic of a conversation and obtaining and presenting related content |
JP4366249B2 (ja) * | 2004-06-02 | 2009-11-18 | パイオニア株式会社 | 情報処理装置、その方法、そのプログラム、そのプログラムを記録した記録媒体、および、情報取得装置 |
US8903827B2 (en) * | 2004-10-29 | 2014-12-02 | Ebay Inc. | Method and system for categorizing items automatically |
JP4008954B2 (ja) * | 2004-10-29 | 2007-11-14 | 松下電器産業株式会社 | 情報検索装置 |
CN101112078B (zh) * | 2005-02-08 | 2012-04-18 | 日本电信电话株式会社 | 信息通信终端、信息通信***、信息通信方法、信息通信程序及存储该程序的记录媒体 |
US20070055526A1 (en) * | 2005-08-25 | 2007-03-08 | International Business Machines Corporation | Method, apparatus and computer program product providing prosodic-categorical enhancement to phrase-spliced text-to-speech synthesis |
US7574449B2 (en) * | 2005-12-02 | 2009-08-11 | Microsoft Corporation | Content matching |
JP4961755B2 (ja) * | 2006-01-23 | 2012-06-27 | 富士ゼロックス株式会社 | 単語アライメント装置、単語アライメント方法、単語アライメントプログラム |
US7698140B2 (en) * | 2006-03-06 | 2010-04-13 | Foneweb, Inc. | Message transcription, voice query and query delivery system |
US20100138451A1 (en) * | 2006-04-03 | 2010-06-03 | Assaf Henkin | Techniques for facilitating on-line contextual analysis and advertising |
US20070294610A1 (en) * | 2006-06-02 | 2007-12-20 | Ching Phillip W | System and method for identifying similar portions in documents |
JP5223673B2 (ja) * | 2006-06-29 | 2013-06-26 | 日本電気株式会社 | 音声処理装置およびプログラム、並びに、音声処理方法 |
WO2008056570A1 (fr) * | 2006-11-09 | 2008-05-15 | Panasonic Corporation | Dispositif de recherche de contenu |
CN101004737A (zh) * | 2007-01-24 | 2007-07-25 | 贵阳易特软件有限公司 | 基于关键词的个性化文档处理*** |
WO2008090609A1 (ja) * | 2007-01-25 | 2008-07-31 | Fujitsu Limited | 嗜好番組抽出装置 |
US8122032B2 (en) * | 2007-07-20 | 2012-02-21 | Google Inc. | Identifying and linking similar passages in a digital text corpus |
US7945525B2 (en) * | 2007-11-09 | 2011-05-17 | International Business Machines Corporation | Methods for obtaining improved text similarity measures which replace similar characters with a string pattern representation by using a semantic data tree |
US20090132385A1 (en) * | 2007-11-21 | 2009-05-21 | Techtain Inc. | Method and system for matching user-generated text content |
US8077984B2 (en) * | 2008-01-04 | 2011-12-13 | Xerox Corporation | Method for computing similarity between text spans using factored word sequence kernels |
US20090292677A1 (en) * | 2008-02-15 | 2009-11-26 | Wordstream, Inc. | Integrated web analytics and actionable workbench tools for search engine optimization and marketing |
US7958136B1 (en) * | 2008-03-18 | 2011-06-07 | Google Inc. | Systems and methods for identifying similar documents |
JP5224868B2 (ja) * | 2008-03-28 | 2013-07-03 | 株式会社東芝 | 情報推薦装置および情報推薦方法 |
US8145482B2 (en) * | 2008-05-25 | 2012-03-27 | Ezra Daya | Enhancing analysis of test key phrases from acoustic sources with key phrase training models |
US8214346B2 (en) * | 2008-06-27 | 2012-07-03 | Cbs Interactive Inc. | Personalization engine for classifying unstructured documents |
US8060513B2 (en) * | 2008-07-01 | 2011-11-15 | Dossierview Inc. | Information processing with integrated semantic contexts |
US8577930B2 (en) * | 2008-08-20 | 2013-11-05 | Yahoo! Inc. | Measuring topical coherence of keyword sets |
US20100250526A1 (en) * | 2009-03-27 | 2010-09-30 | Prochazka Filip | Search System that Uses Semantic Constructs Defined by Your Social Network |
US8306807B2 (en) * | 2009-08-17 | 2012-11-06 | N T repid Corporation | Structured data translation apparatus, system and method |
US20110258054A1 (en) * | 2010-04-19 | 2011-10-20 | Sandeep Pandey | Automatic Generation of Bid Phrases for Online Advertising |
US9560206B2 (en) * | 2010-04-30 | 2017-01-31 | American Teleconferencing Services, Ltd. | Real-time speech-to-text conversion in an audio conference session |
KR101196935B1 (ko) * | 2010-07-05 | 2012-11-05 | 엔에이치엔(주) | 실시간 인기 키워드에 대한 대표 문구를 제공하는 방법 및 시스템 |
US8407215B2 (en) * | 2010-12-10 | 2013-03-26 | Sap Ag | Text analysis to identify relevant entities |
-
2010
- 2010-03-09 CN CN2010101221412A patent/CN102193936B/zh active Active
-
2011
- 2011-03-01 US US12/932,659 patent/US20110225161A1/en not_active Abandoned
- 2011-03-02 WO PCT/US2011/000388 patent/WO2011112236A1/en active Application Filing
- 2011-03-02 EP EP11753706.8A patent/EP2545511A4/en not_active Withdrawn
-
2012
- 2012-01-09 HK HK12100209.3A patent/HK1159815A1/xx unknown
Also Published As
Publication number | Publication date |
---|---|
US20110225161A1 (en) | 2011-09-15 |
CN102193936A (zh) | 2011-09-21 |
CN102193936B (zh) | 2013-09-18 |
EP2545511A1 (en) | 2013-01-16 |
EP2545511A4 (en) | 2016-03-16 |
WO2011112236A1 (en) | 2011-09-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1159815A1 (en) | Method and apparatus for data categorizing | |
IL242091B (en) | Device, system and method | |
ZA201105101B (en) | Data processing apparatus and method | |
EP2609720A4 (en) | METHOD AND APPARATUS FOR FILTERING CONTINUOUS DIFFUSION DATA | |
GB2479922B (en) | Data transmission apparatus and method | |
EP2715550A4 (en) | APPARATUSES AND METHODS FOR ENSURING DATA INTEGRITY | |
EP2715549A4 (en) | APPARATUSES AND METHODS FOR ENSURING DATA INTEGRITY | |
GB201019798D0 (en) | Data processing apparatus and method | |
GB2470611B (en) | Apparatus and method for processing data | |
EP2613443A4 (en) | DATA PROCESSING DEVICE AND DATA PROCESSING METHOD | |
GB201103737D0 (en) | Method and apparatus for transferring data | |
EP2618491A4 (en) | DATA PROCESSING DEVICE AND DATA PROCESSING METHOD | |
EP2790434A4 (en) | DATA TRANSMISSION PROCESS AND DEVICE | |
EP2506522A4 (en) | METHOD AND DEVICE FOR PUSHING DATA | |
EP2549390A4 (en) | DATA PROCESSING DEVICE AND DATA PROCESSING METHOD | |
HK1183394A1 (zh) | 信息處理裝置和信息處理方法 | |
PT2700234T (pt) | Método e dispositivo para codificação com compressão com perda de dados | |
EP2761447A4 (en) | DEVICE AND METHOD FOR SYNCHRONIZING APPLICATION DATA | |
EP2645579A4 (en) | DATA PROCESSING DEVICE AND DATA PROCESSING METHOD | |
PT2793227T (pt) | Método, dispositivo e sistema para processamento de dados áudio | |
EP2512064A4 (en) | METHOD AND APPARATUS FOR CONFIGURING DATA | |
GB2490773B (en) | Method and apparatus for the classification of data | |
EP2616948A4 (en) | METHOD AND APPARATUS FOR MANAGING DATA | |
EP2622544A4 (en) | METHOD AND DEVICE FOR DATA PROCESSING | |
HK1181153A1 (zh) | 種數據容災處理的方法和裝置 |