CN101329680B - 句子层面的大规模快速匹配方法 - Google Patents
句子层面的大规模快速匹配方法 Download PDFInfo
- Publication number
- CN101329680B CN101329680B CN2008101071174A CN200810107117A CN101329680B CN 101329680 B CN101329680 B CN 101329680B CN 2008101071174 A CN2008101071174 A CN 2008101071174A CN 200810107117 A CN200810107117 A CN 200810107117A CN 101329680 B CN101329680 B CN 101329680B
- Authority
- CN
- China
- Prior art keywords
- sentence
- index
- character
- matching
- substring
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 19
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 31
- 230000008878 coupling Effects 0.000 claims description 15
- 238000010168 coupling process Methods 0.000 claims description 15
- 238000005859 coupling reaction Methods 0.000 claims description 15
- 238000006243 chemical reaction Methods 0.000 abstract description 4
- 238000011524 similarity measure Methods 0.000 abstract 1
- 230000001186 cumulative effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000013011 mating Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008101071174A CN101329680B (zh) | 2008-07-17 | 2008-07-17 | 句子层面的大规模快速匹配方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008101071174A CN101329680B (zh) | 2008-07-17 | 2008-07-17 | 句子层面的大规模快速匹配方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101329680A CN101329680A (zh) | 2008-12-24 |
CN101329680B true CN101329680B (zh) | 2010-12-08 |
Family
ID=40205491
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2008101071174A Active CN101329680B (zh) | 2008-07-17 | 2008-07-17 | 句子层面的大规模快速匹配方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101329680B (zh) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014152541A1 (en) * | 2013-03-15 | 2014-09-25 | Sherwin Han | Spatial arithmetic method of sequence alignment |
CN104298684B (zh) * | 2013-07-18 | 2018-04-06 | 深圳中兴网信科技有限公司 | 一种查询方法、装置及服务器 |
CN104008119B (zh) * | 2013-12-30 | 2017-09-26 | 西南交通大学 | 一种一对多的混合字符串融合比对方法 |
CN104750673B (zh) * | 2013-12-31 | 2018-02-23 | ***通信集团公司 | 文本匹配过滤方法及装置 |
CN105183732A (zh) * | 2014-06-04 | 2015-12-23 | 广州市动景计算机科技有限公司 | 网页的处理方法及装置 |
CN104063500B (zh) * | 2014-07-07 | 2019-03-29 | 联想(北京)有限公司 | 信息处理设备以及信息处理方法 |
CN106897258B (zh) * | 2017-02-27 | 2020-05-29 | 郑州云海信息技术有限公司 | 一种文本差异性的计算方法及装置 |
CN108363715A (zh) * | 2017-12-28 | 2018-08-03 | 中兴智能交通股份有限公司 | 一种车牌图片管理方法和装置 |
CN111797285A (zh) * | 2020-06-30 | 2020-10-20 | 深圳壹账通智能科技有限公司 | 字符串模糊匹配方法、装置、设备及可读存储介质 |
CN116029284B (zh) * | 2023-03-27 | 2023-07-21 | 上海蜜度信息技术有限公司 | 中文子串提取方法、***、存储介质及电子设备 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1794236A (zh) * | 2004-12-21 | 2006-06-28 | 英特尔公司 | 高效的基于cam在分组有效载荷中进行串搜索的技术 |
CN101030221A (zh) * | 2007-04-13 | 2007-09-05 | 清华大学 | 一种用于文本或网络内容分析的大规模多关键词匹配方法 |
-
2008
- 2008-07-17 CN CN2008101071174A patent/CN101329680B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1794236A (zh) * | 2004-12-21 | 2006-06-28 | 英特尔公司 | 高效的基于cam在分组有效载荷中进行串搜索的技术 |
CN101030221A (zh) * | 2007-04-13 | 2007-09-05 | 清华大学 | 一种用于文本或网络内容分析的大规模多关键词匹配方法 |
Non-Patent Citations (1)
Title |
---|
龚才春、黄玉兰、许洪波、白硕.基于多重索引模型的大规模词典近似匹配算法.第三届全国信息检索与内容安全学术会议.2007,333-339. * |
Also Published As
Publication number | Publication date |
---|---|
CN101329680A (zh) | 2008-12-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101329680B (zh) | 句子层面的大规模快速匹配方法 | |
CN104199965B (zh) | 一种语义信息检索方法 | |
US8554561B2 (en) | Efficient indexing of documents with similar content | |
CN100527125C (zh) | 一种统计机器翻译中的在线翻译模型选择方法和*** | |
US7644069B2 (en) | Search ranking method for file system and related search engine | |
CN103345496B (zh) | 多媒体信息检索方法和*** | |
CN101794307A (zh) | 基于互联网分词思想的车载导航poi搜索引擎 | |
CN102043843A (zh) | 一种用于基于目标应用获取目标词条的方法与获取设备 | |
CN106557777B (zh) | 一种基于SimHash改进的Kmeans文档聚类方法 | |
CN102750379B (zh) | 一种基于过滤型的字符串快速匹配方法 | |
CN103020054B (zh) | 模糊查询方法及*** | |
CN102402537A (zh) | 中文网页文本除重***及方法 | |
CN102915381B (zh) | 基于多维语义的可视化网络检索呈现***及呈现控制方法 | |
CN101369278B (zh) | 一种近似匹配方法和装置 | |
CN103914570A (zh) | 基于字符串相似度算法的智能客服搜索方法与*** | |
Keivanloo et al. | Seclone-a hybrid approach to internet-scale real-time code clone search | |
CN115563313A (zh) | 基于知识图谱的文献书籍语义检索*** | |
CN110955806A (zh) | 一种针对中文文本的字符串匹配方法 | |
CN109446293B (zh) | 一种并行的高维近邻查询方法 | |
CN105515586B (zh) | 一种快速差量压缩方法 | |
CN112836008B (zh) | 基于去中心化存储数据的索引建立方法 | |
CN117235199A (zh) | 一种基于文档树的信息智能匹配检索的方法 | |
CN103064847A (zh) | 索引装置、索引方法、检索装置、检索方法和检索*** | |
CN110245275B (zh) | 一种大规模相似新闻标题快速归一化方法 | |
CN111538839A (zh) | 一种基于杰卡德距离的实时文本聚类方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee |
Owner name: IFLYTEK CO., LTD. Free format text: FORMER NAME: ANHUI USTC IFLYTEK CO., LTD. |
|
CP03 | Change of name, title or address |
Address after: Wangjiang Road high tech Development Zone Hefei city Anhui province 230088 No. 666 Patentee after: IFLYTEK Co.,Ltd. Address before: 230088 information industry base, No. 616, Mount Huangshan Road, hi tech Zone, Anhui, Hefei Patentee before: ANHUI USTC IFLYTEK Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20190325 Address after: 230088 18 Floor, A5 Building, 666 Wangjiangxi Road, Hefei High-tech Zone, Anhui Province Patentee after: ANHUI IFLYTEK MEDICAL INFORMATION TECHNOLOGY CO.,LTD. Address before: 230088 No. 666 Wangjiangxi Road, Hefei High-tech Development Zone, Anhui Province (230088) Patentee before: IFLYTEK Co.,Ltd. |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: 230088 floor 23-24, building A5, No. 666, Wangjiang West Road, high tech Zone, Hefei, Anhui Province Patentee after: Anhui Xunfei Medical Co.,Ltd. Address before: 230088 18th floor, building A5, NO.666, Wangjiang West Road, high tech Zone, Hefei City, Anhui Province Patentee before: ANHUI IFLYTEK MEDICAL INFORMATION TECHNOLOGY CO.,LTD. |
|
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: 230088 floor 23-24, building A5, No. 666, Wangjiang West Road, high tech Zone, Hefei, Anhui Province Patentee after: IFLYTEK Medical Technology Co.,Ltd. Address before: 230088 floor 23-24, building A5, No. 666, Wangjiang West Road, high tech Zone, Hefei, Anhui Province Patentee before: Anhui Xunfei Medical Co.,Ltd. |