CN105893353B - 分词方法和分词*** - Google Patents
分词方法和分词*** Download PDFInfo
- Publication number
- CN105893353B CN105893353B CN201610251640.9A CN201610251640A CN105893353B CN 105893353 B CN105893353 B CN 105893353B CN 201610251640 A CN201610251640 A CN 201610251640A CN 105893353 B CN105893353 B CN 105893353B
- Authority
- CN
- China
- Prior art keywords
- word
- participle
- segmentation result
- new text
- word segmentation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/253—Grammatical analysis; Style critique
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610251640.9A CN105893353B (zh) | 2016-04-20 | 2016-04-20 | 分词方法和分词*** |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610251640.9A CN105893353B (zh) | 2016-04-20 | 2016-04-20 | 分词方法和分词*** |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105893353A CN105893353A (zh) | 2016-08-24 |
CN105893353B true CN105893353B (zh) | 2018-10-26 |
Family
ID=56704298
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610251640.9A Active CN105893353B (zh) | 2016-04-20 | 2016-04-20 | 分词方法和分词*** |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105893353B (zh) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108197315A (zh) * | 2018-02-01 | 2018-06-22 | 中控技术(西安)有限公司 | 一种建立分词索引库的方法和装置 |
CN109033082B (zh) * | 2018-07-19 | 2022-06-10 | 深圳创维数字技术有限公司 | 语义模型的学习训练方法、装置及计算机可读存储介质 |
CN109918664B (zh) * | 2019-03-05 | 2023-04-18 | 北京声智科技有限公司 | 分词方法和装置 |
CN110222335A (zh) * | 2019-05-20 | 2019-09-10 | 平安科技(深圳)有限公司 | 一种文本分词方法及装置 |
CN111814477B (zh) * | 2020-07-06 | 2022-06-21 | 重庆邮电大学 | 一种基于争议焦点实体的争议焦点发现方法、装置及终端 |
CN111814470A (zh) * | 2020-07-14 | 2020-10-23 | 混沌时代(北京)教育科技有限公司 | 一种基于互联网昵称提取称呼方法及*** |
CN113870478A (zh) * | 2021-09-29 | 2021-12-31 | 平安银行股份有限公司 | 快速取号方法、装置、电子设备及存储介质 |
CN115840800B (zh) * | 2023-02-27 | 2023-05-12 | 江苏曼荼罗软件股份有限公司 | 患者信息匹配方法、***、计算机及可读存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101739393A (zh) * | 2008-11-20 | 2010-06-16 | 苗玉水 | 汉语文本智能分词法 |
CN102087642A (zh) * | 2009-11-04 | 2011-06-08 | 蒋贤春 | Wkr分词方法 |
CN103646018A (zh) * | 2013-12-20 | 2014-03-19 | 大连大学 | 一种基于hash散列表词典结构的中文分词方法 |
-
2016
- 2016-04-20 CN CN201610251640.9A patent/CN105893353B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101739393A (zh) * | 2008-11-20 | 2010-06-16 | 苗玉水 | 汉语文本智能分词法 |
CN102087642A (zh) * | 2009-11-04 | 2011-06-08 | 蒋贤春 | Wkr分词方法 |
CN103646018A (zh) * | 2013-12-20 | 2014-03-19 | 大连大学 | 一种基于hash散列表词典结构的中文分词方法 |
Non-Patent Citations (5)
Title |
---|
"基于Hash结构词典的双向最大匹配分词法";陈之彦等;《计算机科学》;20151130;第42卷(第11A期);论文第49-54页 * |
"基于双向最大匹配和HMM 的分词消歧模型";麦范金等;《知识组织与知识管理》;20081231(第8期);论文第38-40页 * |
"基于学生模型与AIML的智能教学***的研究";王晓敏;《中国优秀硕士学位论文全文数据库 信息科技辑》;20100715;论文第24、34-37页及图6.1 * |
"基于正反向最大匹配分词***的实现";陈明华等;《信息技术》;20091231(第6期);论文第124-127页 * |
"基于词典的中文分词技术研究";郭瞳康;《中国优秀硕士学位论文全文数据库 信息科技辑》;20110615;论文第2-40页 * |
Also Published As
Publication number | Publication date |
---|---|
CN105893353A (zh) | 2016-08-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105893353B (zh) | 分词方法和分词*** | |
EP0715756B1 (en) | Method and system for bootstrapping statistical processing into a rule-based natural language parser | |
CN110377724A (zh) | 一种基于数据挖掘的语料库关键词自动抽取算法 | |
KR102013230B1 (ko) | 구문 전처리 기반의 구문 분석 장치 및 그 방법 | |
Pettersson et al. | Normalisation of historical text using context-sensitive weighted Levenshtein distance and compound splitting | |
WO2014187096A1 (en) | Method and system for adding punctuation to voice files | |
CN104317846A (zh) | 一种语义分析与标注方法及*** | |
KR20140021838A (ko) | 문법 오류 검출 방법 및 이를 위한 오류검출장치 | |
CN107807910A (zh) | 一种基于hmm的词性标注方法 | |
CN110991180A (zh) | 一种基于关键词和Word2Vec的命令识别方法 | |
CN105912522A (zh) | 基于成分分析的英语语料自动提取方法和提取器 | |
Meteer et al. | Statistical language modeling combining n-gram and context-free grammars | |
Wu et al. | Efficient disfluency detection with transition-based parsing | |
CN109933781A (zh) | 基于sao结构的中文专利文本实体关系抽取方法 | |
CN110390022A (zh) | 一种自动化的专业知识图谱构建方法 | |
CN104391837A (zh) | 一种基于格语义的智能语法分析方法 | |
CN104572619A (zh) | 智能机器人交互***在投融资领域的应用 | |
CN108197104A (zh) | 文本分析方法、装置及云平台 | |
CN107480128A (zh) | 中文文本的分词方法及装置 | |
TWI764480B (zh) | 新詞識別方法和裝置 | |
CN110827807B (zh) | 一种语音识别的方法及其*** | |
Motlani et al. | Developing part-of-speech tagger for a resource poor language: Sindhi | |
CN104572628A (zh) | 一种基于句法特征的学术定义自动抽取***及方法 | |
Eidelman et al. | Lessons learned in part-of-speech tagging of conversational speech | |
CN109657202B (zh) | 文本处理的方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200119 Address after: 510665, room 906, ninth floor, 20 rhyme Road, Guangzhou, Guangdong, Tianhe District Patentee after: GUANGZHOU YAOLA NETWORK CO.,LTD. Address before: 510665, room 901, nine floor, 20 rhyme Road, Guangzhou, Guangdong, Tianhe District Patentee before: GUANGDONG INFINITE INFORMATION TECHNOLOGY Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: 510665 room 906, floor 9, No. 20, Keyun Road, Tianhe District, Guangzhou City, Guangdong Province Patentee after: Guangzhou Youla Network Technology Co.,Ltd. Address before: 510665 room 906, floor 9, No. 20, Keyun Road, Tianhe District, Guangzhou City, Guangdong Province Patentee before: GUANGZHOU YAOLA NETWORK CO.,LTD. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230727 Address after: 510000 room 901, floor 9, No. 20, Keyun Road, Tianhe District, Guangzhou City, Guangdong Province (office use only) Patentee after: GUANGDONG INFINITE INFORMATION TECHNOLOGY Co.,Ltd. Address before: 510665 room 906, floor 9, No. 20, Keyun Road, Tianhe District, Guangzhou City, Guangdong Province Patentee before: Guangzhou Youla Network Technology Co.,Ltd. |