CN103608805B - 辞典产生装置及方法 - Google Patents

辞典产生装置及方法 Download PDF

Info

Publication number
CN103608805B
CN103608805B CN201280030052.2A CN201280030052A CN103608805B CN 103608805 B CN103608805 B CN 103608805B CN 201280030052 A CN201280030052 A CN 201280030052A CN 103608805 B CN103608805 B CN 103608805B
Authority
CN
China
Prior art keywords
word
dictionary
text
selection portion
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201280030052.2A
Other languages
English (en)
Chinese (zh)
Other versions
CN103608805A (zh
Inventor
萩原正人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lotte Group Co.,Ltd.
Original Assignee
Rakuten Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Rakuten Inc filed Critical Rakuten Inc
Publication of CN103608805A publication Critical patent/CN103608805A/zh
Application granted granted Critical
Publication of CN103608805B publication Critical patent/CN103608805B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Machine Translation (AREA)
CN201280030052.2A 2012-02-28 2012-09-03 辞典产生装置及方法 Active CN103608805B (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201261604266P 2012-02-28 2012-02-28
US61/604266 2012-02-28
US61/604,266 2012-02-28
PCT/JP2012/072350 WO2013128684A1 (ja) 2012-02-28 2012-09-03 辞書生成装置、方法、及びプログラム

Publications (2)

Publication Number Publication Date
CN103608805A CN103608805A (zh) 2014-02-26
CN103608805B true CN103608805B (zh) 2016-09-07

Family

ID=49081915

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280030052.2A Active CN103608805B (zh) 2012-02-28 2012-09-03 辞典产生装置及方法

Country Status (5)

Country Link
JP (1) JP5373998B1 (ko)
KR (1) KR101379128B1 (ko)
CN (1) CN103608805B (ko)
TW (1) TWI452475B (ko)
WO (1) WO2013128684A1 (ko)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105701133B (zh) * 2014-11-28 2021-03-30 方正国际软件(北京)有限公司 一种地址输入的方法和设备
JP6813776B2 (ja) * 2016-10-27 2021-01-13 キヤノンマーケティングジャパン株式会社 情報処理装置、その制御方法及びプログラム
JP6707483B2 (ja) * 2017-03-09 2020-06-10 株式会社東芝 情報処理装置、情報処理方法、および情報処理プログラム
WO2018232581A1 (en) * 2017-06-20 2018-12-27 Accenture Global Solutions Limited AUTOMATIC EXTRACTION OF A LEARNING CORPUS FOR A DATA CLASSIFIER BASED ON AUTOMATIC LEARNING ALGORITHMS
JP2019049873A (ja) * 2017-09-11 2019-03-28 株式会社Screenホールディングス 同義語辞書作成装置、同義語辞書作成プログラム及び同義語辞書作成方法
CN109033183B (zh) * 2018-06-27 2021-06-25 清远墨墨教育科技有限公司 一种可编辑的云词库的解析方法

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1204811A (zh) * 1998-08-13 1999-01-13 英业达股份有限公司 汉语语句切分的方法及其***

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3777456B2 (ja) * 1996-04-23 2006-05-24 日本電信電話株式会社 日本語形態素解析方法と装置及び辞書未登録語収集方法と装置
JP2002351870A (ja) * 2001-05-29 2002-12-06 Communication Research Laboratory 形態素の解析方法
CN100530171C (zh) * 2005-01-31 2009-08-19 日电(中国)有限公司 字典学习方法和字典学习装置
JP5073349B2 (ja) * 2007-04-05 2012-11-14 ヤフー株式会社 専門用語抽出装置、方法及びプログラム

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1204811A (zh) * 1998-08-13 1999-01-13 英业达股份有限公司 汉语语句切分的方法及其***

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
岑咏华.一种基于多重哈希词典和K-最短路径算法的中文粗分词方案研究.《情报理论与实践》.2009,第32卷(第3期),110-114. *

Also Published As

Publication number Publication date
TW201335776A (zh) 2013-09-01
KR101379128B1 (ko) 2014-03-27
JP5373998B1 (ja) 2013-12-18
CN103608805A (zh) 2014-02-26
WO2013128684A1 (ja) 2013-09-06
KR20130137048A (ko) 2013-12-13
JPWO2013128684A1 (ja) 2015-07-30
TWI452475B (zh) 2014-09-11

Similar Documents

Publication Publication Date Title
CN108287858B (zh) 自然语言的语义提取方法及装置
CN103608805B (zh) 辞典产生装置及方法
CN108363790A (zh) 用于对评论进行评估的方法、装置、设备和存储介质
CN107229610A (zh) 一种情感数据的分析方法及装置
CN102622338B (zh) 一种短文本间语义距离的计算机辅助计算方法
CN110851596A (zh) 文本分类方法、装置及计算机可读存储介质
CN106980609A (zh) 一种基于词向量表示的条件随机场的命名实体识别方法
CN107729309A (zh) 一种基于深度学习的中文语义分析的方法及装置
CN107122349A (zh) 一种基于word2vec‑LDA模型的文本主题词提取方法
CN109271493A (zh) 一种语言文本处理方法、装置和存储介质
CN104778256B (zh) 一种领域问答***咨询的快速可增量聚类方法
CN107480143A (zh) 基于上下文相关性的对话话题分割方法和***
CN101520802A (zh) 一种问答对的质量评价方法和***
CN105045777A (zh) 使用互联网语料库的自动的上下文相关的语言校正和增强
CN109949799B (zh) 一种语义解析方法及***
CN109815400A (zh) 基于长文本的人物兴趣提取方法
CN106997341A (zh) 一种创新方案匹配方法、装置、服务器及***
CN108108468A (zh) 一种基于概念和文本情感的短文本情感分析方法和装置
CN114492327A (zh) 一种公文智能写作方法
CN108319583A (zh) 从中文语料库提取知识的方法与***
CN101308512B (zh) 一种基于网页的互译翻译对抽取方法及装置
CN108345694B (zh) 一种基于主题数据库的文献检索方法及***
CN107870900B (zh) 提供翻译文的方法、装置以及记录介质
CN105045410B (zh) 一种形式化拼音和汉字对应识别的方法
JP2012146263A (ja) 言語モデル学習装置、言語モデル学習方法、言語解析装置、及びプログラム

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: Tokyo, Japan

Patentee after: Lotte Group Co.,Ltd.

Address before: Tokyo, Japan

Patentee before: Rakuten, Inc.