CN106815605B - 一种基于机器学习的数据分类方法及设备 - Google Patents
一种基于机器学习的数据分类方法及设备 Download PDFInfo
- Publication number
- CN106815605B CN106815605B CN201710051325.6A CN201710051325A CN106815605B CN 106815605 B CN106815605 B CN 106815605B CN 201710051325 A CN201710051325 A CN 201710051325A CN 106815605 B CN106815605 B CN 106815605B
- Authority
- CN
- China
- Prior art keywords
- data
- word group
- classification
- feature word
- learning
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 41
- 238000010801 machine learning Methods 0.000 title claims abstract description 40
- 238000013145 classification model Methods 0.000 claims abstract description 17
- 230000011218 segmentation Effects 0.000 claims abstract description 4
- 238000012163 sequencing technique Methods 0.000 claims description 9
- 238000012216 screening Methods 0.000 claims description 6
- 238000012545 processing Methods 0.000 claims description 4
- 230000000694 effects Effects 0.000 description 5
- 238000004590 computer program Methods 0.000 description 4
- 238000000605 extraction Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000003208 petroleum Substances 0.000 description 2
- 238000009960 carding Methods 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000003631 expected effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000003370 grooming effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000015654 memory Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000003936 working memory Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medical Informatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710051325.6A CN106815605B (zh) | 2017-01-23 | 2017-01-23 | 一种基于机器学习的数据分类方法及设备 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710051325.6A CN106815605B (zh) | 2017-01-23 | 2017-01-23 | 一种基于机器学习的数据分类方法及设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106815605A CN106815605A (zh) | 2017-06-09 |
CN106815605B true CN106815605B (zh) | 2021-04-13 |
Family
ID=59112339
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710051325.6A Active CN106815605B (zh) | 2017-01-23 | 2017-01-23 | 一种基于机器学习的数据分类方法及设备 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106815605B (zh) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107273501B (zh) * | 2017-06-16 | 2020-06-26 | 合肥美的智能科技有限公司 | 语料生成方法及***、智能设备和计算机装置 |
US11609353B2 (en) * | 2017-09-26 | 2023-03-21 | Schlumberger Technology Corporation | Apparatus and methods for improved subsurface data processing systems |
CN109597892A (zh) * | 2018-12-25 | 2019-04-09 | 杭州数梦工场科技有限公司 | 一种数据库中数据的分类方法、装置、设备及存储介质 |
CN111339304A (zh) * | 2020-03-16 | 2020-06-26 | 闪捷信息科技有限公司 | 一种基于机器学习的文本数据自动分类方法 |
CN111917648B (zh) * | 2020-06-30 | 2021-10-26 | 华南理工大学 | 一种数据中心里分布式机器学习数据重排的传输优化方法 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1460947A (zh) * | 2003-06-13 | 2003-12-10 | 北京大学计算机科学技术研究所 | 融合关键词学习的支持向量机文本分类增量训练学习方法 |
CN103257957A (zh) * | 2012-02-15 | 2013-08-21 | 深圳市腾讯计算机***有限公司 | 一种基于中文分词的文本相似性识别方法及装置 |
CN104239436A (zh) * | 2014-08-27 | 2014-12-24 | 南京邮电大学 | 一种基于文本分类和聚类分析的网络热点事件发现方法 |
CN104866573A (zh) * | 2015-05-22 | 2015-08-26 | 齐鲁工业大学 | 一种文本分类的方法 |
CN106056098A (zh) * | 2016-06-23 | 2016-10-26 | 哈尔滨工业大学 | 一种基于类别合并的脉冲信号聚类分选方法 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103646464B (zh) * | 2013-12-23 | 2016-01-20 | 尤新革 | 智能点验钞机识别***自动升级的方法 |
CN103886090B (zh) * | 2014-03-31 | 2018-01-02 | 北京搜狗科技发展有限公司 | 基于用户喜好的内容推荐方法及装置 |
CN104112026B (zh) * | 2014-08-01 | 2017-09-08 | 中国联合网络通信集团有限公司 | 一种短信文本分类方法及*** |
CN106294568A (zh) * | 2016-07-27 | 2017-01-04 | 北京明朝万达科技股份有限公司 | 一种基于bp网络的中文文本分类规则生成方法及*** |
-
2017
- 2017-01-23 CN CN201710051325.6A patent/CN106815605B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1460947A (zh) * | 2003-06-13 | 2003-12-10 | 北京大学计算机科学技术研究所 | 融合关键词学习的支持向量机文本分类增量训练学习方法 |
CN103257957A (zh) * | 2012-02-15 | 2013-08-21 | 深圳市腾讯计算机***有限公司 | 一种基于中文分词的文本相似性识别方法及装置 |
CN104239436A (zh) * | 2014-08-27 | 2014-12-24 | 南京邮电大学 | 一种基于文本分类和聚类分析的网络热点事件发现方法 |
CN104866573A (zh) * | 2015-05-22 | 2015-08-26 | 齐鲁工业大学 | 一种文本分类的方法 |
CN106056098A (zh) * | 2016-06-23 | 2016-10-26 | 哈尔滨工业大学 | 一种基于类别合并的脉冲信号聚类分选方法 |
Also Published As
Publication number | Publication date |
---|---|
CN106815605A (zh) | 2017-06-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106815605B (zh) | 一种基于机器学习的数据分类方法及设备 | |
CN109992645B (zh) | 一种基于文本数据的资料管理***及方法 | |
CN109101597B (zh) | 一种电力新闻数据采集*** | |
US9367581B2 (en) | System and method of quality assessment of a search index | |
US20060179051A1 (en) | Methods and apparatus for steering the analyses of collections of documents | |
KR20160149050A (ko) | 텍스트 마이닝을 활용한 순수 기업 선정 장치 및 방법 | |
CN110781333A (zh) | 一种基于机器学习的斜拉桥非结构化监测数据处理方法 | |
CN113190502A (zh) | 基于深度学习的档案管理方法 | |
Eykens et al. | Fine-grained classification of social science journal articles using textual data: A comparison of supervised machine learning approaches | |
CN114491034B (zh) | 一种文本分类方法及智能设备 | |
CN102591920A (zh) | 对文档管理***中的文档集合进行分类的方法以及*** | |
CN114117038A (zh) | 一种文档分类方法、装置、***及电子设备 | |
CN113515622A (zh) | 一种档案数据分类保存*** | |
CN114764463A (zh) | 基于事件传播特征的互联网舆情事件自动预警*** | |
CN117113973A (zh) | 一种信息处理方法及相关装置 | |
CN111859032A (zh) | 一种短信拆字敏感词的检测方法、装置及计算机存储介质 | |
CN100444194C (zh) | 文章标题及关联信息的自动抽取装置和抽取方法 | |
JP2004171316A (ja) | Ocr装置及び文書検索システム及び文書検索プログラム | |
CN110807099B (zh) | 一种基于模糊集的文本分析检索方法 | |
CN113722421A (zh) | 一种合同审计方法和***,及计算机可读存储介质 | |
CN110737749A (zh) | 创业计划评价方法、装置、计算机设备及存储介质 | |
CN117909440B (zh) | 智能档案索引与检索*** | |
KR102555711B1 (ko) | 지식재산권 데이터 플랫폼 및 그의 데이터 처리 방법 | |
CN115640758B (zh) | 一种基于知识构建的三维模型数模质检方法 | |
CN117252514B (zh) | 基于深度学习和模型训练的建筑物资库数据处理方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A Data Classification Method and Equipment Based on Machine Learning Effective date of registration: 20221008 Granted publication date: 20210413 Pledgee: Industrial Bank Co.,Ltd. Shanghai Branch Pledgor: SHANGHAI SUNINFO TECHNOLOGY Co.,Ltd. Registration number: Y2022310000279 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20231017 Granted publication date: 20210413 Pledgee: Industrial Bank Co.,Ltd. Shanghai Branch Pledgor: SHANGHAI SUNINFO TECHNOLOGY Co.,Ltd. Registration number: Y2022310000279 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A Data Classification Method and Equipment Based on Machine Learning Effective date of registration: 20231025 Granted publication date: 20210413 Pledgee: Industrial Bank Co.,Ltd. Shanghai Jinshan Branch Pledgor: SHANGHAI SUNINFO TECHNOLOGY Co.,Ltd. Registration number: Y2023980062535 |