CN101710333B - Network text segmenting method based on genetic algorithm - Google Patents
Network text segmenting method based on genetic algorithm Download PDFInfo
- Publication number
- CN101710333B CN101710333B CN2009102191638A CN200910219163A CN101710333B CN 101710333 B CN101710333 B CN 101710333B CN 2009102191638 A CN2009102191638 A CN 2009102191638A CN 200910219163 A CN200910219163 A CN 200910219163A CN 101710333 B CN101710333 B CN 101710333B
- Authority
- CN
- China
- Prior art keywords
- text
- population
- vocabulary
- expansion
- formula
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102191638A CN101710333B (en) | 2009-11-26 | 2009-11-26 | Network text segmenting method based on genetic algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2009102191638A CN101710333B (en) | 2009-11-26 | 2009-11-26 | Network text segmenting method based on genetic algorithm |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101710333A CN101710333A (en) | 2010-05-19 |
CN101710333B true CN101710333B (en) | 2012-07-04 |
Family
ID=42403123
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2009102191638A Active CN101710333B (en) | 2009-11-26 | 2009-11-26 | Network text segmenting method based on genetic algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101710333B (en) |
Families Citing this family (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101968798A (en) * | 2010-09-10 | 2011-02-09 | 中国科学技术大学 | Community recommendation method based on on-line soft constraint LDA algorithm |
CN102024065B (en) * | 2011-01-18 | 2013-01-02 | 中南大学 | SIMD optimization-based webpage duplication elimination and concurrency method |
WO2012106885A1 (en) * | 2011-07-13 | 2012-08-16 | 华为技术有限公司 | Latent dirichlet allocation-based parameter inference method, calculation device and system |
CN102609407B (en) * | 2012-02-16 | 2014-10-29 | 复旦大学 | Fine-grained semantic detection method of harmful text contents in network |
CN102855312B (en) * | 2012-08-24 | 2013-08-14 | 武汉大学 | Domain-and-theme-oriented Web service clustering method |
CN102929937B (en) * | 2012-09-28 | 2015-09-16 | 福州博远无线网络科技有限公司 | Based on the data processing method of the commodity classification of text subject model |
CN103365978B (en) * | 2013-07-01 | 2017-03-29 | 浙江大学 | TCM data method for digging based on LDA topic models |
CN103914445A (en) * | 2014-03-05 | 2014-07-09 | 中国人民解放军装甲兵工程学院 | Data semantic processing method |
CN105095228A (en) * | 2014-04-28 | 2015-11-25 | 华为技术有限公司 | Method and apparatus for monitoring social information |
CN104281567A (en) * | 2014-10-13 | 2015-01-14 | 安徽华贞信息科技有限公司 | Latent semantic analysis method and system |
CN104281692A (en) * | 2014-10-13 | 2015-01-14 | 安徽华贞信息科技有限公司 | Method and system for realizing paragraph dimensionalized description |
CN104317579A (en) * | 2014-10-13 | 2015-01-28 | 安徽华贞信息科技有限公司 | Method and system for business performance of text document |
CN104317785A (en) * | 2014-10-13 | 2015-01-28 | 安徽华贞信息科技有限公司 | Internet paragraph level topic identifying system |
CN106355628B (en) * | 2015-07-16 | 2019-07-05 | 中国石油化工股份有限公司 | The modification method and system of picture and text knowledge point mask method and device, picture and text mark |
CN105138665B (en) * | 2015-09-02 | 2017-06-20 | 东南大学 | A kind of internet topic online mining method based on improvement LDA models |
CN105136714B (en) * | 2015-09-06 | 2017-10-10 | 河南工业大学 | A kind of tera-hertz spectra Wavelength selecting method based on genetic algorithm |
CN105389306A (en) * | 2015-11-02 | 2016-03-09 | 国网福建省电力有限公司 | Latent semantic analysis based intelligent parsing method for application form |
CN105787088B (en) * | 2016-03-14 | 2018-12-07 | 南京理工大学 | A kind of text information classification method based on segment encoding genetic algorithm |
CN107239438B (en) * | 2016-03-28 | 2020-07-28 | 阿里巴巴集团控股有限公司 | Document analysis method and device |
CN106502983B (en) * | 2016-10-17 | 2019-05-10 | 清华大学 | The event driven collapse Gibbs sampling method of implicit Di Li Cray model |
CN106815310B (en) * | 2016-12-20 | 2020-04-21 | 华南师范大学 | Hierarchical clustering method and system for massive document sets |
CN106709011B (en) * | 2016-12-26 | 2019-07-23 | 武汉大学 | A kind of position concept level resolution calculation method based on space orientation cluster |
CN108009151B (en) * | 2017-11-29 | 2021-04-16 | 深圳中泓在线股份有限公司 | News text automatic segmentation method and device, server and readable storage medium |
CN108038173B (en) * | 2017-12-07 | 2021-11-26 | 广东工业大学 | Webpage classification method and system and webpage classification equipment |
CN109299239B (en) * | 2018-09-29 | 2021-11-23 | 福建弘扬软件股份有限公司 | ES-based electronic medical record retrieval method |
CN109325092A (en) * | 2018-11-27 | 2019-02-12 | 中山大学 | Merge the nonparametric parallelization level Di Li Cray process topic model system of phrase information |
CN109829151B (en) * | 2018-11-27 | 2023-04-21 | 国网浙江省电力有限公司 | Text segmentation method based on hierarchical dirichlet model |
CN109918659B (en) * | 2019-02-28 | 2023-06-20 | 华南理工大学 | Method for optimizing word vector based on unreserved optimal individual genetic algorithm |
CN109977227B (en) * | 2019-03-19 | 2021-06-22 | 中国科学院自动化研究所 | Text feature extraction method, system and device based on feature coding |
CN110110326B (en) * | 2019-04-25 | 2020-10-27 | 西安交通大学 | Text cutting method based on subject information |
CN110222654A (en) * | 2019-06-10 | 2019-09-10 | 北京百度网讯科技有限公司 | Text segmenting method, device, equipment and storage medium |
CN113366511B (en) * | 2020-01-07 | 2022-03-25 | 支付宝(杭州)信息技术有限公司 | Named entity identification and extraction using genetic programming |
CN111797634B (en) * | 2020-06-04 | 2023-09-08 | 语联网(武汉)信息技术有限公司 | Document segmentation method and device |
CN112667817B (en) * | 2020-12-31 | 2022-05-31 | 杭州电子科技大学 | Text emotion classification integration system based on roulette attribute selection |
CN113191133B (en) * | 2021-04-21 | 2021-12-21 | 北京邮电大学 | Audio text alignment method and system based on Doc2Vec |
CN112988981B (en) * | 2021-05-14 | 2021-10-15 | 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) | Automatic labeling method based on genetic algorithm |
CN113673255B (en) * | 2021-08-25 | 2023-06-30 | 北京市律典通科技有限公司 | Text function area splitting method and device, computer equipment and storage medium |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101287229A (en) * | 2008-05-26 | 2008-10-15 | 北京捷讯畅达科技发展有限公司 | Natural language processing technique and device applying to query by short message service of mobile phone |
-
2009
- 2009-11-26 CN CN2009102191638A patent/CN101710333B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101287229A (en) * | 2008-05-26 | 2008-10-15 | 北京捷讯畅达科技发展有限公司 | Natural language processing technique and device applying to query by short message service of mobile phone |
Non-Patent Citations (2)
Title |
---|
刘娜 等.文本线性分割方法的研究.《计算机工程与应用》.2008,(第21期),212-216. * |
石晶 等.基于LDA模型的文本分割.《计算机学报》.2008,第31卷(第10期),1865-1873. * |
Also Published As
Publication number | Publication date |
---|---|
CN101710333A (en) | 2010-05-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101710333B (en) | Network text segmenting method based on genetic algorithm | |
CN100353361C (en) | New method of characteristic vector weighting for text classification and its device | |
Zamani et al. | Neural query performance prediction using weak supervision from multiple signals | |
CN106844424B (en) | LDA-based text classification method | |
CN104268197B (en) | A kind of industry comment data fine granularity sentiment analysis method | |
Misra et al. | Text segmentation via topic modeling: an analytical study | |
Das et al. | A heuristic-driven uncertainty based ensemble framework for fake news detection in tweets and news articles | |
CN105468713A (en) | Multi-model fused short text classification method | |
CN103207913B (en) | The acquisition methods of commercial fine granularity semantic relation and system | |
CN108763484A (en) | A kind of law article recommendation method based on LDA topic models | |
CN107944014A (en) | A kind of Chinese text sentiment analysis method based on deep learning | |
CN105045812A (en) | Text topic classification method and system | |
CN105760493A (en) | Automatic work order classification method for electricity marketing service hot spot 95598 | |
García-Hernández et al. | Single extractive text summarization based on a genetic algorithm | |
CN105912576A (en) | Emotion classification method and emotion classification system | |
CN105095183A (en) | Text emotional tendency determination method and system | |
CN101714135A (en) | Emotional orientation analytical method of cross-domain texts | |
Foong et al. | Text summarization using latent semantic analysis model in mobile android platform | |
CN106202530A (en) | Data processing method and device | |
Sun et al. | Twitter part-of-speech tagging using pre-classification Hidden Markov model | |
Kang et al. | Utilization strategy of user engagements in korean fake news detection | |
CN110851733A (en) | Community discovery and emotion interpretation method based on network topology and document content | |
CN117474126A (en) | LLaMa2 big data model design method for initial examination and evaluation of manuscript | |
Medagoda et al. | Keywords based temporal sentiment analysis | |
Xu et al. | KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: NANTONG LONGXIANG ELECTRICAL EQUIPMENT CO., LTD. Free format text: FORMER OWNER: NORTHWESTERN POLYTECHNICAL UNIVERSITY Effective date: 20140814 Owner name: NORTHWESTERN POLYTECHNICAL UNIVERSITY Effective date: 20140814 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 710072 XI AN, SHAANXI PROVINCE TO: 226600 NANTONG, JIANGSU PROVINCE |
|
TR01 | Transfer of patent right |
Effective date of registration: 20140814 Address after: 226600 No. 69 Donghai Road, Haian Development Zone, Nantong, Jiangsu Patentee after: NANTONG LONGXIANG ELECTRIC EQUIPMENT CO., LTD. Patentee after: Northwestern Polytechnical University Address before: 710072 Xi'an friendship West Road, Shaanxi, No. 127 Patentee before: Northwestern Polytechnical University |