CN106202372A - A kind of method of network text information emotional semantic classification - Google Patents
A kind of method of network text information emotional semantic classification Download PDFInfo
- Publication number
- CN106202372A CN106202372A CN201610534277.1A CN201610534277A CN106202372A CN 106202372 A CN106202372 A CN 106202372A CN 201610534277 A CN201610534277 A CN 201610534277A CN 106202372 A CN106202372 A CN 106202372A
- Authority
- CN
- China
- Prior art keywords
- document
- emotional semantic
- semantic classification
- classification
- emotion
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A kind of method that the invention discloses network text information emotional semantic classification, comprises the steps: step one, first determines whether whether document belongs to news, if belonging to news, only extracting title and carrying out emotional semantic classification, the most whole document being carried out emotional semantic classification;Step 2, the document classifying needs carry out pretreatment;Step 3, according to text length, document is classified: use TF IDF to calculate feature weight more than the document of 140 characters length and then utilize the logistic regression grader trained to classify;Artificial emotion classifying rules is used to classify less than the document of 140 characters length.Compared with prior art, the positive effect of the present invention is: the inventive method is for long text, the different characteristics of short text, use machine learning algorithm to build grader and formulate the technology path that characteristic of division combines with domain expert, it is possible to accurately and timely find reaction information, sensitive information and the negative report related in network public-opinion.
Description
Technical field
The invention belongs to natural language processing field, a kind of method relating to network text information emotional semantic classification.
Background technology
Network, as a kind of New Media, has been given play to the unimpeded will of the people more and more, has been expressed demand, supervises by public opinionus, joins
The effect that political affairs are discussed political affairs and other major and important matters, increasing people utilizes the Internet to express oneself demand at the aspect such as interests, politics, expresses oneself
Attitude or view to hot spot of societys such as the people's livelihood, the administration of justice, anti-corruption.Especially when groupment, unexpected incidents occur,
People are often transmitted by the Internet in the very first time or obtain information.But some hostile forces utilize the hidden of network simultaneously
Cover environment, manufacture the network public opinion deviating from main flow political culture, destroy steady politics and social harmony;The net that minority is unique
The people utilize the facility of network to have a mind to spread dummy message even rumour, to reaching the individual purpose of oneself;Some netizens give vent to feelings
Thread, delivers vulgar, the speech of Lycoperdon polymorphum Vitt on network.And tradition relies on artificial method to be difficult to tackle the collection of online magnanimity information
With study and judge, it is therefore desirable to network public sentiment information is judged by the sensibility classification method of automatization, it is achieved to reaction, sensitivity, negative
The emphasis of the public feelings informations such as face finds.
Summary of the invention
In order to overcome the shortcoming of prior art, a kind of method that the invention provides network text information emotional semantic classification.
The technical solution adopted in the present invention is: a kind of method of network text information emotional semantic classification, comprises the steps:
Step one, first determine whether whether document belongs to news, if belonging to news, only extracting title and carrying out emotional semantic classification,
The most whole document is carried out emotional semantic classification;
Step 2, the document classifying needs carry out pretreatment;
Step 3, according to text length, document is classified:
(1) then use TF-IDF to calculate feature weight more than the document of 140 characters length utilizes train to patrol
Collect recurrence grader to classify;
(2) artificial emotion classifying rules is used to classify less than the document of 140 characters length.
Compared with prior art, the positive effect of the present invention is: the inventive method is for long text, the different spies of short text
Point, uses machine learning algorithm to build grader and formulates the technology path that characteristic of division combines with domain expert, it is possible to accurately
Find reaction information, sensitive information and the negative report related in network public-opinion in time.
Accompanying drawing explanation
Examples of the present invention will be described by way of reference to the accompanying drawings, wherein:
Fig. 1 is network text information classification schematic diagram;
Fig. 2 is logistic regression classification based training process schematic;
Fig. 3 is artificial formulation emotional semantic classification rule schematic diagram.
Detailed description of the invention
A kind of method of network text information emotional semantic classification, as depicted in figs. 1 and 2, comprises the steps:
Step one, first determine whether whether document belongs to news, if belonging to news, only extracting title and carrying out emotional semantic classification,
The most whole document is carried out emotional semantic classification;
Step 2, the document needing classification is carried out pretreatment:
Pretreatment refers to use Chinese lexical analysis system ICTCLAS that text is carried out participle, then filters stop words:
Destination document carried out emotional semantic classification and to first have to carry out data prediction, mainly include participle and remove stop words two
Part.The ICTCLAS Chinese word segmentation system of Inst. of Computing Techn. Academia Sinica's exploitation is based on stacking hidden Markov model
(Hierarchical Hidden Markov Model), it is possible to effectively by the text success participle of input, and follow part of speech closely
Export, and support to import the dictionary that user provides.By ICTCLAS, input text is carried out participle and part-of-speech tagging,
Then filter and disable the word in vocabulary, thus remove the word without practical significance, it is achieved data prediction.
Step 3, according to text length, document is classified:
(1) then use TF-IDF to calculate feature weight more than the document of 140 characters length utilizes train to patrol
Collect recurrence grader to classify:
(1) feature extraction: by calculating χ2Statistical value selection sort feature, TF-IDF calculates feature weight:
χ2Statistics also referred to as CHI statistics, is the statistics of the dependence for two variablees of tolerance conventional in statistics
Amount, characteristic item t and the χ of classification c2Statistical value is calculated as follows:
Wherein N represents that total number of documents, A represent and comprises characteristic item t and belong to the number of files of classification c, and B represents and comprises feature
T and be not belonging to the number of files of classification c, C represents and does not comprises characteristic item t and belong to the number of files of classification c, and D represents and do not wraps
Containing characteristic item t and the number of files that is not belonging to classification c.
Obtaining t χ in each category2The t χ to all categories can be calculated according to following equation after statistical value2System
Meter meansigma methods.
Wherein, P (t, ci) represent that characteristic item t belongs to classification ciProbability.
When carrying out feature selection, set threshold values, only retain χ2Statistical value is more than the lexical item of threshold values as characteristic item.
To give weights, the spy high to discrimination to different characteristic item by TF-IDF method after screening obtains characteristic item
Levy item and give bigger weights.TF-IDF weight computing formula is as follows, and wherein tf refers to characteristic item word frequency, and what n represented is bag
Containing the number of files of current signature item, N is total number of documents.
(2) training LR grader: use logistic regression algorithm to build grader
Can be the N-dimensional vector in vector space model by text representation by data prediction and feature extraction, will
This N-dimensional vector differentiates that result exports as grader as the input of logistic regression grader, document emotion, carries out grader
Training.In logistic regression sorting algorithm, using cross entropy cost function to carry out the degree of predictive metrics mistake, computing formula is such as
Under, wherein y is desired output, and a is the output that grader is actual, and for avoiding training over-fitting, arranging parameter c is 200, arranges
It is 0.01 that gradient descent algorithm learns the step-length δ parameter of J (x) minima.
Weight vectors w and amount of bias b can be obtained through training, can be according to given input by formula below
Example x is calculated its conditional probability distribution P belonging to a certain classification (Y | X), and the maximum classification of probit is then belonging to it
Classification.
(2) artificial emotion classifying rules is used to classify less than the document of 140 characters length:
Domain expert builds seed dictionary of all categories respectively, then extends seed by Chinese thesaurus with word2vector
Dictionary, forms feature database of all categories;The impact on word emotion value of COMPREHENSIVE CALCULATING negative word, sentence pattern, it is judged that document emotion value with
The artificial size setting emotion threshold values of all categories.
As it is shown on figure 3, use the artificial process formulating text message emotional semantic classification rule to specifically include that
(1) seed words obtains
Weight vectors w can be obtained by machine learning algorithm training text this process of information emotion classifiers, pass through
The Feature Words that screening weighted value is bigger can get part classifying seed words, the most comprehensive Northeastern University, Taiwan Univ., China
The emotional semantic classification dictionary of Zhi Wangdeng mechanism, obtains final emotional semantic classification seed words.
(2) synonym extension
" Chinese thesaurus extended edition " that Harbin Institute of Technology's social computing is issued with Research into information retrieval center comprises altogether 77,
Article 343, word, organizes all entries included together according to tree-shaped hierarchical structure, and provides Pyatyi to encode.According to " with
Justice word word woods extended edition " the synonym phrase that provides carries out synonym extension to seed words.
(3) Word2Vector semantic extension
Word2Vector is a efficient work that word is characterized as real number value vector that Google increased income in year in 2013
Tool, utilizes the thought that the degree of depth learns, and the contextual information of bluebeard compound solves the similarity of text semantic.Word2Vector uses
The term vector representation of DistributedRepresentation, uses the neutral net of three layers to build language model
Mould, obtains word expression in vector space simultaneously.Utilize Word2Vector that existing emotional semantic classification word is carried out semantic similitude
Degree calculates, and extracts the higher word of similarity and adds emotional semantic classification dictionary.
(4) according to setting classifying rules, document is classified:
Emotion word in emotional semantic classification dictionary is given different weights, by the relative position information of negative word Yu emotion word
Calculate the emotion value of content of the sentence, then in conjunction with the different sentence pattern such as exclamative sentence, confirmative question, interrogative sentence to whole word emotion value shadow
Ring, calculate the overall emotion value of sentence, the emotion value of all categories of all for document sentences is added the document emotion value that obtains and sets
The threshold values of fixed each emotional category compares, thus judges document emotion.
The inventive method is carried out following accuracy test: obtained by web crawlers and include Sina's microblogging, forum, news
Websites etc. 200,000 data is tested, and has been randomly chosen the observation of corresponding number in the result that all algorithms judge
(random JAVA code is (int) (Math.random () * n) to sample, and during wherein n is 200,000 test data, algorithm judges
Number of samples for respective type).5 emotions of record are studied and judged personnel and are judged the result of same observation sample simultaneously, finally take people
The legitimate reading that result is sample that number is most.Data in table 1 are test result, it can be seen that the present invention is for reaction, quick
The accuracy that sense, front, negative text emotion are classified is the highest.
Table 1 test of heuristics result table
Emotional category | Algorithm marks | Algorithm mark pair | Accuracy |
Negatively | 502 | 385 | 0.766932271 |
Front | 505 | 364 | 0.720792079 |
Sensitive | 715 | 540 | 0.755244755 |
Reaction | 39 | 33 | 0.846153846 |
Claims (9)
1. the method for a network text information emotional semantic classification, it is characterised in that: comprise the steps:
Step one, first determine whether whether document belongs to news, if belonging to news, only extracting title and carrying out emotional semantic classification, otherwise
Then whole document is carried out emotional semantic classification;
Step 2, the document classifying needs carry out pretreatment;
Step 3, according to text length, document is classified:
(1) then use TF-IDF to calculate feature weight more than the document of 140 characters length utilizes the logic trained to return
Grader is returned to classify;
(2) artificial emotion classifying rules is used to classify less than the document of 140 characters length.
The method of a kind of network text information emotional semantic classification the most according to claim 1, it is characterised in that: described in step 2
Pretreatment refer to input text carry out participle and part-of-speech tagging, be then filtered off stop words.
The method of a kind of network text information emotional semantic classification the most according to claim 2, it is characterised in that: use Chinese word
Method is analyzed system ICTCLAS and text is carried out pretreatment.
The method of a kind of network text information emotional semantic classification the most according to claim 1, it is characterised in that: described in step 3
The method using TF IDF calculating feature weight then to utilize the logistic regression grader trained to carry out classifying is:
(1) χ of characteristic item t and classification c it is calculated as follows2Statistical value:
Wherein N represents that total number of documents, A represent and comprises characteristic item t and belong to the number of files of classification c, and B represents and comprises characteristic item t
And being not belonging to the number of files of classification c, C represents and do not comprises characteristic item t and belong to the number of files of classification c, D represents and does not comprises spy
Levy a t and be not belonging to the number of files of classification c;
(2) t χ to all categories it is calculated as follows2Assembly average:
(3) filter out more than the χ setting threshold values2The lexical item of statistical value is as characteristic item;
(4) use TF IDF method to come different characteristic item to give weights, give bigger weights to the characteristic item that discrimination is high;
(5) training LR grader:
It is the N-dimensional vector in vector space model by text representation, using vectorial for this N-dimensional input as logistic regression grader,
Document emotion differentiates that result exports as grader, is trained grader obtaining weight vectors w and amount of bias b;
(6) it is calculated its conditional probability distribution belonging to a certain classification according to given input example x
The classification of probit maximum is then its generic.
The method of a kind of network text information emotional semantic classification the most according to claim 4, it is characterised in that: (4th) step institute
Stating weights uses equation below to calculate:
Wherein tf is characterized a word frequency, and n is the number of files comprising current signature item, and N is total number of documents.
The method of a kind of network text information emotional semantic classification the most according to claim 4, it is characterised in that: at logistic regression
In sorting algorithm, following cross entropy cost function is used to carry out the degree of predictive metrics mistake:
Wherein y is desired output, and a is the output that grader is actual, and c is 200.
The method of a kind of network text information emotional semantic classification the most according to claim 1, it is characterised in that: described in step 3
The method using artificial emotion classifying rules to carry out classifying is:
(1) seed words obtains;
(2) seed words is carried out synonym extension;
(3) utilize Word2Vector that existing emotional semantic classification word is carried out semantic extension;
(4) according to setting classifying rules, document is classified.
The method of a kind of network text information emotional semantic classification the most according to claim 7, it is characterised in that: described seed words
Acquisition methods be: obtain weight vectors by machine learning algorithm training text information emotion classifiers, then by screening
The Feature Words that weighted value is bigger gets part classifying seed words;The machines such as the most comprehensive Northeastern University, Taiwan Univ., middle National IP Network
The emotional semantic classification dictionary of structure, obtains final emotional semantic classification seed words.
The method of a kind of network text information emotional semantic classification the most according to claim 7, it is characterised in that: divide according to setting
The method that document is classified by rule-like is: the emotion word in emotional semantic classification dictionary gives different weights, passes through negative word
Relative position information with emotion word calculates the emotion value of content of the sentence, then in conjunction with differences such as exclamative sentence, confirmative question, interrogative sentences
Whole word emotion value is affected by sentence pattern, calculates the overall emotion value of sentence, the emotion value of all categories of all for document sentences is added
The document emotion value obtained compares with the threshold values of each emotional category of setting, thus judges document emotion.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610534277.1A CN106202372A (en) | 2016-07-08 | 2016-07-08 | A kind of method of network text information emotional semantic classification |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610534277.1A CN106202372A (en) | 2016-07-08 | 2016-07-08 | A kind of method of network text information emotional semantic classification |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106202372A true CN106202372A (en) | 2016-12-07 |
Family
ID=57473503
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610534277.1A Pending CN106202372A (en) | 2016-07-08 | 2016-07-08 | A kind of method of network text information emotional semantic classification |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106202372A (en) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107066446A (en) * | 2017-04-13 | 2017-08-18 | 广东工业大学 | A kind of Recognition with Recurrent Neural Network text emotion analysis method of embedded logic rules |
CN107229731A (en) * | 2017-06-08 | 2017-10-03 | 百度在线网络技术(北京)有限公司 | Method and apparatus for grouped data |
CN107562814A (en) * | 2017-08-14 | 2018-01-09 | 中国农业大学 | A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system |
CN108228587A (en) * | 2016-12-13 | 2018-06-29 | 北大方正集团有限公司 | Stock discrimination method and Stock discrimination device |
CN108628873A (en) * | 2017-03-17 | 2018-10-09 | 腾讯科技(北京)有限公司 | A kind of file classification method, device and equipment |
CN108647335A (en) * | 2018-05-12 | 2018-10-12 | 苏州华必讯信息科技有限公司 | Internet public opinion analysis method and apparatus |
CN109002473A (en) * | 2018-06-13 | 2018-12-14 | 天津大学 | A kind of sentiment analysis method based on term vector and part of speech |
CN109388749A (en) * | 2018-09-29 | 2019-02-26 | 武汉烽火普天信息技术有限公司 | The detection of accurate high-efficiency network public sentiment and method for early warning based on multi-layer geography |
WO2019041528A1 (en) * | 2017-08-31 | 2019-03-07 | 平安科技(深圳)有限公司 | Method, electronic apparatus, and computer readable storage medium for determining polarity of news sentiment |
CN109634833A (en) * | 2017-10-09 | 2019-04-16 | 北京京东尚科信息技术有限公司 | A kind of Software Defects Predict Methods and device |
CN109710732A (en) * | 2018-11-19 | 2019-05-03 | 东软集团股份有限公司 | Information query method, device, storage medium and electronic equipment |
CN109918550A (en) * | 2019-01-22 | 2019-06-21 | 招银云创(深圳)信息技术有限公司 | Information monitoring method, device, computer equipment and readable storage medium storing program for executing |
CN110334182A (en) * | 2019-06-24 | 2019-10-15 | 中国南方电网有限责任公司 | Online service method with speech emotion recognition |
CN110968691A (en) * | 2018-09-30 | 2020-04-07 | 北京国双科技有限公司 | Judicial hotspot determination method and device |
CN111046171A (en) * | 2019-08-29 | 2020-04-21 | 成都信息工程大学 | Emotion discrimination method based on fine-grained labeled data |
CN111126373A (en) * | 2019-12-23 | 2020-05-08 | 北京中科神探科技有限公司 | Internet short video violation judgment device and method based on cross-modal identification technology |
CN111159342A (en) * | 2019-12-26 | 2020-05-15 | 北京大学 | Park text comment emotion scoring method based on machine learning |
CN111414520A (en) * | 2020-03-19 | 2020-07-14 | 南京莱斯网信技术研究院有限公司 | Intelligent mining system for sensitive information in public opinion information |
CN112307771A (en) * | 2020-10-29 | 2021-02-02 | 平安科技(深圳)有限公司 | Course analysis method, device, equipment and medium based on emotion analysis |
CN112685558A (en) * | 2019-10-18 | 2021-04-20 | 普天信息技术有限公司 | Emotion classification model training method and device |
CN112836049A (en) * | 2021-01-28 | 2021-05-25 | 网易(杭州)网络有限公司 | Text classification method, device, medium and computing equipment |
CN114201959A (en) * | 2021-11-16 | 2022-03-18 | 湖南长泰工业科技有限公司 | Mobile emergency command method |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102023967A (en) * | 2010-11-11 | 2011-04-20 | 清华大学 | Text emotion classifying method in stock field |
CN104899335A (en) * | 2015-06-25 | 2015-09-09 | 四川友联信息技术有限公司 | Method for performing sentiment classification on network public sentiment of information |
-
2016
- 2016-07-08 CN CN201610534277.1A patent/CN106202372A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102023967A (en) * | 2010-11-11 | 2011-04-20 | 清华大学 | Text emotion classifying method in stock field |
CN104899335A (en) * | 2015-06-25 | 2015-09-09 | 四川友联信息技术有限公司 | Method for performing sentiment classification on network public sentiment of information |
Non-Patent Citations (3)
Title |
---|
张莹: "在线新闻评论的情感分析研究", 《中国博士学位论文全文数据库信息科技辑》 * |
李其达: "基于词典的文本情感计算***的设计与实现", 《万方全文数据库》 * |
王振浩: "基于情感字典与机器学习相结合的文本情感分类", 《中国优秀硕士学位论文全文数据库信息科技辑》 * |
Cited By (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108228587A (en) * | 2016-12-13 | 2018-06-29 | 北大方正集团有限公司 | Stock discrimination method and Stock discrimination device |
CN108628873A (en) * | 2017-03-17 | 2018-10-09 | 腾讯科技(北京)有限公司 | A kind of file classification method, device and equipment |
CN108628873B (en) * | 2017-03-17 | 2022-09-27 | 腾讯科技(北京)有限公司 | Text classification method, device and equipment |
CN107066446A (en) * | 2017-04-13 | 2017-08-18 | 广东工业大学 | A kind of Recognition with Recurrent Neural Network text emotion analysis method of embedded logic rules |
CN107229731A (en) * | 2017-06-08 | 2017-10-03 | 百度在线网络技术(北京)有限公司 | Method and apparatus for grouped data |
CN107229731B (en) * | 2017-06-08 | 2021-05-25 | 百度在线网络技术(北京)有限公司 | Method and apparatus for classifying data |
CN107562814A (en) * | 2017-08-14 | 2018-01-09 | 中国农业大学 | A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system |
WO2019041528A1 (en) * | 2017-08-31 | 2019-03-07 | 平安科技(深圳)有限公司 | Method, electronic apparatus, and computer readable storage medium for determining polarity of news sentiment |
CN109634833A (en) * | 2017-10-09 | 2019-04-16 | 北京京东尚科信息技术有限公司 | A kind of Software Defects Predict Methods and device |
CN108647335A (en) * | 2018-05-12 | 2018-10-12 | 苏州华必讯信息科技有限公司 | Internet public opinion analysis method and apparatus |
CN109002473A (en) * | 2018-06-13 | 2018-12-14 | 天津大学 | A kind of sentiment analysis method based on term vector and part of speech |
CN109002473B (en) * | 2018-06-13 | 2022-02-11 | 天津大学 | Emotion analysis method based on word vectors and parts of speech |
CN109388749A (en) * | 2018-09-29 | 2019-02-26 | 武汉烽火普天信息技术有限公司 | The detection of accurate high-efficiency network public sentiment and method for early warning based on multi-layer geography |
CN110968691A (en) * | 2018-09-30 | 2020-04-07 | 北京国双科技有限公司 | Judicial hotspot determination method and device |
CN110968691B (en) * | 2018-09-30 | 2023-07-04 | 北京国双科技有限公司 | Judicial hotspot determination method and device |
CN109710732B (en) * | 2018-11-19 | 2021-03-05 | 东软集团股份有限公司 | Information query method, device, storage medium and electronic equipment |
CN109710732A (en) * | 2018-11-19 | 2019-05-03 | 东软集团股份有限公司 | Information query method, device, storage medium and electronic equipment |
CN109918550A (en) * | 2019-01-22 | 2019-06-21 | 招银云创(深圳)信息技术有限公司 | Information monitoring method, device, computer equipment and readable storage medium storing program for executing |
CN110334182A (en) * | 2019-06-24 | 2019-10-15 | 中国南方电网有限责任公司 | Online service method with speech emotion recognition |
CN111046171A (en) * | 2019-08-29 | 2020-04-21 | 成都信息工程大学 | Emotion discrimination method based on fine-grained labeled data |
CN112685558A (en) * | 2019-10-18 | 2021-04-20 | 普天信息技术有限公司 | Emotion classification model training method and device |
CN112685558B (en) * | 2019-10-18 | 2024-05-17 | 普天信息技术有限公司 | Training method and device for emotion classification model |
CN111126373A (en) * | 2019-12-23 | 2020-05-08 | 北京中科神探科技有限公司 | Internet short video violation judgment device and method based on cross-modal identification technology |
CN111159342A (en) * | 2019-12-26 | 2020-05-15 | 北京大学 | Park text comment emotion scoring method based on machine learning |
CN111414520B (en) * | 2020-03-19 | 2021-03-19 | 南京莱斯网信技术研究院有限公司 | Intelligent mining system for sensitive information in public opinion information |
CN111414520A (en) * | 2020-03-19 | 2020-07-14 | 南京莱斯网信技术研究院有限公司 | Intelligent mining system for sensitive information in public opinion information |
CN112307771A (en) * | 2020-10-29 | 2021-02-02 | 平安科技(深圳)有限公司 | Course analysis method, device, equipment and medium based on emotion analysis |
CN112836049A (en) * | 2021-01-28 | 2021-05-25 | 网易(杭州)网络有限公司 | Text classification method, device, medium and computing equipment |
CN114201959A (en) * | 2021-11-16 | 2022-03-18 | 湖南长泰工业科技有限公司 | Mobile emergency command method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106202372A (en) | A kind of method of network text information emotional semantic classification | |
Mahtab et al. | Sentiment analysis on bangladesh cricket with support vector machine | |
CN109829166B (en) | People and host customer opinion mining method based on character-level convolutional neural network | |
CN104778209B (en) | A kind of opining mining method for millions scale news analysis | |
CN107239439A (en) | Public sentiment sentiment classification method based on word2vec | |
CN107193801A (en) | A kind of short text characteristic optimization and sentiment analysis method based on depth belief network | |
CN105843897A (en) | Vertical domain-oriented intelligent question and answer system | |
CN108536801A (en) | A kind of civil aviaton's microblogging security public sentiment sentiment analysis method based on deep learning | |
CN110209818B (en) | Semantic sensitive word and sentence oriented analysis method | |
CN107102976A (en) | Entertainment newses autocreating technology and system based on microblogging | |
Pariyani et al. | Hate speech detection in twitter using natural language processing | |
CN107463703A (en) | English social media account number classification method based on information gain | |
CN112115712B (en) | Topic-based group emotion analysis method | |
Haque et al. | Opinion mining from bangla and phonetic bangla reviews using vectorization methods | |
CN104794209B (en) | Chinese microblogging mood sorting technique based on Markov logical network and system | |
CN115329085A (en) | Social robot classification method and system | |
Chader et al. | Sentiment Analysis for Arabizi: Application to Algerian Dialect. | |
Jagadeesan et al. | Twitter Sentiment Analysis with Machine Learning | |
Kanev et al. | Sentiment analysis of multilingual texts using machine learning methods | |
Campbell et al. | Content+ context networks for user classification in twitter | |
Tohabar et al. | Bengali fake news detection using machine learning and effectiveness of sentiment as a feature | |
Devi et al. | Racist tweets-based sentiment analysis using individual and ensemble classifiers | |
Mohsen et al. | A performance comparison of machine learning classifiers for Covid-19 Arabic Quarantine tweets sentiment analysis | |
Munarko et al. | Named entity recognition model for Indonesian tweet using CRF classifier | |
Alhammadi | Using machine learning in disaster tweets classification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20161207 |