CN106202372A - A kind of method of network text information emotional semantic classification - Google Patents

A kind of method of network text information emotional semantic classification Download PDF

Info

Publication number
CN106202372A
CN106202372A CN201610534277.1A CN201610534277A CN106202372A CN 106202372 A CN106202372 A CN 106202372A CN 201610534277 A CN201610534277 A CN 201610534277A CN 106202372 A CN106202372 A CN 106202372A
Authority
CN
China
Prior art keywords
document
emotional semantic
semantic classification
classification
emotion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610534277.1A
Other languages
Chinese (zh)
Inventor
姚春华
杨颖�
唐明芳
陈小玉
鄢秋霞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Electronic Technology Cyber Security Co Ltd
Original Assignee
China Electronic Technology Cyber Security Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Electronic Technology Cyber Security Co Ltd filed Critical China Electronic Technology Cyber Security Co Ltd
Priority to CN201610534277.1A priority Critical patent/CN106202372A/en
Publication of CN106202372A publication Critical patent/CN106202372A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A kind of method that the invention discloses network text information emotional semantic classification, comprises the steps: step one, first determines whether whether document belongs to news, if belonging to news, only extracting title and carrying out emotional semantic classification, the most whole document being carried out emotional semantic classification;Step 2, the document classifying needs carry out pretreatment;Step 3, according to text length, document is classified: use TF IDF to calculate feature weight more than the document of 140 characters length and then utilize the logistic regression grader trained to classify;Artificial emotion classifying rules is used to classify less than the document of 140 characters length.Compared with prior art, the positive effect of the present invention is: the inventive method is for long text, the different characteristics of short text, use machine learning algorithm to build grader and formulate the technology path that characteristic of division combines with domain expert, it is possible to accurately and timely find reaction information, sensitive information and the negative report related in network public-opinion.

Description

A kind of method of network text information emotional semantic classification
Technical field
The invention belongs to natural language processing field, a kind of method relating to network text information emotional semantic classification.
Background technology
Network, as a kind of New Media, has been given play to the unimpeded will of the people more and more, has been expressed demand, supervises by public opinionus, joins The effect that political affairs are discussed political affairs and other major and important matters, increasing people utilizes the Internet to express oneself demand at the aspect such as interests, politics, expresses oneself Attitude or view to hot spot of societys such as the people's livelihood, the administration of justice, anti-corruption.Especially when groupment, unexpected incidents occur, People are often transmitted by the Internet in the very first time or obtain information.But some hostile forces utilize the hidden of network simultaneously Cover environment, manufacture the network public opinion deviating from main flow political culture, destroy steady politics and social harmony;The net that minority is unique The people utilize the facility of network to have a mind to spread dummy message even rumour, to reaching the individual purpose of oneself;Some netizens give vent to feelings Thread, delivers vulgar, the speech of Lycoperdon polymorphum Vitt on network.And tradition relies on artificial method to be difficult to tackle the collection of online magnanimity information With study and judge, it is therefore desirable to network public sentiment information is judged by the sensibility classification method of automatization, it is achieved to reaction, sensitivity, negative The emphasis of the public feelings informations such as face finds.
Summary of the invention
In order to overcome the shortcoming of prior art, a kind of method that the invention provides network text information emotional semantic classification.
The technical solution adopted in the present invention is: a kind of method of network text information emotional semantic classification, comprises the steps:
Step one, first determine whether whether document belongs to news, if belonging to news, only extracting title and carrying out emotional semantic classification, The most whole document is carried out emotional semantic classification;
Step 2, the document classifying needs carry out pretreatment;
Step 3, according to text length, document is classified:
(1) then use TF-IDF to calculate feature weight more than the document of 140 characters length utilizes train to patrol Collect recurrence grader to classify;
(2) artificial emotion classifying rules is used to classify less than the document of 140 characters length.
Compared with prior art, the positive effect of the present invention is: the inventive method is for long text, the different spies of short text Point, uses machine learning algorithm to build grader and formulates the technology path that characteristic of division combines with domain expert, it is possible to accurately Find reaction information, sensitive information and the negative report related in network public-opinion in time.
Accompanying drawing explanation
Examples of the present invention will be described by way of reference to the accompanying drawings, wherein:
Fig. 1 is network text information classification schematic diagram;
Fig. 2 is logistic regression classification based training process schematic;
Fig. 3 is artificial formulation emotional semantic classification rule schematic diagram.
Detailed description of the invention
A kind of method of network text information emotional semantic classification, as depicted in figs. 1 and 2, comprises the steps:
Step one, first determine whether whether document belongs to news, if belonging to news, only extracting title and carrying out emotional semantic classification, The most whole document is carried out emotional semantic classification;
Step 2, the document needing classification is carried out pretreatment:
Pretreatment refers to use Chinese lexical analysis system ICTCLAS that text is carried out participle, then filters stop words:
Destination document carried out emotional semantic classification and to first have to carry out data prediction, mainly include participle and remove stop words two Part.The ICTCLAS Chinese word segmentation system of Inst. of Computing Techn. Academia Sinica's exploitation is based on stacking hidden Markov model (Hierarchical Hidden Markov Model), it is possible to effectively by the text success participle of input, and follow part of speech closely Export, and support to import the dictionary that user provides.By ICTCLAS, input text is carried out participle and part-of-speech tagging, Then filter and disable the word in vocabulary, thus remove the word without practical significance, it is achieved data prediction.
Step 3, according to text length, document is classified:
(1) then use TF-IDF to calculate feature weight more than the document of 140 characters length utilizes train to patrol Collect recurrence grader to classify:
(1) feature extraction: by calculating χ2Statistical value selection sort feature, TF-IDF calculates feature weight:
χ2Statistics also referred to as CHI statistics, is the statistics of the dependence for two variablees of tolerance conventional in statistics Amount, characteristic item t and the χ of classification c2Statistical value is calculated as follows:
χ 2 ( t , c ) = N ( A D - B C ) 2 ( A + C ) ( B + D ) ( A + B ) ( C + D )
Wherein N represents that total number of documents, A represent and comprises characteristic item t and belong to the number of files of classification c, and B represents and comprises feature T and be not belonging to the number of files of classification c, C represents and does not comprises characteristic item t and belong to the number of files of classification c, and D represents and do not wraps Containing characteristic item t and the number of files that is not belonging to classification c.
Obtaining t χ in each category2The t χ to all categories can be calculated according to following equation after statistical value2System Meter meansigma methods.
χ 2 A V G ( t ) = Σ i = 1 M P ( t , c i ) χ 2 ( t , c i )
Wherein, P (t, ci) represent that characteristic item t belongs to classification ciProbability.
When carrying out feature selection, set threshold values, only retain χ2Statistical value is more than the lexical item of threshold values as characteristic item.
To give weights, the spy high to discrimination to different characteristic item by TF-IDF method after screening obtains characteristic item Levy item and give bigger weights.TF-IDF weight computing formula is as follows, and wherein tf refers to characteristic item word frequency, and what n represented is bag Containing the number of files of current signature item, N is total number of documents.
W = t f × i d f = t f × l o g N n
(2) training LR grader: use logistic regression algorithm to build grader
Can be the N-dimensional vector in vector space model by text representation by data prediction and feature extraction, will This N-dimensional vector differentiates that result exports as grader as the input of logistic regression grader, document emotion, carries out grader Training.In logistic regression sorting algorithm, using cross entropy cost function to carry out the degree of predictive metrics mistake, computing formula is such as Under, wherein y is desired output, and a is the output that grader is actual, and for avoiding training over-fitting, arranging parameter c is 200, arranges It is 0.01 that gradient descent algorithm learns the step-length δ parameter of J (x) minima.
J ( x ) = - 1 n Σ x [ y log a + ( 1 - y ) l o g ( 1 - a ) ] + c ( w ) 2
Weight vectors w and amount of bias b can be obtained through training, can be according to given input by formula below Example x is calculated its conditional probability distribution P belonging to a certain classification (Y | X), and the maximum classification of probit is then belonging to it Classification.
P ( Y | X ) = e ( w · x + b ) 1 + e ( w · x + b )
(2) artificial emotion classifying rules is used to classify less than the document of 140 characters length:
Domain expert builds seed dictionary of all categories respectively, then extends seed by Chinese thesaurus with word2vector Dictionary, forms feature database of all categories;The impact on word emotion value of COMPREHENSIVE CALCULATING negative word, sentence pattern, it is judged that document emotion value with The artificial size setting emotion threshold values of all categories.
As it is shown on figure 3, use the artificial process formulating text message emotional semantic classification rule to specifically include that
(1) seed words obtains
Weight vectors w can be obtained by machine learning algorithm training text this process of information emotion classifiers, pass through The Feature Words that screening weighted value is bigger can get part classifying seed words, the most comprehensive Northeastern University, Taiwan Univ., China The emotional semantic classification dictionary of Zhi Wangdeng mechanism, obtains final emotional semantic classification seed words.
(2) synonym extension
" Chinese thesaurus extended edition " that Harbin Institute of Technology's social computing is issued with Research into information retrieval center comprises altogether 77, Article 343, word, organizes all entries included together according to tree-shaped hierarchical structure, and provides Pyatyi to encode.According to " with Justice word word woods extended edition " the synonym phrase that provides carries out synonym extension to seed words.
(3) Word2Vector semantic extension
Word2Vector is a efficient work that word is characterized as real number value vector that Google increased income in year in 2013 Tool, utilizes the thought that the degree of depth learns, and the contextual information of bluebeard compound solves the similarity of text semantic.Word2Vector uses The term vector representation of DistributedRepresentation, uses the neutral net of three layers to build language model Mould, obtains word expression in vector space simultaneously.Utilize Word2Vector that existing emotional semantic classification word is carried out semantic similitude Degree calculates, and extracts the higher word of similarity and adds emotional semantic classification dictionary.
(4) according to setting classifying rules, document is classified:
Emotion word in emotional semantic classification dictionary is given different weights, by the relative position information of negative word Yu emotion word Calculate the emotion value of content of the sentence, then in conjunction with the different sentence pattern such as exclamative sentence, confirmative question, interrogative sentence to whole word emotion value shadow Ring, calculate the overall emotion value of sentence, the emotion value of all categories of all for document sentences is added the document emotion value that obtains and sets The threshold values of fixed each emotional category compares, thus judges document emotion.
The inventive method is carried out following accuracy test: obtained by web crawlers and include Sina's microblogging, forum, news Websites etc. 200,000 data is tested, and has been randomly chosen the observation of corresponding number in the result that all algorithms judge (random JAVA code is (int) (Math.random () * n) to sample, and during wherein n is 200,000 test data, algorithm judges Number of samples for respective type).5 emotions of record are studied and judged personnel and are judged the result of same observation sample simultaneously, finally take people The legitimate reading that result is sample that number is most.Data in table 1 are test result, it can be seen that the present invention is for reaction, quick The accuracy that sense, front, negative text emotion are classified is the highest.
Table 1 test of heuristics result table
Emotional category Algorithm marks Algorithm mark pair Accuracy
Negatively 502 385 0.766932271
Front 505 364 0.720792079
Sensitive 715 540 0.755244755
Reaction 39 33 0.846153846

Claims (9)

1. the method for a network text information emotional semantic classification, it is characterised in that: comprise the steps:
Step one, first determine whether whether document belongs to news, if belonging to news, only extracting title and carrying out emotional semantic classification, otherwise Then whole document is carried out emotional semantic classification;
Step 2, the document classifying needs carry out pretreatment;
Step 3, according to text length, document is classified:
(1) then use TF-IDF to calculate feature weight more than the document of 140 characters length utilizes the logic trained to return Grader is returned to classify;
(2) artificial emotion classifying rules is used to classify less than the document of 140 characters length.
The method of a kind of network text information emotional semantic classification the most according to claim 1, it is characterised in that: described in step 2 Pretreatment refer to input text carry out participle and part-of-speech tagging, be then filtered off stop words.
The method of a kind of network text information emotional semantic classification the most according to claim 2, it is characterised in that: use Chinese word Method is analyzed system ICTCLAS and text is carried out pretreatment.
The method of a kind of network text information emotional semantic classification the most according to claim 1, it is characterised in that: described in step 3 The method using TF IDF calculating feature weight then to utilize the logistic regression grader trained to carry out classifying is:
(1) χ of characteristic item t and classification c it is calculated as follows2Statistical value:
χ 2 ( t , c ) = N ( A D - B C ) 2 ( A + C ) ( B + D ) ( A + B ) ( C + D )
Wherein N represents that total number of documents, A represent and comprises characteristic item t and belong to the number of files of classification c, and B represents and comprises characteristic item t And being not belonging to the number of files of classification c, C represents and do not comprises characteristic item t and belong to the number of files of classification c, D represents and does not comprises spy Levy a t and be not belonging to the number of files of classification c;
(2) t χ to all categories it is calculated as follows2Assembly average:
χ 2 A V G ( t ) = Σ i = 1 M P ( t , c i ) χ 2 ( t , c i ) ;
(3) filter out more than the χ setting threshold values2The lexical item of statistical value is as characteristic item;
(4) use TF IDF method to come different characteristic item to give weights, give bigger weights to the characteristic item that discrimination is high;
(5) training LR grader:
It is the N-dimensional vector in vector space model by text representation, using vectorial for this N-dimensional input as logistic regression grader, Document emotion differentiates that result exports as grader, is trained grader obtaining weight vectors w and amount of bias b;
(6) it is calculated its conditional probability distribution belonging to a certain classification according to given input example x The classification of probit maximum is then its generic.
The method of a kind of network text information emotional semantic classification the most according to claim 4, it is characterised in that: (4th) step institute Stating weights uses equation below to calculate:
W = t f × i d f = t f × l o g N n
Wherein tf is characterized a word frequency, and n is the number of files comprising current signature item, and N is total number of documents.
The method of a kind of network text information emotional semantic classification the most according to claim 4, it is characterised in that: at logistic regression In sorting algorithm, following cross entropy cost function is used to carry out the degree of predictive metrics mistake:
J ( x ) = - 1 n Σ x [ y log a + ( 1 - y ) l o g ( 1 - a ) ] + c ( w ) 2
Wherein y is desired output, and a is the output that grader is actual, and c is 200.
The method of a kind of network text information emotional semantic classification the most according to claim 1, it is characterised in that: described in step 3 The method using artificial emotion classifying rules to carry out classifying is:
(1) seed words obtains;
(2) seed words is carried out synonym extension;
(3) utilize Word2Vector that existing emotional semantic classification word is carried out semantic extension;
(4) according to setting classifying rules, document is classified.
The method of a kind of network text information emotional semantic classification the most according to claim 7, it is characterised in that: described seed words Acquisition methods be: obtain weight vectors by machine learning algorithm training text information emotion classifiers, then by screening The Feature Words that weighted value is bigger gets part classifying seed words;The machines such as the most comprehensive Northeastern University, Taiwan Univ., middle National IP Network The emotional semantic classification dictionary of structure, obtains final emotional semantic classification seed words.
The method of a kind of network text information emotional semantic classification the most according to claim 7, it is characterised in that: divide according to setting The method that document is classified by rule-like is: the emotion word in emotional semantic classification dictionary gives different weights, passes through negative word Relative position information with emotion word calculates the emotion value of content of the sentence, then in conjunction with differences such as exclamative sentence, confirmative question, interrogative sentences Whole word emotion value is affected by sentence pattern, calculates the overall emotion value of sentence, the emotion value of all categories of all for document sentences is added The document emotion value obtained compares with the threshold values of each emotional category of setting, thus judges document emotion.
CN201610534277.1A 2016-07-08 2016-07-08 A kind of method of network text information emotional semantic classification Pending CN106202372A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610534277.1A CN106202372A (en) 2016-07-08 2016-07-08 A kind of method of network text information emotional semantic classification

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610534277.1A CN106202372A (en) 2016-07-08 2016-07-08 A kind of method of network text information emotional semantic classification

Publications (1)

Publication Number Publication Date
CN106202372A true CN106202372A (en) 2016-12-07

Family

ID=57473503

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610534277.1A Pending CN106202372A (en) 2016-07-08 2016-07-08 A kind of method of network text information emotional semantic classification

Country Status (1)

Country Link
CN (1) CN106202372A (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107066446A (en) * 2017-04-13 2017-08-18 广东工业大学 A kind of Recognition with Recurrent Neural Network text emotion analysis method of embedded logic rules
CN107229731A (en) * 2017-06-08 2017-10-03 百度在线网络技术(北京)有限公司 Method and apparatus for grouped data
CN107562814A (en) * 2017-08-14 2018-01-09 中国农业大学 A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system
CN108228587A (en) * 2016-12-13 2018-06-29 北大方正集团有限公司 Stock discrimination method and Stock discrimination device
CN108628873A (en) * 2017-03-17 2018-10-09 腾讯科技(北京)有限公司 A kind of file classification method, device and equipment
CN108647335A (en) * 2018-05-12 2018-10-12 苏州华必讯信息科技有限公司 Internet public opinion analysis method and apparatus
CN109002473A (en) * 2018-06-13 2018-12-14 天津大学 A kind of sentiment analysis method based on term vector and part of speech
CN109388749A (en) * 2018-09-29 2019-02-26 武汉烽火普天信息技术有限公司 The detection of accurate high-efficiency network public sentiment and method for early warning based on multi-layer geography
WO2019041528A1 (en) * 2017-08-31 2019-03-07 平安科技(深圳)有限公司 Method, electronic apparatus, and computer readable storage medium for determining polarity of news sentiment
CN109634833A (en) * 2017-10-09 2019-04-16 北京京东尚科信息技术有限公司 A kind of Software Defects Predict Methods and device
CN109710732A (en) * 2018-11-19 2019-05-03 东软集团股份有限公司 Information query method, device, storage medium and electronic equipment
CN109918550A (en) * 2019-01-22 2019-06-21 招银云创(深圳)信息技术有限公司 Information monitoring method, device, computer equipment and readable storage medium storing program for executing
CN110334182A (en) * 2019-06-24 2019-10-15 中国南方电网有限责任公司 Online service method with speech emotion recognition
CN110968691A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 Judicial hotspot determination method and device
CN111046171A (en) * 2019-08-29 2020-04-21 成都信息工程大学 Emotion discrimination method based on fine-grained labeled data
CN111126373A (en) * 2019-12-23 2020-05-08 北京中科神探科技有限公司 Internet short video violation judgment device and method based on cross-modal identification technology
CN111159342A (en) * 2019-12-26 2020-05-15 北京大学 Park text comment emotion scoring method based on machine learning
CN111414520A (en) * 2020-03-19 2020-07-14 南京莱斯网信技术研究院有限公司 Intelligent mining system for sensitive information in public opinion information
CN112307771A (en) * 2020-10-29 2021-02-02 平安科技(深圳)有限公司 Course analysis method, device, equipment and medium based on emotion analysis
CN112685558A (en) * 2019-10-18 2021-04-20 普天信息技术有限公司 Emotion classification model training method and device
CN112836049A (en) * 2021-01-28 2021-05-25 网易(杭州)网络有限公司 Text classification method, device, medium and computing equipment
CN114201959A (en) * 2021-11-16 2022-03-18 湖南长泰工业科技有限公司 Mobile emergency command method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102023967A (en) * 2010-11-11 2011-04-20 清华大学 Text emotion classifying method in stock field
CN104899335A (en) * 2015-06-25 2015-09-09 四川友联信息技术有限公司 Method for performing sentiment classification on network public sentiment of information

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102023967A (en) * 2010-11-11 2011-04-20 清华大学 Text emotion classifying method in stock field
CN104899335A (en) * 2015-06-25 2015-09-09 四川友联信息技术有限公司 Method for performing sentiment classification on network public sentiment of information

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
张莹: "在线新闻评论的情感分析研究", 《中国博士学位论文全文数据库信息科技辑》 *
李其达: "基于词典的文本情感计算***的设计与实现", 《万方全文数据库》 *
王振浩: "基于情感字典与机器学习相结合的文本情感分类", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108228587A (en) * 2016-12-13 2018-06-29 北大方正集团有限公司 Stock discrimination method and Stock discrimination device
CN108628873A (en) * 2017-03-17 2018-10-09 腾讯科技(北京)有限公司 A kind of file classification method, device and equipment
CN108628873B (en) * 2017-03-17 2022-09-27 腾讯科技(北京)有限公司 Text classification method, device and equipment
CN107066446A (en) * 2017-04-13 2017-08-18 广东工业大学 A kind of Recognition with Recurrent Neural Network text emotion analysis method of embedded logic rules
CN107229731A (en) * 2017-06-08 2017-10-03 百度在线网络技术(北京)有限公司 Method and apparatus for grouped data
CN107229731B (en) * 2017-06-08 2021-05-25 百度在线网络技术(北京)有限公司 Method and apparatus for classifying data
CN107562814A (en) * 2017-08-14 2018-01-09 中国农业大学 A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system
WO2019041528A1 (en) * 2017-08-31 2019-03-07 平安科技(深圳)有限公司 Method, electronic apparatus, and computer readable storage medium for determining polarity of news sentiment
CN109634833A (en) * 2017-10-09 2019-04-16 北京京东尚科信息技术有限公司 A kind of Software Defects Predict Methods and device
CN108647335A (en) * 2018-05-12 2018-10-12 苏州华必讯信息科技有限公司 Internet public opinion analysis method and apparatus
CN109002473A (en) * 2018-06-13 2018-12-14 天津大学 A kind of sentiment analysis method based on term vector and part of speech
CN109002473B (en) * 2018-06-13 2022-02-11 天津大学 Emotion analysis method based on word vectors and parts of speech
CN109388749A (en) * 2018-09-29 2019-02-26 武汉烽火普天信息技术有限公司 The detection of accurate high-efficiency network public sentiment and method for early warning based on multi-layer geography
CN110968691A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 Judicial hotspot determination method and device
CN110968691B (en) * 2018-09-30 2023-07-04 北京国双科技有限公司 Judicial hotspot determination method and device
CN109710732B (en) * 2018-11-19 2021-03-05 东软集团股份有限公司 Information query method, device, storage medium and electronic equipment
CN109710732A (en) * 2018-11-19 2019-05-03 东软集团股份有限公司 Information query method, device, storage medium and electronic equipment
CN109918550A (en) * 2019-01-22 2019-06-21 招银云创(深圳)信息技术有限公司 Information monitoring method, device, computer equipment and readable storage medium storing program for executing
CN110334182A (en) * 2019-06-24 2019-10-15 中国南方电网有限责任公司 Online service method with speech emotion recognition
CN111046171A (en) * 2019-08-29 2020-04-21 成都信息工程大学 Emotion discrimination method based on fine-grained labeled data
CN112685558A (en) * 2019-10-18 2021-04-20 普天信息技术有限公司 Emotion classification model training method and device
CN112685558B (en) * 2019-10-18 2024-05-17 普天信息技术有限公司 Training method and device for emotion classification model
CN111126373A (en) * 2019-12-23 2020-05-08 北京中科神探科技有限公司 Internet short video violation judgment device and method based on cross-modal identification technology
CN111159342A (en) * 2019-12-26 2020-05-15 北京大学 Park text comment emotion scoring method based on machine learning
CN111414520B (en) * 2020-03-19 2021-03-19 南京莱斯网信技术研究院有限公司 Intelligent mining system for sensitive information in public opinion information
CN111414520A (en) * 2020-03-19 2020-07-14 南京莱斯网信技术研究院有限公司 Intelligent mining system for sensitive information in public opinion information
CN112307771A (en) * 2020-10-29 2021-02-02 平安科技(深圳)有限公司 Course analysis method, device, equipment and medium based on emotion analysis
CN112836049A (en) * 2021-01-28 2021-05-25 网易(杭州)网络有限公司 Text classification method, device, medium and computing equipment
CN114201959A (en) * 2021-11-16 2022-03-18 湖南长泰工业科技有限公司 Mobile emergency command method

Similar Documents

Publication Publication Date Title
CN106202372A (en) A kind of method of network text information emotional semantic classification
Mahtab et al. Sentiment analysis on bangladesh cricket with support vector machine
CN109829166B (en) People and host customer opinion mining method based on character-level convolutional neural network
CN104778209B (en) A kind of opining mining method for millions scale news analysis
CN107239439A (en) Public sentiment sentiment classification method based on word2vec
CN107193801A (en) A kind of short text characteristic optimization and sentiment analysis method based on depth belief network
CN105843897A (en) Vertical domain-oriented intelligent question and answer system
CN108536801A (en) A kind of civil aviaton's microblogging security public sentiment sentiment analysis method based on deep learning
CN110209818B (en) Semantic sensitive word and sentence oriented analysis method
CN107102976A (en) Entertainment newses autocreating technology and system based on microblogging
Pariyani et al. Hate speech detection in twitter using natural language processing
CN107463703A (en) English social media account number classification method based on information gain
CN112115712B (en) Topic-based group emotion analysis method
Haque et al. Opinion mining from bangla and phonetic bangla reviews using vectorization methods
CN104794209B (en) Chinese microblogging mood sorting technique based on Markov logical network and system
CN115329085A (en) Social robot classification method and system
Chader et al. Sentiment Analysis for Arabizi: Application to Algerian Dialect.
Jagadeesan et al. Twitter Sentiment Analysis with Machine Learning
Kanev et al. Sentiment analysis of multilingual texts using machine learning methods
Campbell et al. Content+ context networks for user classification in twitter
Tohabar et al. Bengali fake news detection using machine learning and effectiveness of sentiment as a feature
Devi et al. Racist tweets-based sentiment analysis using individual and ensemble classifiers
Mohsen et al. A performance comparison of machine learning classifiers for Covid-19 Arabic Quarantine tweets sentiment analysis
Munarko et al. Named entity recognition model for Indonesian tweet using CRF classifier
Alhammadi Using machine learning in disaster tweets classification

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20161207