CN107203641A - A kind of method of the collection of Internet traffic public feelings information and processing - Google Patents

A kind of method of the collection of Internet traffic public feelings information and processing Download PDF

Info

Publication number
CN107203641A
CN107203641A CN201710461333.8A CN201710461333A CN107203641A CN 107203641 A CN107203641 A CN 107203641A CN 201710461333 A CN201710461333 A CN 201710461333A CN 107203641 A CN107203641 A CN 107203641A
Authority
CN
China
Prior art keywords
traffic
public
processing
feelings information
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710461333.8A
Other languages
Chinese (zh)
Inventor
常思阳
刘瑞伟
王亚利
张奕
赵新勇
王锐锋
孙建宏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing E Hualu Information Technology Co Ltd
Original Assignee
Beijing E Hualu Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing E Hualu Information Technology Co Ltd filed Critical Beijing E Hualu Information Technology Co Ltd
Priority to CN201710461333.8A priority Critical patent/CN107203641A/en
Publication of CN107203641A publication Critical patent/CN107203641A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/097Supervising of traffic control systems, e.g. by giving an alarm if two crossing streets have green light simultaneously
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/03Data mining

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • General Business, Economics & Management (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Educational Administration (AREA)
  • Development Economics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses the method for a kind of collection of Internet traffic public feelings information and processing, including traffic public feelings information in the Internet media is scanned for, monitors and downloaded according to default focus dictionary by data acquisition platform, it is stored in public feelings information storehouse and is delivered to data processing platform (DPP);The data processing platform (DPP) filtered by the traffic public feelings information to download according to pre-provisioning request, analyze and processing forms the traffic public sentiment content for meeting manager's business needs, is stored in public feelings information storehouse and is delivered to business processing and data study and judge platform;The business processing and data study and judge platform according to the traffic public sentiment content that meets manager's business needs being handled and studied and judged the need for manager, and result is stored in the public feelings information storehouse and business handling module and monitoring, early warning and reporting modules are delivered to, finally realize alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function.

Description

A kind of method of the collection of Internet traffic public feelings information and processing
Technical field
The present invention relates to field of information acquisition, the side of the more particularly to a kind of collection of Internet traffic public feelings information and processing Method.
Background technology
With the rapid development of economy, Traffic Development speedup, traffic news information content is increasing, traffic administration institute Door is increasingly strong to the demand for controlling current traffic information and public sentiment.Be in the prior art by following two technologies to website, The internet news such as microblogging, wechat are collected.
Web crawlers is a kind of program or script according to certain rule, automatically crawl internet information, they It is widely used in internet search engine or other similar websites, can be with all pages that it is able to access that of automatic data collection Hold, to obtain or update the content and retrieval mode of these websites.Functionally, reptile is generally divided into data acquisition, place Reason, stores three parts.
Traditional reptile obtains the URL on Initial page since the URL of one or several Initial pages, in crawl webpage During, new URL is constantly extracted from current page and is put into queue, certain stop condition until meeting system.Focus on The workflow of reptile is complex, it is necessary to linked according to certain web page analysis algorithm filtering is unrelated with theme, remains with Link simultaneously puts it into the URL queues for waiting crawl.Then, under it will be selected according to certain search strategy from queue The one step webpage URL to be captured, and said process is repeated, stop when reaching a certain condition of system.In addition, all climbed The webpage of worm crawl will be stored by system, carry out certain analysis, filtering, and set up index, inquiry and inspection so as to after Rope;For focused crawler, the analysis result obtained by this process be also possible to later crawl process provide feedback and Instruct.
User's obtainable information from internet is contained from technical data, business information to news report, amusement money The document of plurality of classes and the forms such as news, constitute an exception it is huge there is isomerism, the distributed number of open characteristics According to storehouse, and deposited in this database is non-structured text data.With reference to the natural language in field of artificial intelligence research Speech understands and Computational Linguistics that data mining needs two key technologies:Web Mining and text mining.
Web Mining lays particular emphasis on the analysis data related to webpage is excavated, including text, link structure and acess control are (most End form navigates into user network).A variety of different data types are contained in one webpage, therefore Web Mining just contains text Data mining, image mining etc. in this excavation, database.
Text mining is to extract valuable knowledge that is effective, novel, useful, intelligible, being dispersed in text, and And utilize the process of these knowledge preferably organizational information.Text mining includes text collection, text analyzing, feature trimming, text The key technologies such as shelves cluster, document classification.
But web crawlers technology of the prior art, specific aim is strong, lacks the thematic dictionary branch of accurate industrial hot spot Support, obtains content various, it is impossible to effectively gathered for traffic category information.Network and text data digging are used as emerging skill Art, lacks the customization model based on traffic service, it is impossible in time, effectively handle and analyze effective traffic category information.
The traffic public feelings information in internet is gathered in the prior art and the method for processing is by website, microblogging, micro- The internet news channels such as letter, go to obtain the related public sentiment news content of specific public safety traffic management department and to news public sentiment class The monitoring and management of content, the process mainly studied and judged from information monitoring, data acquisition, content analysis to business processing, statistics are adopted Taking or semi-artificial, the processing of half system mode, it is impossible to realize automation comprehensively.
Therefore, how to quick comprehensive grasp, monitoring and management traffic news public sentiment, there is provided more preferable modernization branch Hold, the problem of just turning into those skilled in the art's urgent need to resolve.
The content of the invention
It is existing to overcome it is an object of the invention to provide the method for a kind of collection of Internet traffic public feelings information and processing Drawbacks described above present in technology.
A kind of collection of Internet traffic public feelings information and the method for processing, seek to solve from information monitoring, data acquisition, The full-automatic process that content analysis is studied and judged to business processing, statistics.By building public sentiment monitoring and management platform, to realize carriage The automation process of feelings information, most timely, maximally effective traffic public sentiment content and analysis result are provided for traffic administration person. It is that public safety traffic management layer grasps society comprehensively by finding, obtaining the public feelings information of correlation, and quick personnel assigned processing in time The feelings will of the people, public sentiment dynamic makes right opinion guiding.By analyzing mass data, weak link is found, is provided for decision-making level Specific aim is helped.
To achieve the above object, a kind of method that the present invention provides Internet traffic public feelings information collection and processing, it is wrapped Data acquisition platform is included, data processing platform (DPP), business processing and data study and judge platform, and monitoring, early warning and reporting modules, business are done Manage module and public feelings information library module;The data acquisition platform, data processing platform (DPP), business processing and data study and judge platform according to Secondary electrical connection, and above three platform electrically connects with the public feelings information storehouse, the business processing and data study and judge platform electricity Connection monitoring, early warning and reporting modules and business handling module;The side of the collection of Internet traffic public feelings information and processing Method includes scanning for traffic public feelings information in the Internet media according to default focus dictionary by data acquisition platform, supervising Control and download, are stored in the public feelings information storehouse and are delivered to data processing platform (DPP);It is right that the data processing platform (DPP) passes through Download the traffic public feelings information filtered according to pre-provisioning request, analyze and processing formation meet manager's business needs Traffic public sentiment content, is stored in the public feelings information storehouse and is delivered to business processing and data study and judge platform;The business Processing and data study and judge platform according to the need for manager to meeting the traffic public sentiment content progress of manager's business needs Handle and study and judge, and result is stored in the public feelings information storehouse and business handling module and monitoring, early warning and report is delivered to Module is accused, alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function is finally realized.
Preferably, the data acquisition platform includes network search module, information monitoring module and data download module;Institute Stating data processing platform (DPP) includes data filtering module, semantic module and data processing module;The business processing and data Platform is studied and judged including business handling module, Terminal-decision module, data statistics module and analysis module is studied and judged.
Preferably, the data acquisition platform is based on web crawlers technology according to default focus dictionary to the Internet media On the traffic public feelings information data source specified carry out web search, analyze the traffic public feelings information searched in real time, judge whether symbol The need for the traffic public feelings information capturing service for closing traffic administration person, the traffic carriage of traffic administration person's capturing service needs will be met The related web page information resource of feelings information is downloaded, and the traffic public feelings information storage of download is transmitted simultaneously to public feelings information storehouse To data processing platform (DPP);The data processing platform (DPP) is dug based on data mining technology according to pre-provisioning request by the network of data Pick, text mining and semantic analysis, which are realized, handles being customized of traffic public feelings information of download, and traffic public feelings information is entered Row basis filtration, removes and downloads repetition, downloads the information that resource is imperfect, the time is expired, carry out after semantic analysis, formed The traffic public sentiment content of traffic administration person's business needs is met, traffic public sentiment content is advised according to the coded format of regulation and storage Then storage is delivered to business processing simultaneously to public feelings information storehouse and data study and judge platform;The business processing and data study and judge platform Opened, realized according to the need for public safety traffic management personnel to described towards public safety traffic management personnel by human-computer interaction device The business processing and data of traffic public sentiment content study and judge function, are stored to public feelings information storehouse while being delivered to business handling mould Block and monitoring, early warning and reporting modules, finally realize alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function.
Preferably, the focus dictionary includes the corresponding some passes of the thematic and each traffic administration special topic of 7 traffic administrations Keyword.
Preferably, 7 traffic administrations special topic includes traffic events, much-talked-about topic, traffic administration, traffic organization and pipe Control, malice speech, traffic suggestion and problem report and policies and regulations.
Preferably, the corresponding keyword of the traffic events include to lose control of one's vehicle, congestion, traffic accident, construction, road occupying, Bump against, scratch, female driver, traffic accident, escape, traffic accident, overturn, knock into the back, high speed, highway, blow out, spontaneous combustion, road conditions and Truck;The corresponding keyword of much-talked-about topic include net about car, shared bicycle, share-car, hire a car, Green Travel and electric automobile; The corresponding keyword of traffic administration include road in violating the regulations, violation, break laws and violate discipline, penalty note, overload, retrograde, hypervelocity, illegal parking, Deck, electronic police, capture, make a dash across the red light, going through stop light, drunk driving, drive without a license, drive when intoxicated, parking offense, false number plate, forging Car plate, false car plate, deck, escape vehicle, assault police, joyride and rush card;Traffic organization and the corresponding keyword of management and control include great Activity, security, traffic police, traffic-police, Traffic Police Headquarters, Traffic Warden Subteam, traffic police group, traffic police squadron, vehicle administration office, association police, Dan Shuan Number, restricted driving, speed limit, it is forbidden, raid and deploy to ensure effective monitoring and control of illegal activities, close a road to traffic, driving license, driving school, wagon flow, charge station and detouring;Malice speech is corresponding to close Keyword include accept bribes, corrupt, corruption, hit the person, swear at people and give a present;Traffic suggestion and problem report that corresponding keyword includes disorderly receiving Expense, receive ill-gotten money, illegal vehicle, laws are not fully observed, enforce the law impartially, traffic lights, in violation of rules and regulations charge and entrapment;The corresponding key of policies and regulations Word includes encouraging share-car, shared trip, traffic marking, speed(-)limit sign, traffic sign, traffic publicity and traffic safety.
Preferably, the important content of the Web Mining has 18 aspects, and 18 aspects include public sentiment news mark Topic, public sentiment news content, author, issuing time, data source, starting source, data acquisition time, public sentiment property, original text chain It is grounded location, keyword, clip Text, thematic type, topic information, visit capacity, transfer amount, comment number, follow-up amount and affiliated area Domain.
Preferably, the semantic analysis includes 7 functions, including basic handling function, syntactic analysis function, text mining Function, text cluster function, sentiment analysis function, knowledge abstraction function and temperature analytic function.
Preferably, the service processing function, including public sentiment examination & verification, the police of public sentiment group and public sentiment processing;The public sentiment examination & verification Whether be traffic administration person belong to the traffic public feelings information that gathers and handle effective, true news content and base categories are No rational examination & verification;The public sentiment group police is the area under one's jurisdiction according to belonging to effective traffic public feelings information after examination & verification, system default It is assigned to the traffic police personnel of responsible area under one's jurisdiction business;The public sentiment processing is that traffic administration personnel are believed a certain specific traffic public sentiment Knot and operation is handled at breath carry out.
Preferably, it is the result to traffic public feelings information that the data, which study and judge function, carries out big data analysis, is carried out Directly perceived and digitized display.
Beneficial effects of the present invention:
The method of the collection of Internet traffic public feelings information and processing proposed by the present invention so that traffic administration person can be timely It was found that and oneself compass of competency, the related traffic public feelings information of business;It was found that after effective public feelings information, alert place can be sent in time Reason, it is to avoid public sentiment is further fermented, and causes bad social influence;, can be with by substantial amounts of traffic public sentiment historical data analysis Summarize, summarize the content foundation for being conducive to traffic management policy.
Brief description of the drawings
Fig. 1 is Organization Chart of the invention;
Fig. 2 audits schematic diagram for the public sentiment of the present invention;
Fig. 3 is intended to for the public sentiment group warning of the present invention;
Fig. 4 is the alert policeman's situation schematic diagram on duty of the public sentiment group of the present invention;
Fig. 5 handles schematic diagram for the public sentiment of the present invention;
Fig. 6 studies and judges schematic diagram for the data of the present invention.
Embodiment
To make the purpose, technical scheme and advantage of the invention implemented clearer, below in conjunction with the embodiment of the present invention Accompanying drawing, the technical scheme in the embodiment of the present invention is further described in more detail.In the accompanying drawings, identical from beginning to end or class As label represent same or similar element or the element with same or like function.Described embodiment is the present invention A part of embodiment, rather than whole embodiments.The embodiments described below with reference to the accompanying drawings are exemplary, it is intended to uses It is of the invention in explaining, and be not considered as limiting the invention.Based on the embodiment in the present invention, ordinary skill people The every other embodiment that member is obtained under the premise of creative work is not made, belongs to the scope of protection of the invention.Under Embodiments of the invention are described in detail with reference to accompanying drawing for face.
A kind of method of the collection of Internet traffic public feelings information and processing in a broad embodiment of the invention, it includes number According to acquisition platform, data processing platform (DPP), business processing and data study and judge platform, monitoring, early warning and reporting modules, business handling mould Block and public feelings information library module;It is electric successively that the data acquisition platform, data processing platform (DPP), business processing and data study and judge platform Connect, and above three platform is electrically connected with the public feelings information storehouse, the business processing and data study and judge platform electrical connection Monitoring, early warning and reporting modules and business handling module;The method bag of the collection of Internet traffic public feelings information and processing Include traffic public feelings information in the Internet media is scanned for according to default focus dictionary by data acquisition platform, monitor and Download, be stored in the public feelings information storehouse and be delivered to data processing platform (DPP);The data processing platform (DPP) passes through to downloading The traffic public feelings information filtered, analyzed and processing forms and meets the traffic of manager's business needs according to pre-provisioning request Public sentiment content, is stored in the public feelings information storehouse and is delivered to business processing and data study and judge platform;The business processing Platform is studied and judged according to handling the traffic public sentiment content that meets manager's business needs the need for manager with data With study and judge, and result is stored in the public feelings information storehouse and business handling module and monitoring, early warning and report mould is delivered to Block, finally realizes alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function.
A kind of collection of Internet traffic public feelings information and the method for processing that the present invention is provided, have compared with prior art Advantages below:
Traffic administration person is had found and oneself compass of competency, the related traffic public feelings information of business in time;It was found that After effective public feelings information, alert processing can be sent in time, it is to avoid public sentiment is further fermented, cause bad social influence;By big The traffic public sentiment historical data analysis of amount, can summarize, summarize the content foundation for being conducive to traffic management policy.
The embodiment of the present invention is described as follows:
A kind of method of the collection of Internet traffic public feelings information and processing, it includes data acquisition platform, and data processing is put down Platform, business processing and data study and judge platform, monitoring, early warning and reporting modules, business handling module and public feelings information library module;Institute State data acquisition platform, data processing platform (DPP), business processing and data and study and judge platform and be sequentially connected electrically, and above three platform is equal Electrically connected with the public feelings information storehouse, the business processing and data study and judge platform electrical connection monitoring, early warning and reporting modules with And business handling module;The method of Internet traffic public feelings information collection and processing include by data acquisition platform according to Default focus dictionary is scanned for, monitors and downloaded to traffic public feelings information in the Internet media, is stored in the carriage Feelings information bank is simultaneously delivered to data processing platform (DPP);The data processing platform (DPP) by the traffic public feelings information to download according to Pre-provisioning request filtered, analyze and processing forms the traffic public sentiment content for meeting manager's business needs, is stored in institute State public feelings information storehouse and be delivered to business processing and data study and judge platform;The business processing and data study and judge platform according to management The traffic public sentiment content for meeting manager's business needs is handled and studied and judged the need for person, and result is stored in institute State public feelings information storehouse and be delivered to business handling module and monitoring, early warning and reporting modules, finally realize public sentiment examination & verification, public sentiment Group is alert, public sentiment is handled and big data analytic function.
The data acquisition platform includes network search module, information monitoring module and data download module;The data Processing platform includes data filtering module, semantic module and data processing module;The business processing and data are studied and judged flat Platform includes business handling module, Terminal-decision module, data statistics module and studies and judges analysis module.
The data acquisition platform is based on web crawlers technology according to default focus dictionary to being specified in the Internet media Traffic public feelings information data source carry out web search, analyze the traffic public feelings information searched in real time, judge whether to meet traffic The need for the traffic public feelings information capturing service of manager, the traffic public feelings information of traffic administration person's capturing service needs will be met Related web page information resource be downloaded, by the storage of the traffic public feelings information of download to public feelings information storehouse while being delivered to data Processing platform;Web Mining of the data processing platform (DPP) based on data mining technology according to pre-provisioning request by data, text Excavate and semantic analysis is realized and handles being customized of traffic public feelings information of download, basic mistake is carried out to traffic public feelings information Work is filtered, removes and downloads repetition, downloads the information that resource is imperfect, the time is expired, carry out after semantic analysis, formation meets traffic The traffic public sentiment content that manager's business needs, traffic public sentiment content is arrived according to the coded format of regulation and storage rule storage Public feelings information storehouse is delivered to business processing simultaneously and data study and judge platform;The business processing and data study and judge platform pass through it is man-machine Interactive device is opened towards public safety traffic management personnel, is realized according to the need for public safety traffic management personnel to the traffic public sentiment The business processing and data of content study and judge function, are stored to public feelings information storehouse while being delivered to business handling module and prison Control, early warning and reporting modules, finally realize alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function.
The focus dictionary includes the corresponding some keywords of the thematic and each traffic administration special topic of 7 traffic administrations.
7 traffic administrations special topic includes traffic events, much-talked-about topic, traffic administration, traffic organization and management and control, malice Speech, traffic suggestion and problem report and policies and regulations.
The corresponding keyword of the traffic events include to lose control of one's vehicle, congestion, traffic accident, construction, road occupying, bump against, cut to pieces Rub, female driver, traffic accident, escape, traffic accident, overturn, knock into the back, high speed, highway, blow out, spontaneous combustion, road conditions and truck; The corresponding keyword of much-talked-about topic include net about car, shared bicycle, share-car, hire a car, Green Travel and electric automobile;Traffic administration Corresponding keyword include road in violating the regulations, violation, break laws and violate discipline, penalty note, overload, retrograde, hypervelocity, illegal parking, deck, electronics Police, capture, make a dash across the red light, going through stop light, drunk driving, drive without a license, drive when intoxicated, parking offense, false number plate, false license plates, false car Board, deck, escape vehicle, assault police, joyride and rush card;Traffic organization and the corresponding keyword of management and control include occasion, security, Traffic police, traffic-police, Traffic Police Headquarters, Traffic Warden Subteam, traffic police group, traffic police squadron, vehicle administration office, association police, odd or even number, restricted driving, limit It is fast, forbidden, raid and deploy to ensure effective monitoring and control of illegal activities, close a road to traffic, driving license, driving school, wagon flow, charge station and detouring;The corresponding keyword of malice speech include by Bribe, corruption, corruption, hit the person, swear at people and give a present;Traffic suggestion and problem report corresponding keyword include arbitrary imposition of fees, receive ill-gotten money, Illegal vehicle, laws are not fully observed, enforce the law impartially, traffic lights, in violation of rules and regulations charge and entrapment;The corresponding keyword of policies and regulations includes encouraging Share-car, shared trip, traffic marking, speed(-)limit sign, traffic sign, traffic publicity and traffic safety.
The important content of the Web Mining has 18 aspects, and 18 aspects include public sentiment headline, public sentiment News content, author, issuing time, data source, starting source, data acquisition time, public sentiment property, original text chained address, Keyword, clip Text, thematic type, topic information, visit capacity, transfer amount, comment number, follow-up amount and affiliated area.
The semantic analysis includes 7 functions, including basic handling function, syntactic analysis function, text mining function, text This function of convergence, sentiment analysis function, knowledge abstraction function and temperature analytic function.
The service processing function, including public sentiment examination & verification, the police of public sentiment group and public sentiment processing;The public sentiment examination & verification is traffic pipe Reason person whether belongs to effective, true news content to the traffic public feelings information for gathering and handling and whether base categories are rational Examination & verification;The public sentiment group police is the area under one's jurisdiction according to belonging to effective traffic public feelings information after examination & verification, and system default is assigned to negative The traffic police personnel for blaming area under one's jurisdiction business;Public sentiment processing be traffic administration personnel to a certain specific traffic public feelings information at Tie and handle operation.
It is the result to traffic public feelings information that the data, which study and judge function, carries out big data analysis, carry out it is directly perceived and Digitized display.
The present invention will be described in detail by 1-6 with reference to the accompanying drawings.
A kind of collection of Internet traffic public feelings information and the method for processing, seek to solve from information monitoring, data acquisition, The full-automatic process that content analysis is studied and judged to business processing, statistics.By building public sentiment monitoring and management platform, to realize carriage The automation process of feelings information, most timely, maximally effective traffic public sentiment content and analysis result are provided for traffic administration person.
1 understand with reference to the accompanying drawings,
1. data acquisition platform
In a kind of method of Internet traffic public feelings information processing, data acquisition work(is realized based on web crawlers technology Energy.
Including following three part:
(1) web search, by specified traffic information data source, search in real time.
(2) information monitoring, analyzes the traffic public feelings information searched in real time, judges whether that the information for meeting traffic administration person is adopted Collect business demand.
(3) data are downloaded, and the related info web resource of traffic news public sentiment are downloaded, public feelings information is arrived in storage Storehouse.
Data acquisition platform realizes data acquisition function by three parts.Web crawlers, will be according to friendship in gathered data Siphunculus reason focus dictionary carries out the whole network search, in a kind of Internet traffic public feelings information processing method, the traffic administration of definition Focus dictionary is as follows:
2. data processing platform (DPP)
In a kind of method of Internet traffic public feelings information processing, data processing platform (DPP) is dug by the network of data Pick, text mining, semantic analysis etc., realize and handle being customized of traffic public feelings information of download, formation meets traffic pipe The content that reason business needs.These contents include:
Public sentiment headline
Public sentiment news content
Author
Issuing time
Data source
Starting source
Data acquisition time
Public sentiment property (front, negative, neutrality)
Original text chained address
Keyword
Clip Text
Thematic type
Topic information
Visit capacity
Transfer amount
Comment on number
Follow-up amount
Affiliated area
(1) data filtering
Data filtering, realizes the filtration that the data content of download is done to basis, including remove download repetition, download money The news data such as source is imperfect, the time is expired.
(2) semantic analysis
Including following functions:
Basic handling
Language identification, Chinese word segmentation, part-of-speech tagging, name Entity recognition
Syntactic analysis
Text punctuate, syntactic analysis, SVO are extracted
Text mining
Keyword extraction, text classification, text snippet
Text cluster
Text similarity, term vector, text cluster
Sentiment analysis
Front, negative, neutrality
Knowledge is extracted
Entity extraction, relation are extracted
Temperature is analyzed
Click volume, transfer amount, follow-up amount
(3) data processing
Data processing function, is the result for analyzing data semantic, according to certain coded format and storage rule, enters line number According to storage, basic data is provided for business platform.
3. business processing and data study and judge platform
Service process platform and data study and judge platform, are the operation systems towards public safety traffic management librarian use.
Business processing includes following sections:
(1) public sentiment is audited
For the traffic public feelings information for gathering and handling, whether traffic administration person under confirming, it is necessary to belong to effective, true Whether news content, and some base categories of the news are reasonable.Referring to accompanying drawing 2.
(2) public sentiment group police
Effective public feelings information after examination & verification, by the area under one's jurisdiction according to belonging to the public sentiment content, system can give tacit consent to be assigned to it is negative The traffic police personnel for blaming area under one's jurisdiction business, are handled.Referring to accompanying drawing 3.
Arrangement on duty is carried out daily, to distribute specific area under one's jurisdiction, office, in public sentiment processing links, the traffic police people of required appointment Member, referring to accompanying drawing 4.
(3) public sentiment is handled
Public sentiment disposal is traffic administration personnel to knot at a certain specific public feelings information carry out, handled.Referring to accompanying drawing 5.
Data study and judge platform
By the result to traffic public feelings information, to carry out big data analysis, formed intuitively, digitized knot Really.Referring to accompanying drawing 6.
It is last it is to be noted that:The above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations.To the greatest extent The present invention is described in detail with reference to the foregoing embodiments for pipe, it will be understood by those within the art that:It is still Technical scheme described in foregoing embodiments can be modified, or which part technical characteristic is equally replaced Change;And these modifications or replacement, the essence of appropriate technical solution is departed from the essence of various embodiments of the present invention technical scheme God and scope.

Claims (10)

1. a kind of Internet traffic public feelings information collection and the method for processing, it is characterised in that:Including data acquisition platform, data Processing platform, business processing and data study and judge platform, monitoring, early warning and reporting modules, business handling module and public feelings information storehouse Module;The data acquisition platform, data processing platform (DPP), business processing and data are studied and judged platform and are sequentially connected electrically, and above-mentioned three Individual platform is electrically connected with the public feelings information storehouse, and the business processing and data study and judge platform electrical connection monitoring, early warning and report Accuse module and business handling module;The method of the collection of Internet traffic public feelings information and processing includes passing through data acquisition Platform is scanned for, monitors and downloaded to traffic public feelings information in the Internet media according to default focus dictionary, is stored In the public feelings information storehouse and it is delivered to data processing platform (DPP);The data processing platform (DPP) passes through the traffic public sentiment to download Information filtered according to pre-provisioning request, analyze and processing forms the traffic public sentiment content for meeting manager's business needs, by it It is stored in the public feelings information storehouse and is delivered to business processing and data study and judge platform;The business processing and data study and judge platform According to the traffic public sentiment content that meets manager's business needs being handled and studied and judged the need for manager, and by result It is stored in the public feelings information storehouse and is delivered to business handling module and monitoring, early warning and reporting modules, finally realizes public sentiment Examination & verification, alert public sentiment group, public sentiment processing and big data analytic function.
2. Internet traffic public feelings information collection according to claim 1 and the method for processing, it is characterised in that:The number Include network search module, information monitoring module and data download module according to acquisition platform;The data processing platform (DPP) includes number According to filtering module, semantic module and data processing module;The business processing and data, which study and judge platform, includes business handling Module, Terminal-decision module, data statistics module and study and judge analysis module.
3. Internet traffic public feelings information collection according to claim 1 and the method for processing, it is characterised in that:The number According to acquisition platform based on web crawlers technology according to default focus dictionary to the traffic public feelings information specified in the Internet media Data source carries out web search, analyzes the traffic public feelings information searched in real time, judges whether to meet the traffic carriage of traffic administration person The need for feelings information gathering business, the related web page information of the traffic public feelings information of traffic administration person's capturing service needs will be met Resource is downloaded, and the traffic public feelings information storage of download is delivered into data processing platform (DPP) simultaneously to public feelings information storehouse;It is described Data processing platform (DPP) passes through the Web Mining of data, text mining and semantic analysis based on data mining technology according to pre-provisioning request Realize and handle being customized of traffic public feelings information of download, basic filtration is carried out to traffic public feelings information, under removal Load-carrying is multiple, download the information that resource is imperfect, the time is expired, carries out after semantic analysis, formation meets traffic administration person's business need The traffic public sentiment content wanted, traffic public sentiment content is same to public feelings information storehouse according to the coded format of regulation and storage rule storage When be delivered to business processing and data study and judge platform;The business processing and data study and judge platform by human-computer interaction device towards Public safety traffic management personnel open, and are realized according to the need for public safety traffic management personnel at the business to the traffic public sentiment content Reason and data study and judge function, are stored to public feelings information storehouse while being delivered to business handling module and monitoring, early warning and report Module is accused, alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function is finally realized.
4. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:The heat Point dictionary includes the corresponding some keywords of the thematic and each traffic administration special topic of 7 traffic administrations.
5. Internet traffic public feelings information collection according to claim 4 and the method for processing, it is characterised in that:Described 7 Individual traffic administration special topic includes traffic events, much-talked-about topic, traffic administration, traffic organization and management and control, malice speech, traffic suggestion With problem report and policies and regulations.
6. Internet traffic public feelings information collection according to claim 5 and the method for processing, it is characterised in that:It is described to hand over The corresponding keyword of interpreter's part include to lose control of one's vehicle, congestion, traffic accident, construction, road occupying, bump against, scratch, female driver, traffic Accident, escape, traffic accident, overturn, knock into the back, high speed, highway, blow out, spontaneous combustion, road conditions and truck;Much-talked-about topic is corresponding Keyword include net about car, shared bicycle, share-car, hire a car, Green Travel and electric automobile;The corresponding keyword bag of traffic administration Include road in violating the regulations, violation, break laws and violate discipline, penalty note, overload, drive in the wrong direction, hypervelocity, illegal parking, deck, electronic police, capture, rush it is red Lamp, go through stop light, drunk driving, drive without a license, drive when intoxicated, parking offense, false number plate, false license plates, false car plate, deck, escape car , assault police, joyride and rush card;Traffic organization and the corresponding keyword of management and control include occasion, security, traffic police, traffic-police, Traffic Police Headquarters, Traffic Warden Subteam, traffic police group, traffic police squadron, vehicle administration office, association police, odd or even number, restricted driving, speed limit, it is forbidden, raid cloth Control, road closure, driving license, driving school, wagon flow, charge station and detour;The corresponding keyword of malice speech include accept bribes, corrupt, corruption, beat People, swear at people and give a present;Traffic suggestion and problem report corresponding keyword include arbitrary imposition of fees, receive ill-gotten money, illegal vehicle, laws are not fully observed, Enforce the law impartially, traffic lights, in violation of rules and regulations charge and entrapment;The corresponding keyword of policies and regulations include encouraging share-car, shared trip, Traffic marking, speed(-)limit sign, traffic sign, traffic publicity and traffic safety.
7. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:The net The important content that network is excavated has 18 aspects, 18 aspects include public sentiment headline, public sentiment news content, author, Issuing time, data source, starting source, data acquisition time, public sentiment property, original text chained address, keyword, summary are interior Appearance, thematic type, topic information, visit capacity, transfer amount, comment number, follow-up amount and affiliated area.
8. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:Institute's predicate Justice analysis includes 7 functions, including basic handling function, syntactic analysis function, text mining function, text cluster function, feelings Feel analytic function, knowledge abstraction function and temperature analytic function.
9. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:The industry Business processing function, including public sentiment examination & verification, the police of public sentiment group and public sentiment processing;The public sentiment examination & verification is traffic administration person to gathering and locating Whether the traffic public feelings information of reason belongs to effective, true news content and whether base categories are reasonably audited;The public sentiment Group police is the area under one's jurisdiction according to belonging to effective traffic public feelings information after examination & verification, and system default is assigned to the friendship of responsible area under one's jurisdiction business Alert personnel;The public sentiment processing is traffic administration personnel to knot at a certain specific traffic public feelings information carry out and handles operation.
10. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:It is described It is the result to traffic public feelings information that data, which study and judge function, carries out big data analysis, carries out directly perceived and digitized display.
CN201710461333.8A 2017-06-19 2017-06-19 A kind of method of the collection of Internet traffic public feelings information and processing Pending CN107203641A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710461333.8A CN107203641A (en) 2017-06-19 2017-06-19 A kind of method of the collection of Internet traffic public feelings information and processing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710461333.8A CN107203641A (en) 2017-06-19 2017-06-19 A kind of method of the collection of Internet traffic public feelings information and processing

Publications (1)

Publication Number Publication Date
CN107203641A true CN107203641A (en) 2017-09-26

Family

ID=59907469

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710461333.8A Pending CN107203641A (en) 2017-06-19 2017-06-19 A kind of method of the collection of Internet traffic public feelings information and processing

Country Status (1)

Country Link
CN (1) CN107203641A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108764832A (en) * 2018-05-18 2018-11-06 广东电网有限责任公司 Municipal administration and public sentiment demand approaches to IM, system, device and equipment
CN108959368A (en) * 2018-05-22 2018-12-07 深圳壹账通智能科技有限公司 A kind of information monitoring method, storage medium and server
CN110321472A (en) * 2019-06-12 2019-10-11 中国电子科技集团公司第二十八研究所 Public sentiment based on intelligent answer technology monitors system
CN110866185A (en) * 2019-11-11 2020-03-06 维沃移动通信有限公司 Information pushing method and electronic equipment
CN110910637A (en) * 2019-11-19 2020-03-24 上海易点时空网络有限公司 Content evaluation method, device and equipment based on traffic violation
CN110990748A (en) * 2019-12-18 2020-04-10 成都迪普曼林信息技术有限公司 National public opinion data acquisition and publishing system
CN111523856A (en) * 2020-04-16 2020-08-11 山东贝赛信息科技有限公司 Public opinion comprehensive supervision system
CN111696347A (en) * 2020-06-02 2020-09-22 安徽宇呈数据技术有限公司 Method and device for automatically analyzing traffic incident information
CN112650947A (en) * 2020-12-31 2021-04-13 安徽不如信息科技有限公司 Public opinion collection processing system convenient to carry
CN113392185A (en) * 2021-06-10 2021-09-14 中国联合网络通信集团有限公司 Public opinion early warning method, device, equipment and storage medium
CN113704636A (en) * 2021-08-23 2021-11-26 福建亿榕信息技术有限公司 Fused media public opinion analysis method based on information dissemination
CN115050187A (en) * 2022-08-12 2022-09-13 杭州城市大脑有限公司 Public opinion knowledge graph-based digital urban traffic management method
CN115862333A (en) * 2022-12-07 2023-03-28 东南大学 Expressway vehicle-road cooperative scene and function division method considering information flow characteristics
CN117312634A (en) * 2023-11-29 2023-12-29 大文传媒集团(山东)有限公司 Artificial intelligence data integration and propagation processing system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103268350A (en) * 2013-05-29 2013-08-28 安徽雷越网络科技有限公司 Internet public opinion information monitoring system and monitoring method
US20130290232A1 (en) * 2012-04-30 2013-10-31 Mikalai Tsytsarau Identifying news events that cause a shift in sentiment
CN103841216A (en) * 2014-04-01 2014-06-04 深圳市科盾科技有限公司 Network public opinion monitoring system based on cloud platform
CN104933093A (en) * 2015-05-19 2015-09-23 武汉泰迪智慧科技有限公司 Regional public opinion monitoring and decision-making auxiliary system and method based on big data
CN106294619A (en) * 2016-08-01 2017-01-04 上海交通大学 Public sentiment intelligent supervision method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130290232A1 (en) * 2012-04-30 2013-10-31 Mikalai Tsytsarau Identifying news events that cause a shift in sentiment
CN103268350A (en) * 2013-05-29 2013-08-28 安徽雷越网络科技有限公司 Internet public opinion information monitoring system and monitoring method
CN103841216A (en) * 2014-04-01 2014-06-04 深圳市科盾科技有限公司 Network public opinion monitoring system based on cloud platform
CN104933093A (en) * 2015-05-19 2015-09-23 武汉泰迪智慧科技有限公司 Regional public opinion monitoring and decision-making auxiliary system and method based on big data
CN106294619A (en) * 2016-08-01 2017-01-04 上海交通大学 Public sentiment intelligent supervision method

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108764832A (en) * 2018-05-18 2018-11-06 广东电网有限责任公司 Municipal administration and public sentiment demand approaches to IM, system, device and equipment
CN108959368A (en) * 2018-05-22 2018-12-07 深圳壹账通智能科技有限公司 A kind of information monitoring method, storage medium and server
CN110321472A (en) * 2019-06-12 2019-10-11 中国电子科技集团公司第二十八研究所 Public sentiment based on intelligent answer technology monitors system
CN110866185A (en) * 2019-11-11 2020-03-06 维沃移动通信有限公司 Information pushing method and electronic equipment
CN110910637A (en) * 2019-11-19 2020-03-24 上海易点时空网络有限公司 Content evaluation method, device and equipment based on traffic violation
CN110990748B (en) * 2019-12-18 2023-06-27 成都迪普曼林信息技术有限公司 Public opinion data collection and release system
CN110990748A (en) * 2019-12-18 2020-04-10 成都迪普曼林信息技术有限公司 National public opinion data acquisition and publishing system
CN111523856A (en) * 2020-04-16 2020-08-11 山东贝赛信息科技有限公司 Public opinion comprehensive supervision system
CN111696347A (en) * 2020-06-02 2020-09-22 安徽宇呈数据技术有限公司 Method and device for automatically analyzing traffic incident information
CN112650947A (en) * 2020-12-31 2021-04-13 安徽不如信息科技有限公司 Public opinion collection processing system convenient to carry
CN113392185B (en) * 2021-06-10 2023-06-23 中国联合网络通信集团有限公司 Public opinion early warning method, device, equipment and storage medium
CN113392185A (en) * 2021-06-10 2021-09-14 中国联合网络通信集团有限公司 Public opinion early warning method, device, equipment and storage medium
CN113704636A (en) * 2021-08-23 2021-11-26 福建亿榕信息技术有限公司 Fused media public opinion analysis method based on information dissemination
CN115050187A (en) * 2022-08-12 2022-09-13 杭州城市大脑有限公司 Public opinion knowledge graph-based digital urban traffic management method
CN115050187B (en) * 2022-08-12 2022-11-01 杭州城市大脑有限公司 Public opinion knowledge graph-based digital urban traffic management method
CN115862333A (en) * 2022-12-07 2023-03-28 东南大学 Expressway vehicle-road cooperative scene and function division method considering information flow characteristics
CN115862333B (en) * 2022-12-07 2023-11-21 东南大学 Expressway vehicle-road cooperative scene and function division method considering information flow characteristics
CN117312634A (en) * 2023-11-29 2023-12-29 大文传媒集团(山东)有限公司 Artificial intelligence data integration and propagation processing system
CN117312634B (en) * 2023-11-29 2024-02-20 大文传媒集团(山东)有限公司 Artificial intelligence data integration and propagation processing system

Similar Documents

Publication Publication Date Title
CN107203641A (en) A kind of method of the collection of Internet traffic public feelings information and processing
CN110378824B (en) Brain for public security traffic management data and construction method
CN102880692B (en) A kind of monitor video semantic description towards retrieval and detection modeling method
Sujon et al. Social media mining for understanding traffic safety culture in washington state using twitter data
CN106846801A (en) A kind of region based on track of vehicle is hovered anomaly detection method
Ferguson Structural sensor surveillance
CN111427968A (en) Key person holographic archive construction method and device based on knowledge graph
CN104572615A (en) Method and system for on-line case investigation processing
CN105959621A (en) Quasi-real time control distribution system and method based on multi-source video structural data
CN109597889A (en) A kind of method and system of determining a crime based on text classification and deep neural network
Hanifah et al. Twitter information extraction for smart city
Lei Legal control over Big Data criminal investigation
CN110414007A (en) A kind of legal concept recognition methods based on legal principle rule map engine
CN115080709A (en) Text recognition method and device, nonvolatile storage medium and computer equipment
CN117370539A (en) Legal provision information recommendation system based on knowledge base and large model
CN109325755A (en) Electronics charge system based on automotive hub
Neuhold et al. Driver's dashboard–using social media data as additional information for motorway operators
Waduge et al. Machine learning approaches for detect crime patterns
Zhu et al. Construction and application of knowledge-base in telecom fraud domain
Sobhani et al. An ontology framework for automated visual surveillance system
Huan et al. A Reliability‐Based Analysis of Bicyclist Red‐Light Running Behavior at Urban Intersections
Jose et al. Artificial Intelligence Software Application for Contactless Traffic Violation Apprehension in the Philippines
Zhang et al. Expressway vehicle management system based on vehicle face recognition
CN111368550A (en) Public opinion information management system
Mishra et al. Analyzing traffic violations through e-challan system in metropolitan cities (workshop paper)

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170926

RJ01 Rejection of invention patent application after publication