CN107203641A - A kind of method of the collection of Internet traffic public feelings information and processing - Google Patents
A kind of method of the collection of Internet traffic public feelings information and processing Download PDFInfo
- Publication number
- CN107203641A CN107203641A CN201710461333.8A CN201710461333A CN107203641A CN 107203641 A CN107203641 A CN 107203641A CN 201710461333 A CN201710461333 A CN 201710461333A CN 107203641 A CN107203641 A CN 107203641A
- Authority
- CN
- China
- Prior art keywords
- traffic
- public
- processing
- feelings information
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012545 processing Methods 0.000 title claims abstract description 122
- 238000000034 method Methods 0.000 title claims abstract description 37
- 238000012544 monitoring process Methods 0.000 claims abstract description 28
- 238000012795 verification Methods 0.000 claims abstract description 20
- 238000012517 data analytics Methods 0.000 claims abstract description 8
- 230000006870 function Effects 0.000 claims description 44
- 238000004458 analytical method Methods 0.000 claims description 31
- 238000007726 management method Methods 0.000 claims description 21
- 238000005065 mining Methods 0.000 claims description 18
- 238000005516 engineering process Methods 0.000 claims description 13
- 238000001914 filtration Methods 0.000 claims description 10
- 206010039203 Road traffic accident Diseases 0.000 claims description 9
- 230000008520 organization Effects 0.000 claims description 7
- 238000007405 data analysis Methods 0.000 claims description 6
- 238000007418 data mining Methods 0.000 claims description 5
- 238000012546 transfer Methods 0.000 claims description 5
- 230000015572 biosynthetic process Effects 0.000 claims description 4
- 238000002485 combustion reaction Methods 0.000 claims description 3
- 238000010276 construction Methods 0.000 claims description 3
- 230000002269 spontaneous effect Effects 0.000 claims description 3
- 230000003993 interaction Effects 0.000 claims description 2
- 239000004744 fabric Substances 0.000 claims 1
- 230000008569 process Effects 0.000 description 10
- 238000010586 diagram Methods 0.000 description 4
- 241000270322 Lepidosauria Species 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000010365 information processing Effects 0.000 description 3
- 238000003672 processing method Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000012550 audit Methods 0.000 description 1
- 238000009412 basement excavation Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000005242 forging Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/097—Supervising of traffic control systems, e.g. by giving an alarm if two crossing streets have green light simultaneously
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
- G06F2216/03—Data mining
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Tourism & Hospitality (AREA)
- Economics (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- Human Resources & Organizations (AREA)
- General Business, Economics & Management (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Educational Administration (AREA)
- Development Economics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses the method for a kind of collection of Internet traffic public feelings information and processing, including traffic public feelings information in the Internet media is scanned for, monitors and downloaded according to default focus dictionary by data acquisition platform, it is stored in public feelings information storehouse and is delivered to data processing platform (DPP);The data processing platform (DPP) filtered by the traffic public feelings information to download according to pre-provisioning request, analyze and processing forms the traffic public sentiment content for meeting manager's business needs, is stored in public feelings information storehouse and is delivered to business processing and data study and judge platform;The business processing and data study and judge platform according to the traffic public sentiment content that meets manager's business needs being handled and studied and judged the need for manager, and result is stored in the public feelings information storehouse and business handling module and monitoring, early warning and reporting modules are delivered to, finally realize alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function.
Description
Technical field
The present invention relates to field of information acquisition, the side of the more particularly to a kind of collection of Internet traffic public feelings information and processing
Method.
Background technology
With the rapid development of economy, Traffic Development speedup, traffic news information content is increasing, traffic administration institute
Door is increasingly strong to the demand for controlling current traffic information and public sentiment.Be in the prior art by following two technologies to website,
The internet news such as microblogging, wechat are collected.
Web crawlers is a kind of program or script according to certain rule, automatically crawl internet information, they
It is widely used in internet search engine or other similar websites, can be with all pages that it is able to access that of automatic data collection
Hold, to obtain or update the content and retrieval mode of these websites.Functionally, reptile is generally divided into data acquisition, place
Reason, stores three parts.
Traditional reptile obtains the URL on Initial page since the URL of one or several Initial pages, in crawl webpage
During, new URL is constantly extracted from current page and is put into queue, certain stop condition until meeting system.Focus on
The workflow of reptile is complex, it is necessary to linked according to certain web page analysis algorithm filtering is unrelated with theme, remains with
Link simultaneously puts it into the URL queues for waiting crawl.Then, under it will be selected according to certain search strategy from queue
The one step webpage URL to be captured, and said process is repeated, stop when reaching a certain condition of system.In addition, all climbed
The webpage of worm crawl will be stored by system, carry out certain analysis, filtering, and set up index, inquiry and inspection so as to after
Rope;For focused crawler, the analysis result obtained by this process be also possible to later crawl process provide feedback and
Instruct.
User's obtainable information from internet is contained from technical data, business information to news report, amusement money
The document of plurality of classes and the forms such as news, constitute an exception it is huge there is isomerism, the distributed number of open characteristics
According to storehouse, and deposited in this database is non-structured text data.With reference to the natural language in field of artificial intelligence research
Speech understands and Computational Linguistics that data mining needs two key technologies:Web Mining and text mining.
Web Mining lays particular emphasis on the analysis data related to webpage is excavated, including text, link structure and acess control are (most
End form navigates into user network).A variety of different data types are contained in one webpage, therefore Web Mining just contains text
Data mining, image mining etc. in this excavation, database.
Text mining is to extract valuable knowledge that is effective, novel, useful, intelligible, being dispersed in text, and
And utilize the process of these knowledge preferably organizational information.Text mining includes text collection, text analyzing, feature trimming, text
The key technologies such as shelves cluster, document classification.
But web crawlers technology of the prior art, specific aim is strong, lacks the thematic dictionary branch of accurate industrial hot spot
Support, obtains content various, it is impossible to effectively gathered for traffic category information.Network and text data digging are used as emerging skill
Art, lacks the customization model based on traffic service, it is impossible in time, effectively handle and analyze effective traffic category information.
The traffic public feelings information in internet is gathered in the prior art and the method for processing is by website, microblogging, micro-
The internet news channels such as letter, go to obtain the related public sentiment news content of specific public safety traffic management department and to news public sentiment class
The monitoring and management of content, the process mainly studied and judged from information monitoring, data acquisition, content analysis to business processing, statistics are adopted
Taking or semi-artificial, the processing of half system mode, it is impossible to realize automation comprehensively.
Therefore, how to quick comprehensive grasp, monitoring and management traffic news public sentiment, there is provided more preferable modernization branch
Hold, the problem of just turning into those skilled in the art's urgent need to resolve.
The content of the invention
It is existing to overcome it is an object of the invention to provide the method for a kind of collection of Internet traffic public feelings information and processing
Drawbacks described above present in technology.
A kind of collection of Internet traffic public feelings information and the method for processing, seek to solve from information monitoring, data acquisition,
The full-automatic process that content analysis is studied and judged to business processing, statistics.By building public sentiment monitoring and management platform, to realize carriage
The automation process of feelings information, most timely, maximally effective traffic public sentiment content and analysis result are provided for traffic administration person.
It is that public safety traffic management layer grasps society comprehensively by finding, obtaining the public feelings information of correlation, and quick personnel assigned processing in time
The feelings will of the people, public sentiment dynamic makes right opinion guiding.By analyzing mass data, weak link is found, is provided for decision-making level
Specific aim is helped.
To achieve the above object, a kind of method that the present invention provides Internet traffic public feelings information collection and processing, it is wrapped
Data acquisition platform is included, data processing platform (DPP), business processing and data study and judge platform, and monitoring, early warning and reporting modules, business are done
Manage module and public feelings information library module;The data acquisition platform, data processing platform (DPP), business processing and data study and judge platform according to
Secondary electrical connection, and above three platform electrically connects with the public feelings information storehouse, the business processing and data study and judge platform electricity
Connection monitoring, early warning and reporting modules and business handling module;The side of the collection of Internet traffic public feelings information and processing
Method includes scanning for traffic public feelings information in the Internet media according to default focus dictionary by data acquisition platform, supervising
Control and download, are stored in the public feelings information storehouse and are delivered to data processing platform (DPP);It is right that the data processing platform (DPP) passes through
Download the traffic public feelings information filtered according to pre-provisioning request, analyze and processing formation meet manager's business needs
Traffic public sentiment content, is stored in the public feelings information storehouse and is delivered to business processing and data study and judge platform;The business
Processing and data study and judge platform according to the need for manager to meeting the traffic public sentiment content progress of manager's business needs
Handle and study and judge, and result is stored in the public feelings information storehouse and business handling module and monitoring, early warning and report is delivered to
Module is accused, alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function is finally realized.
Preferably, the data acquisition platform includes network search module, information monitoring module and data download module;Institute
Stating data processing platform (DPP) includes data filtering module, semantic module and data processing module;The business processing and data
Platform is studied and judged including business handling module, Terminal-decision module, data statistics module and analysis module is studied and judged.
Preferably, the data acquisition platform is based on web crawlers technology according to default focus dictionary to the Internet media
On the traffic public feelings information data source specified carry out web search, analyze the traffic public feelings information searched in real time, judge whether symbol
The need for the traffic public feelings information capturing service for closing traffic administration person, the traffic carriage of traffic administration person's capturing service needs will be met
The related web page information resource of feelings information is downloaded, and the traffic public feelings information storage of download is transmitted simultaneously to public feelings information storehouse
To data processing platform (DPP);The data processing platform (DPP) is dug based on data mining technology according to pre-provisioning request by the network of data
Pick, text mining and semantic analysis, which are realized, handles being customized of traffic public feelings information of download, and traffic public feelings information is entered
Row basis filtration, removes and downloads repetition, downloads the information that resource is imperfect, the time is expired, carry out after semantic analysis, formed
The traffic public sentiment content of traffic administration person's business needs is met, traffic public sentiment content is advised according to the coded format of regulation and storage
Then storage is delivered to business processing simultaneously to public feelings information storehouse and data study and judge platform;The business processing and data study and judge platform
Opened, realized according to the need for public safety traffic management personnel to described towards public safety traffic management personnel by human-computer interaction device
The business processing and data of traffic public sentiment content study and judge function, are stored to public feelings information storehouse while being delivered to business handling mould
Block and monitoring, early warning and reporting modules, finally realize alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function.
Preferably, the focus dictionary includes the corresponding some passes of the thematic and each traffic administration special topic of 7 traffic administrations
Keyword.
Preferably, 7 traffic administrations special topic includes traffic events, much-talked-about topic, traffic administration, traffic organization and pipe
Control, malice speech, traffic suggestion and problem report and policies and regulations.
Preferably, the corresponding keyword of the traffic events include to lose control of one's vehicle, congestion, traffic accident, construction, road occupying,
Bump against, scratch, female driver, traffic accident, escape, traffic accident, overturn, knock into the back, high speed, highway, blow out, spontaneous combustion, road conditions and
Truck;The corresponding keyword of much-talked-about topic include net about car, shared bicycle, share-car, hire a car, Green Travel and electric automobile;
The corresponding keyword of traffic administration include road in violating the regulations, violation, break laws and violate discipline, penalty note, overload, retrograde, hypervelocity, illegal parking,
Deck, electronic police, capture, make a dash across the red light, going through stop light, drunk driving, drive without a license, drive when intoxicated, parking offense, false number plate, forging
Car plate, false car plate, deck, escape vehicle, assault police, joyride and rush card;Traffic organization and the corresponding keyword of management and control include great
Activity, security, traffic police, traffic-police, Traffic Police Headquarters, Traffic Warden Subteam, traffic police group, traffic police squadron, vehicle administration office, association police, Dan Shuan
Number, restricted driving, speed limit, it is forbidden, raid and deploy to ensure effective monitoring and control of illegal activities, close a road to traffic, driving license, driving school, wagon flow, charge station and detouring;Malice speech is corresponding to close
Keyword include accept bribes, corrupt, corruption, hit the person, swear at people and give a present;Traffic suggestion and problem report that corresponding keyword includes disorderly receiving
Expense, receive ill-gotten money, illegal vehicle, laws are not fully observed, enforce the law impartially, traffic lights, in violation of rules and regulations charge and entrapment;The corresponding key of policies and regulations
Word includes encouraging share-car, shared trip, traffic marking, speed(-)limit sign, traffic sign, traffic publicity and traffic safety.
Preferably, the important content of the Web Mining has 18 aspects, and 18 aspects include public sentiment news mark
Topic, public sentiment news content, author, issuing time, data source, starting source, data acquisition time, public sentiment property, original text chain
It is grounded location, keyword, clip Text, thematic type, topic information, visit capacity, transfer amount, comment number, follow-up amount and affiliated area
Domain.
Preferably, the semantic analysis includes 7 functions, including basic handling function, syntactic analysis function, text mining
Function, text cluster function, sentiment analysis function, knowledge abstraction function and temperature analytic function.
Preferably, the service processing function, including public sentiment examination & verification, the police of public sentiment group and public sentiment processing;The public sentiment examination & verification
Whether be traffic administration person belong to the traffic public feelings information that gathers and handle effective, true news content and base categories are
No rational examination & verification;The public sentiment group police is the area under one's jurisdiction according to belonging to effective traffic public feelings information after examination & verification, system default
It is assigned to the traffic police personnel of responsible area under one's jurisdiction business;The public sentiment processing is that traffic administration personnel are believed a certain specific traffic public sentiment
Knot and operation is handled at breath carry out.
Preferably, it is the result to traffic public feelings information that the data, which study and judge function, carries out big data analysis, is carried out
Directly perceived and digitized display.
Beneficial effects of the present invention:
The method of the collection of Internet traffic public feelings information and processing proposed by the present invention so that traffic administration person can be timely
It was found that and oneself compass of competency, the related traffic public feelings information of business;It was found that after effective public feelings information, alert place can be sent in time
Reason, it is to avoid public sentiment is further fermented, and causes bad social influence;, can be with by substantial amounts of traffic public sentiment historical data analysis
Summarize, summarize the content foundation for being conducive to traffic management policy.
Brief description of the drawings
Fig. 1 is Organization Chart of the invention;
Fig. 2 audits schematic diagram for the public sentiment of the present invention;
Fig. 3 is intended to for the public sentiment group warning of the present invention;
Fig. 4 is the alert policeman's situation schematic diagram on duty of the public sentiment group of the present invention;
Fig. 5 handles schematic diagram for the public sentiment of the present invention;
Fig. 6 studies and judges schematic diagram for the data of the present invention.
Embodiment
To make the purpose, technical scheme and advantage of the invention implemented clearer, below in conjunction with the embodiment of the present invention
Accompanying drawing, the technical scheme in the embodiment of the present invention is further described in more detail.In the accompanying drawings, identical from beginning to end or class
As label represent same or similar element or the element with same or like function.Described embodiment is the present invention
A part of embodiment, rather than whole embodiments.The embodiments described below with reference to the accompanying drawings are exemplary, it is intended to uses
It is of the invention in explaining, and be not considered as limiting the invention.Based on the embodiment in the present invention, ordinary skill people
The every other embodiment that member is obtained under the premise of creative work is not made, belongs to the scope of protection of the invention.Under
Embodiments of the invention are described in detail with reference to accompanying drawing for face.
A kind of method of the collection of Internet traffic public feelings information and processing in a broad embodiment of the invention, it includes number
According to acquisition platform, data processing platform (DPP), business processing and data study and judge platform, monitoring, early warning and reporting modules, business handling mould
Block and public feelings information library module;It is electric successively that the data acquisition platform, data processing platform (DPP), business processing and data study and judge platform
Connect, and above three platform is electrically connected with the public feelings information storehouse, the business processing and data study and judge platform electrical connection
Monitoring, early warning and reporting modules and business handling module;The method bag of the collection of Internet traffic public feelings information and processing
Include traffic public feelings information in the Internet media is scanned for according to default focus dictionary by data acquisition platform, monitor and
Download, be stored in the public feelings information storehouse and be delivered to data processing platform (DPP);The data processing platform (DPP) passes through to downloading
The traffic public feelings information filtered, analyzed and processing forms and meets the traffic of manager's business needs according to pre-provisioning request
Public sentiment content, is stored in the public feelings information storehouse and is delivered to business processing and data study and judge platform;The business processing
Platform is studied and judged according to handling the traffic public sentiment content that meets manager's business needs the need for manager with data
With study and judge, and result is stored in the public feelings information storehouse and business handling module and monitoring, early warning and report mould is delivered to
Block, finally realizes alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function.
A kind of collection of Internet traffic public feelings information and the method for processing that the present invention is provided, have compared with prior art
Advantages below:
Traffic administration person is had found and oneself compass of competency, the related traffic public feelings information of business in time;It was found that
After effective public feelings information, alert processing can be sent in time, it is to avoid public sentiment is further fermented, cause bad social influence;By big
The traffic public sentiment historical data analysis of amount, can summarize, summarize the content foundation for being conducive to traffic management policy.
The embodiment of the present invention is described as follows:
A kind of method of the collection of Internet traffic public feelings information and processing, it includes data acquisition platform, and data processing is put down
Platform, business processing and data study and judge platform, monitoring, early warning and reporting modules, business handling module and public feelings information library module;Institute
State data acquisition platform, data processing platform (DPP), business processing and data and study and judge platform and be sequentially connected electrically, and above three platform is equal
Electrically connected with the public feelings information storehouse, the business processing and data study and judge platform electrical connection monitoring, early warning and reporting modules with
And business handling module;The method of Internet traffic public feelings information collection and processing include by data acquisition platform according to
Default focus dictionary is scanned for, monitors and downloaded to traffic public feelings information in the Internet media, is stored in the carriage
Feelings information bank is simultaneously delivered to data processing platform (DPP);The data processing platform (DPP) by the traffic public feelings information to download according to
Pre-provisioning request filtered, analyze and processing forms the traffic public sentiment content for meeting manager's business needs, is stored in institute
State public feelings information storehouse and be delivered to business processing and data study and judge platform;The business processing and data study and judge platform according to management
The traffic public sentiment content for meeting manager's business needs is handled and studied and judged the need for person, and result is stored in institute
State public feelings information storehouse and be delivered to business handling module and monitoring, early warning and reporting modules, finally realize public sentiment examination & verification, public sentiment
Group is alert, public sentiment is handled and big data analytic function.
The data acquisition platform includes network search module, information monitoring module and data download module;The data
Processing platform includes data filtering module, semantic module and data processing module;The business processing and data are studied and judged flat
Platform includes business handling module, Terminal-decision module, data statistics module and studies and judges analysis module.
The data acquisition platform is based on web crawlers technology according to default focus dictionary to being specified in the Internet media
Traffic public feelings information data source carry out web search, analyze the traffic public feelings information searched in real time, judge whether to meet traffic
The need for the traffic public feelings information capturing service of manager, the traffic public feelings information of traffic administration person's capturing service needs will be met
Related web page information resource be downloaded, by the storage of the traffic public feelings information of download to public feelings information storehouse while being delivered to data
Processing platform;Web Mining of the data processing platform (DPP) based on data mining technology according to pre-provisioning request by data, text
Excavate and semantic analysis is realized and handles being customized of traffic public feelings information of download, basic mistake is carried out to traffic public feelings information
Work is filtered, removes and downloads repetition, downloads the information that resource is imperfect, the time is expired, carry out after semantic analysis, formation meets traffic
The traffic public sentiment content that manager's business needs, traffic public sentiment content is arrived according to the coded format of regulation and storage rule storage
Public feelings information storehouse is delivered to business processing simultaneously and data study and judge platform;The business processing and data study and judge platform pass through it is man-machine
Interactive device is opened towards public safety traffic management personnel, is realized according to the need for public safety traffic management personnel to the traffic public sentiment
The business processing and data of content study and judge function, are stored to public feelings information storehouse while being delivered to business handling module and prison
Control, early warning and reporting modules, finally realize alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function.
The focus dictionary includes the corresponding some keywords of the thematic and each traffic administration special topic of 7 traffic administrations.
7 traffic administrations special topic includes traffic events, much-talked-about topic, traffic administration, traffic organization and management and control, malice
Speech, traffic suggestion and problem report and policies and regulations.
The corresponding keyword of the traffic events include to lose control of one's vehicle, congestion, traffic accident, construction, road occupying, bump against, cut to pieces
Rub, female driver, traffic accident, escape, traffic accident, overturn, knock into the back, high speed, highway, blow out, spontaneous combustion, road conditions and truck;
The corresponding keyword of much-talked-about topic include net about car, shared bicycle, share-car, hire a car, Green Travel and electric automobile;Traffic administration
Corresponding keyword include road in violating the regulations, violation, break laws and violate discipline, penalty note, overload, retrograde, hypervelocity, illegal parking, deck, electronics
Police, capture, make a dash across the red light, going through stop light, drunk driving, drive without a license, drive when intoxicated, parking offense, false number plate, false license plates, false car
Board, deck, escape vehicle, assault police, joyride and rush card;Traffic organization and the corresponding keyword of management and control include occasion, security,
Traffic police, traffic-police, Traffic Police Headquarters, Traffic Warden Subteam, traffic police group, traffic police squadron, vehicle administration office, association police, odd or even number, restricted driving, limit
It is fast, forbidden, raid and deploy to ensure effective monitoring and control of illegal activities, close a road to traffic, driving license, driving school, wagon flow, charge station and detouring;The corresponding keyword of malice speech include by
Bribe, corruption, corruption, hit the person, swear at people and give a present;Traffic suggestion and problem report corresponding keyword include arbitrary imposition of fees, receive ill-gotten money,
Illegal vehicle, laws are not fully observed, enforce the law impartially, traffic lights, in violation of rules and regulations charge and entrapment;The corresponding keyword of policies and regulations includes encouraging
Share-car, shared trip, traffic marking, speed(-)limit sign, traffic sign, traffic publicity and traffic safety.
The important content of the Web Mining has 18 aspects, and 18 aspects include public sentiment headline, public sentiment
News content, author, issuing time, data source, starting source, data acquisition time, public sentiment property, original text chained address,
Keyword, clip Text, thematic type, topic information, visit capacity, transfer amount, comment number, follow-up amount and affiliated area.
The semantic analysis includes 7 functions, including basic handling function, syntactic analysis function, text mining function, text
This function of convergence, sentiment analysis function, knowledge abstraction function and temperature analytic function.
The service processing function, including public sentiment examination & verification, the police of public sentiment group and public sentiment processing;The public sentiment examination & verification is traffic pipe
Reason person whether belongs to effective, true news content to the traffic public feelings information for gathering and handling and whether base categories are rational
Examination & verification;The public sentiment group police is the area under one's jurisdiction according to belonging to effective traffic public feelings information after examination & verification, and system default is assigned to negative
The traffic police personnel for blaming area under one's jurisdiction business;Public sentiment processing be traffic administration personnel to a certain specific traffic public feelings information at
Tie and handle operation.
It is the result to traffic public feelings information that the data, which study and judge function, carries out big data analysis, carry out it is directly perceived and
Digitized display.
The present invention will be described in detail by 1-6 with reference to the accompanying drawings.
A kind of collection of Internet traffic public feelings information and the method for processing, seek to solve from information monitoring, data acquisition,
The full-automatic process that content analysis is studied and judged to business processing, statistics.By building public sentiment monitoring and management platform, to realize carriage
The automation process of feelings information, most timely, maximally effective traffic public sentiment content and analysis result are provided for traffic administration person.
1 understand with reference to the accompanying drawings,
1. data acquisition platform
In a kind of method of Internet traffic public feelings information processing, data acquisition work(is realized based on web crawlers technology
Energy.
Including following three part:
(1) web search, by specified traffic information data source, search in real time.
(2) information monitoring, analyzes the traffic public feelings information searched in real time, judges whether that the information for meeting traffic administration person is adopted
Collect business demand.
(3) data are downloaded, and the related info web resource of traffic news public sentiment are downloaded, public feelings information is arrived in storage
Storehouse.
Data acquisition platform realizes data acquisition function by three parts.Web crawlers, will be according to friendship in gathered data
Siphunculus reason focus dictionary carries out the whole network search, in a kind of Internet traffic public feelings information processing method, the traffic administration of definition
Focus dictionary is as follows:
2. data processing platform (DPP)
In a kind of method of Internet traffic public feelings information processing, data processing platform (DPP) is dug by the network of data
Pick, text mining, semantic analysis etc., realize and handle being customized of traffic public feelings information of download, formation meets traffic pipe
The content that reason business needs.These contents include:
Public sentiment headline
Public sentiment news content
Author
Issuing time
Data source
Starting source
Data acquisition time
Public sentiment property (front, negative, neutrality)
Original text chained address
Keyword
Clip Text
Thematic type
Topic information
Visit capacity
Transfer amount
Comment on number
Follow-up amount
Affiliated area
(1) data filtering
Data filtering, realizes the filtration that the data content of download is done to basis, including remove download repetition, download money
The news data such as source is imperfect, the time is expired.
(2) semantic analysis
Including following functions:
Basic handling
Language identification, Chinese word segmentation, part-of-speech tagging, name Entity recognition
Syntactic analysis
Text punctuate, syntactic analysis, SVO are extracted
Text mining
Keyword extraction, text classification, text snippet
Text cluster
Text similarity, term vector, text cluster
Sentiment analysis
Front, negative, neutrality
Knowledge is extracted
Entity extraction, relation are extracted
Temperature is analyzed
Click volume, transfer amount, follow-up amount
(3) data processing
Data processing function, is the result for analyzing data semantic, according to certain coded format and storage rule, enters line number
According to storage, basic data is provided for business platform.
3. business processing and data study and judge platform
Service process platform and data study and judge platform, are the operation systems towards public safety traffic management librarian use.
Business processing includes following sections:
(1) public sentiment is audited
For the traffic public feelings information for gathering and handling, whether traffic administration person under confirming, it is necessary to belong to effective, true
Whether news content, and some base categories of the news are reasonable.Referring to accompanying drawing 2.
(2) public sentiment group police
Effective public feelings information after examination & verification, by the area under one's jurisdiction according to belonging to the public sentiment content, system can give tacit consent to be assigned to it is negative
The traffic police personnel for blaming area under one's jurisdiction business, are handled.Referring to accompanying drawing 3.
Arrangement on duty is carried out daily, to distribute specific area under one's jurisdiction, office, in public sentiment processing links, the traffic police people of required appointment
Member, referring to accompanying drawing 4.
(3) public sentiment is handled
Public sentiment disposal is traffic administration personnel to knot at a certain specific public feelings information carry out, handled.Referring to accompanying drawing 5.
Data study and judge platform
By the result to traffic public feelings information, to carry out big data analysis, formed intuitively, digitized knot
Really.Referring to accompanying drawing 6.
It is last it is to be noted that:The above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations.To the greatest extent
The present invention is described in detail with reference to the foregoing embodiments for pipe, it will be understood by those within the art that:It is still
Technical scheme described in foregoing embodiments can be modified, or which part technical characteristic is equally replaced
Change;And these modifications or replacement, the essence of appropriate technical solution is departed from the essence of various embodiments of the present invention technical scheme
God and scope.
Claims (10)
1. a kind of Internet traffic public feelings information collection and the method for processing, it is characterised in that:Including data acquisition platform, data
Processing platform, business processing and data study and judge platform, monitoring, early warning and reporting modules, business handling module and public feelings information storehouse
Module;The data acquisition platform, data processing platform (DPP), business processing and data are studied and judged platform and are sequentially connected electrically, and above-mentioned three
Individual platform is electrically connected with the public feelings information storehouse, and the business processing and data study and judge platform electrical connection monitoring, early warning and report
Accuse module and business handling module;The method of the collection of Internet traffic public feelings information and processing includes passing through data acquisition
Platform is scanned for, monitors and downloaded to traffic public feelings information in the Internet media according to default focus dictionary, is stored
In the public feelings information storehouse and it is delivered to data processing platform (DPP);The data processing platform (DPP) passes through the traffic public sentiment to download
Information filtered according to pre-provisioning request, analyze and processing forms the traffic public sentiment content for meeting manager's business needs, by it
It is stored in the public feelings information storehouse and is delivered to business processing and data study and judge platform;The business processing and data study and judge platform
According to the traffic public sentiment content that meets manager's business needs being handled and studied and judged the need for manager, and by result
It is stored in the public feelings information storehouse and is delivered to business handling module and monitoring, early warning and reporting modules, finally realizes public sentiment
Examination & verification, alert public sentiment group, public sentiment processing and big data analytic function.
2. Internet traffic public feelings information collection according to claim 1 and the method for processing, it is characterised in that:The number
Include network search module, information monitoring module and data download module according to acquisition platform;The data processing platform (DPP) includes number
According to filtering module, semantic module and data processing module;The business processing and data, which study and judge platform, includes business handling
Module, Terminal-decision module, data statistics module and study and judge analysis module.
3. Internet traffic public feelings information collection according to claim 1 and the method for processing, it is characterised in that:The number
According to acquisition platform based on web crawlers technology according to default focus dictionary to the traffic public feelings information specified in the Internet media
Data source carries out web search, analyzes the traffic public feelings information searched in real time, judges whether to meet the traffic carriage of traffic administration person
The need for feelings information gathering business, the related web page information of the traffic public feelings information of traffic administration person's capturing service needs will be met
Resource is downloaded, and the traffic public feelings information storage of download is delivered into data processing platform (DPP) simultaneously to public feelings information storehouse;It is described
Data processing platform (DPP) passes through the Web Mining of data, text mining and semantic analysis based on data mining technology according to pre-provisioning request
Realize and handle being customized of traffic public feelings information of download, basic filtration is carried out to traffic public feelings information, under removal
Load-carrying is multiple, download the information that resource is imperfect, the time is expired, carries out after semantic analysis, formation meets traffic administration person's business need
The traffic public sentiment content wanted, traffic public sentiment content is same to public feelings information storehouse according to the coded format of regulation and storage rule storage
When be delivered to business processing and data study and judge platform;The business processing and data study and judge platform by human-computer interaction device towards
Public safety traffic management personnel open, and are realized according to the need for public safety traffic management personnel at the business to the traffic public sentiment content
Reason and data study and judge function, are stored to public feelings information storehouse while being delivered to business handling module and monitoring, early warning and report
Module is accused, alert public sentiment examination & verification, public sentiment group, public sentiment processing and big data analytic function is finally realized.
4. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:The heat
Point dictionary includes the corresponding some keywords of the thematic and each traffic administration special topic of 7 traffic administrations.
5. Internet traffic public feelings information collection according to claim 4 and the method for processing, it is characterised in that:Described 7
Individual traffic administration special topic includes traffic events, much-talked-about topic, traffic administration, traffic organization and management and control, malice speech, traffic suggestion
With problem report and policies and regulations.
6. Internet traffic public feelings information collection according to claim 5 and the method for processing, it is characterised in that:It is described to hand over
The corresponding keyword of interpreter's part include to lose control of one's vehicle, congestion, traffic accident, construction, road occupying, bump against, scratch, female driver, traffic
Accident, escape, traffic accident, overturn, knock into the back, high speed, highway, blow out, spontaneous combustion, road conditions and truck;Much-talked-about topic is corresponding
Keyword include net about car, shared bicycle, share-car, hire a car, Green Travel and electric automobile;The corresponding keyword bag of traffic administration
Include road in violating the regulations, violation, break laws and violate discipline, penalty note, overload, drive in the wrong direction, hypervelocity, illegal parking, deck, electronic police, capture, rush it is red
Lamp, go through stop light, drunk driving, drive without a license, drive when intoxicated, parking offense, false number plate, false license plates, false car plate, deck, escape car
, assault police, joyride and rush card;Traffic organization and the corresponding keyword of management and control include occasion, security, traffic police, traffic-police,
Traffic Police Headquarters, Traffic Warden Subteam, traffic police group, traffic police squadron, vehicle administration office, association police, odd or even number, restricted driving, speed limit, it is forbidden, raid cloth
Control, road closure, driving license, driving school, wagon flow, charge station and detour;The corresponding keyword of malice speech include accept bribes, corrupt, corruption, beat
People, swear at people and give a present;Traffic suggestion and problem report corresponding keyword include arbitrary imposition of fees, receive ill-gotten money, illegal vehicle, laws are not fully observed,
Enforce the law impartially, traffic lights, in violation of rules and regulations charge and entrapment;The corresponding keyword of policies and regulations include encouraging share-car, shared trip,
Traffic marking, speed(-)limit sign, traffic sign, traffic publicity and traffic safety.
7. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:The net
The important content that network is excavated has 18 aspects, 18 aspects include public sentiment headline, public sentiment news content, author,
Issuing time, data source, starting source, data acquisition time, public sentiment property, original text chained address, keyword, summary are interior
Appearance, thematic type, topic information, visit capacity, transfer amount, comment number, follow-up amount and affiliated area.
8. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:Institute's predicate
Justice analysis includes 7 functions, including basic handling function, syntactic analysis function, text mining function, text cluster function, feelings
Feel analytic function, knowledge abstraction function and temperature analytic function.
9. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:The industry
Business processing function, including public sentiment examination & verification, the police of public sentiment group and public sentiment processing;The public sentiment examination & verification is traffic administration person to gathering and locating
Whether the traffic public feelings information of reason belongs to effective, true news content and whether base categories are reasonably audited;The public sentiment
Group police is the area under one's jurisdiction according to belonging to effective traffic public feelings information after examination & verification, and system default is assigned to the friendship of responsible area under one's jurisdiction business
Alert personnel;The public sentiment processing is traffic administration personnel to knot at a certain specific traffic public feelings information carry out and handles operation.
10. Internet traffic public feelings information collection according to claim 3 and the method for processing, it is characterised in that:It is described
It is the result to traffic public feelings information that data, which study and judge function, carries out big data analysis, carries out directly perceived and digitized display.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710461333.8A CN107203641A (en) | 2017-06-19 | 2017-06-19 | A kind of method of the collection of Internet traffic public feelings information and processing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710461333.8A CN107203641A (en) | 2017-06-19 | 2017-06-19 | A kind of method of the collection of Internet traffic public feelings information and processing |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107203641A true CN107203641A (en) | 2017-09-26 |
Family
ID=59907469
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710461333.8A Pending CN107203641A (en) | 2017-06-19 | 2017-06-19 | A kind of method of the collection of Internet traffic public feelings information and processing |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107203641A (en) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108764832A (en) * | 2018-05-18 | 2018-11-06 | 广东电网有限责任公司 | Municipal administration and public sentiment demand approaches to IM, system, device and equipment |
CN108959368A (en) * | 2018-05-22 | 2018-12-07 | 深圳壹账通智能科技有限公司 | A kind of information monitoring method, storage medium and server |
CN110321472A (en) * | 2019-06-12 | 2019-10-11 | 中国电子科技集团公司第二十八研究所 | Public sentiment based on intelligent answer technology monitors system |
CN110866185A (en) * | 2019-11-11 | 2020-03-06 | 维沃移动通信有限公司 | Information pushing method and electronic equipment |
CN110910637A (en) * | 2019-11-19 | 2020-03-24 | 上海易点时空网络有限公司 | Content evaluation method, device and equipment based on traffic violation |
CN110990748A (en) * | 2019-12-18 | 2020-04-10 | 成都迪普曼林信息技术有限公司 | National public opinion data acquisition and publishing system |
CN111523856A (en) * | 2020-04-16 | 2020-08-11 | 山东贝赛信息科技有限公司 | Public opinion comprehensive supervision system |
CN111696347A (en) * | 2020-06-02 | 2020-09-22 | 安徽宇呈数据技术有限公司 | Method and device for automatically analyzing traffic incident information |
CN112650947A (en) * | 2020-12-31 | 2021-04-13 | 安徽不如信息科技有限公司 | Public opinion collection processing system convenient to carry |
CN113392185A (en) * | 2021-06-10 | 2021-09-14 | 中国联合网络通信集团有限公司 | Public opinion early warning method, device, equipment and storage medium |
CN113704636A (en) * | 2021-08-23 | 2021-11-26 | 福建亿榕信息技术有限公司 | Fused media public opinion analysis method based on information dissemination |
CN115050187A (en) * | 2022-08-12 | 2022-09-13 | 杭州城市大脑有限公司 | Public opinion knowledge graph-based digital urban traffic management method |
CN115862333A (en) * | 2022-12-07 | 2023-03-28 | 东南大学 | Expressway vehicle-road cooperative scene and function division method considering information flow characteristics |
CN117312634A (en) * | 2023-11-29 | 2023-12-29 | 大文传媒集团(山东)有限公司 | Artificial intelligence data integration and propagation processing system |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103268350A (en) * | 2013-05-29 | 2013-08-28 | 安徽雷越网络科技有限公司 | Internet public opinion information monitoring system and monitoring method |
US20130290232A1 (en) * | 2012-04-30 | 2013-10-31 | Mikalai Tsytsarau | Identifying news events that cause a shift in sentiment |
CN103841216A (en) * | 2014-04-01 | 2014-06-04 | 深圳市科盾科技有限公司 | Network public opinion monitoring system based on cloud platform |
CN104933093A (en) * | 2015-05-19 | 2015-09-23 | 武汉泰迪智慧科技有限公司 | Regional public opinion monitoring and decision-making auxiliary system and method based on big data |
CN106294619A (en) * | 2016-08-01 | 2017-01-04 | 上海交通大学 | Public sentiment intelligent supervision method |
-
2017
- 2017-06-19 CN CN201710461333.8A patent/CN107203641A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130290232A1 (en) * | 2012-04-30 | 2013-10-31 | Mikalai Tsytsarau | Identifying news events that cause a shift in sentiment |
CN103268350A (en) * | 2013-05-29 | 2013-08-28 | 安徽雷越网络科技有限公司 | Internet public opinion information monitoring system and monitoring method |
CN103841216A (en) * | 2014-04-01 | 2014-06-04 | 深圳市科盾科技有限公司 | Network public opinion monitoring system based on cloud platform |
CN104933093A (en) * | 2015-05-19 | 2015-09-23 | 武汉泰迪智慧科技有限公司 | Regional public opinion monitoring and decision-making auxiliary system and method based on big data |
CN106294619A (en) * | 2016-08-01 | 2017-01-04 | 上海交通大学 | Public sentiment intelligent supervision method |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108764832A (en) * | 2018-05-18 | 2018-11-06 | 广东电网有限责任公司 | Municipal administration and public sentiment demand approaches to IM, system, device and equipment |
CN108959368A (en) * | 2018-05-22 | 2018-12-07 | 深圳壹账通智能科技有限公司 | A kind of information monitoring method, storage medium and server |
CN110321472A (en) * | 2019-06-12 | 2019-10-11 | 中国电子科技集团公司第二十八研究所 | Public sentiment based on intelligent answer technology monitors system |
CN110866185A (en) * | 2019-11-11 | 2020-03-06 | 维沃移动通信有限公司 | Information pushing method and electronic equipment |
CN110910637A (en) * | 2019-11-19 | 2020-03-24 | 上海易点时空网络有限公司 | Content evaluation method, device and equipment based on traffic violation |
CN110990748B (en) * | 2019-12-18 | 2023-06-27 | 成都迪普曼林信息技术有限公司 | Public opinion data collection and release system |
CN110990748A (en) * | 2019-12-18 | 2020-04-10 | 成都迪普曼林信息技术有限公司 | National public opinion data acquisition and publishing system |
CN111523856A (en) * | 2020-04-16 | 2020-08-11 | 山东贝赛信息科技有限公司 | Public opinion comprehensive supervision system |
CN111696347A (en) * | 2020-06-02 | 2020-09-22 | 安徽宇呈数据技术有限公司 | Method and device for automatically analyzing traffic incident information |
CN112650947A (en) * | 2020-12-31 | 2021-04-13 | 安徽不如信息科技有限公司 | Public opinion collection processing system convenient to carry |
CN113392185B (en) * | 2021-06-10 | 2023-06-23 | 中国联合网络通信集团有限公司 | Public opinion early warning method, device, equipment and storage medium |
CN113392185A (en) * | 2021-06-10 | 2021-09-14 | 中国联合网络通信集团有限公司 | Public opinion early warning method, device, equipment and storage medium |
CN113704636A (en) * | 2021-08-23 | 2021-11-26 | 福建亿榕信息技术有限公司 | Fused media public opinion analysis method based on information dissemination |
CN115050187A (en) * | 2022-08-12 | 2022-09-13 | 杭州城市大脑有限公司 | Public opinion knowledge graph-based digital urban traffic management method |
CN115050187B (en) * | 2022-08-12 | 2022-11-01 | 杭州城市大脑有限公司 | Public opinion knowledge graph-based digital urban traffic management method |
CN115862333A (en) * | 2022-12-07 | 2023-03-28 | 东南大学 | Expressway vehicle-road cooperative scene and function division method considering information flow characteristics |
CN115862333B (en) * | 2022-12-07 | 2023-11-21 | 东南大学 | Expressway vehicle-road cooperative scene and function division method considering information flow characteristics |
CN117312634A (en) * | 2023-11-29 | 2023-12-29 | 大文传媒集团(山东)有限公司 | Artificial intelligence data integration and propagation processing system |
CN117312634B (en) * | 2023-11-29 | 2024-02-20 | 大文传媒集团(山东)有限公司 | Artificial intelligence data integration and propagation processing system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107203641A (en) | A kind of method of the collection of Internet traffic public feelings information and processing | |
CN110378824B (en) | Brain for public security traffic management data and construction method | |
CN102880692B (en) | A kind of monitor video semantic description towards retrieval and detection modeling method | |
Sujon et al. | Social media mining for understanding traffic safety culture in washington state using twitter data | |
CN106846801A (en) | A kind of region based on track of vehicle is hovered anomaly detection method | |
Ferguson | Structural sensor surveillance | |
CN111427968A (en) | Key person holographic archive construction method and device based on knowledge graph | |
CN104572615A (en) | Method and system for on-line case investigation processing | |
CN105959621A (en) | Quasi-real time control distribution system and method based on multi-source video structural data | |
CN109597889A (en) | A kind of method and system of determining a crime based on text classification and deep neural network | |
Hanifah et al. | Twitter information extraction for smart city | |
Lei | Legal control over Big Data criminal investigation | |
CN110414007A (en) | A kind of legal concept recognition methods based on legal principle rule map engine | |
CN115080709A (en) | Text recognition method and device, nonvolatile storage medium and computer equipment | |
CN117370539A (en) | Legal provision information recommendation system based on knowledge base and large model | |
CN109325755A (en) | Electronics charge system based on automotive hub | |
Neuhold et al. | Driver's dashboard–using social media data as additional information for motorway operators | |
Waduge et al. | Machine learning approaches for detect crime patterns | |
Zhu et al. | Construction and application of knowledge-base in telecom fraud domain | |
Sobhani et al. | An ontology framework for automated visual surveillance system | |
Huan et al. | A Reliability‐Based Analysis of Bicyclist Red‐Light Running Behavior at Urban Intersections | |
Jose et al. | Artificial Intelligence Software Application for Contactless Traffic Violation Apprehension in the Philippines | |
Zhang et al. | Expressway vehicle management system based on vehicle face recognition | |
CN111368550A (en) | Public opinion information management system | |
Mishra et al. | Analyzing traffic violations through e-challan system in metropolitan cities (workshop paper) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170926 |
|
RJ01 | Rejection of invention patent application after publication |