CN110489400A - A kind of realization people's vehicle acquisition data quasi real time associated algorithm model - Google Patents

A kind of realization people's vehicle acquisition data quasi real time associated algorithm model Download PDF

Info

Publication number
CN110489400A
CN110489400A CN201910784988.8A CN201910784988A CN110489400A CN 110489400 A CN110489400 A CN 110489400A CN 201910784988 A CN201910784988 A CN 201910784988A CN 110489400 A CN110489400 A CN 110489400A
Authority
CN
China
Prior art keywords
occurrence
matching degree
people
algorithm model
vehicle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910784988.8A
Other languages
Chinese (zh)
Inventor
严俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Bai Hong Software Technology Co Ltd
Original Assignee
Wuhan Bai Hong Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Bai Hong Software Technology Co Ltd filed Critical Wuhan Bai Hong Software Technology Co Ltd
Priority to CN201910784988.8A priority Critical patent/CN110489400A/en
Publication of CN110489400A publication Critical patent/CN110489400A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • G06F16/212Schema design and management with details for data modelling support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Educational Administration (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Development Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Traffic Control Systems (AREA)

Abstract

The present invention proposes that a kind of realization people's vehicle acquires data quasi real time associated algorithm model.It includes several front-end collection equipment, server, disk array, big data component, database etc..Wherein front-end collection equipment is mainly used for acquiring the information of license plate, face and associated electronic device and records collecting location and acquisition time formation track data and report, server combination big data component cleans the track data reported, be put in storage and algorithm model calculates, disk array is used to store the track data of storage, big data component includes: Hadoop, Flume, Kafka, ElasticSearch etc., and database MySQL is used to store the relevant information of front-end collection equipment.Algorithm model various dimensions of the invention consider that reality factor calculates the associated matching degree of people's vehicle, adapt to reality scene and environment, and the accurate people's vehicle of finding out of energy is associated with and provides with reference to matching degree, provide important support for case investigation.

Description

A kind of realization people's vehicle acquisition data quasi real time associated algorithm model
Technical field
The present invention relates to terminal acquisition fields more particularly to a kind of realization people's vehicle to acquire data quasi real time associated calculation Method model.
Technical background
The association of people's vehicle, which investigates and prosecute to case, provides important technical support.People in the present invention refers mainly to hand-hold electronic equipments Identification code, is also possible to facial image, and vehicle refers to license plate number.Equipment, which is acquired, by various terminals acquires space-time trajectory data, Carrying out space-time analysis excavation to these space-time trajectory datas accurate can obtain people Che Guanlian and association matching degree.
Because of various practical reasons, electronic device information acquire equipment acquisition rate it is limited, improve each electronic equipment with The quasi real time association accuracy of license plate needs various dimensions to consider to calculate.
The dimension of consideration mainly has:
(1) significance level in time and space is different.It is different for the closeness of different time personnel, when low-density People's vehicle co-occurrence is more credible, and spatially there is also the differences of density of personnel and flow of personnel speed, and the lower flowing of density of personnel is more Slow place, people's vehicle co-occurrence is more credible, and corresponding weight is also higher.
(2) external concern is different with ownership place weight.Certain electronic equipments have ground domain identifier, if the vehicle of identical region Board and electronic equipment co-occurrence, confidence level is higher, can increase its weight.
(3) it is limited by acquisition rate, there may be individual acquisition floor drains to adopt in the period of concern, at this moment considers that history is closed Connection supplements real time correlation.Historical context is by an accumulative co-occurrence number of program record and according to the same related compounds The co-occurrence number for being associated object with other calculates confidence level.
(4) consider reality, co-occurrence number reaches certain value, that is, has higher confidence level, avoids people and Che same Time locus points difference is larger and the lesser situation of co-occurrence accounting causes matching degree lower;
(5) consider that there are the unreachable situations in track further to screen to result for people's wheel paths.
Summary of the invention
To achieve the goals above, technical solution provided by the present application is as follows:
A variety of front end data acquisition equipments are mainly used for acquiring the information and record of license plate, face and associated electronic device Collecting location and acquisition time form track data and report.
Server and big data component clean the track data reported, be put in storage and algorithm model calculates.Big data Component includes: Hadoop, Flume, Kafka, ElasticSearch etc..
MySQL database is used to store the relevant information of front-end collection equipment.
Disk array is used to store the track data of storage.
Compared with prior art, the invention has the following beneficial effects: algorithm model various dimensions to consider that reality factor calculates The associated matching degree of people's vehicle adapts to reality scene and environment, and the accurate people's vehicle of finding out of energy is associated with and provides with reference to matching degree, is Case investigation provides important support.
Detailed description of the invention
Fig. 1 is system diagram according to the present invention;
Fig. 2 is algorithm model flow chart of the invention.
Specific embodiment
With reference to the accompanying drawings and detailed description, technical solution of the present invention is described in detail.
Technical solution of the present invention mainly consists of two parts: front end data acquisition and algorithm model.
Front end data acquisition is to acquire equipment and electronic information by being mounted on the camera image at each main traffic crossing Acquire devices collect data and reported data center.When data information includes collecting location (place) and the acquisition of collected target Between and correlation attribute information.Data flow: front-end collection equipment -- > Flume-- > Kafka-- > ElasticSearch.
The emphasis of the invention is in algorithm model, below with (other similar) association of certain electron-like acquisition keyword keyword For license plate keyword, detailed algorithm model flow is introduced.
(1) ElasticSearch is passed through according to keyword and time started stamp (sTime) and ending time stamp (eTime) Track (removal place and time all identical data) are inquired, for the place of these tracing points, if set without license plate acquisition It is standby then remove corresponding tracing point, our available effective track points (trajecotryLen), in effective tracing point Place, which is extracted, obtains a digit (sitecodeCount) with duplicate removal;
(2) vehicle that the tracing point (sitecode#time) at the appointed time poor (timeDiff) inquired in (1) respectively occurs Board, when such as inquiring license plate and its acquisition that place sitecode occurs between time-timeDiff to time+timeDiff Between;
(3) data obtained in (2) are converted, the corresponding co-occurrence tracing point of license plate of each potential colleague, co-occurrence rail are obtained Mark points are co-occurrence number (sharetotal), extract available co-occurrence after duplicate removal to the place in co-occurrence tracing point Point digit (sharespacesize).
It is as follows to be associated with matching degree benchmark:
Based on sitecodeCount, one matching degree loss factor facterParameter (being defaulted as 0.7) is set, such as:
Facter=1- (facterParameter/Math.pow (2, sitecodeCount-1))
Wherein sitecodeCount -1 power that Math.pow (2, sitecodeCount-1) is 2, when SitecodeCount is smaller, and matching degree loss is bigger.
The calculation of comprehensive matching degree is as follows:
Samerate=(w1*sharetotal/trajecotryLen+w2*sharespacesize/ sitecodeCount)*facter
Wherein w1 and w2 is respectively co-occurrence number weight and co-occurrence place number weight, and default assigns w1=0.4, w2=0.6, It can be adjusted according to real data situation.
The general frame of the invention algorithm model above, introduce one by one below several dimensions for considering in algorithm model because Element:
1) consider that the significance level of time and place are different (weight coefficient is configurable).
Time co-occurrence weight is embodied in sharetotal, divides time into four periods: 7-9,10-16,17-21, 22-6, for the time of each tracing point of keyword, the corresponding co-occurrence weight of different periods is different, wherein 7-9 and 17-21 co-occurrence weight is 1;10-16 co-occurrence weight is 2;22-6 co-occurrence weight is 3, to all co-occurrence situation and different co-occurrences Weight sums to obtain sharetotal.
Place co-occurrence weight is embodied in sharespacesize, for each tracing point of keyword to be associated Place significance level is also classified into three grades by place, general/important/extremely important, corresponding co-occurrence place weight difference It is 1/2/3, all co-occurrence situations and different places co-occurrence weight is summed to obtain sharespacesize.
2) increase matching degree when considering two class keywords with ownership place for keyword and license plate:
The ownership place paid close attention to by setting keyword, is divided into three grades here, general/important/extremely important, different Grade controls different facterParameter loss factors, its more important corresponding loss factor is smaller, as: FacterParameter=0.7;Important facterParameter=0.5;Extremely important facterParameter=0.3.
3) consider that (historical context needs another program record co-occurrence number and calculates confidence the case where historical context Degree), if including the biggish license plate of historical context confidence level in the license plate that association comes out, by the historical context confidence level RelSamerate (if there is no historical context, then the value is 0) to be added in above-mentioned matching degree calculating, specific as follows:
SamerateR=samerate* (1-relSamerate)+relSamerate*Math.max (samerate, relSamerate)
4) consider co-occurrence situation, i.e., if when co-occurrence number/co-occurrence place number reaches certain numerical value, association has been compared It is accurate, at this moment wish that whole matching degree is larger, the specific co-occurrence factor (shareFacter) calculation method is as follows:
ShareFacterT=1- (facterParameter/Math.pow (1.3, shareTotal-1))
ShareFacterS=1- (facterParameter/Math.pow (1.3, shareSpacesize-1))
ShareFacter=w1*shareFacterT+w2*shareFacterS
Wherein facterParameter, w1 are with w2 as above-mentioned value.
Then co-occurrence factor ladder is given according to co-occurrence number (sharetotal) and co-occurrence place number (sharespacesize) Degree assigns weight (meanFacter):
Mean=(shareTotal+shareSpacesize)/2
If mean < 3, meanFacter=0.5;If 3≤mean < 5, meanFacter=0.6;If 5≤ Mean < 8, meanFacter=0.7;If 8≤mean < 12, meanFacter=0.8;If mean > 12, meanFacter =0.9
Final matching degree are as follows: (1-meanFacter) * samerateR+meanFacter*shareFacter
5) postsearch screening interface is provided for association results, if imsi1 is with plate1 is obtained, interface can detect imsi1 It whether there is the inaccessible situation in tracing point path between [sTime, eTime] with plate1.
The above, the only specific embodiment of the embodiment of the present application, but the protection scope of the embodiment of the present application is not It is confined to this, anyone skilled in the art can think easily in the technical scope that the embodiment of the present application discloses To change or replacement, should all cover within the protection scope of the embodiment of the present application.Therefore, the protection scope of the embodiment of the present application It should be based on the protection scope of the described claims.

Claims (2)

1. a kind of realization people's vehicle acquires data quasi real time associated algorithm model system characterized by comprising several front end numbers According to acquisition equipment, server, disk array, big data component, MySQL database;Wherein front-end collection equipment is mainly used for adopting Collect the information of license plate, face and associated electronic device and records collecting location and acquisition time formation track data and report;Clothes Business device and big data component the track data reported is cleaned, be put in storage and algorithm model calculate;Disk array is for storing The track data of storage;MySQL database is used to store the relevant information of front-end collection equipment.
2. a kind of real time correlation algorithm model, which is characterized in that dug using different type track data source by multi dimensional analysis The incidence relation and calculating matching degree between it are dug, here either " people " association " vehicle ", is also possible to " vehicle " association " people ", Except this be also applied to it is interrelated between other different acquisition features.
For track trajecotry of some electronic equipment within sTime to the eTime time, tracing point length is TrajecotryLen is sitecodeCount to number after the duplicate removal of tracing point place, when looking for its front and back by each tracing point Between the license plate that occurs of difference timeDiff, it is total to calculate license plate that each co-occurrence is crossed the license plate as crossed with this electronic equipment co-occurrence The place duplicate removal number sharespacesize of existing number sharetotal and co-occurrence, is arranged a matching degree loss factor FacterParameter is calculated match degree factor (facterParameter is defaulted as 0.7): facter=1- (facterParameter/Math.pow(2,sitecodeCount-1))
Wherein sitecodeCount -1 power that Math.pow (2, sitecodeCount-1) is 2, when sitecodeCount is got over Small, matching degree loss is bigger.
The calculation of comprehensive matching degree is as follows:
Samerate=(w1*sharetotal/trajecotryLen+w2*sharespacesize/sitecodeCo unt) * facter
Wherein w1 and w2 is respectively co-occurrence number weight and co-occurrence place number weight, and default assigns w1=0.4, w2=0.6, can be with It is adjusted according to real data situation.
Furthermore consider the time and space significance level it is different, by control reset section sharetotal and The size of sharespacesize achievees the purpose that adjust matching degree samerate;External concern ownership place weight is different, leads to It crosses control and resets section facterParameter size, achieve the purpose that adjust matching degree samerate;Consider that historical context is made For supplement, historical context confidence level is assigned to weight and the final matching degree of samerate fusion calculation;Consider real co-occurrence number Reaching certain value just has higher confidence level, and gradient weight is arranged by control co-occurrence number and co-occurrence place number and controls newest matching Degree;Consider the unreachable further screening of people's wheel paths, compares whether two tracks of people's vehicle within a specified time whether there is space Unreachable situation, if it does, it may be considered that the association is insincere.
CN201910784988.8A 2019-08-23 2019-08-23 A kind of realization people's vehicle acquisition data quasi real time associated algorithm model Pending CN110489400A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910784988.8A CN110489400A (en) 2019-08-23 2019-08-23 A kind of realization people's vehicle acquisition data quasi real time associated algorithm model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910784988.8A CN110489400A (en) 2019-08-23 2019-08-23 A kind of realization people's vehicle acquisition data quasi real time associated algorithm model

Publications (1)

Publication Number Publication Date
CN110489400A true CN110489400A (en) 2019-11-22

Family

ID=68551739

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910784988.8A Pending CN110489400A (en) 2019-08-23 2019-08-23 A kind of realization people's vehicle acquisition data quasi real time associated algorithm model

Country Status (1)

Country Link
CN (1) CN110489400A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110909262A (en) * 2019-11-29 2020-03-24 北京明略软件***有限公司 Method and device for determining companion relationship of identity information
CN111260140A (en) * 2020-01-19 2020-06-09 武汉中科通达高新技术股份有限公司 Method for predicting instantaneous return large passenger flow in subway station
CN112654035A (en) * 2020-11-20 2021-04-13 深圳市先创数字技术有限公司 Graph code association method, system and storage medium based on mobile terminal feature code
CN113327336A (en) * 2021-06-03 2021-08-31 厦门科拓通讯技术股份有限公司 Method and device for identifying man-vehicle relationship and electronic equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014034310A1 (en) * 2012-08-30 2014-03-06 株式会社日立製作所 Information analysis system and information analysis method
US20150363508A1 (en) * 2014-06-17 2015-12-17 Naveen NANDAN Grid-based analysis of geospatial trajectories
CN105575118A (en) * 2015-12-25 2016-05-11 银江股份有限公司 Screening method of personnel without driving qualification
CN108170834A (en) * 2018-01-12 2018-06-15 南京理工大学 A kind of determining method of mobile target association co-occurrence pattern
CN108765018A (en) * 2018-05-31 2018-11-06 重庆市城投金卡信息产业股份有限公司 Based on the associated adaptive advertisement pushing method and system of people's vehicle
CN109165237A (en) * 2018-08-28 2019-01-08 新华三大数据技术有限公司 Method, apparatus and electronic equipment are determined with object
CN109543312A (en) * 2018-11-27 2019-03-29 珠海市新德汇信息技术有限公司 A kind of space-time investigation analysis method and system
CN109918368A (en) * 2019-03-27 2019-06-21 成都市公安科学技术研究所 A kind of system and method that vehicle driver is identified by Track association degree
CN109947758A (en) * 2019-04-03 2019-06-28 深圳市甲易科技有限公司 A kind of route crash analysis method in Behavior-based control track library

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014034310A1 (en) * 2012-08-30 2014-03-06 株式会社日立製作所 Information analysis system and information analysis method
US20150363508A1 (en) * 2014-06-17 2015-12-17 Naveen NANDAN Grid-based analysis of geospatial trajectories
CN105575118A (en) * 2015-12-25 2016-05-11 银江股份有限公司 Screening method of personnel without driving qualification
CN108170834A (en) * 2018-01-12 2018-06-15 南京理工大学 A kind of determining method of mobile target association co-occurrence pattern
CN108765018A (en) * 2018-05-31 2018-11-06 重庆市城投金卡信息产业股份有限公司 Based on the associated adaptive advertisement pushing method and system of people's vehicle
CN109165237A (en) * 2018-08-28 2019-01-08 新华三大数据技术有限公司 Method, apparatus and electronic equipment are determined with object
CN109543312A (en) * 2018-11-27 2019-03-29 珠海市新德汇信息技术有限公司 A kind of space-time investigation analysis method and system
CN109918368A (en) * 2019-03-27 2019-06-21 成都市公安科学技术研究所 A kind of system and method that vehicle driver is identified by Track association degree
CN109947758A (en) * 2019-04-03 2019-06-28 深圳市甲易科技有限公司 A kind of route crash analysis method in Behavior-based control track library

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
刘伟昆等: "基于图码联侦全息档案的重点人员管控***", 警察技术, pages 169 - 171 *
周川等: "视频大数据分析及其在公安合成作战中的应用", 警察技术, pages 11 - 14 *
樊志英;: "一种卡口车辆轨迹相似度算法的研究和实现", 现代电子技术, no. 23, 1 December 2016 (2016-12-01), pages 133 - 135 *
陈天钟等: "浅论现代公安社区管理下的平安社区建设", 警察技术, pages 69 - 71 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110909262A (en) * 2019-11-29 2020-03-24 北京明略软件***有限公司 Method and device for determining companion relationship of identity information
CN110909262B (en) * 2019-11-29 2022-10-25 北京明略软件***有限公司 Method and device for determining companion relationship of identity information
CN111260140A (en) * 2020-01-19 2020-06-09 武汉中科通达高新技术股份有限公司 Method for predicting instantaneous return large passenger flow in subway station
CN112654035A (en) * 2020-11-20 2021-04-13 深圳市先创数字技术有限公司 Graph code association method, system and storage medium based on mobile terminal feature code
CN112654035B (en) * 2020-11-20 2023-12-05 深圳市先创数字技术有限公司 Picture code association method, system and storage medium based on mobile terminal feature code
CN113327336A (en) * 2021-06-03 2021-08-31 厦门科拓通讯技术股份有限公司 Method and device for identifying man-vehicle relationship and electronic equipment
CN113327336B (en) * 2021-06-03 2023-02-28 厦门科拓通讯技术股份有限公司 Method and device for identifying people-vehicle relationship and electronic equipment

Similar Documents

Publication Publication Date Title
CN110489400A (en) A kind of realization people&#39;s vehicle acquisition data quasi real time associated algorithm model
Fan et al. Social sensing in disaster city digital twin: Integrated textual–visual–geo framework for situational awareness during built environment disruptions
Holm et al. The development of a system for monitoring trend in range condition in the arid shrublands of Western Australia.
CN106707099A (en) Monitoring and locating method based on abnormal electricity consumption detection module
EP2834606A1 (en) A method and system for source selective real-time monitoring and mapping of environmental noise
Webber et al. Rapid assessment of surface-water flood-management options in urban catchments
CN106254137A (en) The alarm root-cause analysis system and method for supervisory systems
CN104182466A (en) House information base network system
CN105976446A (en) Intelligent chest card-based convention and exhibition method and system
CN110322688A (en) A kind of method of data processing, the method for data query and relevant device
CN109885601A (en) A kind of power distribution network basic data collection and analysis system
Ou et al. A data‐driven approach to determining freeway incident impact areas with fuzzy and graph theory‐based clustering
CN113627678B (en) Method and system for measuring and calculating power distribution station area flooded by rainstorm induced flood
Boyce et al. Negative binomial models for abundance estimation of multiple closed populations
CN113887895A (en) Flexible city intelligent planning system, method and storage medium
CN106412507A (en) Intelligent monitoring method and system of personnel flow
CN109344190A (en) A kind of police service data processing method and device
CN110751092B (en) Agricultural monitoring method and device based on Internet of things, storage medium and electronic equipment
CN113779136B (en) Knowledge-graph-based debt collection object determining method and device and electronic equipment
Pace et al. Locating Domestic Well Communities in California: A Methodological Overview
CN114519267A (en) Data updating method of underground cable model
Feltynowski Unsustainable spatial planning–the example of communities of the central region
CN109300024A (en) A kind of real-estate market monitoring and analysis system and its application method based on big data
CN104866482A (en) Wide area dynamic personnel real-time trace positioning information overlaying fuzzy quick inquiry system
Hayashi et al. Composition of simulation data for large-scale disaster estimation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination