CN106203756A - A kind of trade effluent pollutant Source Tracing method of feature based data base - Google Patents

A kind of trade effluent pollutant Source Tracing method of feature based data base Download PDF

Info

Publication number
CN106203756A
CN106203756A CN201510224532.8A CN201510224532A CN106203756A CN 106203756 A CN106203756 A CN 106203756A CN 201510224532 A CN201510224532 A CN 201510224532A CN 106203756 A CN106203756 A CN 106203756A
Authority
CN
China
Prior art keywords
factory
pollutant
data base
sample
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510224532.8A
Other languages
Chinese (zh)
Inventor
王德庆
张伟宁
刘增武
刘庆文
吴�琳
时晓梅
张文吉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
760 RESEARCH INSTITUTE OF CSIC
Original Assignee
760 RESEARCH INSTITUTE OF CSIC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 760 RESEARCH INSTITUTE OF CSIC filed Critical 760 RESEARCH INSTITUTE OF CSIC
Priority to CN201510224532.8A priority Critical patent/CN106203756A/en
Publication of CN106203756A publication Critical patent/CN106203756A/en
Pending legal-status Critical Current

Links

Landscapes

  • Removal Of Specific Substances (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A kind of trade effluent pollutant Source Tracing method of feature based data base, belongs to trade effluent pollutant regulation technique field.Its feature includes: set up the feature weight data base M of all types of factories pollutant parameter, gather the pollutant data that each factory in target area drains the sewage away, build the characteristic Sample Storehouse N of target area factory blowdown, described characteristic Sample Storehouse N is utilized to build the class variable of KD-tree, each pollutant measurement value vector to be identified z of input mixing industrial wastewater, use described kNN grader to carry out match cognization with the data in characteristic Sample Storehouse N, complete pollutant and trace to the source.The present invention can complete the pollutant of various region mixing industrial wastewater and trace to the source, provide the sequence of target discharge factory fast and effectively, the suitability and universality are strong, investigate pollution source factory in order provide technical support for the functional department that is correlated with of government, substantially increase investigation efficiency and success rate.

Description

A kind of trade effluent pollutant Source Tracing method of feature based data base
Technical field
The invention belongs to trade effluent pollutant regulation technique field, relate to a kind of trade effluent pollutant and trace to the source Analysis method, a kind of trade effluent pollutant Source Tracing method of feature based data base.
Background technology
The water pollution problems of China is quite serious at present, and this has remained incessant after repeated prohibition directly with the row's phenomenon steathily that illegally exceeds standard The relation connect.The supervision strengthening all types of industries sewage effluent has become as the important content of governments at all levels' work, Effectively supervision to illegal super row still lacks effective technological means with tracing to the source to call to account.Traditional skill of tracing to the source Art is typically all after contamination accident occurs, and is sampled the plant effuent near polluter upstream and downstream analyzing, Workload longer, poor in timeliness big, time-consuming, is often difficult to find polluter in time.
The most also emerge some new tracing technologies, but be typically all for a certain class industrial wastewater or Some anion and carry out, or in terms of regulatory management propose method, such as China Environmental Science The source tracing method of risk source " a kind of water environment pollution accident that happens suddenly be accused of " disclosed in institute 2012, publication number For CN102622670A, for another example disclosed in South China Institute of Environmental Sciences. MEP 2014 " one Plant the source tracing method of sudden water environment event polluter under limited conditions ", Publication No. CN104103005A, The method mainly gives a system, and its Matching Model is suitable for the most under the limited conditions.
Still lack a kind of source tracing method being generally suitable for various industry composite waste, meet variety classes industrial occupancy The demand of tracing to the source of territory combined sewage.
Summary of the invention
The invention aims to overcome the deficiency of above-mentioned background technology, it is provided that a kind of feature based data base Trade effluent pollutant Source Tracing method, it is possible to be applicable to various industry composite wastes trace to the source.
The trade effluent pollutant Source Tracing method such as Fig. 1 of a kind of feature based data base that the present invention provides Shown in, comprise the following steps that:
(1) collect all types of factory owner and want the pollutant parameter of sewage effluent, set up all types of factory and pollute The feature weight data base M of thing parameter;
(2) gather the pollutant data that each factory in target area drains the sewage away, build the blowdown of target area factory Characteristic Sample Storehouse N;
(3) utilize described characteristic Sample Storehouse N to build the class variable of KD-tree, classify as kNN The identification model of device;
(4) gather industrial computer by integrated form pollutant obtain or be directly manually entered mixing industrial wastewater Each pollutant measurement value, as to be identified vector z;
(5) described kNN grader is used to carry out match cognization with the data in characteristic Sample Storehouse N, In the matching process, described feature weight data base M is utilized according to factory types, target factory to be weighted, The sequence of target discharge factory is given according to matching degree;
(6) target discharge factory title is shown in order.
The core identification algorithm of described kNN grader is:
Input: the training sample set D of described characteristic Sample Storehouse N, described vector z to be identified, sample Class label collection L.
Output:czThe classification of ∈ L, i.e. sample vector.
For each sample y ∈ D do
Calculate z to y distance d (z, y)
end
K the training sample closest with z is selected to constitute set N,
c z = arg max v ∈ L Σ y ∈ N I ( v = class ( c y ) )
Wherein, k is the number of the predetermined target discharge factory identified, I () is indicator function, if condition becomes Vertical value is 1, if condition is false, value is 0.
Treatment of details principle in kNN grader includes:
(z is y) sample to be identified and the measuring similarity of sample in training sample database, in representing feature space to A.d The distance of two points.Other mathematical distance alternative have multiple, such as Euler's distance, mahalanobis distance, Manhatton distance, Chebyshev's distance, Min Shi distance etc.;
B. use weighted voting algorithm carry out kind judging, take the principle that the minority is subordinate to the majority, in neighbour which Which class the point of classification is the most just divided into, and weight is chosen by factory types according to feature weight data base M;
C. characteristic vector is normalized, reduces the impact that variable codomain is adjusted the distance;
D. the method using principal component analysis, reduces the high-dimensional impact adjusted the distance and weigh.
The invention has the beneficial effects as follows, it is possible to the pollutant completing various region mixing industrial wastewater are traced to the source, soon Speed effectively provides the sequence of target discharge factory, and the suitability is strong with universality, is correlated with functional department for government Investigation pollution source factory provides technical support in order, substantially increases investigation efficiency and success rate.
Accompanying drawing explanation
Fig. 1 trade effluent pollutant Source Tracing method flow diagram.
Fig. 2 target area factory title diagram.
Detailed description of the invention:
The present invention is described in further detail with embodiment below in conjunction with the accompanying drawings, but this embodiment should not be managed Solve as limitation of the present invention.
In the present embodiment, industrial park, selection Daliang City, target area, in region, factory includes 12, As shown in Figure 2, each factory drains the sewage away pollutant kind 28 kinds specific name, including benzene, pH value, ammonia Nitrogen, colourity, toluene, total nitrogen, total cadmium, total chromium, total nickel, total arsenic, total copper, total zinc, dimethylbenzene, fluorine Compound, volatile phenol etc..
Trace to the source according to following operating procedure:
(1) collect all types of factory owner and want the pollutant parameter of sewage effluent, set up all types of factory and pollute The feature weight data base M1 of thing parameter;
(2) gather 28 kinds of pollutant data that each factory shown in Fig. 2 drains the sewage away, build target area work The characteristic Sample Storehouse N1 of factory's blowdown, and according to type, factory is classified;
(3) utilize described characteristic Sample Storehouse N1 to build the class variable of KD-tree, divide as kNN The identification model of class device;
(4) manually input mixes each pollutant value of industrial wastewater, as vector z1 to be identified;
(5) described kNN grader is used to carry out match cognization with the data in characteristic Sample Storehouse N1, In the matching process, described feature weight data base M1 is utilized according to factory types, target factory to be added Power, provides the sequence of target discharge factory according to matching degree;
(6) described target discharge factory title is shown in order.
In described kNN classifier algorithm, described distance d (z1, y) selects Euclidean distance, and formula is as follows:
d euc ( x , y ) = [ Σ j = 1 d ( x j - y j ) 2 ] 1 2 = [ ( x - y ) ( x - y ) T ] 1 2
In described kNN classifier algorithm, k value is chosen as 3, and franchise repeated root is according to feature weight data base M1 Choose by factory types, characteristic vector z1 has been carried out normalization, the method that have employed principal component analysis.

Claims (1)

1. the trade effluent pollutant Source Tracing method of a feature based data base, it is characterised in that
(1) collect all types of factory owner and want the pollutant parameter of sewage effluent, set up all types of factory and pollute The feature weight data base M of thing parameter;
(2) gather the pollutant data that each factory in target area drains the sewage away, build the blowdown of target area factory Characteristic Sample Storehouse N;
(3) utilize described characteristic Sample Storehouse N to build the class variable of KD-tree, classify as kNN The identification model of device;
(4) gather industrial computer by integrated form pollutant obtain or be directly manually entered mixing industrial wastewater Each pollutant measurement value, as to be identified vector z;
(5) described kNN grader is used to carry out match cognization with the data in characteristic Sample Storehouse N, In the matching process, described feature weight data base M is utilized according to factory types, target factory to be weighted, The sequence of target discharge factory is given according to matching degree;
Use weighted voting algorithm to carry out kind judging, take the principle that the minority is subordinate to the majority, which class in neighbour Which class other point is the most just divided into, and weight is chosen by factory types according to feature weight data base M;
Characteristic vector is normalized, reduces the impact that variable codomain is adjusted the distance;
The method using principal component analysis, reduces the high-dimensional impact adjusted the distance and weigh;
The core identification algorithm of described kNN grader is:
Input: the training sample set D of described characteristic Sample Storehouse N, described vector z to be identified, sample Class label collection L;
Output:czThe classification of ∈ L, i.e. sample vector;
For each sample y ∈ D do
Calculate z to y distance d (z, y)
end
K the training sample closest with z is selected to constitute set N,
c z = arg max v ∈ L Σ y ∈ N I ( v = class ( c y ) )
Wherein, k is the number of the predetermined target discharge factory identified, I () is indicator function, if condition becomes Vertical value is 1, if condition is false, value is 0;
(z, is y) sample to be identified and the measuring similarity of sample in training sample database to d, represents in feature space two The distance of individual point;
(6) target discharge factory title is shown in order.
CN201510224532.8A 2015-05-05 2015-05-05 A kind of trade effluent pollutant Source Tracing method of feature based data base Pending CN106203756A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510224532.8A CN106203756A (en) 2015-05-05 2015-05-05 A kind of trade effluent pollutant Source Tracing method of feature based data base

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510224532.8A CN106203756A (en) 2015-05-05 2015-05-05 A kind of trade effluent pollutant Source Tracing method of feature based data base

Publications (1)

Publication Number Publication Date
CN106203756A true CN106203756A (en) 2016-12-07

Family

ID=57458976

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510224532.8A Pending CN106203756A (en) 2015-05-05 2015-05-05 A kind of trade effluent pollutant Source Tracing method of feature based data base

Country Status (1)

Country Link
CN (1) CN106203756A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108596473A (en) * 2018-04-20 2018-09-28 上海海洋大学 A kind of basin pollution management system and new accounting method
CN109709293A (en) * 2018-11-27 2019-05-03 聚光科技(杭州)股份有限公司 The recognition methods of sewerage steathily
CN109886830A (en) * 2019-01-02 2019-06-14 同济大学 A kind of water supply network pollution sources tracking positioning method based on customer complaint information
CN110335047A (en) * 2018-03-30 2019-10-15 中国石油化工股份有限公司 A kind of oil product Source Tracing system
CN110531043A (en) * 2019-08-29 2019-12-03 中水三立数据技术股份有限公司 A kind of basin water environment pollution intelligent management system and its method
CN110729026A (en) * 2019-10-28 2020-01-24 中国科学院生态环境研究中心 Polychlorinated biphenyl space-time quantitative tracing method based on combination of mixed list construction and space multi-medium model simulation
CN111882473A (en) * 2020-07-23 2020-11-03 南京财经大学 Zero-direct-emission tracing method for rain and sewage pipe network
CN112184090A (en) * 2020-11-30 2021-01-05 广东浩迪创新科技有限公司 Standard electricity utilization feature library establishing method, environment-friendly monitoring method, system and monitor
CN112903660A (en) * 2021-03-11 2021-06-04 广西大学 Method for judging current situation and source of pollution of watershed water body
CN117373556A (en) * 2023-12-04 2024-01-09 山东清控生态环境产业发展有限公司 Tracing instrument and system based on multidimensional data

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622670A (en) * 2012-02-20 2012-08-01 中国环境科学研究院 Tracing method for suspected risk sources of sudden water environment pollution accidents
US20140088883A1 (en) * 2007-12-03 2014-03-27 Charles C. Zhou Method and System for Knowledge Pattern Search and Analysis for Selecting Microorganisms Based on Desired Metabolic Property or Biological Behavior

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140088883A1 (en) * 2007-12-03 2014-03-27 Charles C. Zhou Method and System for Knowledge Pattern Search and Analysis for Selecting Microorganisms Based on Desired Metabolic Property or Biological Behavior
CN102622670A (en) * 2012-02-20 2012-08-01 中国环境科学研究院 Tracing method for suspected risk sources of sudden water environment pollution accidents

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
崔妍妍: "数据挖掘分类***的研究与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
李彩鹦: "污水化学指纹数据库的构建及其在水污染溯源中的应用", 《中国优秀硕士学位论文全文数据库 工程科技Ⅰ辑》 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110335047A (en) * 2018-03-30 2019-10-15 中国石油化工股份有限公司 A kind of oil product Source Tracing system
CN108596473B (en) * 2018-04-20 2021-07-02 上海海洋大学 Drainage basin pollution management system and new accounting method
CN108596473A (en) * 2018-04-20 2018-09-28 上海海洋大学 A kind of basin pollution management system and new accounting method
CN109709293A (en) * 2018-11-27 2019-05-03 聚光科技(杭州)股份有限公司 The recognition methods of sewerage steathily
CN109886830A (en) * 2019-01-02 2019-06-14 同济大学 A kind of water supply network pollution sources tracking positioning method based on customer complaint information
CN109886830B (en) * 2019-01-02 2023-07-04 同济大学 Water supply network pollution source tracking and positioning method based on user complaint information
CN110531043A (en) * 2019-08-29 2019-12-03 中水三立数据技术股份有限公司 A kind of basin water environment pollution intelligent management system and its method
CN110729026A (en) * 2019-10-28 2020-01-24 中国科学院生态环境研究中心 Polychlorinated biphenyl space-time quantitative tracing method based on combination of mixed list construction and space multi-medium model simulation
CN111882473A (en) * 2020-07-23 2020-11-03 南京财经大学 Zero-direct-emission tracing method for rain and sewage pipe network
CN112184090A (en) * 2020-11-30 2021-01-05 广东浩迪创新科技有限公司 Standard electricity utilization feature library establishing method, environment-friendly monitoring method, system and monitor
CN112903660A (en) * 2021-03-11 2021-06-04 广西大学 Method for judging current situation and source of pollution of watershed water body
CN117373556A (en) * 2023-12-04 2024-01-09 山东清控生态环境产业发展有限公司 Tracing instrument and system based on multidimensional data
CN117373556B (en) * 2023-12-04 2024-02-13 山东清控生态环境产业发展有限公司 Tracing instrument and system based on multidimensional data

Similar Documents

Publication Publication Date Title
CN106203756A (en) A kind of trade effluent pollutant Source Tracing method of feature based data base
Xu et al. Assessment and spatiotemporal variation analysis of water quality in the Zhangweinan River Basin, China
CN103903441B (en) Road traffic state distinguishing method based on semi-supervised learning
Liu et al. Concentrations, distribution, sources, and ecological risk assessment of heavy metals in agricultural topsoil of the Three Gorges Dam region, China
CN106600509B (en) Method for analyzing and judging water exchange and pollution discharge behaviors of enterprise based on basic data
CN111861274A (en) Water environment risk prediction and early warning method
CN102999927B (en) Fine partition method of soil pollutant content spatial distribution
Batbayar et al. Chemical water quality gradients in the Mongolian sub-catchments of the Selenga River basin
CN110489785A (en) A kind of online Source Apportionment of atmosphere pollution and system
CN103530688B (en) A kind of grinding technique detection system and method
CN101759236B (en) Regulating and controlling method of point source pollutant of industrial park
CN104217040A (en) Rapid pollution incident detection method based on traditional online monitor
CN109613197B (en) Water quality monitoring early warning feedback response method based on river course water network
CN112198144A (en) Method and system for rapid sewage tracing
Xiao et al. Water quality assessment and pollution source apportionment using multivariate statistical techniques: a case study of the Laixi River Basin, China
CN114210718A (en) Agricultural product pollution treatment and restoration method
Chen et al. Source apportionment of trace element pollution in surface sediments using positive matrix factorization combined support vector machines: application to the Jinjiang River, China
CN118039021A (en) Sewage pollutant detection and alarm method and system
CN103377316A (en) Penicillin production process monitoring method based on statistical analysis and Bayesian ensemble
CN110781225A (en) Method for diagnosing concentration level of environmental medium pollutants
Lin et al. Environmental DNA metabarcoding revealed the impacts of anthropogenic activities on phytoplankton diversity in Dianchi Lake and its three inflow rivers
CN102789598B (en) Operation workshop bottleneck identification method taking machine attribute as interval number
Li et al. Waste landfill plant and waste disposal plant efficiencies in China
CN201330211Y (en) Working parameter self-optimizing simulation system for sewage treatment plant
CN104569340A (en) Underground environment quality determination method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20161207

WD01 Invention patent application deemed withdrawn after publication