CN113256101B - Key driving force analysis method for water storage capacity change of urban and rural lakes - Google Patents
Key driving force analysis method for water storage capacity change of urban and rural lakes Download PDFInfo
- Publication number
- CN113256101B CN113256101B CN202110549208.9A CN202110549208A CN113256101B CN 113256101 B CN113256101 B CN 113256101B CN 202110549208 A CN202110549208 A CN 202110549208A CN 113256101 B CN113256101 B CN 113256101B
- Authority
- CN
- China
- Prior art keywords
- water storage
- storage capacity
- item set
- change
- driving force
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 title claims abstract description 107
- 238000003860 storage Methods 0.000 title claims abstract description 96
- 230000008859 change Effects 0.000 title claims abstract description 87
- 238000004458 analytical method Methods 0.000 title claims abstract description 19
- 238000000034 method Methods 0.000 claims abstract description 46
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 32
- 238000005070 sampling Methods 0.000 claims abstract description 25
- 238000011160 research Methods 0.000 claims abstract description 21
- 238000012216 screening Methods 0.000 claims abstract description 11
- 238000004364 calculation method Methods 0.000 claims abstract description 10
- 238000007781 pre-processing Methods 0.000 claims abstract description 9
- 230000008569 process Effects 0.000 claims description 14
- 230000001965 increasing effect Effects 0.000 claims description 13
- 239000002689 soil Substances 0.000 claims description 12
- 230000002829 reductive effect Effects 0.000 claims description 10
- 238000007619 statistical method Methods 0.000 claims description 10
- 238000013138 pruning Methods 0.000 claims description 9
- 230000003247 decreasing effect Effects 0.000 claims description 8
- 238000001704 evaporation Methods 0.000 claims description 7
- 230000008020 evaporation Effects 0.000 claims description 7
- 230000002093 peripheral effect Effects 0.000 claims description 7
- 238000012271 agricultural production Methods 0.000 claims description 6
- 238000013501 data transformation Methods 0.000 claims description 4
- 238000007689 inspection Methods 0.000 claims description 4
- 238000004140 cleaning Methods 0.000 claims description 3
- 238000013480 data collection Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 claims description 3
- 238000002474 experimental method Methods 0.000 claims description 3
- 230000010354 integration Effects 0.000 claims description 3
- 230000009467 reduction Effects 0.000 description 9
- 238000001556 precipitation Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 4
- 238000004445 quantitative analysis Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 238000007726 management method Methods 0.000 description 3
- 241000282414 Homo sapiens Species 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000002352 surface water Substances 0.000 description 2
- 235000017166 Bambusa arundinacea Nutrition 0.000 description 1
- 235000017491 Bambusa tulda Nutrition 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 235000009024 Ceanothus sanguineus Nutrition 0.000 description 1
- 240000003553 Leptospermum scoparium Species 0.000 description 1
- 235000015459 Lycium barbarum Nutrition 0.000 description 1
- 244000082204 Phyllostachys viridis Species 0.000 description 1
- 235000015334 Phyllostachys viridis Nutrition 0.000 description 1
- 241000124033 Salix Species 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 239000011425 bamboo Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000013100 final test Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 239000005416 organic matter Substances 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000004162 soil erosion Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
- G06Q10/06393—Score-carding, benchmarking or key performance indicator [KPI] analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2462—Approximate or statistical queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Human Resources & Organizations (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Tourism & Hospitality (AREA)
- Development Economics (AREA)
- Economics (AREA)
- Educational Administration (AREA)
- Strategic Management (AREA)
- General Physics & Mathematics (AREA)
- Marketing (AREA)
- Entrepreneurship & Innovation (AREA)
- General Business, Economics & Management (AREA)
- Probability & Statistics with Applications (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Databases & Information Systems (AREA)
- Fuzzy Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Game Theory and Decision Science (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- General Health & Medical Sciences (AREA)
Abstract
The invention discloses a key driving force analysis method for water storage capacity change of urban and rural lakes, which comprises the following steps: carrying out equidistant grid division on a research area, and selecting a plurality of typical sampling points of unit area water storage capacity according to a screening standard; establishing a database for bearing driving force factors, and performing data preprocessing; scanning a lake basin characteristic trend database to form a candidate item set C1; judging the candidate item set C1, generating a frequent item set L1, and drawing a histogram of L1 items; combining each item of data in the frequent item set L1 by adopting an Apriori-Gen algorithm to form a candidate item set C2; judging the candidate item set C2 to form a frequent item set L2; and (4) generating a maximum frequent item set in an iteration mode, and determining key driving force of the change of the water storage capacity of a plurality of lakes. The method utilizes the correlation algorithm to analyze the factors influencing the change of the water storage capacity of the urban and rural lakes one by one, and obtains the key driving force influencing the change of the water storage capacity of the urban and rural lakes through iterative calculation and judgment of the correlation algorithm.
Description
Technical Field
The invention relates to the field of landscape hydrology, in particular to a key driving force analysis method for water storage capacity change of urban and rural lakes.
Background
The lake water body is an important component of urban and rural water resources, bears multiple functions of preventing and draining flood, adjusting climate, improving urban and rural ecological environment, providing leisure and entertainment places and the like, and is closely related to the survival and development of human beings; however, lake resources often face severe problems such as excessive development, ecological function degradation, water environment deterioration and the like. The method is used for researching the water quantity and water ecology of the urban and rural lake ecosystem, analyzing the relative importance of the influence factors of the ecological system based on the evolution process of the service function of the ecological system, and providing a basis for the sustainable utilization and ecological restoration of lake resources.
The hydrological process is closely related and interactive with the formation and evolution of the ecological landscape pattern, and the runoff volume of the lake, the precipitation and evaporation on the surface of the lake, the vegetation environment, the dominant soil type, the climate type, the terrain condition, the hydrological type and the like are considered as main influencing factors of the variation of the urban and rural lakes, but controversy exists about the explanation of the dominant factors of the variation of the water storage capacity of the lakes. The landscape hydrology quantitative analysis methods widely applied at present comprise a pairing basin test method, an ecological hydrology model method and a non-model method, and all the methods have certain limitations. The paired watershed test is only suitable for watersheds with small areas and needs long-term positioning observation; the requirement of the ecological hydrological model on data is high, the quality of the data can influence the simulation quality to a large extent, and the ecological hydrological model for a large watershed is more time-consuming and labor-consuming; the ecological hydrological non-model method mainly utilizes a statistical method and a hydrological graph to carry out rapid evaluation, has low requirement on data, but is difficult to reveal specific hydrological responses and mechanisms inside a drainage basin.
Disclosure of Invention
In order to solve the above mentioned deficiencies in the background art, the present invention provides a method for analyzing key driving force of water storage change in urban and rural lakes, which comprises analyzing the factors affecting the water storage change in urban and rural lakes one by using an association algorithm (Apriori), determining the key driving force affecting the water storage change in urban and rural lakes by iterative calculation of the association algorithm, and realizing quantitative analysis of each influence factor of landscape hydrology, and performing sustainable management, utilization and ecological restoration on the urban and rural lake basins according to the analysis.
The purpose of the invention can be realized by the following technical scheme:
a key driving force analysis method for water storage capacity change of urban and rural lakes comprises the following steps:
step A: carrying out equidistant grid division on a research area, and selecting a plurality of typical sampling points of the water storage capacity per unit area according to a screening standard;
and B: defining 60 attribute data change trends of 10 types of influence factors including hydrology, runoff, climate, evaporation, terrain, soil, vegetation environment, agricultural production, peripheral land type and urbanization degree as items, establishing a database for bearing driving force factors, and performing data preprocessing;
and C: scanning a lake basin characteristic trend database to form a candidate item set C1, wherein each group in the C1 comprises an attribute data change trend influencing lake water storage capacity, and the total number of the groups is 60;
step D: judging the candidate item set C1, comparing whether the support degree of the change trend of each item of attribute data is greater than the minimum support degree, generating a frequent item set L1 by all items greater than the minimum support degree, and drawing various item histograms of the L1;
step E: combining the data in the frequent item set L1 by adopting an Apriori-Gen algorithm to form a candidate item set C2, so that each group in the C2 contains 2 attribute data change trends influencing the lake water storage capacity;
step F: judging the candidate item set C2, comparing whether the support degree of the change trend of each group of attribute data is greater than the minimum support degree, and forming a frequent item set L2 by all items greater than the minimum support degree, wherein all items are candidate sets bearing the key driving force of the change of the lake water storage capacity, and generating an analysis chart describing the correlation of the watershed attributes;
step G: iteratively generating a maximum frequent item set, and determining key driving forces of the water storage capacity changes of several lakes;
step H: and (5) performing further inspection by combining the professional related knowledge of the landscape architecture with a statistical analysis method to verify the accuracy of the result.
Further preferably, step a specifically is: equidistant grid division is carried out on a research area, lakes are named as A, B, C, & gt, N, after research objects and ranges are determined, grids of 100m x 100m are selected and are equidistantly distributed in all parts of the research area, the grids are numbered, the first grid in the northwest corner is taken as a number 1, and the grids are numbered sequentially from left to right and from top to bottom, namely 1, 2, 3, & gt, N; selecting a typical sampling point of the water storage capacity per unit area according to a screening standard; after the sampling points are determined, the sampling points with increased water storage capacity and reduced water storage capacity are distinguished, the areas with increased water storage capacity are marked by circular points, the areas with reduced water storage capacity are marked by square points, and the overall marking on the base map is completed;
the screening standard of a typical sampling point of the water storage capacity per unit area simultaneously meets the following conditions:
(1) The water storage capacity per unit area of the ecological unit in unit time has obvious change. Considering the feasibility of data collection, determining the unit time to be 1 year, and collecting data of nearly ten years for statistical analysis;
(2) The ecological unit has one or more driving force data with obvious change;
(3) The ecological unit sampling points finally appear in groups, and the condition that only one sampling point exists in a certain area is avoided.
Further preferably, step B specifically is: defining each attribute change trend influencing the change of the lake water storage capacity as one item, establishing a database, and performing data preprocessing; taking 60 attribute data in 10 types of influence factors including hydrology, runoff, climate, evaporation, terrain, soil, vegetation environment, agricultural production, peripheral land type and urbanization degree as the driving force alternative factors influencing the change of the water storage capacity of urban and rural lakes, taking each change trend of each attribute, namely an increasing trend or a decreasing trend as one item, importing all related data in all sampling points, establishing a database, namely the database bearing the driving force factors, and then preprocessing the data of the database, wherein the method mainly comprises the following steps: data cleaning, data integration, data transformation and data specification.
Further preferably, step C specifically is: scanning the database to form candidate item sets C1 containing the variation trend of one item of attribute data, wherein each item in the candidate item sets C1 corresponds to one variation trend of one attribute, namely the driving force influencing the variation of the lake water storage capacity.
Further preferably, step D specifically includes: judging the candidate item set C1, comparing whether the support degree of each item of attribute data change trend is greater than the minimum support degree, and generating a frequent item set L by comparing all items greater than the minimum support degree, wherein the value of the minimum support degree determines the generated result, the generation of an effective frequent item set is influenced by an excessively high value, and a meaningless result is caused by an excessively low value, through repeated experiments, the value of the minimum support degree can be defined between 0.5 and 0.6, namely, the trend change of the related attribute data influencing the lake water storage capacity reaches at least 50%, the frequent item set L1 is entered, the Apriori algorithm is realized at an interval of 0.02, a corresponding graph is drawn, the horizontal axis is the value of the minimum support degree, and the vertical axis corresponds to the number of the maximum frequent item set.
Further preferably, step E specifically is: combining various data in the frequent item set L1 by using an Apriori-Gen algorithm to form a candidate item set C2 containing 2 attribute data change trends in each group, wherein an Apriori-Gen function is a combination of a connecting step and a pruning step of the candidate item set, and the connecting process connects the two subsets to generate a potential candidate item set, so that the calculation amount is reduced; in the pruning process, if a certain K-1 item set in the candidate K item sets is infrequent, the K-1 item sets are removed according to properties, the calculation efficiency of the algorithm is also improved, each candidate is formed by combining a pair of frequent K-1 item sets, so that the rest K-2 subsets of the candidate are ensured to be frequent only by an additional candidate pruning step, and in this link, the candidate 2 item sets formed by the research object are a combination of attribute data change trends or driving forces, wherein each group of the candidate 2 item sets comprises 2 items, namely 2 items influence the lake water storage capacity.
Further preferably, step F is specifically: and judging the candidate item set C2, comparing the support degree of the change trend of each group of attribute data with the minimum support degree, forming a frequent 2 item set L2 by all items greater than the minimum support degree, generating an analysis chart for describing the relevance of the attributes of the basin, and analyzing the relevance.
Further preferably, step G specifically is: continuously recombining the frequent item set L2 by using an Apriori-Gen algorithm, and combining each item of data in the frequent item set L2 to form a candidate item set C3 containing 3 item attribute data change trends in each group; judging the candidate item set C3, comparing the support degree of the change trend of each group of attribute data with the minimum support degree, and forming a frequent 3 item set L3 by all items greater than the minimum support degree; the process is repeated until the maximum frequent K item set LK with all items being key driving forces is formed in an iteration mode, and therefore the key driving forces of which lake water storage amount changes at most are determined.
Further preferably, step H is specifically: after key driving force factors influencing the lake water storage capacity of the research region are obtained by running the association algorithm for multiple times, further checking by using landscape and garden professional related knowledge and combining a statistical analysis method, and verifying the accuracy of a result; the remote sensing image is used as a main information source in the testing process, the geographical information system technology and the mathematical statistics method are utilized to analyze the landscape pattern change of the researched region, and the final testing result is consistent with the result generated by the correlation algorithm.
The invention has the beneficial effects that:
the invention utilizes the association algorithm (Apriori) to carry out one-by-one analysis on the factors influencing the variation of the water storage capacity of the urban and rural lakes, obtains the key driving force influencing the variation of the water storage capacity of the urban and rural lakes through iterative calculation and judgment of the association algorithm, realizes quantitative analysis on each influencing factor of landscape hydrology, and carries out sustainable management, utilization and ecological restoration on the urban and rural lake watershed on the basis of the analysis.
Drawings
The invention will be further described with reference to the accompanying drawings.
FIG. 1 is a schematic flow chart of a key driving force analysis method for water storage capacity variation of urban and rural lakes according to the present invention;
FIG. 2 is an exemplary illustration of a lake partition according to an embodiment of the present invention;
FIG. 3 is an exemplary diagram of a grid division of a lake research area according to an embodiment of the present invention;
FIG. 4 is an exemplary graph of sample points after screening in accordance with an embodiment of the present invention;
FIG. 5 is a variation graph of the minimum support value of lake A according to an embodiment of the present invention;
FIG. 6 is a bar graph of an embodiment L1 of the present invention;
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the description of the present invention, it is to be understood that the terms "opening," "upper," "lower," "thickness," "top," "middle," "length," "inner," "peripheral," and the like are used in an orientation or positional relationship that is merely for convenience in describing and simplifying the description, and do not indicate or imply that the referenced component or element must have a particular orientation, be constructed and operated in a particular orientation, and thus should not be considered as limiting the present invention.
The invention aims to analyze the factors influencing the change of the water storage capacity of the urban and rural lakes one by utilizing an association algorithm (Apriori), judge and obtain the key driving force influencing the change of the water storage capacity of the urban and rural lakes through iterative calculation of the association algorithm, realize quantitative analysis of each influencing factor of landscape hydrology, and carry out sustainable management, utilization and ecological restoration on the urban and rural lake watershed according to the key driving force.
A key driving force analysis method for the water storage capacity change of urban and rural lakes based on a correlation algorithm utilizes an artificial intelligence correlation algorithm to quantitatively explore the key driving force and the mechanism thereof which affect the back of the water storage capacity change of the urban and rural lakes, and comprises the following steps:
step 1: and carrying out equidistant grid division on the research area, and selecting a plurality of typical sampling points of the water storage capacity per unit area according to the screening standard.
Taking the area of Huanglong of Jiangning Nanjing as an example, the study objects are determined by numbering the larger lakes from A to E. (see fig. 2)
The mesh division operation method comprises the following steps:
and (2) carrying out grid division on the research area, selecting grids of 100m x 100m, equidistantly distributing the grids in each part of the research area, numbering the grids, sequentially numbering from left to right and from top to bottom by taking the first grid in the northwest corner as a number 1, and carrying out 1, 2 and 3. (see fig. 3)
The screening standard of a typical sampling point of the water storage capacity per unit area simultaneously meets the following conditions:
(1) The water storage capacity per unit area of the ecological unit in unit time has obvious change. Considering the feasibility of data collection, determining the unit time to be 1 year, and collecting data of nearly ten years for statistical analysis;
(2) The ecological unit has one or more driving force data with obvious change;
(3) The ecological unit sampling points are finally grouped, and the condition that only one sampling point exists in a certain area is avoided.
After the sampling points are determined, the sampling points with increased water storage capacity and the sampling points with decreased water storage capacity are distinguished, the areas with increased water storage capacity are marked by circular points, the areas with decreased water storage capacity are marked by square points, and the overall marking on the base map is completed. (see fig. 4)
Step 2: defining each attribute change trend influencing the change of the lake water storage capacity as one item, establishing a database and carrying out data preprocessing.
The Huang Longxian area natural geographic environment and social human environment contain various different attributes, including 60 attribute data in 10 types of influence factors including hydrology, runoff, climate, evaporation, terrain, soil, vegetation environment, agricultural production, peripheral land type and urbanization degree, as driving force alternative factors influencing the water storage capacity change of the area. And (3) importing all relevant data in all sampling points by taking each change trend, namely an increasing trend or a decreasing trend, of each attribute as one item, and establishing a database. The database is a database for bearing driving force factors.
When the database is established, two groups are needed to be distinguished, namely a driving force factor causing the water storage capacity of the lake to be increased and a driving force factor causing the water storage capacity of the lake to be reduced.
The factors influencing the change of the water storage capacity of the urban and rural lakes comprise the following 10 types of factors and 60 attributes:
(1) Hydrologic S factors, relevant attributes include: s1, lake water quality, S2 flood season water level, S3 normal water level, S4 dry water level, S5 lake surface area, S6 lake catchment area, S7 surface water use and S8 underground water use;
(2) Runoff J factor, relevant attributes include: j1 surface runoff lake inflow amount, J2 surface runoff lake outflow amount, J3 underground runoff lake inflow amount and J4 underground runoff lake outflow amount;
(3) Climate Q factor, relevant attributes include: the average precipitation per year on the surface of a Q1 lake, the precipitation distribution per year on the surface of a Q2 lake, the average temperature per day of a Q3, the average relative humidity per day of a Q4, the sunshine hours of a Q5, the average wind speed of a Q6 and the solar radiation daily average value of a Q7;
(4) Evaporation Z factor, relevant attributes include: potential evapotranspiration quantity of the Z1 lake surface and actual evapotranspiration quantity of the Z2 lake surface;
(5) Terrain D factors, relevant attributes include: d1 elevation, D2 gradient, D3 slope and D4 undulation degree;
(6) Soil T factor, data types include: the soil type of T1, the soil type distribution of T2, the soil humidity of T3, the soil texture of T4, the soil thickness of T5, the organic matter content of T6 and the soil erosion strength of T7;
(7) Vegetation environment H factors, and the relevant attributes include: h1 vegetation type, H2 vegetation distribution, H3 growth status, H4 vegetation coverage, H5 three-dimensional green volume, H6 vegetation season change, H7 net primary productivity;
(8) Agricultural production N factor, and related attributes comprise: n1 planting scale, N2 planting structure and N3 water-saving engineering;
(9) Peripheral land type Y factor, and the related attributes comprise: y1 arable area, Y2 forest land area, Y3 grassland area, Y4 wetland area, Y5 garden area, Y6 fishpond area, Y7 construction land area, and Y8 unused land area;
(10) The degree of urbanization C factor, and the related attributes comprise: the system comprises a C1 road grade, a C2 road network density, a C3 building density, a C4 building height, a C5 town population density, a C6 agricultural population density, a C7 crowd activity period, a C8 town industrialization rate, a C9 industrial development degree and a C10 rural industrialization rate.
When attribute data are summarized, areas with increased or decreased water amount are separated, and then follow-up analysis of each attribute element is carried out.
The method for preprocessing the collected data comprises the following steps:
data cleaning: and deleting irrelevant data in the original data and the repeated data, and processing missing values, abnormal values and the like. The processing content is geographic information data acquired after the field exploration, and comprises all data in driving force factor reference attributes influencing the water storage capacity of the lake. For example, if the value of precipitation in the lake A in a certain year is obviously higher than the value in the adjacent two years, the abnormal value is regarded as a missing value, and a missing value processing method, namely a plug-in method, is used for processing.
(2) Data integration: data from multiple data sources is consolidated and stored in a consistent data warehouse. Entity identification and attribute redundancy issues need to be considered because real-world entity expressions from multiple data sources differ, potentially resulting in mismatches between data. For example, agr _ area in data source a and agr _ ar in data source B both describe the farmland area in this study, i.e., agr _ area = agr _ ar, and the farmland area data in the two data sources should be merged and stored in the data warehouse.
(3) Data transformation: and carrying out normalization processing on the data, and converting the data into a proper form so as to be suitable for the requirement of an algorithm. Taking an attribute construction method in data transformation as an example, the method needs to construct a new attribute by using an existing attribute set and add the new attribute into the existing attribute set. For example, in the present study, the existing attribute of land utilization type is annual cultivated land area, and in order to more clearly explore the driving force factors affecting the change of lake water storage, a new attribute, namely the change trend of cultivated land area, needs to be constructed, and the attribute corresponds to two items, namely, "increase of cultivated land area" and "decrease of cultivated land area", namely, agr _ area _ INC and agr _ area _ DEC. Wherein, if the number of the cultivated land area in the previous year is more than that in the next year, the increase of the cultivated land area is determined, otherwise, the decrease of the cultivated land area is determined.
(4) Data specification: complex data analysis and mining on large data sets takes a long time, data conventions result in new data sets that are smaller but maintain the integrity of the original data, thereby reducing the impact of invalid, erroneous data on modeling, reducing the cost of storing data, and significantly reducing computation time with small and representative amounts of data. For example, in the present study, when dealing with attributes related to land use types, a merge attribute method is adopted, where the original attributes include "tea tree area", "willow area", "bamboo forest area", and the like, and after stipulation, these attributes are merged into a new attribute "forest area", that is, for _ area.
And step 3: and scanning the database to form a candidate item set C1 containing the variation trend of one item of attribute data in each group. Each item in C1 corresponds to a variation trend of one attribute, namely, a driving force influencing the change of the lake water storage capacity. For example: increased grass area, decreased annual average precipitation, etc.
And 4, step 4: and judging the candidate item set C1, and comparing whether the support degree of the change trend of each item of attribute data is greater than the minimum support degree or not, wherein all items greater than the minimum support degree generate a frequent item set L1.
The value of the minimum support degree determines the result, the generation of an effective frequent item set is influenced by an excessively high value, and a meaningless result is caused by an excessively low value. Through repeated experiments, the research defines the minimum support reference value between 0.5 and 0.6, namely the trend change of the relevant attribute data influencing the lake water storage capacity reaches at least 50 percent, and then the frequent item set L1 is entered. The interval is 0.02, an Apriori algorithm is realized, a corresponding chart is drawn, the horizontal axis is a value of the minimum support degree, and the vertical axis corresponds to the number of the frequent items in the maximum frequent item set. With the increase of the minimum support degree, the number of the frequent items is reduced, namely the key driving force factor is reduced. (see fig. 5)
And drawing a histogram according to the support degree of the change trend of each item of attribute data in the L1. The drawing mode is as follows: the method comprises the steps of distinguishing data of different regions and increasing and decreasing of water storage capacity, drawing bar charts of the different water storage capacity of the different regions respectively, arranging all items in the L1 in a horizontal direction according to the sequence of the support degree from large to small, comparing the support degree of all items of the L1, namely the change trend of all attributes, and primarily judging the possible attributes of the key driving force. In the figure, the lake A is taken as an example, and the key driving force factors are preliminarily considered to comprise three items which are arranged according to the size of the support degree and are rainfall reduction, forest area reduction and surface water reduction in sequence. (see fig. 6)
And 5: and combining the data in the frequent item set L1 by using an Apriori-Gen algorithm to form a candidate 2 item set C2 containing 2 item attribute data change trends in each group.
The Apriori-Gen function is a collective term of a connecting step and a pruning step of the candidate item set, and the connecting process connects the two subsets to generate a potential candidate item set, so that the calculation amount is reduced; in the pruning process, if a certain K-1 item set in the candidate K item sets is infrequent, the K-1 item sets are removed according to the properties, and the calculation efficiency of the algorithm is also improved. Since each candidate is merged by a pair of frequent K-1 sets of entries, only additional candidate pruning steps are needed to ensure that the remaining K-2 subsets of the candidate are frequent.
In this link, the candidate 2 sets formed by the research objects are each set including 2 items, that is, 2 items are the combination of the attribute data change trend, or the driving force, affecting the lake water storage capacity. Taking lake A as an example, the frequent 2 items include items such as [ increase of urbanization rate, decrease of wetland ], [ decrease of forest, decrease of agricultural land ], [ decrease of forest, decrease of precipitation ], [ increase of urbanization rate, decrease of agricultural land ] and the like.
Step 6: and judging the candidate item set C2, comparing the support degree of the change trend of each group of attribute data with the minimum support degree, forming a frequent 2 item set L2 by all items greater than the minimum support degree, generating an analysis chart for describing the relevance of the attributes of the basin, and analyzing the relevance.
The chart is prepared in the following way:
first, data of water storage amount increase and water storage amount decrease are distinguished, and increase graphs and decrease graphs are drawn respectively. Taking the drawing of the water storage capacity increase chart as an example, the table is transversely and sequentially arranged with different value intervals of the association degree, and is longitudinally and sequentially arranged in different regions of research. Wherein the degree of association represents a correlation between the attributes of the watershed.
For example, the township rate increase and the agricultural land area decrease have the highest correlation among the driving force factors of the change of the water storage capacity of the lake A, namely the highest correlation.
And 7: and continuously utilizing the Apriori-Gen algorithm to recombine the frequent item set L2, and combining all data in the frequent item set L2 to form a candidate 3 item set C3 containing 3 item attribute data change trends in each group. And judging the candidate item set C3, and comparing the support degree of the change trend of each group of attribute data with the minimum support degree, wherein all items greater than the minimum support degree form a frequent 3 item set L3. The process is repeated until the maximum frequent K item set LK with all items being key driving forces is formed in an iteration mode, and therefore the key driving forces of which lake water storage amount changes at most are determined.
For example, the key driving factors influencing the reduction of the water storage capacity of the lake A are three through multiple analyses, including the reduction of the annual average precipitation, the reduction of forests and the increase of the urbanization rate, as shown in the following table 1:
TABLE 1 confidence analysis
And 8: and (6) checking the result. After key driving force factors influencing the water storage capacity of the lake in the Huanglong region are obtained by running the association algorithm for multiple times, the accuracy of the result is verified by further testing by combining the professional related knowledge of landscape architecture with a statistical analysis method.
The testing process takes remote sensing images as a main information source, and analyzes the landscape pattern change of Huanglong in 2000-2020 by utilizing a geographic information system technology and a mathematical statistics method. Selecting 6 landscape indexes for research, namely landscape patch Number (NP), landscape Patch Density (PD), landscape Shape Index (LSI), shannon landscape diversity index (SHDI), shannon landscape dominance index (SHEI) and landscape spreading index (CONTAG), analyzing the influence of each factor on lake water storage capacity, screening driving force factors by using a statistical method, and inducing key factors influencing lake water storage capacity change.
The final result is checked to be consistent with the result generated by the correlation algorithm, taking A lake as an example, the annual average precipitation reduction, forest reduction and urbanization rate increase are key driving force factors influencing the reduction of the water storage capacity of the A lake
In the description herein, references to the description of "one embodiment," "an example," "a specific example" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
The foregoing shows and describes the general principles, principal features, and advantages of the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are given by way of illustration of the principles of the present invention, but that various changes and modifications may be made without departing from the spirit and scope of the invention, and such changes and modifications are within the scope of the invention as claimed.
Claims (9)
1. A key driving force analysis method for water storage capacity change of urban and rural lakes is characterized by comprising the following steps:
step A: carrying out equidistant grid division on a research area, and selecting a plurality of typical sampling points of unit area water storage capacity according to a screening standard;
and B: defining 60 attribute data change trends of 10 types of influence factors including hydrology, runoff, climate, evaporation, terrain, soil, vegetation environment, agricultural production, peripheral land type and urbanization degree as items, establishing a database for bearing driving force factors, and performing data preprocessing;
step C: scanning a lake basin characteristic trend database to form a candidate item set C1, wherein each group in the C1 comprises an attribute data change trend influencing lake water storage capacity, and the total number of the groups is 60;
step D: judging the candidate item set C1, comparing whether the support degree of the change trend of each item of attribute data is greater than the minimum support degree, generating a frequent item set L1 by all items greater than the minimum support degree, and drawing various item histograms of the L1;
step E: combining the data in the frequent item set L1 by adopting an Apriori-Gen algorithm to form a candidate item set C2, so that each group in the C2 contains 2 attribute data change trends influencing the lake water storage capacity;
step F: judging the candidate item set C2, comparing whether the support degree of the change trend of each group of attribute data is greater than the minimum support degree, and forming a frequent item set L2 by all items greater than the minimum support degree, wherein all items are candidate sets bearing the key driving force of the change of the lake water storage capacity, and generating an analysis chart describing the correlation of the watershed attributes;
step G: iteratively generating a maximum frequent item set, and determining key driving force of the change of the water storage capacity of a plurality of lakes;
step H: and (5) performing further inspection by combining the professional related knowledge of the landscape architecture with a statistical analysis method to verify the accuracy of the result.
2. The method for analyzing key driving force of water storage capacity change of urban and rural lakes according to claim 1, wherein the step A is specifically as follows: equidistant grid division is carried out on a research area, lakes are named as A, B, C, & gt, N, after research objects and ranges are determined, grids of 100m x 100m are selected and are equidistantly distributed in all parts of the research area, the grids are numbered, the first grid in the northwest corner is taken as a number 1, and the grids are numbered sequentially from left to right and from top to bottom, namely 1, 2, 3, & gt, N; selecting a typical sampling point of the water storage capacity per unit area according to a screening standard; after the sampling points are determined, the sampling points with increased water storage capacity and reduced water storage capacity are distinguished, the areas with increased water storage capacity are marked by circular points, the areas with reduced water storage capacity are marked by square points, and the overall marking on the base map is completed;
the screening standard of a typical sampling point of the water storage capacity per unit area simultaneously meets the following conditions:
(1) The water storage capacity of the ecological unit in unit time per unit area has obvious change, the feasibility of data collection is considered, the unit time is determined to be 1 year, and data of nearly ten years are collected for statistical analysis;
(2) The ecological unit has one or more driving force data with obvious change;
(3) The ecological unit sampling points are finally grouped, and the condition that only one sampling point exists in a certain area is avoided.
3. The method for analyzing key driving force of water storage capacity change of urban and rural lakes according to claim 1, wherein the step B specifically comprises the following steps: defining each attribute change trend influencing the lake water storage capacity change as one item, establishing a database, and performing data preprocessing; taking 60 attribute data in 10 types of influence factors including hydrology, runoff, climate, evaporation, terrain, soil, vegetation environment, agricultural production, peripheral land type and urbanization degree as the driving force alternative factors influencing the change of the water storage capacity of urban and rural lakes, taking each change trend of each attribute, namely an increasing trend or a decreasing trend as one item, importing all related data in all sampling points, establishing a database, namely the database bearing the driving force factors, and then preprocessing the data of the database, wherein the method mainly comprises the following steps: data cleaning, data integration, data transformation and data specification.
4. The method for analyzing the key driving force of the variation of the water storage capacity of urban and rural lakes according to claim 1, wherein the step C is specifically as follows: scanning the database to form candidate item sets C1 containing the variation trend of one item of attribute data, wherein each item in the candidate item sets C1 corresponds to one variation trend of one attribute, namely the driving force influencing the variation of the lake water storage capacity.
5. The method for analyzing key driving force of water storage capacity change of urban and rural lakes according to claim 1, wherein the step D is specifically as follows: judging the candidate item set C1, comparing whether the support degree of each item of attribute data change trend is greater than the minimum support degree, and generating a frequent item set L by comparing all items greater than the minimum support degree, wherein the value of the minimum support degree determines the generated result, the generation of an effective frequent item set is influenced by an excessively high value, and a meaningless result is caused by an excessively low value, through repeated experiments, the value of the minimum support degree can be defined between 0.5 and 0.6, namely, the trend change of the related attribute data influencing the lake water storage capacity reaches at least 50%, the frequent item set L1 is entered, the Apriori algorithm is realized at an interval of 0.02, a corresponding graph is drawn, the horizontal axis is the value of the minimum support degree, and the vertical axis corresponds to the number of the maximum frequent item set.
6. The method for analyzing key driving force of water storage capacity change of urban and rural lakes according to claim 1, wherein the step E is specifically as follows: combining various data in the frequent item set L1 by using an Apriori-Gen algorithm to form a candidate item set C2 containing 2 attribute data change trends in each group, wherein an Apriori-Gen function is a combination of a connecting step and a pruning step of the candidate item set, and the connecting process connects the two subsets to generate a potential candidate item set, so that the calculation amount is reduced; in the pruning process, if a certain K-1 item set in the candidate K item sets is infrequent, the K-1 item sets are removed according to properties, the calculation efficiency of the algorithm is also improved, each candidate is formed by combining a pair of frequent K-1 item sets, so that the rest K-2 subsets of the candidate are ensured to be frequent only by an additional candidate pruning step, and in this link, the candidate 2 item sets formed by the research object are a combination of attribute data change trends or driving forces, wherein each group of the candidate 2 item sets comprises 2 items, namely 2 items influence the lake water storage capacity.
7. The method for analyzing key driving force of water storage capacity change of urban and rural lakes according to claim 1, wherein the step F is specifically as follows: and judging the candidate item set C2, comparing the support degree of the change trend of each group of attribute data with the minimum support degree, forming a frequent 2 item set L2 by all items greater than the minimum support degree, generating an analysis chart for describing the relevance of the attributes of the basin, and analyzing the relevance.
8. The method for analyzing key driving force of water storage capacity change of urban and rural lakes according to claim 1, wherein the step G is specifically as follows: continuously recombining the frequent item set L2 by using an Apriori-Gen algorithm, and combining each item of data in the frequent item set L2 to form a candidate item set C3 containing 3 item attribute data change trends in each group; judging the candidate item set C3, comparing the support degree of the change trend of each group of attribute data with the minimum support degree, and forming a frequent 3 item set L3 by all items greater than the minimum support degree; the process is repeated until the maximum frequent K item set LK with all items being key driving forces is formed in an iteration mode, and therefore the key driving forces of which lake water storage amount changes at most are determined.
9. The method for analyzing key driving force of water storage capacity change of urban and rural lakes according to claim 1, wherein the step H is specifically as follows: after key driving force factors influencing the lake water storage capacity of the research region are obtained by running the association algorithm for multiple times, further checking by using landscape and garden professional related knowledge and combining a statistical analysis method, and verifying the accuracy of a result; the remote sensing image is used as a main information source in the inspection process, the geographical information system technology and the mathematical statistics method are utilized to analyze the landscape pattern change of the researched area, and the final inspection result is consistent with the result generated by the correlation algorithm.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110549208.9A CN113256101B (en) | 2021-05-20 | 2021-05-20 | Key driving force analysis method for water storage capacity change of urban and rural lakes |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110549208.9A CN113256101B (en) | 2021-05-20 | 2021-05-20 | Key driving force analysis method for water storage capacity change of urban and rural lakes |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113256101A CN113256101A (en) | 2021-08-13 |
CN113256101B true CN113256101B (en) | 2022-10-28 |
Family
ID=77182898
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110549208.9A Active CN113256101B (en) | 2021-05-20 | 2021-05-20 | Key driving force analysis method for water storage capacity change of urban and rural lakes |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113256101B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115994639B (en) * | 2023-02-06 | 2023-09-26 | 天津大学 | Water grid pattern and topography association evaluation method based on redundancy analysis |
CN117149935B (en) * | 2023-10-30 | 2024-01-19 | 北京江云智能科技有限公司 | Water network data supervision system and method based on digital twinning |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109598082A (en) * | 2018-12-17 | 2019-04-09 | 河海大学 | The calculation method of lake evaporation capacity and crucial hydrographic information based on Hydrogen-oxygen Isotope |
CN111504424A (en) * | 2020-06-17 | 2020-08-07 | 水利部交通运输部国家能源局南京水利科学研究院 | Lake water storage variable quantity monitoring method based on remote sensing |
-
2021
- 2021-05-20 CN CN202110549208.9A patent/CN113256101B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109598082A (en) * | 2018-12-17 | 2019-04-09 | 河海大学 | The calculation method of lake evaporation capacity and crucial hydrographic information based on Hydrogen-oxygen Isotope |
CN111504424A (en) * | 2020-06-17 | 2020-08-07 | 水利部交通运输部国家能源局南京水利科学研究院 | Lake water storage variable quantity monitoring method based on remote sensing |
Non-Patent Citations (1)
Title |
---|
近40年来纳木错水面面积及蓄水量变化特征;李蒙等;《水电能源科学》;20170225(第02期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN113256101A (en) | 2021-08-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109002621B (en) | Forest stand average height and breast diameter calculation method considering neighborhood and geographic difference | |
CN104486435B (en) | Low energy consumption ecological environmental monitoring node deployment method based on sensor network | |
CN108984803B (en) | Method and system for spatializing crop yield | |
CN101916337B (en) | Method for dynamically predicting potential productivity of paddy rice based on geographical information system | |
CN107256451A (en) | Agriculture suitability evaluation analysis method of the agriculture big data based on GIS | |
CN113256101B (en) | Key driving force analysis method for water storage capacity change of urban and rural lakes | |
CN109582755A (en) | A kind of precision agriculture database implementation method based on grid | |
CN111445116A (en) | Auxiliary compiling system for territorial space planning | |
CN110222656A (en) | Aboveground vegetation ecological water quantitative inversion method based on remote sensing technology | |
Sui et al. | Spatiotemporal variation of cultivated land ecosystem stability in typical regions of Lower Liaohe Plain China based on stress-buffer-response | |
Gonfa et al. | Land suitability and surface water resources potential for irrigation in Becho Plain, upper Awash basin, Ethiopia | |
CN114792274A (en) | Layout method for artificial vegetation construction in arid region and mountain land | |
CN117494419A (en) | Multi-model coupling drainage basin soil erosion remote sensing monitoring method | |
Kimaro et al. | Distribution of traditional irrigation canals and their discharge dynamics at the southern slopes of Mount Kilimanjaro | |
CN113570273A (en) | Spatial method and system for irrigation farmland statistical data | |
CN116757357A (en) | Land ecological condition assessment method coupled with multisource remote sensing information | |
CN106980701A (en) | Statistical system and statistical method for geography influence on residential building form | |
Miao et al. | Measurement of cultivated land ecosystem resilience in black soil region of Northeast China under the background of cultivated land protection policy in China: Case study of Qiqihar City | |
CN113935658A (en) | Mountain land utilization change evaluation method based on terrain gradient | |
CN114416692A (en) | Method for constructing watershed water environment management data resource system | |
Zuo et al. | Analysis of the gains and losses of ecosystem service value under land use change and zoning in Qiqihar | |
Meirelles et al. | Data mining in the reduction of the number of places of experiments for plant cultivates | |
Cong et al. | A Rapid Integration Method of Wild Ornamental Plant Resources Based on Improved Clustering Algorithm | |
Santos | Synergy between digital soil mapping and crop modeling: influence of soil data on sugarcane attainable yield | |
Paiboonvorachat | Using remote sensing and GIS techniques to assess land use/land cover changes in the Nan Watershed, Thailand |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |