WO2021218207A1 - 城市内部登革热时空预测方法、***及电子设备 - Google Patents
城市内部登革热时空预测方法、***及电子设备 Download PDFInfo
- Publication number
- WO2021218207A1 WO2021218207A1 PCT/CN2020/139657 CN2020139657W WO2021218207A1 WO 2021218207 A1 WO2021218207 A1 WO 2021218207A1 CN 2020139657 W CN2020139657 W CN 2020139657W WO 2021218207 A1 WO2021218207 A1 WO 2021218207A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- dengue fever
- city
- data
- prediction
- township
- Prior art date
Links
- 208000001490 Dengue Diseases 0.000 title claims abstract description 113
- 206010012310 Dengue fever Diseases 0.000 title claims abstract description 113
- 208000025729 dengue disease Diseases 0.000 title claims abstract description 113
- 238000013277 forecasting method Methods 0.000 title abstract 2
- 238000000034 method Methods 0.000 claims abstract description 41
- 238000012549 training Methods 0.000 claims abstract description 24
- 238000007781 pre-processing Methods 0.000 claims abstract description 21
- 239000013598 vector Substances 0.000 claims description 24
- 238000010276 construction Methods 0.000 claims description 19
- 230000015654 memory Effects 0.000 claims description 15
- 239000011159 matrix material Substances 0.000 claims description 14
- 230000006870 function Effects 0.000 claims description 11
- 230000001186 cumulative effect Effects 0.000 claims description 10
- 230000003442 weekly effect Effects 0.000 claims description 7
- 208000035473 Communicable disease Diseases 0.000 claims description 6
- 238000011156 evaluation Methods 0.000 claims description 6
- 238000012795 verification Methods 0.000 claims description 6
- 238000010200 validation analysis Methods 0.000 claims description 5
- 230000004913 activation Effects 0.000 claims description 4
- 238000012886 linear function Methods 0.000 claims description 4
- 230000004931 aggregating effect Effects 0.000 claims description 2
- 230000002265 prevention Effects 0.000 abstract description 7
- 238000010586 diagram Methods 0.000 description 10
- 241000255925 Diptera Species 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 208000015181 infectious disease Diseases 0.000 description 4
- 238000013527 convolutional neural network Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012706 support-vector machine Methods 0.000 description 3
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 238000013500 data storage Methods 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 230000006806 disease prevention Effects 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 241000256111 Aedes <genus> Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 238000013136 deep learning model Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000005507 spraying Methods 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/80—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for detecting, monitoring or modelling epidemics or pandemics, e.g. flu
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Definitions
- the invention relates to a method, a system and electronic equipment for predicting the time and space of dengue fever in a city.
- vector control such as spraying mosquitoes to eliminate adult mosquitoes, removing breeding grounds for Aedes mosquitoes, etc.
- vector control is still the main method of dengue fever prevention and control.
- accurate prediction and early warning of the number and location of dengue fever in the future has become the key to prevention and control.
- the present invention provides a method for temporal and spatial prediction of dengue fever in a city.
- the method includes the following steps: a. Collect and preprocess related data on dengue fever in the city.
- the data on dengue fever in the city includes: dengue fever case data and meteorological data in the studied city , Population distribution data, township vector files; b. Construct a map structure that reflects the spatial relationship of the inner city; c. Select the input features for dengue spatiotemporal prediction; d. Construct a map based on the preprocessed dengue-related data within the city
- the structure and selected input features are used to construct and train the GCN model.
- the method further includes step e: evaluating the prediction performance of the GCN model.
- the step a specifically includes:
- Preprocess the collected data of dengue fever cases convert the home address of the case to latitude and longitude coordinates; determine the township where each case is located; count the number of cases in each township in each week according to the onset date of each case, constituting W*N
- the number matrix of cases, W is the number of weeks, and N is the number of towns;
- Preprocessing the collected meteorological data Obtain the daily average temperature and rainfall recorded by all meteorological observation stations in the city, and use the kriging method to interpolate them separately; aggregate the interpolated data to the township level on a weekly basis, Calculate the average temperature and cumulative rainfall of each town in each week to form a W*N average temperature matrix and cumulative rainfall matrix;
- the preprocessing of the collected population distribution data includes: aggregating the population distribution data to the township level to obtain the total population of each township.
- the step b specifically includes the following steps:
- a graph structure is constructed.
- Said step c specifically includes:
- the GCN model is composed of one input layer, at least two hidden layers and one output layer; after the at least two hidden layers, the rectified linear function ReLU and the hyperbolic tangent function tanh are respectively used as activation functions.
- the training of the GCN model in step d includes:
- the step e specifically includes:
- the hit rate of the prediction result in week t is defined as follows:
- N m,t means that the number of cases in all towns within the city predicted in week t is ranked from highest to lowest, and the sum of the actual number of cases in the top m% of high-risk streets and towns; N t means the number of cases in week t The total number of actual cases in the city.
- the present invention provides a space-time prediction system for dengue fever in a city.
- the system includes a preprocessing unit, a graph structure building unit, a selection unit, and a model building unit.
- the preprocessing unit is used to collect data related to dengue fever in the city and perform preprocessing.
- the data related to dengue fever in the city includes: dengue fever case data, meteorological data, population distribution data, and township vector files of the studied city; the graph structure construction unit is used to construct a graph structure reflecting the spatial relationship of the inner city; The selection unit is used to select input features for the spatiotemporal prediction of dengue fever; the model construction unit is used to construct and train the GCN model based on the preprocessed intra-city dengue related data, the constructed graph structure, and the selected input features.
- system further includes: an evaluation unit for evaluating the prediction performance of the GCN model.
- the present invention also provides an electronic device, including:
- At least one processor At least one processor
- a memory communicatively connected with the at least one processor; wherein,
- the memory stores instructions that can be executed by the one processor, and the instructions are executed by the at least one processor, so that the at least one processor can execute the urban interior described in any one of 1 to 8 above.
- Step a Collect and preprocess the dengue-related data in the city.
- the dengue-related data in the city includes: dengue case data, meteorological data, population distribution data, and township vector files in the studied city;
- Step b Construct a map structure that reflects the spatial relationship of the city's internal regions
- Step c Select the input features for the spatiotemporal prediction of dengue fever
- Step d Constructing and training a GCN model according to the preprocessed data related to dengue fever in the city, the constructed graph structure, and the selected input features, so as to use the GCN model to perform dengue fever spatiotemporal prediction.
- the present invention is oriented to each area within the city, and realizes prediction on a finer spatial scale.
- the spatial relationship between various regions help to capture the characteristics of dengue fever in the city, effectively improve the prediction performance, and enhance the level of precision prevention and control of dengue fever.
- Figure 1 is a flow chart of the method for predicting dengue fever within a city according to the present invention
- FIG. 2 is a schematic diagram of a process of constructing a spatial relationship within a city according to an embodiment of the present invention
- FIG. 3 is a schematic structural diagram of a graph convolutional neural network model provided by an embodiment of the present invention.
- FIG. 4 is a hardware architecture diagram of the dengue fever spatiotemporal prediction system in the city of the present invention.
- FIG. 5 is a schematic diagram of the hardware device structure of a method for predicting dengue fever in a city according to an embodiment of the present invention
- Fig. 6 is a schematic diagram of comparison of dengue fever prediction effects on a township scale in Guangzhou City in Example 1 of the present invention.
- This embodiment is explained with the prediction of a township scale.
- the present invention is also applicable to urban internal spatial units divided in other ways, such as administrative districts, traffic analysis districts, grids, and the like.
- FIG. 1 it is a flowchart of a preferred embodiment of the method for predicting dengue fever within a city according to the present invention.
- Step S1 collecting data related to dengue fever in the city and preprocessing it. in particular:
- the data related to dengue fever in the city includes: dengue fever case data, meteorological data, population distribution data, and township vector files (shapefile) of the studied city.
- the meteorological data includes daily average temperature and rainfall collected by meteorological monitoring stations in the city.
- the dengue fever case data is obtained from the disease prevention and control center of the country/province/city, and the dengue fever case data includes: the onset date and home address of each case; the meteorological data is obtained from the national/provincial/city meteorological
- the said population distribution data is obtained from the open source global population data project WorldPop website (https://www.worldpop.org/).
- the preprocessing of the collected dengue case data includes: first use geocoding to convert the home address of the case into latitude and longitude coordinates, and import all case points into ArcGIS according to their latitude and longitude coordinates to obtain a vector file of point type; then use Spatial in ArcGIS software
- the Join tool associates cases (point-type vector files) with townships (face-type vector files, that is, township vector files) to determine the township where each case is located; finally, count each week according to the date of onset of each case
- the number of cases in each township constitutes a W*N matrix of the number of cases, where W is the number of weeks and N is the number of townships.
- the preprocessing of the collected meteorological data includes: obtaining the daily average temperature and rainfall recorded by all meteorological observatories in the city, firstly using the kriging method to interpolate them separately; then, the interpolated data is aggregated into weekly At the township level, the average temperature and cumulative rainfall of each township in each week are counted to form the W*N average temperature matrix and cumulative rainfall matrix.
- the spatial interpolation and data aggregation are processed in batches using the ArcPy toolkit of the Python language.
- the preprocessing of the collected population distribution data includes: this embodiment uses ArcGIS software to download the population distribution data with a resolution of 100 meters from the WorldPop website in 2015 and aggregate it to the township level to obtain the total population of each township.
- Step S2 constructing a graph structure reflecting the spatial relationship of the inner regions of the city according to the neighboring relationship between the regions.
- the step S2 includes:
- Step 201 Use the Spatial Join function of ArcGIS software to obtain the neighboring relationship between the township and the township from the township vector file.
- Step 202 Regard the township as a point, and the adjacency relationship between the townships as an edge, and construct a graph structure. Please refer to FIG. 2 for a schematic diagram of the construction process of structures A and B in this embodiment.
- Step S3 selecting input features for dengue fever prediction. in particular:
- This embodiment selects four types of features commonly used in the literature that are closely related to the spread and outbreak of dengue fever, including the number of cases in the current week and the past week, the average temperature, the cumulative rainfall, and the number of populations. As shown in Table 1, there are 13 features in total. Among them, the average moderate and the accumulated rainfall are related to the survival suitability of mosquito vectors; because dengue fever is an infectious disease, the number of future cases is also closely related to the number of past cases and the number of population.
- step S4 the GCN model is constructed and trained according to the preprocessed data related to dengue fever in the city, the constructed graph structure, and the selected input features.
- the step S4 includes:
- Step 401 Model construction.
- the graph convolutional neural network model used in this embodiment was proposed by Kipf Thomas N and Max Welling in 2016, and its basic structure is shown in FIG. 3.
- the model consists of an input layer, two hidden layers (more hidden layers can also be set), and an output layer; after the two hidden layers, the rectified linear function ReLU and the hyperbolic tangent function tanh are used as the activation functions.
- Step 402 Model training. According to the input and output requirements of the GCN model, and the different prediction windows, organize K sets of data sets; each set of data sets are divided into training sets and validation sets in a certain proportion: in this embodiment, the top 75% of all weeks in the data set The weekly data is used for training, and the last 25% of the weekly data is used for verification; the training set under each prediction window is used to train the constructed GCN model.
- Step S5 Evaluate the prediction performance of the GCN model. in particular:
- the hit rate of the forecast results in week t is defined as follows:
- N m,t means that the number of cases in all towns within the city predicted in week t is ranked from highest to lowest, and the sum of the actual number of cases in the top m% of high-risk streets and towns; N t means the number of cases in week t The total number of actual cases in the city.
- FIG. 4 is a hardware architecture diagram of the dengue fever spatiotemporal prediction system 10 in a city of the present invention.
- the system includes: a preprocessing unit 101, a graph structure construction unit 102, a selection unit 103, a model construction unit 104, and an evaluation unit 105.
- the preprocessing unit 101 is used to collect and preprocess the data related to dengue fever in the city. in particular:
- the data related to dengue fever in the city includes: dengue fever case data, meteorological data, population distribution data, and township vector files (shapefile) of the studied city.
- the meteorological data includes daily average temperature and rainfall collected by meteorological monitoring stations in the city.
- the dengue fever case data is obtained from the disease prevention and control center of the country/province/city, and the dengue fever case data includes: the onset date and home address of each case; the meteorological data is obtained from the national/provincial/city meteorological
- the said population distribution data is obtained from the open source global population data project WorldPop website (https://www.worldpop.org/).
- the preprocessing of the collected dengue case data by the preprocessing unit 101 includes: firstly using a geocoding method to convert the home address of the case into latitude and longitude coordinates, and import all case points into ArcGIS according to their latitude and longitude coordinates to obtain a vector file of point type; Use the SpatialJoin tool in ArcGIS software to associate the case (point-type vector file) with the township (face-type vector file, that is, the township vector file) to determine the township where each case is located; finally, according to the date of onset of each case , Count the number of cases in each town in each week to form a matrix of the number of cases in W*N, where W is the number of weeks and N is the number of towns.
- the pre-processing of the collected meteorological data by the pre-processing unit 101 includes: obtaining the daily average temperature and rainfall recorded by all meteorological observatories in the city, firstly using the kriging method to perform spatial interpolation on them; The data is aggregated to the township level by week, and the average temperature and cumulative rainfall of each township in each week are counted to form the average temperature matrix and cumulative rainfall matrix of W*N.
- the spatial interpolation and data aggregation are processed in batches using the ArcPy toolkit of the Python language.
- the preprocessing unit 101 preprocessing the collected population distribution data includes: using ArcGIS software in this embodiment to download the population distribution data with a resolution of 100 meters in 2015 from the WorldPop website to the township level to obtain the total population of each township.
- the graph structure construction unit 102 is used for constructing a graph structure reflecting the spatial relationship of the inner region of the city according to the neighboring relationship between the regions. in particular:
- the graph structure construction unit 102 uses the Spatial Join function of ArcGIS software to obtain the neighboring relationship between the township and the township from the township vector file.
- FIG. 2 a schematic diagram of the construction process of structures A and B in this embodiment.
- the selection unit 103 is used to select input features for dengue fever prediction. in particular:
- the selection unit 103 selects four types of features commonly used in the literature that are closely related to the spread and outbreak of dengue fever, including the number of cases in the current week and the past week, the average temperature, the cumulative rainfall, and the number of population. As shown in Table 1, there are 13 features in total. Among them, the average moderate and the accumulated rainfall are related to the survival suitability of mosquito vectors; because dengue fever is an infectious disease, the number of future cases is also closely related to the number of past cases and the number of population.
- the model construction unit 104 is used for constructing and training the GCN model according to the preprocessed intra-city dengue fever related data, the constructed graph structure, and the selected input features. in particular:
- the model construction unit 104 performs model construction.
- the graph convolutional neural network model used in this embodiment was proposed by Kipf Thomas N and Max Welling in 2016, and its basic structure is shown in FIG. 3.
- the model consists of an input layer, two hidden layers (more hidden layers can also be set), and an output layer; after the two hidden layers, the rectified linear function ReLU and the hyperbolic tangent function tanh are used as the activation functions.
- the model construction unit 104 performs model training. According to the input and output requirements of the GCN model, and the different prediction windows, organize K sets of data sets; each set of data sets are divided into training sets and validation sets in a certain proportion: in this embodiment, the top 75% of all weeks in the data set The weekly data is used for training, and the last 25% of the weekly data is used for verification; the training set under each prediction window is used to train the constructed GCN model.
- the evaluation unit 105 is used to evaluate the prediction performance of the GCN model. in particular:
- the evaluation unit 105 inputs the verification set under each prediction window into the corresponding trained GCN model, and accordingly obtains the prediction value of the kth week in the future (that is, the number of cases in each town). Since the main purpose of prediction is to identify high-risk streets and towns among multiple streets and towns in the city, and to deploy prevention and control measures in a targeted manner, the hit rate is used in this embodiment to evaluate the prediction performance.
- the hit rate of the forecast results in week t is defined as follows:
- N m,t means that the number of cases in all towns within the city predicted in week t is ranked from highest to lowest, and the sum of the actual number of cases in the top m% of high-risk streets and towns; N t means the number of cases in week t The total number of actual cases in the city.
- FIG. 5 is a schematic diagram of the hardware device structure of the method for simulating the spread of infectious diseases in a city provided by an embodiment of the present application.
- the device includes one or more processors and memory. Taking a processor as an example, the device may also include: an input system and an output system.
- the processor, the memory, the input system, and the output system may be connected by a bus or other methods.
- the connection by a bus is taken as an example.
- the memory can be used to store non-transitory software programs, non-transitory computer executable programs, and modules.
- the processor executes various functional applications and data processing of the electronic device by running non-transitory software programs, instructions, and modules stored in the memory, that is, realizing the processing methods of the foregoing method embodiments.
- the memory may include a program storage area and a data storage area, where the program storage area can store an operating system and an application program required by at least one function; the data storage area can store data and the like.
- the memory may include a high-speed random access memory, and may also include a non-transitory memory, such as at least one magnetic disk storage device, a flash memory device, or other non-transitory solid-state storage devices.
- the memory may optionally include a memory remotely provided with respect to the processor, and these remote memories may be connected to the processing system through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, corporate intranets, local area networks, mobile communication networks, and combinations thereof.
- the input system can receive input digital or character information, and generate signal input.
- the output system may include display devices such as a display screen.
- the one or more modules are stored in the memory, and when executed by the one or more processors, the following operations of any of the foregoing method embodiments are performed:
- Step a Collect and preprocess the dengue fever-related data in the city.
- the dengue fever-related data in the city includes: dengue fever case data, meteorological data, population distribution data, and township vector files of the studied city;
- Step b Construct a map structure that reflects the spatial relationship of the city's internal regions
- Step c Select the input features for the spatiotemporal prediction of dengue fever
- Step d Constructing and training a GCN model according to the preprocessed data related to dengue fever in the city, the constructed graph structure, and the selected input features, so as to use the GCN model to perform dengue fever spatiotemporal prediction.
- the embodiments of the present application provide a non-transitory (non-volatile) computer electronic device, the computer electronic device stores computer-executable instructions, and the computer-executable instructions can perform the following operations:
- Step a Collect and preprocess the dengue fever-related data in the city.
- the dengue fever-related data in the city includes: dengue fever case data, meteorological data, population distribution data, and township vector files of the studied city;
- Step b Construct a map structure that reflects the spatial relationship of the city's internal regions
- Step c Select the input features for the spatiotemporal prediction of dengue fever
- Step d Constructing and training a GCN model according to the preprocessed data related to dengue fever in the city, the constructed graph structure, and the selected input features, so as to use the GCN model to perform dengue fever spatiotemporal prediction.
- the embodiment of the present application provides a computer program product, the computer program product includes a computer program stored on a non-transitory computer-readable electronic device, the computer program includes program instructions, when the program instructions are executed by a computer To make the computer do the following:
- Step a Collect and preprocess the dengue fever-related data in the city.
- the dengue fever-related data in the city include: dengue fever case data, meteorological data, population distribution data, and township vector files of the studied city;
- Step b Construct a map structure that reflects the spatial relationship of the city's internal regions
- Step c Select the input features for the spatiotemporal prediction of dengue fever
- Step d Constructing and training a GCN model according to the preprocessed data related to dengue fever in the city, the constructed graph structure, and the selected input features, so as to use the GCN model to perform dengue fever spatiotemporal prediction.
- Example 1 of this application took 167 villages and towns in Guangdongzhou as an example to conduct experiments.
- the study period is from January 1, 2015 to September 22, 2019, with a total of 247 weeks.
- the data from week 5 to week 195 is used for model training, and the data from week 196 to week 247 is used for model verification.
- the prediction window k is 1, 2, ..., 8.
- the comparison methods are LASSO (least absolute shrinkage and selection operator) and SVM (support vector machine) regression models that are commonly used in current dengue fever prediction research and have been proven to be relatively effective. Use the above two models to make individual predictions for each township.
- Figure 6 is a comparison diagram of model effects with hit rate as an evaluation index. It can be seen that, compared with the dengue fever prediction method based on the LASSO and SVM regression model, the dengue fever prediction method using GCN provided in the present invention has better overall prediction performance, which fully demonstrates the effectiveness of the present invention.
- the present invention introduces the deep learning model Graph Convolutional Network (GCN) for the first time, which fully considers the spatial relationship between the inner regions of the city to capture the spread of diseases in space, and conducts joint prediction of each region, and achieves better results. Accurate prediction effect. In order to provide decision support for relevant prevention and control departments, avoid wasting manpower and material resources, and reduce the loss of life, health and property.
- GCN Graph Convolutional Network
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Public Health (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Biomedical Technology (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Biophysics (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Evolutionary Computation (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Databases & Information Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Pathology (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (11)
- 一种城市内部登革热时空预测方法,其特征在于,该方法包括如下步骤:a.采集城市内部登革热相关数据并进行预处理,所述城市内部登革热相关数据包括:所研究城市的登革热病例数据、气象数据、人口分布数据、乡镇矢量文件;b.构建反映城市内部区域空间关系的图结构;c.选择用于登革热时空预测的输入特征;d.根据预处理后的城市内部登革热相关数据、构建的图结构、选择的输入特征,对GCN模型进行构建与训练,以使用所述GCN模型进行登革热时空预测。
- 如权利要求1所述的方法,其特征在于,该方法还包括步骤e:对所述GCN模型的预测性能进行评估。
- 如权利要求1或2所述的方法,其特征在于,所述的步骤a具体包括:对采集的登革热病例数据预处理:将病例家庭住址转换为经纬度坐标;确定每个病例所在乡镇;根据每个病例的发病日期,统计每个周次每个乡镇的发病病例数量,构成W*N的病例数量矩阵,W为周次数量,N为乡镇数量;对采集的气象数据预处理:获取城市内所有气象观测站所记录的每日平均温和降雨量,使用克里金法分别对其进行空间插值;将插值后的数据分周次聚合至乡镇级别,统计每个周次每个乡镇的平 均温和累积降雨量,构成W*N的平均温矩阵和累积降雨量矩阵;对采集的人口分布数据预处理包括:将人口分布数据聚合至乡镇级别,获取每个乡镇的总人口。
- 如权利要求3所述的方法,其特征在于,所述的步骤b具体包括如下步骤:获取乡镇与乡镇之间的邻接关系;将乡镇视为点,乡镇之间的邻接关系视为边,构建图结构。
- 如权利要求4所述的方法,其特征在于,所述的步骤c具体包括:选择文献中常用的、与登革热传播和爆发有密切关系的特征作为输入特征。
- 如权利要求5所述的方法,其特征在于,所述的GCN模型由一层输入层、至少两层隐藏层及一层输出层构成;所述至少两层隐藏层后分别使用整流线性函数ReLU和双曲正切函数tanh作为激活函数。
- 如权利要求6所述的方法,其特征在于,步骤d中所述对GCN模型进行训练包括:根据所述GCN模型的输入、输出需求及不同预测窗口,整理K套数据集,每套所述数据集均划分为训练集和验证集;使用每个预测窗口下的训练集分别对构建的GCN模型进行训练。
- 一种城市内部登革热时空预测***,其特征在于,该***包括预处理单元、图结构构建单元、选择单元、模型构建单元,其中:所述预处理单元用于采集城市内部登革热相关数据并进行预处理,所述城市内部登革热相关数据包括:所研究城市的登革热病例数据、气象数据、人口分布数据、乡镇矢量文件;所述图结构构建单元用于构建反映城市内部区域空间关系的图结构;所述选择单元用于选择用于登革热时空预测的输入特征;所述模型构建单元用于根据预处理后的城市内部登革热相关数据、构建的图结构、选择的输入特征,对GCN模型进行构建与训练,以使用所述GCN模型进行登革热时空预测。
- 如权利要求9所述的***,其特征在于,所述***还包括:评估单元,用于对所述GCN模型的预测性能进行评估。
- 一种电子设备,包括:至少一个处理器;以及与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行上述权利要求1至8任一项所述的城市内部传染病扩散模拟方法的以下操作:步骤a:采集城市内部登革热相关数据并进行预处理,所述城市内部登革热相关数据包括:所研究城市的登革热病例数据、气象数据、人口分布数据、乡镇矢量文件;步骤b:构建反映城市内部区域空间关系的图结构;步骤c:选择用于登革热时空预测的输入特征;步骤d:根据预处理后的城市内部登革热相关数据、构建的图结构、选择的输入特征,对GCN模型进行构建与训练,以使用所述GCN模型进行登革热时空预测。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010346736.XA CN111554408B (zh) | 2020-04-27 | 2020-04-27 | 城市内部登革热时空预测方法、***及电子设备 |
CN202010346736.X | 2020-04-27 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021218207A1 true WO2021218207A1 (zh) | 2021-11-04 |
Family
ID=72004089
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/139657 WO2021218207A1 (zh) | 2020-04-27 | 2020-12-25 | 城市内部登革热时空预测方法、***及电子设备 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN111554408B (zh) |
WO (1) | WO2021218207A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115036040A (zh) * | 2022-06-28 | 2022-09-09 | 福州大学 | 融合发热人数和人口背景数据的流行病疫情时空预警方法 |
CN117973184A (zh) * | 2024-01-04 | 2024-05-03 | 南京中禹智慧水利研究院有限公司 | 考虑时空特征的城市内涝智能预报模型构建方法 |
CN118016318A (zh) * | 2024-04-08 | 2024-05-10 | 中国科学院地理科学与资源研究所 | 基于图神经网络的***共患病风险预测模型的构建方法 |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111554408B (zh) * | 2020-04-27 | 2024-04-19 | 中国科学院深圳先进技术研究院 | 城市内部登革热时空预测方法、***及电子设备 |
CN112185566B (zh) * | 2020-10-14 | 2021-08-13 | 上海玺翎智能科技有限公司 | 一种基于机器学习预测预警感染性疾病就医人数突增的方法 |
CN112397205A (zh) * | 2020-12-08 | 2021-02-23 | 中国气象局广州热带海洋气象研究所 | 一种基于气象学模型的登革热传染病预测方法 |
CN114464329A (zh) * | 2021-12-31 | 2022-05-10 | 中国科学院深圳先进技术研究院 | 一种城市疫情时空预测方法、***、终端以及存储介质 |
CN114360739B (zh) * | 2022-01-05 | 2023-07-21 | 中国科学院地理科学与资源研究所 | 一种基于遥感云计算与深度学习的登革热风险预测方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102682188A (zh) * | 2011-03-15 | 2012-09-19 | 中国科学院遥感应用研究所 | 一种城市级传染病模拟的方法及装置 |
CN108172301A (zh) * | 2018-01-31 | 2018-06-15 | 中国科学院软件研究所 | 一种基于梯度提升树的蚊媒传染病疫情预测方法及*** |
CN109859854A (zh) * | 2018-12-17 | 2019-06-07 | 中国科学院深圳先进技术研究院 | 传染病预测方法、装置、电子设备及计算机可读介质 |
CN111554408A (zh) * | 2020-04-27 | 2020-08-18 | 中国科学院深圳先进技术研究院 | 城市内部登革热时空预测方法、***及电子设备 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3995424B2 (ja) * | 2001-03-16 | 2007-10-24 | 株式会社パスコ | 感染症伝播解析システム及びその伝播シミュレーションシステム |
US20170103172A1 (en) * | 2015-10-07 | 2017-04-13 | The Arizona Board Of Regents On Behalf Of The University Of Arizona | System And Method To Geospatially And Temporally Predict A Propagation Event |
CN109545386B (zh) * | 2018-11-02 | 2021-07-20 | 深圳先进技术研究院 | 一种基于深度学习的流感时空预测方法及装置 |
CN110136842A (zh) * | 2019-04-04 | 2019-08-16 | 平安科技(深圳)有限公司 | 急性传染病的发病预测方法、装置及计算机可读存储介质 |
CN110459329B (zh) * | 2019-07-11 | 2022-11-18 | 广东省公共卫生研究院 | 一种登革热风险综合评估方法 |
CN110610767B (zh) * | 2019-08-01 | 2023-06-02 | 平安科技(深圳)有限公司 | 发病率监测方法、装置、设备及存储介质 |
CN110993119B (zh) * | 2020-03-04 | 2020-07-07 | 同盾控股有限公司 | 基于人口迁移的疫情预测方法、装置、电子设备及介质 |
-
2020
- 2020-04-27 CN CN202010346736.XA patent/CN111554408B/zh active Active
- 2020-12-25 WO PCT/CN2020/139657 patent/WO2021218207A1/zh active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102682188A (zh) * | 2011-03-15 | 2012-09-19 | 中国科学院遥感应用研究所 | 一种城市级传染病模拟的方法及装置 |
CN108172301A (zh) * | 2018-01-31 | 2018-06-15 | 中国科学院软件研究所 | 一种基于梯度提升树的蚊媒传染病疫情预测方法及*** |
CN109859854A (zh) * | 2018-12-17 | 2019-06-07 | 中国科学院深圳先进技术研究院 | 传染病预测方法、装置、电子设备及计算机可读介质 |
CN111554408A (zh) * | 2020-04-27 | 2020-08-18 | 中国科学院深圳先进技术研究院 | 城市内部登革热时空预测方法、***及电子设备 |
Non-Patent Citations (2)
Title |
---|
ANONYMOUS: "Incomplete summary of existing variants of GCN (application in spatio-temporal data mining)", JIANSHU, 4 March 2020 (2020-03-04), XP055865300, Retrieved from the Internet <URL:https://www.jianshu.com/p/da48a2fb4265> [retrieved on 20211124] * |
STORM: "Application of GCN in the forecast of shared bicycle traffic", ZHIHU (GRAPH ALGORITHM AND APPLICATION), 9 August 2019 (2019-08-09), XP055865294, Retrieved from the Internet <URL:https://zhuanlan.zhihu.com/p/77209846?from_voters_page=true> [retrieved on 20211124] * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115036040A (zh) * | 2022-06-28 | 2022-09-09 | 福州大学 | 融合发热人数和人口背景数据的流行病疫情时空预警方法 |
CN117973184A (zh) * | 2024-01-04 | 2024-05-03 | 南京中禹智慧水利研究院有限公司 | 考虑时空特征的城市内涝智能预报模型构建方法 |
CN118016318A (zh) * | 2024-04-08 | 2024-05-10 | 中国科学院地理科学与资源研究所 | 基于图神经网络的***共患病风险预测模型的构建方法 |
Also Published As
Publication number | Publication date |
---|---|
CN111554408B (zh) | 2024-04-19 |
CN111554408A (zh) | 2020-08-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021218207A1 (zh) | 城市内部登革热时空预测方法、***及电子设备 | |
Meyer et al. | Power-law models for infectious disease spread | |
Kraemer et al. | Past and future spread of the arbovirus vectors Aedes aegypti and Aedes albopictus | |
JP2019108798A (ja) | 建築物改装に伴う損害防止の推定 | |
CN106021508A (zh) | 基于社交媒体的突发事件应急信息挖掘方法 | |
US12009106B2 (en) | Emergency demand prediction device, emergency demand prediction method, and program | |
CN108650614A (zh) | 一种自动推断社会关系的移动用户位置预测方法与装置 | |
WO2023123624A1 (zh) | 城市流感发病趋势预测方法、***、终端以及存储介质 | |
CN113496781A (zh) | 城市内部传染病扩散模拟方法、***及电子设备 | |
CN109242170A (zh) | 一种基于数据挖掘技术的城市道路管理***及方法 | |
Jiang et al. | Unraveling the dynamic impacts of COVID-19 on metro ridership: An empirical analysis of Beijing and Shanghai, China | |
Ponce-de-Leon et al. | COVID-19 Flow-Maps an open geographic information system on COVID-19 and human mobility for Spain | |
CN115730763A (zh) | 基于终端信令数据的工作日设施可达性计算方法及装置 | |
US20230056370A1 (en) | Construction method of fine-grained infectious disease simulation model | |
Núñez-Corrales et al. | The epidemiology workbench: a tool for communities to strategize in response to covid-19 and other infectious diseases | |
Almquist et al. | Point process models for household distributions within small areal units | |
CN116703132B (zh) | 共享车辆动态调度的管理方法、装置及计算机设备 | |
CN112651574A (zh) | 基于p中位遗传算法的选址方法、装置及电子设备 | |
CN113408867B (zh) | 基于手机用户和poi数据的城市入室盗窃犯罪风险评估方法 | |
Kim et al. | Examining the socio-spatial patterns of bus shelters with deep learning analysis of street-view images: A case study of 20 cities in the US | |
Ganti et al. | Spatio-temporal spread of events in social networks: A gas shortage case study | |
Hu et al. | A simplified deep residual network for citywide crowd flows prediction | |
CN110428627A (zh) | 一种公交出行潜力区域识别方法及识别*** | |
CN117521908B (zh) | 一种城市空间区域适配性评估方法、***及终端 | |
Behrens et al. | Using geographic information systems to analyze the distribution and abundance of Aedes aegypti in Africa: the potential role of human travel in determining the intensity of mosquito infestation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20933882 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20933882 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 20/04/2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20933882 Country of ref document: EP Kind code of ref document: A1 |