KR20190115330A

KR20190115330A - An application for predicting an acute exacerbation of chronic respiratory disease

Info

Publication number: KR20190115330A
Application number: KR1020180038236A
Authority: KR
Inventors: 김숙경
Original assignee: 주식회사 씨씨앤아이리서치
Priority date: 2018-04-02
Filing date: 2018-04-02
Publication date: 2019-10-11

Abstract

Provided is an application for predicting acute exacerbation of chronic respiratory disease. The application for predicting acute exacerbation of chronic respiratory disease comprises: a step of establishing a first model set to an equation comprising variables selected from variables from an integrated server for medical records collecting patient medical information from all hospital servers storing history data, clinical picture data, and genetic data of patients with chronic respiratory disease, a virus information server, a weather data server, a social data server, a life pattern provision server, and a location information server, and parameter coefficients coupled to the selected variables, and a second model set with a machine learning algorithm that includes at least some of the variables, as values used in the predictive model of acute exacerbation of chronic respiratory disease; a step of setting a model with high accuracy of prediction results among the first and second models as a predictive model; a step of predicting the occurrence of acute exacerbation of a chronic respiratory disease patient by inputting the variables into the predictive model every time the variables of the patient determined in the predictive model occur. According to the present invention, the patient can handle and prevent exacerbations before an acute exacerbation occurs.

Description

만성 호흡기 질환의 급성 악화 예측용 애플리케이션{An application for predicting an acute exacerbation of chronic respiratory disease}Application for predicting an acute exacerbation of chronic respiratory disease

본 발명은 만성 호흡기 질환(chronic respiratory disease)의 급성 악화 예측용 애플리케이션에 관한 것으로서, 보다 상세하게는 병원에 저장된 만성 호흡기 질환의 환자의 정보에 제한되지 않고, 모든 병원의 정보를 통합적으로 관리하는 국가 기관 또는 보험 회사 등의 모든 환자의 방대한 데이터에 기초하여 개별 환자의 만성 호흡기 질환의 급성 악화를 예측함으로써, 환자 스스로가 급성 악화의 발생 전에 악화를 미리 예방하는 애플리케이션에 관한 것이다. The present invention relates to an application for predicting acute exacerbation of chronic respiratory disease, and more particularly, not limited to the information of patients with chronic respiratory disease stored in the hospital, the country of the integrated management of all hospital information By predicting the acute exacerbation of chronic respiratory disease in an individual patient based on the vast data of all patients, such as agencies or insurance companies, the patient is concerned with an application that prevents exacerbation before the onset of acute exacerbation.

만성 호흡기 질환이란 비감염성의 기도나 폐 조직의 만성 질환이며, 주된 것으로, 천식, 만성 폐쇄성 폐 질환(COPD),기관지 확장증, 간질성 폐질환, 만성 기관지염 및 폐기종 등을 들 수 있다. 천식 또는 COPD이란 주로 담배의 연기나 오염된 대기와 같은 유해 물질을 장기간에 흡입함으로써 발병하는 폐의 염증성 질환이며, 진행성의 기류 폐쇄를 보인다. 더욱이, COPD의 유병률 및 사망률은 세계적으로 높은 수준(세계 사망 원인의 제4위, WHO 조사, 2004년)으로 향후 수십 년간은 환자수가 증가할 것으로 예측되고 있다. 또한, COPD라는 질환이 일반적으로 널리 알려져 있지 않은 상태이기 때문에, 잠재적인 환자도 상당수 있는 것으로 생각된다.Chronic respiratory diseases are chronic diseases of non-infectious airways or lung tissue, and the main ones include asthma, chronic obstructive pulmonary disease (COPD), bronchiectasis, interstitial lung disease, chronic bronchitis and emphysema. Asthma or COPD is an inflammatory disease of the lungs caused primarily by prolonged inhalation of harmful substances, such as cigarette smoke and contaminated air, with progressive airflow obstruction. Moreover, the prevalence and mortality rates of COPD are globally high (4th place in global mortality, WHO survey, 2004), which is expected to increase the number of patients over the next few decades. In addition, since a condition called COPD is generally not widely known, a large number of potential patients are considered.

또한, 만성 호흡기 질환 가운데, 간질성 폐질환은 폐의 간질 조직에 염증이 생긴 결과, 염증 조직의 섬유화가 일어남으로써 발병한다. 폐에는 약 3억개라고도 하는 폐포에 대기를 취입하고, 이 폐포에 얽혀 붙은 모세 혈관을 통해 가스 교환이 행해지고 있지만, 이들을 둘러싸고 지지하는 조직이 간질이다. 간질이 섬유화하면 폐 전체가 딱딱해져 폐의 정상적인 팽창이나 수축을 방해하여 폐활량이 저하하는 동시에, 폐포와 모세 혈관 간에 행해지는 가스 교환의 효율도 저하한다. In addition, among chronic respiratory diseases, interstitial lung diseases are caused by fibrosis of inflamed tissues as a result of inflammation of the interstitial tissues of the lungs. In the lungs, air is blown into alveoli, also called about 300 million, and gas exchange is performed through capillaries entangled in the alveoli, but the tissue surrounding and supporting them is epilepsy. Fibrosis of the epilepsy hardens the entire lung, impedes the normal expansion and contraction of the lungs, lowers the lung capacity and reduces the efficiency of gas exchange between the alveoli and capillaries.

만성 호흡기 질환은 세계적으로 높은 사망의 원인이 되고 있으며, 흡연, 대기오염 등으로 급격히 늘고 있는 추세이다. 이러한 만성폐쇄성 폐질환의 원인은 독성 분자나 가스에 대한 폐의 비정상적 만성 염증 반응으로, 흡연, 도시화와 공해, 호흡기 감염증 등의 여러 가지 인자들이 복합적으로 관여한다. Chronic respiratory disease is the leading cause of death worldwide, and is rapidly increasing due to smoking and air pollution. The cause of the chronic obstructive pulmonary disease is an abnormal chronic inflammatory response of the lung to toxic molecules or gases, which is a combination of several factors, such as smoking, urbanization and pollution, respiratory infections.

만성 호흡기 질환은 발병되더라도 급속한 악화는 지속적인 치료를 통해 억제할 수 있으나, 약물 복용의 불규칙성, 미세 먼지의 급격한 증가, 만성 호흡기 질환으로 인한 직접적인 관련 증상의 늦은 발견, 호흡기 바이러스의 갑작스러운 발생, 급격한 온도 변화 등으로 인하여, 환자는 급격히 약화되어 단시간에 사망에 이를 수 있다. Although chronic respiratory disease develops, rapid deterioration can be suppressed by continuous treatment, but irregularities in drug use, rapid increase in fine dust, late detection of direct related symptoms due to chronic respiratory disease, sudden development of respiratory virus, and sudden temperature Changes, etc., can cause the patient to rapidly weaken and lead to death in a short time.

따라서, 악화의 발생을 예측하여 악화 발생을 예방하거나 조기에 증상을 치료하는 치료를 시작함으로써, 악화에 의해 발생하는 손해를 경감시키는 것이 바람직하다.Therefore, it is desirable to reduce the damage caused by exacerbation by anticipating the occurrence of exacerbation and preventing the occurrence of exacerbation or starting treatment to treat symptoms early.

이를 위해, 임상학적 파라미터의 조합 구성은 환자의 급성 악화를 예측하는데 사용되나, 이러한 임상학적 파라미터는 각각의 환자에게 적용시키기에 충분히 정확한 예측은 아니다. 더욱이, 환자는 병원 방문 후에 상술한 요인들에 의해 급성 악화의 가능성에 노출될 수 있음에도, 스스로 급성 악화의 가능성을 예측할 수 없으므로, 환자가 급성 악화 후에 병원을 방문하면 예상치 않는 결과를 초래할 수 있다. 병원 방문 전이라도 급성 악화의 가능성을 스스로 예측하기 위한 툴의 개발이 필요하다. To this end, a combination of clinical parameters is used to predict acute exacerbation of a patient, but such clinical parameters are not accurate enough to apply to each patient. Moreover, although the patient may be exposed to the possibility of acute exacerbation by the aforementioned factors after the hospital visit, the possibility of acute exacerbation by themselves cannot be predicted, and therefore, the patient visiting the hospital after the acute exacerbation may have unexpected results. Even before a hospital visit, the development of tools to predict the possibility of acute exacerbation is needed.

본 발명이 이루고자 하는 기술적 과제는 모든 병원의 정보를 통합적으로 관리하는 국가 기관 또는 보험 회사 등의 모든 만성 호흡기 질환 환자의 방대한 데이터에 기초하여 개별 환자의 급성 악화를 예측함으로써, 환자 스스로가 급성 악화의 발생 전에 악화를 미리 예방하는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션을 제공하는데 있다. The technical problem to be achieved by the present invention is to predict the acute deterioration of individual patients based on the vast amount of data of all chronic respiratory disease patients, such as national agencies or insurance companies that collectively manage the information of all hospitals, so that patients themselves It is to provide an application for predicting acute exacerbation of chronic respiratory disease that prevents exacerbation before it occurs.

본 발명의 목적은 이상에서 언급된 목적으로 제한되지 않으며, 언급되지 않은 다른 목적들은 아래의 기재로부터 당업자에게 명확하게 이해될 수 있을 것이다. The object of the present invention is not limited to the above-mentioned object, and other objects not mentioned will be clearly understood by those skilled in the art from the following description.

상기 기술적 과제를 이루기 위한 본 발명의 일 양태에 따르면, 만성 호흡기 질환의 급성 악화 예측용 애플리케이션은 만성 호흡기 질환 급성 악화의 예측 모델에서 이용하는 값들로서, 만성 호흡기 질환 환자들의 병력 데이터, 임상 양상 데이터, 유전자 데이터를 저장하는 모든 병원 서버로부터 환자 의료 정보들을 수집하는 의료 기록용 통합 서버, 바이러스 정보 서버, 기상 데이터 서버, 소셜 데이터 서버, 생활 패턴 제공 서버 및 위치 정보 서버로부터의 변수들로부터 선택된 변수들과 상기 선택된 변수들에 결합되는 파라미터 계수들을 포함하는 방정식으로 설정된 제 1 모형, 및 상기 변수들의 적어도 일부를 포함하는 머신 러닝(machine learning) 알고리즘으로 설정된 제 2 모형을 수립하는 단계와, 상기 서버들로부터, 상기 제 1 및 제 2 모형들 각각에서 선택된 동일 기간 내의 변수들과 상기 동일 기간 내의 만성 호흡기 질환의 급성 악화의 실제 발생 여부에 따른 실제 결과값들을 추출하여 상기 모형들에서의 급성 악화의 발생 여부를 나타내는예측 결과값과 상기 실제 결과값들을 비교하고, 비교 결과 예측 결과값의 정확도가 높은 모형을 예측 모델로 설정하는 단계와, 만성 호흡기 질환의 환자의 급성 악화 발생 여부에 대한 예측 요청에 의해, 상기 서버들로부터, 상기 예측 모델에서 결정된 상기 환자의 변수들이 발생한 시점마다 상기 변수들을 데이터 세트로 구성하는 적어도 하나의 명세서를 생성하는 단계와, 상기 명세서의 변수들을 상기 예측 모델에 입력하여 상기 환자의 만성 호흡기 질환 환자의 급성 악화의 발생을 예측하는 단계, 및 예측 결과값을 상기 환자, 의료 기록용 통합 서버 및 병원 서버에 적어도 하나에 제공하는 단계를 실행시키고, 상기 제 1 및 제 2 모형들에서 공통되는 변수들은 상기 의료 기록용 통합 서버로부터 입수된 병력 데이터 및 상기 바이러스 정보 서버로부터 입수된 호흡기 바이러스 데이터를 적어도 포함하고, 상기 병력 데이터는 환자의 만성 호흡기 질환의 약물 복용 및 복용 빈도, 응급실 이용 빈도, 입원 빈도, 요양 일수, 진료 일수, 만성 호흡기 질환 급성 악화 발생 빈도를 적어도 포함한다. According to an aspect of the present invention for achieving the above technical problem, the application for predicting acute exacerbation of chronic respiratory disease is a value used in the predictive model of acute exacerbation of chronic respiratory disease, history data, clinical picture data, genes of patients with chronic respiratory disease Variables selected from variables from an integrated server for medical records, virus information server, weather data server, social data server, life pattern providing server and location information server that collects patient medical information from all hospital servers storing data; Establishing a first model set with an equation comprising parameter coefficients coupled to selected variables, and a second model set with a machine learning algorithm including at least some of the variables, from the servers, In each of the first and second models By extracting the actual results according to the selected variables within the same period and the actual occurrence of acute exacerbation of chronic respiratory disease within the same period, the predicted and indicative results indicating whether acute exacerbation occurs in the models Comparing, and setting the model having a high accuracy of the prediction result as a prediction model and requesting a prediction of whether acute deterioration of a patient of chronic respiratory disease has occurred, from the servers, the determination determined in the prediction model. Generating at least one specification comprising the variables in the data set at each time point of occurrence of the patient's variables, and inputting the parameters of the specification into the prediction model to predict the occurrence of acute exacerbation of a patient with chronic respiratory disease of the patient And predicting results of the patient, the integrated server for medical records, and the bottle. And providing at least one to the server, wherein the variables common to the first and second models include at least one of medical history data obtained from the integrated medical record server and respiratory virus data obtained from the virus information server. The medical history data includes at least the frequency of taking and taking the drug of the patient, the frequency of emergency room use, the frequency of hospitalization, the number of days of treatment, the number of days of treatment, and the frequency of occurrence of chronic acute respiratory disease.

다른 실시예에서, 상기 호흡기 바이러스 데이터는 상기 환자의 병력 데이터 또는 임상 양상 데이터와 관련된 변수들의 발생 시점보다 소정 기간 전에 수집되는 바이러스 검출률이며, 아데노 바이러스(ADenoVirus; 'ADV'), 파라인플루엔자 바이러스(ParaInfluenza Virus; 'PIV'), 호흡기세포융합 바이러스(Respiratory Syncytial Virus; 'RSV'), H1N1pdm 바이러스, 인간 코로나 바이러스(human CoronaVirus; 'hCoV'), 인간 라이노 바이러스(human Rhion Virus; 'hRV'), 인간 보카 바이러스(human BOcaVirus; 'hBOV'), hEV 바이러스 및 IFV 바이러스 중 적어도 하나일 수 있다. In another embodiment, the respiratory virus data is a virus detection rate that is collected a predetermined time before the time of occurrence of variables related to the medical history data or clinical pattern data of the patient, adenovirus (ADenoVirus; 'ADV'), parainfluenza virus (ParaInfluenza) Virus ('PIV'), Respiratory Syncytial Virus ('RSV'), H1N1pdm virus, human CoronaVirus ('hCoV'), human Rhion Virus ('hRV'), human It may be at least one of human BOcaVirus ('hBOV'), hEV virus and IFV virus.

또 다른 실시예에서, 상기 병력 데이터의 COPD 약물 복용은 서로 상이한 성분의 복수 약제이며, 상기 복용 빈도는 약제 별 복용 빈도일 수 있다. In another embodiment, the COPD drug dosage of the medical history data is a plurality of drugs of different components, and the dosage frequency may be a dosage frequency for each drug.

또 다른 실시예에서,상기 제 1 모형은 상기 급성 악화의 발생 여부를 나타내는 예측 결과값과 상기 변수들과 상호 연관되는(correlated) 것을 반영하는 모델을 이용할 수 있다. In another embodiment, the first model may use a model reflecting a predicted result indicating whether the acute deterioration has occurred and correlated with the variables.

상기 병력 데이터는 천식 또는 COPD(Chronic Obstructive pulmonary Disease) 여부를 포함하는 관련 증상 발생 빈도, 및 허혈성심질환, 폐암, 골다공증, 우울증, 관절염, 당뇨병, 위식도 역류, 기흉, 심부전, 고혈압, 빈혈, 대사증후군 중 적어도 하나를 포함하는 동반 질환 빈도를 더 가질 수 있다. The medical history data includes the incidence of related symptoms, including asthma or Chronic Obstructive Pulmonary Disease (COPD), and ischemic heart disease, lung cancer, osteoporosis, depression, arthritis, diabetes, gastroesophageal reflux, pneumothorax, heart failure, hypertension, anemia, metabolic syndrome. It may further have a comorbid disease frequency comprising at least one of.

또 다른 실시예에서, 상기 임상 양상 데이터는 평균 흡연량, 흡연 이력, 1초 노력성 호기량(forced expiratory volume in 1 second; FEV1%), 환자 자가 진단의 질의에 대한 응답을 계량화한 CAT(COPD Assessment Test) 중 적어도 하나를 포함할 수 있다. In another embodiment, the clinical modality data is a COPD Assessment Test that quantifies the mean smoking volume, smoking history, forced expiratory volume in 1 second (FEV1%), and response to questions of patient self-diagnosis. It may include at least one of).

또 다른 실시예에서, 상기 기상 데이터 서버로부터 제공되는 변수들은 상기 환자의 병력 데이터 또는 상기 임상 양상 데이터와 관련된 변수들의 발생 시점보다 소정 기간 전에 수집되는 데이터로서, 미세 먼지 누적량, 최저 기온 누적량, 최저 기온 변화량, 강수량 누적량, 최고 기온 누적량, 최고 기온 변화량, 일교차, 일별 최대 풍속, 평균 습도, 일조량 중 적어도 하나를 포함할 수 있다. In another embodiment, the variables provided from the meteorological data server are data collected before a time point of occurrence of variables related to the medical history data or the clinical pattern data of the patient, and include fine dust accumulation amount, minimum temperature accumulation amount, and minimum temperature. It may include at least one of a change amount, a cumulative amount of precipitation, a maximum temperature accumulation amount, a maximum temperature change amount, a day crossing, a daily maximum wind speed, an average humidity, and a sunshine amount.

또 다른 실시예에서, 상기 소셜 데이터 서버로부터 제공되는 변수들은 상기 환자의 병력 데이터 또는 상기 임상 양상 데이터와 관련된 변수들의 발생 시점을 포함하는 소정 기간에 수집되는 데이터로서, 검색 엔진 및 소셜 데이터에서 발견되는 만성 호흡기 질환 관련 용어, 만성 호흡기 질환에 의한 증상 관련 용어, 만성 호흡기 질환에 따른 질환 관련 용어의 빈도일 수 있다. In another embodiment, the variables provided from the social data server are data collected over a period of time, including when the occurrence of variables related to the patient's medical history data or the clinical modality data, is found in search engines and social data. Term related to chronic respiratory disease, symptom related to chronic respiratory disease, and disease related term to chronic respiratory disease.

또 다른 실시예에서, 상기 생활 패턴 제공 서버로부터 제공되는 변수들은 상기 환자들의 카드 구매 정보들을 포함하고, 상기 위치 정보 서버는 상기 환자가 위치한 장소의 대기질 정보를 포함할 수 있다. In another embodiment, the variables provided from the life pattern providing server may include card purchase information of the patients, and the location information server may include air quality information of the location where the patient is located.

또 다른 실시예에서, 상기 제 1 모형을 수립하는 단계에서 파라미터 계수들(β)을 결정하는 것은 하기 식 1에 따른 계수 추정식을 0으로 만드는 상기 파라미터 계수들을 구함으로써 추정할 수 있다. In another embodiment, determining the parameter coefficients β in the step of establishing the first model may be estimated by obtaining the parameter coefficients that make the coefficient estimation equation according to Equation 1 equal to zero.

[식 1]

[Equation 1]

(i는 환자별 식별자이고, j는 환자별 명세서 식별자이며,

는 i번째 환자의 j번째의 명세서로서변수들의 데이터 세트(

)로 구성되며,

는 명세서

에 따른 실제 급성 악화 발생 여부에 따른 실제 결과값이며, (i is patient-specific identifier, j is patient-specific statement identifier,

Is the jth specification of the i patient and the data set of variables (

),

Specification

According to whether the actual acute exacerbation occurred

β는 변수들과 상관되는 파라미터 계수들의 세트(

)이고,

이고,

이며,

(평균)이고, β is the set of parameter coefficients that correlate

)ego,

ego,

Is,

(Average)

이며,

(분산)인 경우,

이고,

는 명세서와 급성 악화 발생 여부에 따른 값 간의 상관성을 해결하기 위한 시변(time varing) 상관 행렬로서 기본 행렬임)

Is,

(Dispersion),

ego,

Is a time varing correlation matrix to solve the correlation between the specification and the value of acute deterioration, which is the base matrix.)

또한, 상기 예측 모델이 상기 제 1 모형으로 설정된 경우, 상기 예측 결과값을 제공하는 단계 후에, 상기 예측 모델에 따라 급성 악화의 발생 여부가 예측된 환자의 실제 급성 악화 여부의 결과값을 상기 의료 기록용 통합 서버 및 상기 애플리케이션이 내장된 예측 서버에 기록하는 단계와, 상기 의료 기록용 통합 서버, 상기 기상 데이터 서버, 상기 바이러스 정보 서버, 상기 소셜 데이터 서버, 상기 생활 패턴 제공 서버, 상기 위치 정보 서버, 상기 예측 서버로부터 모든 변수들 및 만성 호흡기 질환의 급성 악화의 실제 발생 여부에 따른 실제 결과값들을 수집하여 모든 환자 별 명세서들을 생성하는 단계와, 상기 예측 모델의 계수 추정식을 통해, 상기 명세서의 변수들로 구성된 데이터 세트와 결과값들에 기초하여 변수들에 결합되는 파라미터 계수들을 재추정하는 단계, 및 상기 실제 결과값들에 대한 영향도를 평가하여 상기 실제 결과값들에 영향을 미치는 변수들과 파라미터 계수를 재결정하는 단계를 더 포함하고, 상기 파라미터 계수들의 재추정은 상기 식 1에 따른 계수 추정식을 이용할 수 있다.In addition, when the prediction model is set to the first model, after providing the prediction result value, the medical record of the result of the actual acute deterioration of the patient predicted whether or not the occurrence of acute deterioration according to the prediction model A recording server, an integrated server for the application, and a prediction server in which the application is embedded; the integrated server for medical records, the weather data server, the virus information server, the social data server, the life pattern providing server, the location information server, Collecting all variables from the prediction server and actual results according to the actual occurrence of acute exacerbation of chronic respiratory disease to generate all patient-specific specifications, and through the coefficient estimation formula of the prediction model, the variables of the specification Parameter coefficients that are coupled to the variables based on the data set and result values. Re-estimating, and evaluating the influence on the actual result values to re-determine parameters and parameter coefficients that affect the actual result values, wherein the re-estimation of the parameter coefficients is expressed by Equation 1 above: The coefficient estimation equation according to can be used.

또 다른 실시예에서, 상기 제 2 모형을 수립하는 단계는, 상기 서버들로부터 모든 변수들 및 급성 악화의 실제 발생 여부에 따른 실제 결과값들을 수집하여 모든 환자 별 명세서들을 생성하는 단계와, 상기 모든 변수들 및 상기 변수들마다 대응하는 실제 결과값들로부터 랜덤하게(randomly) 복원 추출하여 복수의 샘플 데이터 세트들을 생성하는 단계와, 하나의 샘플 데이터 세트의 변수들로부터 독립 변수들을 랜덤하게 선택하는 단계와, 상기 선택된 독립 변수들 중 실제 결과값들을 가장 잘 분류하는 독립 변수들의 탐색 및 대응 가중치들의 산출하는 단계와, 상기 탐색된 독립 변수들 중 분류의 소정 방식에 부합하는 1 번째 세대의 노드 및 이로부터 분기되는 2 번째 세대의 노드와 관련된 독립 변수를 선정하는 노드 분리를 수행하며 기 설정된 세대에 도달할 때까지 분류 트리(tree)를 생성하는 단계와, 상기 분류 트리가 기 설정된 개수로 생성되지 않은 경우, 상기 복수의 샘플 데이터 세트 중 상기 단계들이 진행되지 않은 세트들에 대해 상기 단계들을 반복하는 단계, 및 상기 분류 트리가 기 설정된 개수로 생성된 경우, 상기 복수의 분류 트리를 구조 유사도 별로 분류하여 개수를 파악하고, 최대 개수의 분류 트리를 제 2 모형으로 수립하는 단계를 포함할 수 있다. In another embodiment, establishing the second model includes collecting all variables from the servers and actual results according to the actual occurrence of acute exacerbation and generating all patient specific specifications; Randomly restoring and extracting the variable and the actual result values corresponding to the variables to generate a plurality of sample data sets, and randomly selecting independent variables from the variables of one sample data set. Searching for and calculating corresponding weights of independent variables that best classify actual results among the selected independent variables, and a node of a first generation corresponding to a predetermined manner of classification among the searched independent variables, and Node separation that selects independent variables related to the second generation of nodes branching from Generating a classification tree until reaching the step; and if the classification tree is not generated in a predetermined number, repeating the steps on the sets of the plurality of sample data sets that are not in progress. And when the classification tree is generated with a predetermined number, classifying the plurality of classification trees by structural similarity to determine the number, and establishing a maximum number of classification trees as a second model.

기타 실시예들의 구체적인 사항들은 상세한 설명 및 도면들에 포함되어 있다. Specific details of other embodiments are included in the detailed description and the drawings.

본 발명에 따르면, 모든 병원의 정보를 통합적으로 관리하는 국가 기관 또는 보험 회사 등의 모든 환자의 방대한 데이터에 기초하여 개별 환자의 만성 호흡기 질환의 급성 악화를 예측함으로써, 환자 스스로가 급성 악화의 발생 전에 악화를 미리 대처하면서 예방할 수 있다According to the present invention, based on the vast data of all patients, such as national institutions or insurance companies, which collectively manage the information of all hospitals, the patient predicts the acute exacerbation of chronic respiratory disease of an individual patient before the patient himself or herself develops acute exacerbation. We can prevent aggravation beforehand

또한, 만성 호흡기 질환의 급성 악화에 영향을 주는 변수들에 대하여 평가하여 실질적으로 급성 악화를 유발할 가능성이 높은 변수들에 기초하여 예측함으로써, 예측 정확도를 향상시킬 수 있다. In addition, the accuracy of prediction can be improved by evaluating variables that affect acute exacerbation of chronic respiratory disease and predicting based on variables that are likely to cause acute exacerbation substantially.

더욱이, 유발 가능성이 높은 변수들과 함께, 이와 상관되는 파라미터 계수들의 방정식으로 구성된 제 1 모형과 머신 러닝(machine learing) 알고리즘으로 구성된 제 2 모형 중 정확한 예측을 나타내는 모형을 예측 모델로 설정함으로써 급성 악화의 예측이 정확하게 수행될 수 있다. Moreover, acute exacerbation by setting the predictive model as the predictive model among the first model composed of equations of the parameter coefficients correlated with the probable variables and the second model composed of machine learing algorithms. Can be accurately performed.

아울러, 검색 엔진, 및 SNS, 카페, 블로그 등의 개인화된 소셜 데이터로부터 검출되는 만성 호흡기 질환 관련 용어, 이에 의한 증상 관련 용어, 이에 따른 질환 관련 용어의 빈도와 소셜 데이터의 트렌드를 예측하는데 활용함으로써, 다른 만성 호흡기 질환 보유 환자의 추이를 간접적으로 파악하여 급성 악화를 보다 정확하게 예상할 수 있다.In addition, by using it to predict the frequency of chronic respiratory disease-related terminology, symptoms-related terminology, and related disease terminology detected from search engines and personalized social data such as SNS, cafes and blogs, By indirectly identifying trends in patients with other chronic respiratory diseases, acute exacerbations can be predicted more accurately.

또한, 개별 환자의 유전자 정보, 담배, 술 등에 대한 환자의 구매를 통한 생활 패턴 정보 및 환자 실제 위치한 장소의 대기질 장소도 급성 악화의 예측시에 이용되어, 급성 악화 예측의 정확도가 향상될 수 있다. In addition, the genetic information of the individual patient, lifestyle information through the purchase of the patient for cigarettes, alcohol, etc., and the air quality place of the actual place where the patient is located may also be used in the prediction of the acute deterioration, thereby improving the accuracy of the acute deterioration prediction. .

이에 더하여, 애플리케이션으로 예측한 환자에 대한 실제 급성 악화의 여부를 병원 또는 의료 기록용 통합 서버로부터 추적하고, 이를 이용하여 예측 모델의 변수들과 파라미터 계수들 또는 가중치를 조정함으로써, 예상치 않는 변수들을 반영할 수 있으며, 예측 모델의 더 정교하게 발전시킬 수 있다. In addition, the application tracks the actual acute deterioration of the patient's predicted patient from a hospital or integrated server for medical records and uses it to adjust the variables and parameter coefficients or weights of the predictive model to reflect unexpected variables. And more sophisticated development of predictive models.

도 1은 본 발명의 일 실시예에 따른 만성 호흡기 질환의 급성 악화 예측용 애플리케이션이 구현된 예측 서버를 포함하는 전체 네트워크를 개략적으로 도시한 도면이다.
도 2는 예측 서버에서 내장된 만성 호흡기 질환의 급성 악화 예측용 애플리케이션의 개략적인 구성도이다.
도 3은 모델 구성부의 상세 구성도이다.
도 4은 제 1 모형을 수립하는 프로세스에 관한 흐름도이다.
도 5는 제 2 모형을 수립하는 프로세스에 관한 흐름도이다.
도 6은 예측 모델을 판정하는 프로세스에 관한 흐름도이다.
도 7은 개별 환자의 요청에 따라 만성 호흡기 질환의 급성 악화를 예측하는 프로세스에 관한 흐름도이다.
도 8은 예측 모델이 제 1 모형인 경우에 예측 모델을 갱신하는 프로세스에 관한 흐름도이다. 1 is a diagram schematically illustrating an entire network including a prediction server in which an application for predicting acute exacerbation of chronic respiratory disease according to an embodiment of the present invention is implemented.
2 is a schematic diagram of an application for predicting acute exacerbation of chronic respiratory disease embedded in a prediction server.
3 is a detailed configuration diagram of the model configuration unit.
4 is a flowchart of a process for establishing a first model.
5 is a flow diagram of a process for establishing a second model.
6 is a flowchart of a process for determining a prediction model.
7 is a flow diagram of a process for predicting acute exacerbation of chronic respiratory disease at the request of an individual patient.
8 is a flowchart of a process of updating a prediction model when the prediction model is a first model.

이하, 첨부한 도면들 및 후술되어 있는 내용을 참조하여 본 발명의 바람직한 실시예들을 상세히 설명한다. 그러나, 본 발명은 여기서 설명되어지는 실시예들에 한정되지 않고 다른 형태로 구체화될 수도 있다. 오히려, 여기서 소개되는 실시예들은 개시된 내용이 철저하고 완전해질 수 있도록 그리고 당업자에게 본 발명의 사상이 충분히 전달될 수 있도록 하기 위해 제공되어지는 것이다. 명세서 전체에 걸쳐서 동일한 참조번호들은 동일한 구성요소들을 나타낸다. 한편, 본 명세서에서 사용된 용어는 실시예들을 설명하기 위한 것이며 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급되지 않는 한 복수형도 포함된다. 명세서에서 사용되는 "포함한다(comprises)" 및/또는 "포함하는(comprising)"은 언급된 구성요소, 단계, 동작 및/또는 소자가 하나 이상의 다른 구성요소, 단계, 동작 및/또는 소자의 존재 또는 추가를 배제하지 않는다. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings and the contents described below. However, the present invention is not limited to the embodiments described herein and may be embodied in other forms. Rather, the embodiments introduced herein are provided to ensure that the disclosed subject matter is thorough and complete, and that the scope of the invention to those skilled in the art will fully convey. Like numbers refer to like elements throughout. Meanwhile, the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. In this specification, the singular forms also include the plural unless specifically stated otherwise in the text. As used herein, “comprises” and / or “comprising” refers to the presence of one or more other components, steps, operations and / or elements in which the mentioned components, steps, operations and / or elements are known. Or does not exclude additions.

또한, "부"내지 "모듈" 이란, 일반적으로 논리적으로 분리 가능한 소프트웨어(컴퓨터 프로그램), 하드웨어 등의 부품을 가리킨다. 따라서, 본 실시형태에 있어서의 모듈은 컴퓨터 프로그램에 있어서의 모듈뿐만 아니라, 하드웨어 구성에 있어서의 모듈도 가리킨다. 그 때문에, 본 실시형태는, 그것들을 모듈로서 기능시키기 위한 컴퓨터 프로그램(컴퓨터에 각각의 단계를 실행시키기 위한 프로그램, 컴퓨터를 각각의 수단으로서 기능시키기 위한 프로그램, 컴퓨터에 각각의 기능을 실현시키기 위한 프로그램), 시스템 및 방법의 설명도 겸하고 있다. 다만, 설명의 형편상, "저장한다", "저장시킨다", 이것들과 동등한 문언을 이용하지만, 이들 문언은, 실시형태가 컴퓨터 프로그램일 경우에는, 기억 장치에 기억시키는, 또는 기억 장치에 기억시키는 것과 같이 제어하는 것을 의미한다. 또한, "부" 내지 "모듈"은 기능에 일대일로 대응하고 있어도 되지만, 실장에 있어서는, 1 모듈을 1 프로그램으로 구성해도 되고, 복수 모듈을 1프로그램으로 구성해도 되고, 반대로 1 모듈을 복수 프로그램으로 구성해도 된다. 또한, 복수 모듈은 1 컴퓨터에 의해 실행되어도 되고, 분산 또는 병렬 환경에 있어서의 컴퓨터에 의해 1 모듈이 복수 컴퓨터로 실행되어도 된다. 또한, 1개의 모듈에 다른 모듈이 포함되어 있어도 된다. 또한, 이하, "접속"이란 물리적인 접속 외에, 논리적인 접속(데이터의 주고받기, 지시, 데이터간의 참조 관계 등)일 경우에도 채용한다. "소정의" 또는 "미리 정해진" 이란, 대상으로 하고 있는 처리 전에 정해져 있음을 말하고, 본 실시형태에 의한 처리가 개시되기 전은 물론이고, 본 실시형태에 의한 처리가 개시된 후에도, 대상으로 하고 있는 처리 전이면, 그 때의 상황, 상태에 따라, 또는 그때까지의 상황, 상태에 따라 정해지는 것의 의미를 포함해서 이용한다.In addition, "part" to "module" generally refer to components, such as software (computer program) and hardware which are logically separable. Therefore, the module in the present embodiment indicates not only the module in the computer program but also the module in the hardware configuration. Therefore, in the present embodiment, a computer program for making them function as a module (a program for executing each step of the computer, a program for making the computer function as each means, a program for realizing each function in the computer) And a description of the system and method. However, for the sake of explanation, the words "save" and "save", which are equivalent to these, are used. However, when the embodiment is a computer program, these words are stored in the storage device or stored in the storage device. It means to control as. In addition, although "unit" to "module" may correspond one-to-one to a function, in implementation, one module may be comprised by one program, multiple modules may be comprised by one program, and conversely, one module may be comprised by multiple programs. You may comprise. The multiple modules may be executed by one computer, or one module may be executed by multiple computers by a computer in a distributed or parallel environment. In addition, another module may be included in one module. In addition, in the following, the term "connection" is also employed in the case of a logical connection (data exchange, instruction, reference relationship between data, etc.) in addition to the physical connection. The term "predetermined" or "predetermined" means that it is determined before the target processing, and not only before the processing according to the present embodiment is started but also after the processing according to the present embodiment is started. If it is before processing, it uses including the meaning of what is decided according to the situation, state at that time, or the situation, state until then.

또한, 시스템 또는 장치란, 복수의 컴퓨터, 하드웨어, 장치 등이 네트워크(일대 일 대응의 통신 접속을 포함함) 등의 통신 수단에 의해 접속되어 구성되는 것 외에, 1개의 컴퓨터, 하드웨어, 장치 등에 의해 실현될 경우도 포함된다. In addition, a system or apparatus means that a plurality of computers, hardware, devices, and the like are connected and configured by a communication means such as a network (including a one-to-one communication connection). This is also the case when it is realized.

또한, 각 부 내지 각 모듈에 의한 처리마다 또는 각 부 내지 모듈 내에서 복수의 처리를 행할 경우는 그 처리마다, 대상이 되는 정보를 기억 장치(메모리)로부터 판독 입력하고, 그 처리를 행한 후에, 처리 결과를 기억 장치에 기입하는 것이다. 따라서, 처리 전의 기억 장치로부터의 판독 입력, 처리 후의 기억 장치에의 기입에 관해서는, 설명을 생략할 경우가 있다. 또한, 여기에서의 기억 장치로서는, 하드디스크, RAM(RandoM Access Memory), 외부 기억 매체, 통신 회선을 통한 기억 장치, CPU(Central Processing Unit) 내의 레지스터 등을 포함하고 있어도 된다.In the case where a plurality of processes are performed for each process by each unit or each module or within each unit or module, the target information is read out from the storage device (memory) for each process, and then the process is performed. The result of the processing is written into the storage device. Therefore, the description may be omitted for reading input from the storage device before the processing and writing to the storage device after the processing. The storage device here may include a hard disk, a random access memory (RAM), an external storage medium, a storage device via a communication line, a register in a central processing unit (CPU), and the like.

이하, 도 1 및 도 2를 참조하여, 본 발명의 일 실시예에 따른 만성 호흡기 질환의 급성 악화 예측용 애플리케이션에 대하여 상세히 설명하기로 한다. 도 1은 본 발명의 일 실시예에 따른 만성 호흡기 질환의 급성 악화 예측용 애플리케이션이 구현된 예측 서버를 포함하는 전체 네트워크를 개략적으로 도시한 도면이고, 도 2는 예측 서버에서 내장된 만성 호흡기 질환의 급성 악화 예측용 애플리케이션의 개략적인 구성도이다. 도 3은 모델 구성부의 상세 구성도이다. 1 and 2, an application for predicting acute exacerbation of chronic respiratory disease according to an embodiment of the present invention will be described in detail. 1 is a diagram schematically showing an entire network including a prediction server in which an application for predicting acute exacerbation of chronic respiratory disease according to an embodiment of the present invention is implemented, and FIG. 2 is a diagram of a chronic respiratory disease embedded in a prediction server. This is a schematic diagram of an application for predicting acute deterioration. 3 is a detailed configuration diagram of the model configuration unit.

이하에서는 설명의 편의상 만성 호흡기 질환의 급성 악화 예측용 애플리케이션(210)이 예측 서버(200)에 내장되는 것을 일례로 기술하기로 한다. 그러나, 만성 호흡기 질환의 급성 악화 예측용 애플리케이션은 클라우드 서버 또는 다른 네트워크 서버 등에 내장되어 실행되고, 상기 애플리케이션과 연동된 사용자용 프로그램을 통해 사용자의 모바일 또는 유선 단말(180, 190)에 서비스할 수 있다. 또한, 도 2에 도시된 만성 호흡기 질환의 급성 악화 예측용 애플리케이션의 각 부는 예측 서버(200) 및 사용자 단말들(180, 190)에 분산되어 처리될 수 있다. Hereinafter, for convenience of description, an example in which the application 210 for predicting acute exacerbation of chronic respiratory disease is embedded in the prediction server 200 will be described as an example. However, an application for predicting acute exacerbation of chronic respiratory disease may be embedded in a cloud server or other network server, and executed, and may be serviced to a user's mobile or wired terminal 180 or 190 through a user program linked with the application. . In addition, each part of the application for predicting acute exacerbation of chronic respiratory disease illustrated in FIG. 2 may be distributed and processed in the prediction server 200 and the user terminals 180 and 190.

또한, 만성 호흡기 질환은 천식, 만성 폐쇄성 폐 질환(COPD),기관지 확장증, 간질성 폐질환, 만성 기관지염 및 폐기종 등일 수 있으나, 본 실시예에서는 환자가 천식 또는 COPD를 보유하고 있는 것으로 예시하기로 하며, 만성 호흡기 질환의 다른 병일 경우에는 각 서버(110 내지 170)가 해당 병에 적합한 데이터들을 제공한다. In addition, the chronic respiratory disease may be asthma, chronic obstructive pulmonary disease (COPD), bronchiectasis, interstitial lung disease, chronic bronchitis and emphysema, but in the present embodiment will be illustrated as having a patient with asthma or COPD For other diseases of chronic respiratory disease, each server 110-170 provides data suitable for that disease.

시스템(100)은 만성 호흡기 질환의 급성 악화 예측용 애플리케이션(210)을 이용하여 서버들에서 제공하는 의료 정보, 호흡기 관련 바이러스 정보, 기상 데이터, 소셜 데이터, 생활 패턴 정보, 위치 정보로부터 유추되는 대기질 정보를 수집하여 예측 모델을 구성하고, 이 예측 모델로 만성 호흡기 질환 중 어느 하나를 갖고 있는 개별 환자의 급성 악화를 예측한다. 다양한 서버들, 사용자 모바일 단말(180), 사용자 유선 단말(190) 및 애플리케이션이 내장되는 예측 서버(200)가 시스템(100)에 제공된다. 서버들은 의료 기록용 통합 서버(110), 병원 서버(120), 기상 데이터 서버(130), 바이러스 정보 서버(140), 소셜 데이터 서버(150), 생활 패턴 제공 서버(160) 및 위치 정보 서버(170)를 포함할 수 있다. 이들 구성요소는 유, 무선 접속에 의한 네트워크(195)를 통해 연결되고, 이들 구성요소 간의 데이터 교환은 네트워크(195)를 경유하여 수행된다. The system 100 uses the application 210 for predicting acute exacerbation of chronic respiratory disease, and the air quality inferred from medical information, respiratory virus information, weather data, social data, life pattern information, and location information provided by servers. Information is gathered to construct a predictive model, which predicts acute exacerbations of individual patients with any of chronic respiratory diseases. Various servers, a user mobile terminal 180, a user wired terminal 190, and a prediction server 200 in which an application is embedded are provided to the system 100. Servers are integrated server 110 for medical records, hospital server 120, weather data server 130, virus information server 140, social data server 150, life pattern providing server 160 and location information server ( 170). These components are connected via a network 195 by wired or wireless connection, and data exchange between these components is performed via the network 195.

도 1은 이해의 편의상 병원 서버(120), 소셜 데이터 서버(150)과 함께, 모바일 또는 유선 단말(180, 190)이 하나인 것으로 예시하고 있으나, 환자가 방문하는 모든 병원의 서버, 검색가능한 다양한 포털 사이트, 소셜 네트워크 서비스(SNS), 카페, 블로그 등의 복수의 소셜 데이터 서버 및 복수의 사용자 단말들을 압축하여 도시한 것에 불과하다.1 illustrates a hospital server 120 and a social data server 150 together with a mobile or wired terminal 180 or 190 as one for convenience of understanding, but the servers of all hospitals visited by a patient are various. It merely illustrates a plurality of social data servers and a plurality of user terminals such as a portal site, a social network service (SNS), a cafe, a blog, and the like.

의료 기록용 통합 서버(110)는 천식 또는 COPD 환자들의 병력 데이터, 임상 양상 데이터 및 유전자 데이터를 저장하는 모든 병원 서버로부터 환자 의료 정보들을 수집한다. 의료 기록용 통합 서버(110)는 각 병원마다 저장된 환자별 의료 정보를 통합적으로 관리하는 서버이며, 예컨대 국가 기관 또는 보험 회사 등의 서버일 수 있다. 한국의 경우, 의료 기록용 통합 서버(110)는 건강보험심사평가원(health insurance review & assessment service)일 수 있다. Integration server 110 for medical records collects patient medical information from all hospital servers that store history data, clinical picture data and genetic data of asthma or COPD patients. The integrated medical server 110 for medical records is a server that collectively manages patient-specific medical information stored in each hospital, and may be, for example, a server such as a national agency or an insurance company. In Korea, the integrated server 110 for medical records may be a health insurance review & assessment service.

병원 서버(120)는 개별 천식 또는 COPD 환자가 치료받기 위해 방문하는 병원에 설치된 서버이며, 환자가 병원으로부터 서비스받은 진료의 기록, 처방 약물, 입원 기록, 검사 기록, 문진 기록, 유전자 데이터 등을 저장할 수 있다. 병원 서버(120)는 구체적으로 개별 환자의 병력 데이터, 임상 양상 데이터 및 유전자 데이터를 저장한다. 환자를 치료하는 모든 병원 서버들은 이하에 기술된 병력 데이터들, 임상 양상 데이터들 및 유전자 데이터를 의료 기록용 통합 서버(110)에 전달한다. 또한, 병원 서버(120)는 병력 데이터들과 임상 양상 데이터들이 발생할 때마다 실제 급성 악화가 발생한지 여부를 기록하고, 상기 데이터들과 함께 의료 기록용 통합 서버(110)에 전송한다. 천식 또는 COPD 등의 만성 호흡기 질환 및 그 급성 악화는 개인의 유전자에 따라 발병될 확률이 증가되며, 급성 악화에 영향을 미치는 유전자 데이터 역시 예측 모델의 변수에 활용된다. The hospital server 120 is a server installed in a hospital where an individual asthma or COPD patient visits to be treated, and stores a record, prescription medication, hospitalization record, examination record, questionnaire record, genetic data, etc., which the patient has received from the hospital. Can be. The hospital server 120 specifically stores medical history data, clinical modal data, and genetic data of individual patients. All hospital servers treating the patient deliver the medical history data, clinical modal data and genetic data described below to the integrated server 110 for medical records. In addition, the hospital server 120 records whether or not the actual acute deterioration has occurred whenever the medical history data and clinical pattern data occur, and transmits the data to the integrated server 110 for medical records. Chronic respiratory diseases, such as asthma or COPD, and their acute exacerbations increase the likelihood of developing according to individual genes, and genetic data that affect acute exacerbations are also used as variables in predictive models.

병력 데이터는 천식, COPD 혹은 이들의 급성 악화로 인하여 서비스받은 진단, 처방, 치료에 관한 데이터서로서, 천식 또는 COPD 약물 복용 및 복용 빈도, 응급실 이용 빈도, 입원 빈도, 요양 일수, 진료 일수, 천식 또는 COPD 급성 악화 발생 빈도, 관련 증상 여부, 동반 질환 여부 등을 포함할 수 있다. Medical history data is data about diagnoses, prescriptions, and treatments that have been serviced by asthma, COPD, or their acute exacerbations, such as frequency of taking and taking asthma or COPD drugs, frequency of emergency room use, hospitalization, days of care, days of care, asthma or The frequency of acute exacerbation of COPD may include the presence of related symptoms and the presence of comorbid conditions.

응급실 이용 빈도는 과거 1년 또는 2년 동안에 발생한 횟수일 수 있으며, 입원 빈도, 요양 일수, 진료 일수, 급성 악화 발생 빈도 역시 과거 1년 또는 2년 동안에 발생한 데이터일 수 있다. The frequency of emergency room use may be the number of occurrences in the past year or two years, and the frequency of hospitalizations, the number of days of care, the number of days of treatment, and the frequency of acute exacerbations may also be data from the past one or two years.

관련 증상은 과거 1년 또는 2년 동안 천식 또는 COPD로 인하여 직접적으로 유발되는 증상으로서 COPD 또는 천식의 발생 등일 수 있으며, 동반 질환은 과거 1년 또는 2년 동안 이들로 인하여 발생되는 다른 질환으로서 허혈성심질환, 폐암, 골다공증, 우울증, 관절염, 당뇨병, 위식도 역류, 기흉, 심부전, 고혈압, 빈혈, 대사증후군 중 적어도 하나를 포함할 수 있다. Related symptoms are symptoms that are directly caused by asthma or COPD for the past year or two years, such as the occurrence of COPD or asthma, and the accompanying disease is an ischemic heart disease caused by them for the past one or two years. And lung cancer, osteoporosis, depression, arthritis, diabetes, gastroesophageal reflux, pneumothorax, heart failure, hypertension, anemia, metabolic syndrome.

환자의 상태에 따라 처방되는 약제가 상이하므로, 천식 또는 COPD 급성 악화가 발생할지 여부는 약제에 의해 판단될 수 있다. 이에 따라, 천식 또는 COPD 약물 복용은 서로 상이한 성분의 복수 약제이며, 복용 빈도는 약제 별 복용 빈도일 수 있다. 구체적으로 COPD인 경우에, 약제는 LAMA(Long Acting Muscarinic Antagonist), SABA(Short Acting Beta-2 Agonist), SAMA(Short Acting Muscarinic Antagonist), 티오필린(theophylin), ICSLABA, LABA, 시스브론크(sysbronch), SAMASABA, LABALAMA일 수 있다. Since the medications prescribed vary depending on the condition of the patient, it may be determined by the medication whether asthma or COPD acute exacerbation will occur. Accordingly, taking asthma or COPD drugs is a plurality of drugs of different components, and the dosage frequency may be the frequency of each drug. Specifically, in the case of COPD, the drug may be a long acting muscarinic antagonist (LAMA), a short acting beta-2 agonist (SABA), a short acting muscarinic antagonist (SAMA), a thiophylin, a ICSLABA, a LABA, or a sysbronch. ), SAMASABA, LABALAMA.

임상 양상 데이터는 병원에서 천식 또는 COPD 환자에게 실시한 검사 및 문진에 대한 자료로서, 평균 흡연량, 흡연 이력, 1초 노력성 호기량(forced expiratory volume in 1 second; FEV1%), 환자 자가 진단의 질의에 대한 응답을 계량화한 CAT(COPD Assessment Test) 중 적어도 하나를 포함할 수 있다. 상세하게는 CAT는 환자가 일상에서 기침, 가래, 가슴 답답함, 심리적인 상태 등과 관련된 질의에 대하여 수치화된 형태로 응답함으로써 집계된다.Clinical picture data are data on examinations and questionnaires performed in hospitals for patients with asthma or COPD, including average smoking volume, smoking history, forced expiratory volume in 1 second (FEV1%), and questions about patient self-diagnosis. It may include at least one of a COPD Assessment Test (CAT) that quantifies the response. Specifically, CAT is aggregated by the patient responding in a quantified form to questions related to coughing, sputum, chest tightness, psychological status, and the like in everyday life.

이후에 상세히 설명하겠으나, 의료 기록용 통합 서버(110)에서 관리되는 병력 데이터, 임상 양상 데이터 및 유전자 데이터의 전술한 사항들은 애플리케이션에 의해 예측 서버(200)로 수집되며, 애플리케이션에서 구현되는 예측 모델, 제 1 모형의 계수 추정식 또는 제 2 모형 등에 이용하기 위해 생성되는 명세서의 변수들로 구성된다. 명세서는 예측 서버(200)에서 생성되는 시점을 발생 시점으로 채용하지 않고, 병력 또는 임상 양상 데이터의 변수들이 병원 등의 의료 기관에서 기록한 시점을 발생 시점으로 채용한다. As will be described in detail later, the above-described matters of medical history data, clinical modal data, and genetic data managed by the integrated server 110 for medical records are collected by the application to the prediction server 200, and the prediction model implemented in the application, It consists of variables in the specification generated for use in coefficient estimation equation of the first model or the second model. The specification does not employ a time point generated by the prediction server 200 as a generation time point, and employs a time point when the variables of the medical history or clinical pattern data are recorded by a medical institution such as a hospital.

기상 데이터 서버(130)는 천식 또는 COPD 급성 악화에 영향을 줄 수 있는 기상 정보를 애플리케이션(210)에 전송할 수 있다. The weather data server 130 may transmit weather information to the application 210 that may affect asthma or COPD acute exacerbations.

기상 데이터 서버(130)는 의료 기록용 통합 서버(110)로부터 접수된 병력 데이터 또는 임상 양상 데이터와 관련된 변수들의 발생 시점보다 소정 기간 전에 수집되는 데이터로서, 미세 먼지 누적량, 최저 기온 누적량, 최저 기온 변화량, 강수량 누적량 중 적어도 하나를 포함할 수 있다. 예를 들면, 미세 먼지 누적량, 최저 기온 변화량, 강수량 누적량, 최고 기온 누적량, 최고 기온 변화량, 일교차, 일별 최대 풍속, 평균 습도 및 일조량 등은 병력 또는 임상 양상 데이터에서의 변수들의 발생 시점 전 며칠 전에 발생한 데이터이며, 병력 또는 임상 양상 데이터의 발생 시점과 상응하는 명세서에 변수들 형태로 첨가될 수 있다. The meteorological data server 130 is data collected before a point of time of occurrence of variables related to medical history data or clinical pattern data received from the integrated medical server 110 for medical records, and includes a fine dust accumulation amount, a minimum temperature accumulation amount, and a minimum temperature change amount. It may include at least one of the cumulative amount of precipitation. For example, fine dust accumulation, minimum temperature change, precipitation accumulation, maximum temperature accumulation, maximum temperature change, daily crossover, daily maximum wind speed, average humidity and sunshine amount occur a few days before the onset of variables in the history or clinical pattern data. Data and may be added in the form of variables to the specification corresponding to the time of occurrence of the history or clinical picture data.

바이러스 정보 서버(140)는 천식 또는 COPD 급성 악화에 영향을 미칠 수 있는 호흡기 바이러스 데이터를 애플리케이션(210)에 전송한다. The virus information server 140 transmits respiratory virus data to the application 210 that may affect asthma or COPD acute exacerbations.

호흡기 바이러스 데이터는 환자의 병력 데이터 또는 임상 양상 데이터와 관련된 변수들의 발생 시점보다 소정 기간(예컨대, 1주 또는 4주) 전에 수집되는 바이러스 검출률이며, 아데노 바이러스(ADenoVirus; 'ADV'), 파라인플루엔자 바이러스(ParaInfluenza Virus; 'PIV'), 호흡기세포융합 바이러스(Respiratory Syncytial Virus; 'RSV'), H1N1pdm 바이러스, 인간 코로나 바이러스(human CoronaVirus; 'hCoV'), 인간 라이노 바이러스(human Rhion Virus; 'hRV'), 인간 보카 바이러스(human BOcaVirus; 'hBOV'), hEV 바이러스 및 IFV 바이러스 중 적어도 하나일 수 있다. Respiratory virus data is the rate of virus detection collected a predetermined time period (e.g., one or four weeks) prior to the onset of variables associated with the patient's medical history or clinical manifestation data, such as adenovirus (ADV), parainfluenza virus. ParaInfluenza Virus ('PIV'), Respiratory Syncytial Virus ('RSV'), H1N1pdm virus, human CoronaVirus ('hCoV'), human Rhion Virus ('hRV') , Human BOcaVirus ('hBOV'), hEV virus and IFV virus.

소셜 데이터 서버(150)는 검색가능한 다양한 포털 사이트일 수 있으며, 사용자 단말들(180, 190)의 상호 간에 행해지는 디지털 액티비티(activity), 예를 들면 메시지 교환, 텍스트, 이미지, 비디오, 오디오 및 각종 데이터 의 공유 등을 구현할 수 있는 서비스 사이트일 수 있다. 예컨대, 서비스 사이트는 소셜 네트워크 서비스(SNS), 카페, 블로그 등일 수 있다. The social data server 150 may be a variety of searchable portal sites, where digital activities, such as message exchange, text, images, video, audio, and the like, are performed between the user terminals 180 and 190. It may be a service site that can implement data sharing. For example, the service site may be a social network service (SNS), a cafe, a blog, or the like.

소셜 데이터 서버(150)는 빅데이터로 교환되는 검색 엔진 등에서 이루어지는 천식 또는 COPD 급성 악화와 관련된 소셜 데이터를 예측 서버(200)에 전송할 수 있다. 소셜 데이터 서버(150)는 환자의 병력 데이터와 관련된 변수들의 발생 시점을 포함하는 소정 기간에 수집되는 소셜 데이터로서, 검색 엔진 및 소셜 데이터에서 발견되는 만성 호흡기 질환 관련 용어, 이에 의한 증상 관련 용어, 이에 따른 질환 관련 용어의 빈도일 수 있다. 관련 용어는 예를 들면, COPD, 천식, 폐기종, 만성기관지염, 호흡 곤란, 급성 악화, 독감, 숨차는 증상 등일 수 있다.The social data server 150 may transmit social data related to acute exacerbation of asthma or COPD generated in a search engine exchanged with big data to the prediction server 200. The social data server 150 is social data collected at a predetermined time period including a time point of occurrence of variables related to the medical history data of the patient, and the term related to chronic respiratory disease, and thus, related to symptoms found in search engines and social data, Frequency of disease related terms. Related terms may be, for example, COPD, asthma, emphysema, chronic bronchitis, shortness of breath, acute exacerbation, flu, breathing symptoms, and the like.

호흡기 바이러스 데이터 및 검색 관련 소셜 데이터는 병력 또는 임상 양상 데이터의 발생 시점과 상응하는 명세서에 변수들 형태로 첨가될 수 있다.Respiratory virus data and search-related social data may be added in the form of variables to the specification corresponding to the time of occurrence of the history or clinical profile data.

만성 호흡기 질환 관련 트렌드 데이터를 이의 급성 악화 예측에 사용하면, 다른 환자의 추이 및 관심도를 간접적으로 파악하여 환자 자신의 급성 악화 가능성을 보다 정확하게 예측할 수 있다.Trend data related to chronic respiratory disease can be used to predict acute exacerbation of a patient and indirectly understand the trends and interests of other patients to more accurately predict the likelihood of acute exacerbation of the patient himself.

생활 패턴 제공 서버(160)는 의료 기록용 통합 서버(110)로부터 접수된 병력 데이터 또는 임상 양상 데이터와 관련된 변수들의 발생 시점보다 소정 기간 전에 수집되는 데이터로서, 환자들의 카드 구매 정보들을 포함할 수 있다. 예를 들면, 생활 패턴 제공 서버(160)는 개별 환자가 사용하는 카드사의 서버 등일 수 있으며, 특히 급성 악화에 큰 영향을 미치는 담배, 술 등의 구매일시, 구매량, 구매 빈도 등이 카드 구매 정보로부터 추출될 수 있다. The life pattern providing server 160 is data collected before a generation time of variables related to medical history data or clinical pattern data received from the integrated medical server 110 for medical records, and may include card purchase information of patients. . For example, the life pattern providing server 160 may be a server of a card company used by an individual patient. In particular, the purchase date, amount, and frequency of purchase of cigarettes and alcohol, which greatly affect acute deterioration, may be obtained from the card purchase information. Can be extracted.

위치 정보 서버(170)는 개별 환자의 모바일 단말(180)으로부터 수신되는 GPS 데이터로부터 유추되는 정보들을 제공하는 통신사 또는 대기질 정보 서비스사등 일 수 있다. 상기 유추되는 정보는 급성 악화 발생 여부에 대한 요청시까지 추적되는 환자 위치에 따른 대기질 정보, 예컨대 환자가 실제 위치한 특정 장소에서의 미세 먼지, 초미세 먼지 등의 데이터일 수 있다. The location information server 170 may be a communication company or an air quality information service company that provides information inferred from GPS data received from the mobile terminal 180 of an individual patient. The inferred information may be air quality information according to the location of the patient being tracked until a request for acute deterioration occurs, for example, data such as fine dust and ultrafine dust at a specific place where the patient is actually located.

상술한 카드 구매 정보 및 대기질 정보는 병력 또는 임상 양상 데이터에서의 변수들의 발생 시점까지 발생한 데이터이며, 병력 또는 임상 양상 데이터의 발생 시점과 상응하는 명세서에 변수들 형태로 첨가될 수 있다. The card purchase information and air quality information described above are data generated until the occurrence of the variables in the medical history or clinical manifestation data, and may be added in the form of variables to the specification corresponding to the occurrence of the medical history or clinical manifestation data.

사용자 모바일 및 유선 단말들(180, 190)은 COPD 환자가 외부와 통신 가능한 단말로서, 예를 들면 셀룰러(cellular) 폰, 스마트 폰, 태블릿 컴퓨터, 랩탑(laptop) 컴퓨터 또는 데스크탑(desktop) 컴퓨터일 수 있다. User mobile and wired terminals 180 and 190 are terminals with which the COPD patient can communicate with the outside, for example, may be a cellular phone, a smart phone, a tablet computer, a laptop computer or a desktop computer. have.

예측 서버(200)는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션(210)을 내장한다. 애플리케이션(210)은 급성 악화의 예측을 위한 예측 모델과 함께, 각 서버로부터 입력된 환자 별 명세서들의 변수들에 근거하여 예측 모델의 후보군인 제 1 모형 및 제 2 모형을 포함한다. 제 1 모형은 각 서버로부터의 변수들로부터 선택된 변수들과 상기 선택된 변수들에 결합되는 파라미터 계수들을 포함하는 방정식으로 설정되며, 제 2 모형은 상기 변수들의 적어도 일부를 포함하는 머신 러닝(machine learning) 알고리즘으로 구성된다. Prediction server 200 has an application 210 for predicting acute exacerbation of chronic respiratory disease. The application 210 includes a first model and a second model, which are candidate groups of the prediction model, based on variables of patient-specific specifications input from each server, together with a prediction model for predicting acute deterioration. The first model is set up with an equation comprising variables selected from the variables from each server and parameter coefficients coupled to the selected variables, and the second model includes at least some of the variables. It consists of an algorithm.

구체적으로, 예측 서버(200)는 천식 또는 COPD 등의 만성 호흡기 질환을 가진 환자의 급성 악화 발생 여부에 대한 예측 요청을 접수하고 사용자의 인증을 수행하는 입력부(220), 수집부(230), 모델 구성부(240), 분석부(250), 갱신부(260), 제공부(270) 및 저장부(280)를 포함할 수 있다. In detail, the prediction server 200 receives a prediction request regarding whether acute deterioration occurs in a patient having a chronic respiratory disease such as asthma or COPD, and inputs 220, a collecting unit 230, and a model to authenticate a user. The component 240, the analyzer 250, the updater 260, the provider 270, and the storage 280 may be included.

수집부(230)는 각 서버로부터 예측 모델을 구축하기 위한 변수들과 예측 요청한 환자와 관련된 변수들을 수집하여 변수들의 데이터 세트로 구성되는 명세서를 생성한다. Collecting unit 230 generates a specification consisting of a data set of variables by collecting the variables related to the prediction request patient and the variables for building the prediction model from each server.

구체적으로, 수집부(230)는 예측 모델을 구축하기 위해, 환자 의료 기록용 통합 서버(110)로부터 입수되는 모든 환자의 병력 데이터, 임상 양상 데이터 및 유전자 데이터의 변수들에 포함되는 것으로서, 환자의 만성 호흡기 질환 관련 약물 복용 및 복용 빈도, 응급실 이용 빈도, 입원 빈도, 진료 일수, 만성 호흡기 질환의 급성 악화 발생 빈도, 관련 증상 발생 빈도, 동반 질환 빈도, 흡연 이력, FEV1%, CAT 등을 입수하고, 변수들의 발생 시점마다 명세서를 생성할 수 있다. Specifically, the collection unit 230 is included in the variables of the patient's medical history data, clinical profile data and genetic data obtained from the integration server 110 for patient medical records in order to build a predictive model, Frequency of taking and taking drugs related to chronic respiratory disease, frequency of emergency room use, hospitalization, days of treatment, frequency of acute exacerbation of chronic respiratory disease, frequency of related symptoms, accompanying disease frequency, smoking history, FEV1%, CAT, etc. The specification may be generated for each occurrence of the variables.

또한, 수집부(230)는 예측 모델을 구축하기 위해, 기상 데이터 서버(130), 바이러스 정보 서버(140) 및 소셜 데이터 서버(150)로부터, 병력 데이터 등의 발생 시점 전후에 존재하는 기상 정보, 호흡기 바이러스 데이터 및 소셜 데이터를 해당 발생 시점의 명세서에 변수 형태로 첨부될 수 있다. In addition, the collection unit 230, from the weather data server 130, the virus information server 140 and the social data server 150, the weather information existing before and after the occurrence of the medical history data, to build a prediction model, Respiratory virus data and social data may be appended in the form of variables to the specification of the time of occurrence.

수집부(230)는 예측 모델을 구축하기 위해, 모든 환자의 명세서 별로 천식 또는 COPD 급성 악화의 실제 발생 여부도 수집할 수 있으며, 병력 데이터에 포함된 급성 악화 발생 빈도를 이용할 수도 있다. The collection unit 230 may collect whether or not asthma or COPD acute exacerbation actually occurs for each patient's specification in order to build a predictive model, and may use the acute exacerbation frequency included in the medical history data.

명세서는 병력 또는 임상 양상 데이터의 발생 시점마다 작성할 수도 있으나, 소정 기간 동안 서로 다른 변수들이 발생하면, 해당 기간 동안의 변수들을 그룹핑한 데이터 세트를 하나의 명세서로 작성할 수 있거나, 어떤 변수가 변화가 있는 시점에서 특정 변수의 변화가 없으면, 미변화된 변수는 어떤 변수의 발생 시점의 명세서에 재차 기록될 수 있다. The specification may be prepared for each occurrence of medical history or clinical manifestation data. However, if different variables occur during a certain period of time, a data set grouping the variables during the period may be prepared as one specification, or if any variable is changed. If there is no change of a particular variable at that point in time, the unchanged variable may be recorded again in the specification at the time of occurrence of any variable.

모든 환자의 명세서 별 데이터 세트와 급성 악화의 실제 발생 여부를 나타내는 실제 결과값이 모델 구성부(240)로 전달되어, 예측 모델이 설정된 경우, 수집부(230)는 특정 환자가 급성 악화의 예측 요청에 따라, 소정 조건에 부합하는 변수들을 각 서버로부터 수집하여 변수의 발생 시점마다 특정 환자의 명세서를 생성시킨다. Specification-specific data sets of all patients and actual results indicating whether actual acute deterioration has occurred are passed to the model component 240, and when a predictive model is set, the collection unit 230 requests that a specific patient predict the acute deterioration. According to the present invention, variables meeting specific conditions are collected from each server to generate a specification of a specific patient at each occurrence of the variable.

한편, 모델 구성부(240)는 제 1 및 제 2 모형들을 각각 수립하고, 이들 중 예측 정확도가 높은 모형을 예측 모델로 설정한다. On the other hand, the model configuration unit 240 establishes each of the first and second models, and sets a model with high prediction accuracy among them as a prediction model.

구체적으로, 모델 구성부(240)는 제 1 모형 수립부(242), 제 2 모형 수립부(244) 및 판정부(246)를 포함할 수 있다. In detail, the model constructer 240 may include a first model establisher 242, a second model establisher 244, and a determiner 246.

제 1 모형 수립부(242)는 각 변수마다 상응하여 소정 연산 형태로 결합하는 파라미터 계수들을 계수 추정식에 의해 추정한다. 제 1 모형 수립부(242)는 선택된 변수들과 이에 결합되는 파라미터 계수들을 포함하는 방정식으로 설정된다. The first model establishment unit 242 estimates the parameter coefficients corresponding to each variable in a predetermined calculation form by a coefficient estimation equation. The first model establishment unit 242 is set to an equation including the selected variables and parameter coefficients coupled thereto.

이 경우에, 제 1 모형은 각 변수마다 상응하여 결합되는 파라미터 계수를 이용하고 명세서의 데이터 세트를 구성하는 변수들을 입력값으로 채용한다. 제 1 모형은 급성 악화의 발생 여부를 나타내는 예측값이 변수들과 상호 연관되는(correlated) 것을 반영하는 모델링을 이용할 수 있으며, 예컨대 일반화된 추정 방정식(genearalized estimating equations)일 수 있다. 급성 악화가 발생한다는 예측 후에 급성 악화가 실제로 발생되면, 예측 결과값과 실제 결과값 등은 다음 변수들에 영향을 주므로, 예측 모델은 전술한 모델링을 사용할 필요가 있다. In this case, the first model uses parameter coefficients correspondingly combined for each variable and employs as inputs the variables that make up the data set of the specification. The first model may use modeling that reflects that the predictive value indicating whether acute deterioration has occurred is correlated with the variables, for example, may be generalized estimating equations. If acute deterioration actually occurs after the prediction that an acute deterioration occurs, the predictive model needs to use the above-described modeling since the predicted result value and the actual result value affect the following variables.

전술한 방정식에 의하면, 제 1 모형 수립부(242)는 모든 환자의 명세서 별 데이터 세트와 실제 결과값을 계수 추정식에 입력하여, 예측 모델에서 요구되는 파라미터 계수들을 추정할 수 있다. According to the above-described equation, the first model establishment unit 242 may estimate the parameter coefficients required in the prediction model by inputting the data set for each specification and the actual result value of every patient in the coefficient estimation equation.

제 1 모형 수립부(242)는 모든 환자의 명세서를 구성하는 변수들, 추정된 파라미터 계수들 및 실제 결과값을 분석하여 높은 예측력을 갖는 변수들과 파라미터 계수들을 결정할 수 있다. 이러한 분석은 실제 결과값들에 대한 영향도를 평가하는 것으로서, 파라미터 계수의 분산, 신뢰도, 정규분포 분석시 요구되는 Pr>|z| 등에 수행될 수 있다. The first model establishment unit 242 may determine variables and parameter coefficients having a high predictive power by analyzing variables, estimated parameter coefficients, and actual result values constituting all patient specifications. This analysis evaluates the influence on the actual results. Pr> | z | Or the like.

제 1 모형 수립부(242)는 모든 환자의 명세서를 구성하는 데이터 세트의 변수들 중 높은 예측력을 갖는 변수들과 파라미터 계수들을 결정할 수 있다. The first model establishment unit 242 may determine parameters and parameter coefficients having high predictive power among the variables of the data set constituting the specification of all patients.

전술한 제 1 모형의 수립 과정은 도 4를 통해 상세히 설명하기로 한다. The above-described process of establishing the first model will be described in detail with reference to FIG. 4.

또한, 제 2 모형 수립부(244)는 변수들의 적어도 일부를 포함하는 머신 러닝(machine learning) 알고리즘으로 설정된 제 2 모형을 수립한다. In addition, the second model establishment unit 244 establishes a second model set by a machine learning algorithm including at least some of the variables.

적어도 일부 변수들은 환자의 만성 호흡기 질환 관련 약물 복용, 복용 빈도, 응급실 이용 빈도, 입원 빈도, 진료 일수, 만성 호흡기 질환의 급성 악화 발생 빈도, 호흡기 바이러스 데이터를 포함한다. At least some variables include patients taking chronic respiratory disease related medications, frequency of use, frequency of emergency room use, hospitalization, days of treatment, frequency of acute exacerbations of chronic respiratory disease, and respiratory viral data.

구체적으로, 제 2 모형은 앙상블 분류 트리(tree) 알고리즘으로서, 랜덤 포레스트 등을 이용할 수 있으며, 제 2 모형의 자세한 수립 과정은 도 5를 통해 설명하기로 한다. Specifically, the second model is an ensemble classification tree algorithm, and a random forest may be used. A detailed process of establishing the second model will be described with reference to FIG. 5.

판정부(246)는 서버들(110 내지 170)로부터, 제 1 및 제 2 모형들 각각에서 선택된 동일 기간 내의 변수들과 이 기간 내의 만성 호흡기 질환의 급성 악화의 실제 발생 여부에 따른 실제 결과값들을 추출하여, 모형들에서의 급성 악화의 발생 여부를 나타내는예측 결과값과 실제 결과값들을 비교하고, 비교 결과 예측 결과값의 정확도가 높은 모형을 예측 모델로 설정한다. 이에 따라, 판정부(246)는 최종적인 예측 모델을 생성하고, 분석부(250)에서 요청되는 급성 악화의 예측에 이용될 수 있다. The determination unit 246 obtains, from the servers 110 to 170, actual results according to the variables within the same period selected in each of the first and second models and the actual occurrence of acute exacerbation of chronic respiratory disease within this period. The results are compared with the predicted results indicating the occurrence of acute deterioration in the models and the actual results, and the model with high accuracy of the comparison result prediction result is set as the prediction model. Accordingly, the determination unit 246 may generate a final prediction model and may be used to predict the acute deterioration requested by the analysis unit 250.

분석부(250)는 사용자 단말들(180,190)을 통한 환자의 예측 요청에 의해, 모델 구성부(240)에 제공된 예측 모델을 이용하여 급성 악화를 예측한다. 분석부(250)는 수집부(230)로부터 전달된 요청한 천식 또는 COPD 환자의 명세서에 기초하여 급성 악화의 예측 결과값을 산출한다. The analysis unit 250 predicts acute deterioration by using a prediction model provided to the model configuration unit 240 by a prediction request of the patient through the user terminals 180 and 190. The analyzer 250 calculates a predicted result of the acute exacerbation based on the specification of the requested asthma or COPD patient transmitted from the collector 230.

갱신부(260)는 예측 모델이 설정된 후에, 예측 요청된 환자의 실제 결과값과 예측 요청이 없는 환자의 급성 악화 결과값을 재분석하여 예측 모델을 갱신할 수 있다.After the prediction model is set, the updater 260 may update the prediction model by re-analyzing the actual result value of the patient who has been requested for prediction and the acute exacerbation result value of the patient who does not have a prediction request.

이와 관련하여, 수집부(230)는 의료 기록용 통합 서버(110) 및 저장부(280)에 예측된 환자의 실제 결과값을 접수하고, 의료 기록용 통합 서버(110), 기상 데이터 서버(130), 바이러스 정보 서버(140), 소셜 데이터 서버(150), 생활 패턴 제공 서버(160), 위치 정보 서버(170), 저장부(280)부터 모든 변수들 및 급성 악화의 실제 발생 여부에 따른 실제 결과값들을 수집하여 모든 환자 별 명세서들을 생성할 수 있다. In this regard, the collection unit 230 receives the actual results of the patient predicted in the integrated medical record server 110 and the storage unit 280, and the integrated medical record server 110, weather data server 130 for medical records ), From the virus information server 140, the social data server 150, the life pattern providing server 160, the location information server 170, and the storage unit 280, according to the actual occurrence of all variables and acute deterioration. The results can be collected to generate all patient-specific specifications.

갱신부(260)는 모델 구성부(240)에서 확정된 제 1 모형의 계수 추정식 또는 제 2 모형의 앙상블 분류 트리 알고리즘 등을 통해, 명세서의 변수들로 구성된 데이터 세트와 실제 결과값들에 기초하여 변수들에 파라미터 계수들 또는 가중치을 재추정할 수 있다. The updating unit 260 is based on a data set composed of variables of the specification and actual result values through a coefficient estimation equation of the first model or an ensemble classification tree algorithm of the second model determined by the model configuration unit 240. Parameter coefficients or weights can be reestimated to the variables.

갱신부(260)는 모델 구성부(240)에서 설명한 분석과 실질적으로 동일하게, 실제 결과값들에 대한 영향도를 평가하여 상기 실제 결과값들에 영향을 미치는 변수들과 함께, 파라미터 계수 또는 가중치들을 재결정할 수 있다. 갱신부(260)는 갱신된 예측 모델을 분석부(250)에 제공한다. The updater 260 evaluates the degree of influence on the actual result values in substantially the same way as the analysis described in the model constituent 240, together with the parameters affecting the actual result values, parameter coefficients or weights. You can re-determine them. The updater 260 provides the updated prediction model to the analyzer 250.

제공부(270)는 분석부(250)에서 수행된 예측 결과값을 환자의 단말들(160, 170), 의료 기록용 통합 서버(110) 및 병원 서버(120)에 적어도 하나에 제공한다. The provider 270 provides the predicted result value performed by the analyzer 250 to at least one of the terminal terminals 160 and 170, the medical record integration server 110, and the hospital server 120 of the patient.

저장부(280)는 분석부(250)에서 수행된 예측 결과값과 함께, 예측 모델에서 사용된 변수들과 함께, 파라미터 계수들 또는 가중치들을 저장하고, 예측 후에 급성 악화가 실제 발생하였는지 여부에 대한 실제 결과값을 저장할 수 있다. The storage unit 280 stores the parameter coefficients or weights together with the variables used in the prediction model together with the prediction result value performed by the analysis unit 250, and determines whether or not acute deterioration actually occurred after the prediction. You can save the actual result.

이하, 도 1 내지 도 4를 참조하여, 만성 호흡기 질환의 급성 악화 예측용 애플리케이션에서 수행되는 본 발명의 실시예인 제 1 모형을 수립하는 프로세스에 관하여 설명하기로 한다. 도 4은 제 1 모형을 수립하는 프로세스에 관한 흐름도이다. 1 to 4, a process for establishing a first model, which is an embodiment of the present invention performed in an application for predicting acute exacerbation of chronic respiratory disease, will be described. 4 is a flowchart of a process for establishing a first model.

먼저, 수집부(230)는 의료 기록용 통합 서버(110), 기상 데이터 서버(130), 바이러스 정보 서버(140), 소셜 데이터 서버(150), 생활 패턴 제공 서버(160), 위치 정보 서버(170)로부터 모든 천식 또는 COPD 환자들의 모든 변수들(Xij,p) 및 급성 악화 여부에 따른 실제 결과값(Yij)을 수집하여 모든 환자 별 명세서들(Xij)을 생성시킨다(S410). First, the collection unit 230 is a medical record integrated server 110, meteorological data server 130, virus information server 140, social data server 150, life pattern providing server 160, location information server ( From 170) all the variables (Xij, p) of all asthma or COPD patients and the actual result value (Yij) according to whether acute exacerbation is collected to generate all patient-specific specifications (Xij) (S410).

여기서, i는 환자 별 식별자이며, j는 환자별 명세서 식별자이고, Xij,p는 명세서 Xij의 데이터 세트를 구성하는 p번째 변수를 의미한다. Here, i is a patient-specific identifier, j is a patient-specific specification identifier, and Xij, p means a p-th variable constituting a data set of the specification Xij.

구체적으로, 수집부(230)는 환자 의료 기록용 통합 서버(110)로부터, 모든 환자의 병력 데이터, 임상 양상 데이터, 유전자 데이터의 변수들에 포함되는 천식 또는 COPD 환자의 약물 복용 및 복용 빈도, 응급실 이용 빈도, 입원 빈도, 진료 일수, 급성 악화 발생 빈도, 관련 증상 발생 빈도, 동반 질환 빈도, 흡연 이력, FEV1%, CAT 등을 입수하고, 변수들의 발생 시점마다 명세서를 생성한다. Specifically, the collection unit 230 from the integration server 110 for patient medical records, the history of all patients, clinical pattern data, the frequency of taking or taking the drug of asthma or COPD patients included in the variables of the genetic data, emergency room Use frequency, hospitalization frequency, number of days of treatment, frequency of acute exacerbation, frequency of related symptoms, frequency of concomitant diseases, smoking history, FEV1%, CAT, etc. are obtained, and a statement is generated for each occurrence of variables.

또한, 수집부(230)는 기상 데이터 서버(130), 바이러스 정보 서버(140) 및 소셜 데이터 서버(150), 생활 패턴 제공 서버(160), 위치 정보 서버(170)로부터, 병력 데이터 등의 발생 시점 전후에 존재하는 기상 정보, 호흡기 바이러스 데이터, 소셜 데이터, 카드 구매 정보 및 환자가 실제 위치한 장소의 대기질 정보를 해당 발생 시점의 명세서에 변수 형태로 첨부된다. In addition, the collection unit 230 is generated from the weather data server 130, virus information server 140 and social data server 150, life pattern providing server 160, location information server 170, such as medical history data Weather information, respiratory virus data, social data, card purchase information, and air quality information where the patient is actually located before and after the time point are attached to the specification at the time of occurrence in variable form.

수집부(230)는 모든 천식 또는 COPD 환자의 명세서 별로 급성 악화의 실제 발생 여부도 수집할 수 있으며, 병력 데이터에 포함된 급성 악화 발생 빈도를 이용할 수도 있다. The collection unit 230 may collect whether or not the actual occurrence of acute exacerbation by the specification of all asthma or COPD patients, and may use the frequency of acute exacerbations included in the medical history data.

다음으로, 제 1 모형 수립부(242)는 급성 악화의 예측 모델의 계수 추정식을 통해, 수집부(230)에서 생성된 명세서(Xij)의 변수들(Xij,p)로 구성된 데이테 세트와 실제 결과값(Yij)에 기초하여, 변수들(Xij,p)에 소정 연산 형태로 결합되는 파라미터 계수들(βp)를 추정한다(S420). 상세하게는 파라미터 계수(βp)는 각 변수(Xij,p)마다 상응하여 소정 연산 형태로 결합하는 계수이다. Next, the first model establishment unit 242 and the data set consisting of the variables (Xij, p) of the specification (Xij) generated by the collector 230 through the coefficient estimation formula of the predictive model of acute deterioration and Based on the actual result value Yij, the parameter coefficients βp coupled to the variables Xij, p in a predetermined operation form are estimated (S420). In detail, the parameter coefficient β is a coefficient that is combined for each variable Xij, p in a predetermined calculation form.

제 1 모형 수립부(242)에서 구축하는 제 1 모형은 급성 악화의 발생 여부를 나타내는 예측값과 변수들과 상호 연관되는(correlated) 것을 반영하는 모델링을 이용할 수 있으며, 본 실시예에서는 일반화된 추정 방정식(genearalized estimating equations)를 예로 든다.The first model constructed by the first model establishment unit 242 may use modeling that reflects correlated correlations with variables and predictive values indicating whether acute deterioration has occurred. In this embodiment, a generalized estimation equation Take (genearalized estimating equations) as an example.

전술한 방정식에 의하면, 제 1 모형 수립부(242)는 모든 환자의 명세서(Xij) 별 데이터 세트와 실제 결과값(Yij)을 계수 추정식에 입력하여, 제 1 모형에서 요구되는 파라미터 계수들(βp)을 추정할 수 있다.According to the above equation, the first model establishment unit 242 inputs the data set for each patient's specification (Xij) and the actual result value (Yij) to the coefficient estimating formula, and calculates the parameter coefficients required in the first model ( βp) can be estimated.

계수 추정식은 하기 식 1에 따르며, 파리미터 계수들(βp)은 식 1을 0으로 만드는 값을 구함으로써 추정된다. The coefficient estimating formula is in accordance with Equation 1 below, and the parameter coefficients βp are estimated by finding a value that makes Equation 1 zero.

[식 1]

[Equation 1]

(i는 환자별 식별자이고, j는 환자별 명세서 식별자이며,

는 i번째 환자의 j번째의 명세서로서변수들의 데이터 세트(

)로 구성되며,

는 명세서

Is the jth specification of the i patient and the data set of variables (

),

Specification

According to whether the actual acute exacerbation occurred

β는 변수들과 상관되는 파라미터 계수들의 세트(

)이고,

이고,

이며,

(평균)이고, β is the set of parameter coefficients that correlate

)ego,

ego,

Is,

(Average)

이며,

(분산)인 경우,

이고,

Is,

(Dispersion),

ego,

계속해서, 제 1 모형 수립부(242)는 실제 결과값들(Yij)에 대한 영향도를 평가하여 실제 결과값들(Yij)에 영향을 주는 변수들(Xij,p)과 파라미터 계수들(βp)를 결정한다(S430). Subsequently, the first model establishment unit 242 evaluates the degree of influence on the actual result values Yij and the variables Xij, p and parameter coefficients βp that affect the actual result values Yij. Determine (S430).

이러한 평가 분석은 실제 결과값들(Yij)에 대한 영향도를 평가하는 것으로서, 파라미터 계수의 분산, 신뢰도, 정규분포 분석시 요구되는 Pr>|z| 등에 수행될 수 있다.This evaluation analysis evaluates the influence on the actual results (Yij). Pr> | z | Or the like.

본 발명자는 평가 분석을 통해, 천식 또는 COPD 급성 악화에 높은 영향을 주는 변수들이 환자의 천식 또는COPD 약물 복용 및 복용 빈도, 응급실 이용 빈도, 입원 빈도, 진료 일수, 급성 악화 발생 빈도를 포함하는 병력 데이터와 함께, 호흡기 바이러스 데이터인 것을 확인하였다. Through an analysis of the assessment, the inventors have reviewed medical history data in which variables that have a high impact on acute asthma or COPD acute exacerbations include patients' frequency of taking and taking asthma or COPD drugs, frequency of emergency room use, hospitalization, days of care, and frequency of acute exacerbations. Together with the respiratory virus data.

호흡기 바이러스 데이터는 환자의 병력 데이터와 관련된 변수들의 발생 시점보다 소정 기간 전에 수집되는 바이러스 검출률이며, ADV, PIV, RSV, H1N1pdm 바이러스, hCoV, hRV, hBOV 및 hEV 바이러스 중 적어도 하나와 관련된다. Respiratory virus data is the rate of virus detection collected a predetermined time before the onset of variables related to the patient's medical history data and is associated with at least one of ADV, PIV, RSV, H1N1pdm virus, hCoV, hRV, hBOV and hEV virus.

다음으로, 제 1 모형 수립부(242)는 결정된 변수들(Xij,p)과 파라미터 계수들(βp)로 구성된 예측 모델을 최종 설정하고, 분석부(250)에 제공한다(S440). Next, the first model establishment unit 242 finally sets a prediction model composed of the determined variables Xij and p and the parameter coefficients βp and provides the analysis model 250 to the analysis unit 250 (S440).

예측 모델은 하기 식 2이며, 식 1에서 설명한 바와 같이, β는 파라미터 계수들(βp)을 구성하는 세트 형태이고, A는 천식 또는 COPD 급성 악화의 예측 결과값이다. The prediction model is Equation 2 below, and as described in Equation 1, β is in the form of a set of parameter coefficients βp, and A is a predicted result of asthma or COPD acute exacerbation.

[식 2]

[Equation 2]

본 실시예에 따르면, 천식 또는 COPD 급성 악화에 영향을 주는 변수들에 대하여 평가하여 실질적으로 급성 악화를 유발할 가능성이 높은 변수들에 기초하여 예측함으로써, 예측 정확도를 향상시킬 수 있다. 더욱이, 유발 가능성이 높은 변수들 중에서도 이와 상관되는 파라미터 계수들을 조정함으로써 예측에 활용되는 변수들의 영향도를 더욱 정밀하게 분석함으로써 급성 악화의 예측이 정확하게 될 수 있다.According to the present embodiment, prediction accuracy may be improved by evaluating variables affecting asthma or COPD acute exacerbation based on variables that are likely to cause acute exacerbation substantially. Furthermore, the prediction of acute deterioration can be accurate by analyzing the influence of variables used in the prediction more precisely by adjusting the parameter coefficients that are correlated with among the more likely variables.

또한, 검색 엔진, 및 SNS, 카페, 블로그 등의 개인화된 소셜 데이터로부터 검출되는 만성 호흡기 질환 관련 용어, 이에 의한 증상 관련 용어, 이에 따른 질환 관련 용어의 빈도와 소셜 데이터의 트렌드를 예측 모델의 변수로 최종적으로 포함하는 경우에, 다른 만성 호흡기 질환의 환자의 추이를 간접적으로 파악하여 급성 악화를 보다 정확하게 예상할 수 있다.In addition, the frequency of the chronic respiratory disease related term, the symptom related term, the related disease term, and the trend of social data detected from search engines and personalized social data such as SNS, cafe, blog, etc. are used as variables of the prediction model. When finally included, acute exacerbation can be predicted more accurately by indirectly identifying the trend of patients with other chronic respiratory diseases.

이에 더하여, 개별 환자의 유전자 정보, 담배, 술 등에 대한 환자의 구매를 통한 생활 패턴 정보 및 환자 실제 위치한 장소의 대기질 장소도 급성 악화의 제 1 모형에 이용됨으로써, 예측 모델이 제 1 모형으로 활용되는 경우, 급성 악화 예측의 정확도가 향상될 수 있다. In addition, the genetic information of individual patients, lifestyle information through purchases of cigarettes, alcohol, etc., and air quality locations of actual places where patients are located are also used in the first model of acute deterioration, so that the predictive model is used as the first model. If so, the accuracy of the acute exacerbation prediction can be improved.

이하, 도 1 내지 도 3 및 도 5를 참조하여, 만성 호흡기 질환의 급성 악화 예측용 애플리케이션에서 수행되는 본 발명의 실시예인 제 2 모형을 수립하는 프로세스에 관하여 설명하기로 한다. 도 5는 제 2 모형을 수립하는 프로세스에 관한 흐름도이다. 본 실시예에서, 서버들(110 내지 170)로부터의 변수들, 급성 악화 여부에 따른 실제 결과값, 이들을 데이터 세트 형태로 구성한 명세서들은 도 4의 실시예에서 설명한 참조부호를 차용하기로 한다. 또한, 본 실시예에서 구축되는 제 2 모형이 머신 러닝의 기법 중 앙상블 분류 트리 알고리즘으로서 랜덤 포레스트인 것으로 예시한다.1 to 3 and 5, a process of establishing a second model, which is an embodiment of the present invention performed in an application for predicting acute exacerbation of chronic respiratory disease, will be described. 5 is a flow diagram of a process for establishing a second model. In the present embodiment, the variables from the servers 110 to 170, the actual result values according to acute deterioration, and the specifications configuring them in the form of data sets will be referred to by the reference numerals described in the embodiment of FIG. In addition, it is illustrated that the second model constructed in this embodiment is a random forest as an ensemble classification tree algorithm among the techniques of machine learning.

먼저, 수집부(230)는 서버들(110 내지 170)로부터 모든 천식 또는 COPD 환자들의 모든 변수들(Xij,p)및 급성 악화 여부에 따른 실제 결과값(Yij)을 수집하여 모든 환자 별 명세서들(Xij)을 생성시킨다(S510). First, the collection unit 230 collects all the variables (Xij, p) of all asthma or COPD patients and the actual result value (Yij) according to whether or not acute exacerbation from the servers (110 to 170) to all patient-specific specifications (Xij) is generated (S510).

서버들(110 내지 170)로부터 수집되는 변수들(Xij,p)은 도 4의 실시예에서 설명한 것과 동일하여 상세 설명은 생략한다. Variables (Xij, p) collected from the servers (110 to 170) are the same as described in the embodiment of Figure 4 and will not be described in detail.

다음으로, 제 2 모형 수립부(244)는 모든 변수들(Xij,p) 및 상기 변수들마다 대응하는 실제 결과값들(Yij)로부터 랜덤하게(randomly) 복원 추출하여 복수의 샘플 데이터 세트들을 생성한다(S520). 본 실시예에서는 샘플 데이터 세트마다 분류 트리를 생성하고, 복수의 분류 트리 중 최적의 트리만 선택한다. Next, the second model establishment unit 244 generates a plurality of sample data sets by randomly restoring and extracting all the variables Xij, p and the corresponding actual result values Yij for each of the variables. (S520). In this embodiment, a classification tree is generated for each sample data set, and only an optimal tree is selected from among a plurality of classification trees.

계속해서, 제 2 모형 수립부(244)는 하나의 샘플 데이터 세트의 변수들로부터 독립 변수들을 랜덤하게 선택한다(S530)Subsequently, the second model establishment unit 244 randomly selects independent variables from variables of one sample data set (S530).

이어서, 제 2 모형 수립부(244)는 선택된 독립 변수들 중 실제 결과값들(Yij)을 가장 잘 분류하는 독립 변수들의 탐색 및 대응 가중치들의 산출한다(S540). Subsequently, the second model establishment unit 244 searches for independent variables that best classify actual result values Yij among the selected independent variables and calculates corresponding weights (S540).

제 2 모형 수립부(244)는 탐색된 독립 변수들 중 분류의 소정 방식에 부합하는 1 번째 세대의 노드 및 이로부터 분기되는 2 번째 세대의 노드와 관련된 독립 변수를 선정하는 노드 분리를 수행하며, 분류 트리를 생성한다(S550). The second model establishment unit 244 performs node separation for selecting an independent variable related to a node of the first generation and a second generation node branching therefrom among the searched independent variables, A classification tree is generated (S550).

다음으로, 제 2 모형 수립부(244)는 분류 트리가 기 설정된 세대에 도달하지 않았으면, 도달할 때까지 S530 내지 S550의 단계를 반복하여 분류 트리를 생성한다(S560). Next, if the classification tree does not reach the preset generation, the second model establishment unit 244 generates the classification tree by repeating steps S530 to S550 until reaching the preset generation (S560).

하나의 분류 트리가 생성된 후에, 제 2 모형 수립부(244)는 분류 트리가 기 설정된 개수로 생성되지 않은 경우, 복수의 샘플 데이터 세트 중 상기 단계들 S530 내지 S560이 진행되지 않은 세트들에 대해 S530 내지 S560 단계들을 반복한다(S570). After one classification tree is generated, if the classification tree is not generated in a predetermined number, the second model establishment unit 244 may be configured for the sets for which the steps S530 to S560 of the plurality of sample data sets are not performed. Steps S530 to S560 are repeated (S570).

다음으로, 분류 트리가 기 설정된 개수로 생성된 경우, 제 2 모형 수립부(244)는 복수의 분류 트리를 구조 유사도 별로 분류하여 개수를 파악하고, 최대 개수의 분류 트리를 제 2 모형으로 수립한다(S580). Next, when the classification tree is generated with a predetermined number, the second model establishment unit 244 classifies the plurality of classification trees by structural similarity, identifies the number, and establishes the maximum number of classification trees as the second model. (S580).

이와 같이 수립된 제 2 모형은 만성 호흡기 질환의 급성 악화에 영향을 미치는 정도가 높아 선별된 변수들(Xij,p)과 이에 결합되는 가중치들로 구성된 머신 러닝 알고리즘으로 구현된다. 선별된 변수들(Xij,p)은 제 1 모형과 마찬가지로, 환자의 만성 호흡기 질환의 약물 복용 및 복용 빈도, 응급실 이용 빈도, 입원 빈도, 진료 일수, 급성 악화 발생 빈도를 포함하는 병력 데이터와 함께, 호흡기 바이러스 데이터를 적어도 포함한다. The second model established as described above is implemented with a machine learning algorithm composed of selected variables (Xij, p) and weights associated with them because they have a high degree of influence on acute exacerbation of chronic respiratory disease. The screened variables (Xij, p), like the first model, together with medical history data including the frequency of drug use and dose of patients with chronic respiratory disease, the frequency of emergency room use, the frequency of hospitalization, the number of days of treatment, and the frequency of acute exacerbations, At least respiratory virus data.

이하, 도 1 내지 도 3 및 도 6을 참조하여, 만성 호흡기 질환의 급성 악화 예측용 애플리케이션에서 수행되는 본 발명의 실시예인 예측 모델을 확정하는 과정에 대해 설명하기로 한다. 도 6은 예측 모델을 판정하는 프로세스에 관한 흐름도이다. Hereinafter, referring to FIGS. 1 to 3 and 6, a process of determining a prediction model that is an embodiment of the present invention performed in an application for predicting acute exacerbation of chronic respiratory disease will be described. 6 is a flowchart of a process for determining a prediction model.

판정부(246)는 환자 별 명세서들(Xij)로부터 제 1 및 제 2 모형에서 결정된 소정 기간 내의 변수들 (Xij, p)과 이에 대응하는 실제 결과값들(Yij)을 추출한다(S610). The determination unit 246 extracts the variables Xij and p within the predetermined period determined in the first and second models from the patient-specific specifications Xij and the actual result values Yij corresponding thereto (S610).

다음으로, 판정부(246)는 소정 기간 내의 변수들 (Xij, p)에 따른 제 1 및 제 2 모형의 예측 결과값과 실제 결과값들(Yij)을 비교한다(S620). Next, the determination unit 246 compares the prediction result values and the actual result values Yij of the first and second models according to the variables Xij and p within a predetermined period (S620).

이어서, 판정부(246)는 비교 결과, 양 모형 중 예측 결과값의 정확도가 높은 모형을 예측 모델로 최종 설정한다(S630). Subsequently, the determination unit 246 finally sets, as a prediction model, a model having high accuracy of the prediction result value between the two models (S630).

이에 의하면, 유발 가능성이 높은 변수들과 함께, 이와 상관되는 파라미터 계수들의 방정식으로 구성된 제 1 모형과 머신 러닝 알고리즘으로 구성된 제 2 모형 중 정확한 예측을 나타내는 모형을 예측 모델로 설정함으로써 급성 악화의 예측이 정확하게 수행될 수 있다. According to this, the prediction model of acute exacerbation is established by setting the predictive model as the predictive model among the first model composed of equations of the parameter coefficients correlated with the most likely variables and the second model composed of machine learning algorithms. Can be performed correctly.

이하, 도 1 내지 도 3 및 도 7을 참조하여, 만성 호흡기 질환의 급성 악화 예측용 애플리케이션에서 수행되는 본 발명의 실시예인 급성 악화를 예측하는 프로세스에 관하여 설명하기로 한다. 도 7은 개별 환자의 요청에 따라 만성 호흡기 질환의 급성 악화를 예측하는 프로세스에 관한 흐름도이다. 1 to 3 and 7, a process for predicting acute exacerbation, which is an embodiment of the present invention performed in an application for predicting acute exacerbation of chronic respiratory disease, will be described. 7 is a flow diagram of a process for predicting acute exacerbation of chronic respiratory disease at the request of an individual patient.

먼저, 입력부(220)는 예측을 희망하는 만성 호흡기 질환 보유 환자의 급성 악화 발생의 여부를 요청하는 것을 접수한다(S710).First, the input unit 220 receives a request for whether or not acute deterioration occurs in a patient having a chronic respiratory disease who wishes to predict (S710).

이어서, 수집부(230)는 예측 요청한 환자와 관련된 변수들 중 예측 모델에 요구하는 변수들을 각 서버(110 내지 170)로부터 수집하여 변수의 발생 시점마다 해당 환자의 명세서를 생성시킨다(S720).Subsequently, the collection unit 230 collects variables required for the prediction model among the variables related to the patient who has requested the prediction from each server 110 to 170 and generates a specification of the patient at each occurrence of the variable (S720).

복수의 명세서를 구성하는 변수들의 데이터 세트는 도 4 또는 도 5와 실질적으로 동일하게 구성된다. The data set of variables constituting the plurality of specifications is configured substantially the same as in FIG. 4 or FIG. 5.

다음으로, 분석부(250)는 최종 설정된 예측 모델에서, 명세서의 변수들로 구성된 데이터 세트와 함께, 파라미터 계수들(제 1 모형이 예측 모델인 경우) 또는가중치들(제 2 모형이 예측 모델인 경우)을 이용하여 해당 환자의 급성 악화 발생 여부에 대한 예측 결과값을 산출한다(S730). 이 경우에, 예측 결과값은 단순히 발생 유무 또는 발생에 대한 위험 정도를 수치화/레벨화 등의 표기로 산출될 수 있다. Next, the analysis unit 250, together with the data set composed of the parameters of the specification in the finally set prediction model, parameter coefficients (when the first model is a prediction model) or weights (the second model is a prediction model) Case) is used to calculate a predicted result of acute deterioration of the patient (S730). In this case, the predicted result value may simply be calculated by notation of occurrence or risk of occurrence by notation such as digitization / leveling.

이어서, 제공부(270)는 요청한 환자의 단말들(180, 190), 의료 기록용 통합 서버 및 해당 환자의 병원 서버(120) 중 적어도 하나에 예측 결과값을 제공하고, 저장부(280)는 예측 결과값과 관련 명세서들을 저장시킨다(S740). Subsequently, the provider 270 provides the prediction result value to at least one of the terminals 180 and 190 of the requested patient, the integrated server for medical records, and the hospital server 120 of the patient, and the storage unit 280 The prediction result value and related specifications are stored (S740).

본 실시예에 따르면, 모든 병원의 정보를 통합적으로 관리하는 의료 기록용 통합 서버(110)에 저장된 모든 천식 또는 COPD 환자의 방대한 데이터에 기초한 급성 악화의 예측 모델을 사용함으로써, 환자 스스로가 급성 악화의 발생 전에 악화를 미리 대처하면서 예방할 수 있다According to the present embodiment, by using a predictive model of acute exacerbation based on the vast amount of data of all asthma or COPD patients stored in the integrated medical record server 110 for managing the information of all the hospitals, the patient himself can We can prevent aggravation before it occurs

도 4 내지 6에서도 설명한 바와 같이, 예측 모델은 급성 악화에 높은 영향력을 갖는 변수들과 함께, 소셜 데이터와 관련된 변수들도 편입하고 있으므로, 급성 악화를 더 정밀하게 예측할 수 있다. As described with reference to FIGS. 4 to 6, the prediction model includes variables related to social data as well as variables having a high influence on the acute exacerbation, thereby more accurately predicting the acute deterioration.

이하, 도 1, 도 2 및 도 8을 참조하여, 만성 호흡기 질환의 급성 악화 예측용 애플리케이션에서 수행되는 본 발명의 실시예인 예측 모델을 갱신하는 프로세스에 관하여 설명하기로 한다. 여기서는 예측 모델이 제 1 모형인 점으로 예시하며, 도 8은 예측 모델이 제 1 모형인 경우에 예측 모델을 갱신하는 프로세스에 관한 흐름도이다. 1, 2, and 8, a process of updating a prediction model, which is an embodiment of the present invention, performed in an application for predicting acute exacerbation of chronic respiratory disease will be described. Here, the prediction model is illustrated as being the first model, and FIG. 8 is a flowchart illustrating a process of updating the prediction model when the prediction model is the first model.

우선, 제공부(270)는 예측 모델에 따라 급성 악화의 발생 여부가 판정된 환자의 실제 발생 여부에 대한 실제 결과값을 해당 환자의 병원 서버(120) 및 의료 기록용 통합 서버(110)에 기록하고, 저장부(280) 역시 실제 결과값을 저장한다(S810). First, the provider 270 records the actual result value of the actual occurrence of the patient is determined whether the acute exacerbation according to the prediction model to the hospital server 120 and the integrated server 110 for medical records of the patient. In addition, the storage unit 280 also stores the actual result value (S810).

다음으로, 수집부(230)는 의료 기록용 통합 서버(110) 및 저장부(280)에 예측된 환자의 실제 결과값을 접수하고, 의료 기록용 통합 서버(110), 기상 데이터 서버(130), 바이러스 정보 서버(140), 소셜 데이터 서버(150), 생활 패턴 제공 서버(160), 위치 정보 서버((170), 저장부(280)부터 모든 변수들(Xij,p) 및 급성 악화의 실제 발생 여부에 따른 실제 결과값들(Yij)을 수집하여 모든 환자 별 명세서들(Xij)을 생성한다(S820). 수집부(230)의 동작은 도 4의 S410 단계와 실질적으로 마찬가지이고, Xij, Xij,p, Yij 및 이하의 βp는 모순되지 않는 범위에서 도 4에서의 명칭과 실질적으로 동일하다. Next, the collection unit 230 receives the actual result value of the patient predicted in the medical record integrated server 110 and the storage unit 280, the medical record integrated server 110, meteorological data server 130 From the virus information server 140, the social data server 150, the life pattern providing server 160, the location information server 170, the storage unit 280, all the variables (Xij, p) and the actual condition of the acute deterioration. The actual result values Yij according to the occurrence are collected and all the patient-specific specifications Xij are generated at step S820. Operation of the collection unit 230 is substantially the same as step S410 of FIG. Xij, p, Yij and the following βp are substantially the same as the names in FIG. 4 within a contradiction.

이어서, 갱신부(260)는 도 4에서 설명한 식 1과 같은 계수 추정식을 통해, 명세서 (Xij)의 변수들(Xij,p)로 구성된 데이터 세트와 실제 결과값들(Yij)에 기초하여 변수들(Xij,p)에 소정 연산 형태로 결합하는 파라미터 계수들(βp)을 재추정한다(S830). Subsequently, the updater 260 performs a variable based on a data set composed of the variables Xij and p of the specification Xij and actual result values Yij through a coefficient estimation equation as in Equation 1 described with reference to FIG. 4. The parameter coefficients βp coupled to the fields Xij, p in a predetermined operation form (S830).

계속해서, 갱신부(260)는 도 4의 S430에서 설명한 평가 분석과 실질적으로 동일하게, 실제 결과값들(Yij)에 대한 영향도를 평가하여 실제 결과값들(Yij)에 영향을 미치는 변수들(Xij,p)과 파라미터 계수(βp)를 재결정한다(S840). Subsequently, the updater 260 evaluates the degree of influence on the actual result values Yij, substantially the same as the evaluation analysis described in S430 of FIG. 4, and affects the actual result values Yij. (Xij, p) and the parameter coefficient βp are recrystallized (S840).

갱신부(260)는 재결정된 변수들(Xij,p)과 파라미터 계수(βp)을 식 2에 따른 제 1 모형에 반영함으로써, 예측 모델을 갱신한다(S850).The updater 260 updates the prediction model by reflecting the re-determined variables Xij, p and the parameter coefficient βp in the first model according to Equation 2 (S850).

본 실시예에 의하면, 애플리케이션으로 예측한 환자에 대한 실제 급성 악화의 여부를 병원 또는 의료 기록용 통합 서버로부터 추적하고, 이를 이용하여 예측 모델의 변수들과 파라미터 계수들을 조정함으로써, 예상치 않는 변수들을 반영할 수 있으며, 예측 모델의 더 정교하게 발전시킬 수 있다.According to this embodiment, the actual acute deterioration of the patient predicted by the application is tracked from the integrated server for hospital or medical records, and the adjusted values and parameter coefficients of the predictive model are used to reflect unexpected variables. And more sophisticated development of predictive models.

본 실시예에서는 예측 모델이 제 1 모형인 경우의 갱신 과정을 예시하였으나, 예측 모델이 제 2 모형인 경우에, 도 5 및 도 8의 적절한 조합을 통해, 갱신 당시에 적합하게 선정된 변수들(Xij,p) 과 가중치들로 구성된 분류 트리로 갱신될 수 있다. In the present exemplary embodiment, an update process when the predictive model is the first model is illustrated. However, when the predictive model is the second model, variables appropriately selected at the time of updating through appropriate combinations of FIGS. 5 and 8 (Xij) , p) and a classification tree composed of weights.

도 2 및 도 3에 도시된 만성 호흡기 질환의 급성 악화 예측용 애플리케이션(210)을 구성하는 구성요소 또는 도 4 내지 도 8에 도시된 실시예들에 따른 단계는 그 기능을 실현시키는 프로그램의 형태로 컴퓨터 판독가능한 기록 매체에 기록될 수 있다. 여기에서, 컴퓨터 판독 가능한 기록 매체란, 데이터나 프로그램 등의 정보를 전기적, 자기적, 광학적, 기계적, 또는 화학적 작용에 의해 축적하고, 컴퓨터에서 판독할 수 있는 기록 매체를 말한다. 이러한 기록 매체 중 컴퓨터로부터 분리 가능한 것으로서는, 예를 들면, 휴대용 스토리지(portalbe storage), 플렉시블 디스크, 광자기 디스크, CD-ROM, CD-R/W, DVD, DAT, 메모리 카드 등이 있다. 또한, 모바일 디바이스 및 컴퓨터에 고정된 기록 매체로서 SSD(Solid State Disk), 하드디스크나 ROM 등이 있다.The components constituting the application 210 for predicting acute exacerbation of chronic respiratory disease shown in FIGS. 2 and 3 or the steps according to the embodiments shown in FIGS. 4 to 8 are in the form of a program for realizing the function. It can be recorded on a computer readable recording medium. Here, the computer-readable recording medium refers to a recording medium in which information such as data and programs are accumulated by electrical, magnetic, optical, mechanical, or chemical actions, and can be read by a computer. Examples of such recording media that can be separated from a computer include, for example, portable storage, flexible disks, magneto-optical disks, CD-ROMs, CD-R / Ws, DVDs, DATs, memory cards, and the like. Further, solid state disks (SSDs), hard disks, and ROMs may be used as recording media fixed to mobile devices and computers.

또한, 이상에서, 본 발명의 실시예를 구성하는 모든 구성 요소들이 하나로 결합되어 동작하는 것으로 설명되었다고 해서, 본 발명이 반드시 이러한 실시예에 한정되는 것은 아니다. 즉, 본 발명의 목적 범위 안에서라면, 그 모든 구성 요소들이 하나 이상으로 선택적으로 결합하여 동작할 수도 있다. 또한, 그 모든 구성 요소들이 각각 하나의 독립적인 하드웨어로 구현될 수 있지만, 각 구성 요소들의 일부 또는 전부가 선택적으로 조합되어 하나 또는 복수 개의 하드웨어에서 조합된 일부 또는 전부의 기능을 수행하는 프로그램 모듈을 갖는 컴퓨터 프로그램으로서 구현될 수도 있다.In addition, in the above, it is described that all the components constituting the embodiment of the present invention are combined to operate as one, the present invention is not necessarily limited to these embodiments. In other words, within the scope of the present invention, all of the components may be selectively operated in combination with one or more. In addition, although all the components may be implemented as one independent hardware, each or some of the components may be selectively combined to perform a program module which performs some or all functions combined in one or a plurality of hardware. It may be implemented as a computer program having.

이상에서 대표적인 실시예를 통하여 본 발명에 대하여 상세하게 설명하였으나, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 상술한 실시예에 대하여 본 발명의 범주에서 벗어나지 않는 한도 내에서 다양한 변형이 가능함을 이해할 것이다. 그러므로 본 발명의 권리 범위는 설명된 실시예에 국한되어 정해져서는 안 되며, 후술하는 특허청구범위뿐만 아니라 이 특허청구범위와 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태에 의하여 정해져야 한다.Although the present invention has been described in detail through the representative embodiments, those skilled in the art to which the present invention pertains can make various modifications without departing from the scope of the present invention. I will understand. Therefore, the scope of the present invention should not be limited to the described embodiments, but should be defined by all changes or modifications derived from the claims and the equivalent concepts as well as the following claims.

110: 의료 기록용 통합 서버 120: 병원 서버
130: 기상 데이터 서버 140: 바이러스 정보 서버
150: 소셜 데이터 서버 160: 생활 패턴 제공 서버
170: 위치 정보 서버 180: 사용자 모바일 단말
190: 사용자 유선 단말 200: 예측 서버
210: 애플리케이션 내장부 220: 입력부
230: 수집부 240: 모델 구성부
242: 제 1 모형 수립부 244: 제 2 모형 수립부
246: 판정부 250: 분석부
260: 갱신부 270: 제공부
280: 저장부110: integrated server for medical records 120: hospital server
130: weather data server 140: virus information server
150: social data server 160: life pattern providing server
170: location information server 180: user mobile terminal
190: user wired terminal 200: prediction server
210: application built-in unit 220: input unit
230: collector 240: model configuration
242: first model establishment unit 244: second model establishment unit
246: determination unit 250: analysis unit
260: update unit 270: provider
280: storage unit

Claims

하드웨어와 결합되어 매체에 저장된 만성 호흡기 질환의 급성 악화 예측용 애플리케이션에 있어서,
만성 호흡기 질환 급성 악화의 예측 모델에서 이용하는 값들로서, 만성 호흡기 질환 환자들의 병력 데이터, 임상 양상 데이터, 유전자 데이터를 저장하는 모든 병원 서버로부터 환자 의료 정보들을 수집하는 의료 기록용 통합 서버, 바이러스 정보 서버, 기상 데이터 서버, 소셜 데이터 서버, 생활 패턴 제공 서버 및 위치 정보 서버로부터의 변수들로부터 선택된 변수들과 상기 선택된 변수들에 결합되는 파라미터 계수들을 포함하는 방정식으로 설정된 제 1 모형, 및 상기 변수들의 적어도 일부를 포함하는 머신 러닝(machine learning) 알고리즘으로 설정된 제 2 모형을 수립하는 단계;
상기 서버들로부터, 상기 제 1 및 제 2 모형들 각각에서 선택된 동일 기간 내의 변수들과 상기 동일 기간 내의 만성 호흡기 질환의 급성 악화의 실제 발생 여부에 따른 실제 결과값들을 추출하여 상기 모형들에서의 급성 악화의 발생 여부를 나타내는예측 결과값과 상기 실제 결과값들을 비교하고, 비교 결과 예측 결과값의 정확도가 높은 모형을 예측 모델로 설정하는 단계;
만성 호흡기 질환 환자의 급성 악화 발생 여부에 대한 예측 요청에 의해, 상기 서버들로부터, 상기 예측 모델에서 결정된 상기 환자의 변수들이 발생한 시점마다 상기 변수들을 데이터 세트로 구성하는 적어도 하나의 명세서를 생성하는 단계;
상기 명세서의 변수들을 상기 예측 모델에 입력하여 상기 환자의 만성 호흡기 질환 환자의 급성 악화의 발생을 예측하는 단계; 및
예측 결과값을 상기 환자, 의료 기록용 통합 서버 및 병원 서버에 적어도 하나에 제공하는 단계를 실행시키고,
상기 제 1 및 제 2 모형들에서 공통되는 변수들은 상기 의료 기록용 통합 서버로부터 입수된 병력 데이터 및 상기 바이러스 정보 서버로부터 입수된 호흡기 바이러스 데이터를 적어도 포함하고, 상기 병력 데이터는 환자의 만성 호흡기 질환의 약물 복용 및 복용 빈도, 응급실 이용 빈도, 입원 빈도, 요양 일수, 진료 일수, 만성 호흡기 질환의 급성 악화 발생 빈도를 적어도 포함하는 것인 매체에 저장된 만성 호흡기 질환의 급성 악화 예측용 애플리케이션.In applications for predicting acute exacerbation of chronic respiratory disease stored in media in combination with hardware,
Values used in predictive models of acute exacerbation of chronic respiratory disease include: an integrated server for medical records, virus information server, which collects patient medical information from all hospital servers storing history data, clinical picture data and genetic data of patients with chronic respiratory disease. A first model set with an equation comprising variables selected from variables from a weather data server, a social data server, a life pattern providing server and a location information server and parameter coefficients coupled to the selected variables, and at least some of the variables Establishing a second model set by a machine learning algorithm comprising a;
From the servers, acute in the models is extracted by extracting actual results according to the variables within the same time period selected in each of the first and second models and the actual occurrence of acute exacerbation of chronic respiratory disease within the same time period. Comparing the prediction result value indicating whether deterioration has occurred with the actual result values, and setting a model having a high accuracy of the comparison result prediction result value as a prediction model;
Generating, by the server, at least one specification for configuring the variables as a data set at each time point of occurrence of the variables of the patient determined in the prediction model, by a request for prediction of whether an acute exacerbation of a chronic respiratory disease patient occurs. ;
Predicting the occurrence of acute exacerbation of a patient of chronic respiratory disease of the patient by inputting variables of the specification to the predictive model; And
Providing the predicted result value to at least one of the patient, the integrated server for medical records, and the hospital server,
Variables common to the first and second models include at least medical history data obtained from the integrated medical record server and respiratory virus data obtained from the virus information server, wherein the medical history data may be used to describe chronic respiratory disease in a patient. Application for predicting acute exacerbation of chronic respiratory disease stored in a medium, including at least a frequency of taking and taking a drug, frequency of emergency room use, frequency of hospitalization, days of care, days of treatment, frequency of occurrence of acute exacerbation of chronic respiratory disease.

제 1 항에 있어서,
상기 호흡기 바이러스 데이터는 상기 환자의 병력 데이터 또는 임상 양상 데이터와 관련된 변수들의 발생 시점보다 소정 기간 전에 수집되는 바이러스 검출률이며, 아데노 바이러스(ADenoVirus; 'ADV'), 파라인플루엔자 바이러스(ParaInfluenza Virus; 'PIV'), 호흡기세포융합 바이러스(Respiratory Syncytial Virus; 'RSV'), H1N1pdm 바이러스, 인간 코로나 바이러스(human CoronaVirus; 'hCoV'), 인간 라이노 바이러스(human Rhion Virus; 'hRV'), 인간 보카 바이러스(human BOcaVirus; 'hBOV'), hEV 바이러스 및 IFV 바이러스 중 적어도 하나인 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. The method of claim 1,
The respiratory virus data is a virus detection rate collected a predetermined time before the time of occurrence of the variables associated with the patient's medical history data or clinical picture data, adenovirus (ADenoVirus; 'ADV'), ParaInfluenza Virus ('PIV') ), Respiratory Syncytial Virus ('RSV'), H1N1pdm virus, human CoronaVirus ('hCoV'), human Rhion Virus ('hRV'), human BOcaVirus 'hBOV'), an application for predicting acute exacerbation of chronic respiratory disease, which is at least one of hEV virus and IFV virus.

제 1 항에 있어서,
상기 병력 데이터의 만성 호흡기 질환 약물 복용은 서로 상이한 성분의 복수 약제이며, 상기 복용 빈도는 약제 별 복용 빈도인 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. The method of claim 1,
Chronic respiratory disease drug administration of the medical history data is a plurality of drugs of different components, the dose frequency is the application frequency for the acute exacerbation of chronic respiratory disease is the application frequency.

제 1 항에 있어서,
상기 제 1 모형은 상기 급성 악화의 발생 여부를 나타내는 예측 결과값과 상기 변수들과 상호 연관되는(correlated) 것을 반영하는 모델을 이용하는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. The method of claim 1,
The first model is an application for predicting acute exacerbation of chronic respiratory disease using a model that reflects the predicted value indicating whether the acute exacerbation has occurred and correlated with the variables.

제 1 항에 있어서,
상기 병력 데이터는 천식 또는 COPD(Chronic Obstructive pulmonary Disease) 여부를 포함하는 관련 증상 발생 빈도, 및 허혈성심질환, 폐암, 골다공증, 우울증, 관절염, 당뇨병, 위식도 역류, 기흉, 심부전, 고혈압, 빈혈, 대사증후군 중 적어도 하나를 포함하는 동반 질환 빈도를 더 갖는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. The method of claim 1,
The medical history data includes the incidence of related symptoms, including asthma or Chronic Obstructive Pulmonary Disease (COPD), and ischemic heart disease, lung cancer, osteoporosis, depression, arthritis, diabetes, gastroesophageal reflux, pneumothorax, heart failure, hypertension, anemia, metabolic syndrome. Application for predicting acute exacerbation of chronic respiratory disease with a further associated disease frequency comprising at least one of.

제 1 항에 있어서,
상기 임상 양상 데이터는 평균 흡연량, 흡연 이력, 1초 노력성 호기량 (forced expiratory volume in 1 second; FEV1%), 환자 자가 진단의 질의에 대한 응답을 계량화한 CAT(만성 호흡기 질환 Assessment Test) 중 적어도 하나를 포함하는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. The method of claim 1,
The clinical picture data includes at least one of mean smoking volume, smoking history, forced expiratory volume in 1 second (FEV1%), and CAT (Acute Chronic Respiratory Disease Assessment Test), which quantifies the response to the question of patient self-diagnosis. Application for predicting acute exacerbation of chronic respiratory disease comprising a.

제 1 항에 있어서,
상기 기상 데이터 서버로부터 제공되는 변수들은 상기 환자의 병력 데이터 또는 상기 임상 양상 데이터와 관련된 변수들의 발생 시점보다 소정 기간 전에 수집되는 데이터로서, 미세 먼지 누적량, 최저 기온 누적량, 최저 기온 변화량, 강수량 누적량, 최고 기온 누적량, 최고 기온 변화량, 일교차, 일별 최대 풍속, 평균 습도, 일조량 중 적어도 하나를 포함하는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. The method of claim 1,
Variables provided from the meteorological data server are data collected at a predetermined time before the occurrence of the history data of the patient or the variables related to the clinical pattern data, and include fine dust accumulation amount, minimum temperature accumulation amount, minimum temperature change amount, precipitation accumulation amount, highest value. Application for predicting acute exacerbation of chronic respiratory disease, including at least one of temperature accumulation, peak temperature change, day crossing, daily maximum wind speed, average humidity, and sunshine.

제 1 항에 있어서,
상기 소셜 데이터 서버로부터 제공되는 변수들은 상기 환자의 병력 데이터 또는 상기 임상 양상 데이터와 관련된 변수들의 발생 시점을 포함하는 소정 기간에 수집되는 데이터로서, 검색 엔진 및 소셜 데이터에서 발견되는 만성 호흡기 질환 관련 용어, 만성 호흡기 질환에 의한 증상 관련 용어, 만성 호흡기 질환에 따른 질환 관련 용어의 빈도인 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. The method of claim 1,
The variables provided from the social data server are data collected over a period of time including the time of occurrence of the patient's history data or variables related to the clinical modality data, terms related to chronic respiratory disease found in search engines and social data, Application for predicting acute exacerbation of chronic respiratory disease, which is a frequency of symptoms related to chronic respiratory disease and disease related terms according to chronic respiratory disease.

제 1 항에 있어서,
상기 생활 패턴 제공 서버로부터 제공되는 변수들은 상기 환자들의 카드 구매 정보들을 포함하고, 상기 위치 정보 서버는 상기 환자가 위치한 장소의 대기질 정보를 포함하는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. The method of claim 1,
The variables provided from the life pattern providing server include card purchase information of the patients, and the location information server includes air quality information of a location where the patient is located.

제 1 항에 있어서,
상기 제 1 모형을 수립하는 단계에서 파라미터 계수들(β)을 결정하는 것은 하기 식 1에 따른 계수 추정식을 0으로 만드는 상기 파라미터 계수들을 구함으로써 추정하는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션.

[식 1]
(i는 환자별 식별자이고, j는 환자별 명세서 식별자이며,

는 i번째 환자의 j번째의 명세서로서변수들의 데이터 세트(

)로 구성되며,

는 명세서

에 따른 실제 급성 악화 발생 여부에 따른 실제 결과값이며,
β는 변수들과 상관되는 파라미터 계수들의 세트(

)이고,

이고,

이며,

(평균)이고,

이며,

(분산)인 경우,

이고,

는 명세서와 급성 악화 발생 여부에 따른 값 간의 상관성을 해결하기 위한 시변(time varing) 상관 행렬로서 기본 행렬임)The method of claim 1,
Determining the parameter coefficients (β) in the step of establishing the first model is estimated by obtaining the parameter coefficients that make the coefficient estimation equation according to Equation 1 below.

[Equation 1]
(i is patient-specific identifier, j is patient-specific statement identifier,

Is the jth specification of the i patient and the data set of variables (

),

Specification

According to whether the actual acute exacerbation occurred
β is the set of parameter coefficients that correlate

)ego,

ego,

Is,

(Average)

Is,

(Dispersion),

ego,

제 10항에 있어서,
상기 예측 모델이 상기 제 1 모형으로 설정된 경우, 상기 예측 결과값을 제공하는 단계 후에,
상기 예측 모델에 따라 급성 악화의 발생 여부가 예측된 환자의 실제 급성 악화 여부의 실제 결과값을 상기 의료 기록용 통합 서버 및 상기 애플리케이션이 내장된 예측 서버에 기록하는 단계;
상기 의료 기록용 통합 서버, 상기 기상 데이터 서버, 상기 바이러스 정보 서버, 상기 소셜 데이터 서버, 상기 생활 패턴 제공 서버, 상기 위치 정보 서버, 상기 예측 서버로부터 모든 변수들 및 만성 호흡기 질환의 급성 악화의 실제 발생 여부에 따른 실제 결과값들을 수집하여 모든 환자 별 명세서들을 생성하는 단계;
상기 예측 모델의 계수 추정식을 통해, 상기 명세서의 변수들로 구성된 데이터 세트와 실제 결과값들에 기초하여 변수들에 결합되는 파라미터 계수들을 재추정하는 단계; 및
상기 실제 결과값들에 대한 영향도를 평가하여 상기 실제 결과값들에 영향을 미치는 변수들과 파라미터 계수를 재결정하는 단계를 더 포함하되,
상기 파라미터 계수들의 재추정은 상기 식 1에 따른 계수 추정식을 이용하는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션.The method of claim 10,
If the prediction model is set to the first model, after providing the prediction result value,
Recording the actual result value of the actual acute exacerbation of the patient whose predicted occurrence of the acute exacerbation according to the prediction model is recorded in the integrated server for medical record and the prediction server in which the application is embedded;
Actual occurrence of all variables and acute exacerbation of chronic respiratory disease from the integrated server for medical records, the meteorological data server, the virus information server, the social data server, the life pattern providing server, the location information server, the prediction server Generating all patient-specific specifications by collecting actual results according to whether or not;
Re-estimating, through coefficient estimation of the prediction model, parameter coefficients coupled to the variables based on a data set consisting of the variables of the specification and actual results; And
Evaluating the influence on the actual result values to re-determine parameters and parameter coefficients affecting the actual result values,
The re-estimation of the parameter coefficients is an application for predicting acute exacerbation of chronic respiratory disease using the coefficient estimation equation according to Equation 1.

상기 제 2 모형을 수립하는 단계는
상기 서버들로부터 모든 변수들 및 급성 악화의 실제 발생 여부에 따른 실제 결과값들을 수집하여 모든 환자 별 명세서들을 생성하는 단계;
상기 모든 변수들 및 상기 변수들마다 대응하는 실제 결과값들로부터 랜덤하게(randomly) 복원 추출하여 복수의 샘플 데이터 세트들을 생성하는 단계;
하나의 샘플 데이터 세트의 변수들로부터 독립 변수들을 랜덤하게 선택하는 단계;
상기 선택된 독립 변수들 중 실제 결과값들을 가장 잘 분류하는 독립 변수들의 탐색 및 대응 가중치들의 산출하는 단계;
상기 탐색된 독립 변수들 중 분류의 소정 방식에 부합하는 1 번째 세대의 노드 및 이로부터 분기되는 2 번째 세대의 노드와 관련된 독립 변수를 선정하는 노드 분리를 수행하며 기 설정된 세대에 도달할 때까지 분류 트리(tree)를 생성하는 단계;
상기 분류 트리가 기 설정된 개수로 생성되지 않은 경우, 상기 복수의 샘플 데이터 세트 중 상기 단계들이 진행되지 않은 세트들에 대해 상기 단계들을 반복하는 단계; 및
상기 분류 트리가 기 설정된 개수로 생성된 경우, 상기 복수의 분류 트리를 구조 유사도 별로 분류하여 개수를 파악하고, 최대 개수의 분류 트리를 제 2 모형으로 수립하는 단계를 포함하는 만성 호흡기 질환의 급성 악화 예측용 애플리케이션. Establishing the second model
Generating all patient specific specifications from the servers by collecting all results and actual results according to the actual occurrence of acute exacerbation;
Generating a plurality of sample data sets by randomly restoring extraction from all the variables and corresponding actual result values for each of the variables;
Randomly selecting independent variables from variables of one sample data set;
Searching for independent variables that best classify actual results among the selected independent variables and calculating corresponding weights;
A node separation is performed to select independent variables related to the first generation node and the second generation node branching therefrom among the searched independent variables, and are classified until a preset generation is reached. Creating a tree;
If the classification tree is not generated in a predetermined number, repeating the steps with respect to sets of the plurality of sample data sets that have not been performed; And
When the classification tree is generated with a predetermined number, acute exacerbation of chronic respiratory disease, including classifying the plurality of classification trees by structural similarity, and establishing a maximum number of classification trees as a second model. Prediction application.