KR102293160B1

KR102293160B1 - A device for predicting dementia, a device for generating a predictive model, and a program for predicting dementia

Info

Publication number: KR102293160B1
Application number: KR1020207037398A
Authority: KR
Inventors: 히로요시 도요시바; 히데후미 우치야마; 다이시로 기시모토; 게이 후나키; 요코 스가; 쇼고 홋타; 다카노리 후지타; 마사루 미무라
Original assignee: 가부시키가이샤 프론테오; 각고호우징 게이오기주크
Priority date: 2018-09-12
Filing date: 2019-07-03
Publication date: 2021-08-24
Also published as: ES2963236T3; JP6733891B2; WO2020054186A1; KR20210003944A; EP3835972C0; EP3835972A1; CN112470143A; JP2020042659A; US20210313070A1; EP3835972A4; EP3835972B1

Abstract

인지증(認知症)의 중증도가 기지(旣知) m인의 환자가 행한 자유회화의 내용을 나타내는 m개의 문장으로부터 n개의 단어를 추출하고, m개의 문장 및 n개의 단어 사이의 관계성을 반영한 관계성 지표값을 산출하는 관계성 지표값 산출부(100A)와, 1개의 문장에 대하여 n개의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성하는 예측 모델 생성부(14A)와, 예측용 데이터 입력부(20)에 의해 입력되는 문장으로부터 관계성 지표값 산출부(100A)에 의해 산출되는 관계성 지표값을 예측 모델에 적용함으로써, 예측 대상의 문장으로부터 환자의 인지증 중증도를 예측하는 인지증 예측부(21A)를 구비하고, 미니 멘탈 스테이트 검사를 행하지 않고 인지증의 중증도를 예측할 수 있도록 한다.A relationship reflecting the relationship between m sentences and n words by extracting n words from m sentences representing the content of free conversations conducted by patients with known severity of dementia. Prediction generating a predictive model for predicting the severity of dementia based on the relational index value calculation unit 100A for calculating the gender index value and the sentence index value group consisting of n relational index values for one sentence By applying the relation index value calculated by the relation index value calculation unit 100A from the sentence input by the model generation unit 14A and the prediction data input unit 20 to the prediction model, the prediction target sentence A dementia prediction unit 21A for predicting the severity of dementia of a patient is provided, and the severity of dementia can be predicted without performing a mini-mental state test.

Description

인지증 예측 장치, 예측 모델 생성 장치 및 인지증 예측용 프로그램A device for predicting dementia, a device for generating a predictive model, and a program for predicting dementia

본 발명은, 인지증(認知症) 예측 장치, 예측 모델 생성 장치 및 인지증 예측용 프로그램에 관한 것이며, 특히, 환자의 인지증 중증도(환자가 인지증에 걸려 있을 가능성을 포함함)을 예측하는 기술 및 이 예측에 사용하는 예측 모델을 생성하는 기술에 관한 것이다.The present invention relates to an apparatus for predicting dementia, an apparatus for generating a predictive model, and a program for predicting dementia, in particular, predicting the severity of dementia (including the possibility that the patient has dementia) It relates to techniques and techniques for generating predictive models for use in making these predictions.

인지증은 인구의 고령화와 함께 증가의 일로를 걷고 있고, 의료의 문제일뿐만 아니라 커다란 사회적 과제로 되어 있다. 인지증의 치료에 있어서, 인지증의 조기발견 및 중증도의 평가는 매우 중요하다. 현재, 인지증의 스크리닝 검사나 중증도의 평가에 미니 멘탈 스테이트 검사(Mini-Mental State Examination: MMSE)가 일상 임상에서 널리 사용되고 있다. MMSE는, 소재식, 기억력, 주의력(계산력), 언어적 능력, 구성력(도형적 능력) 등을 보는 11항목 30점 만점의 질문으로 이루어지는 인지 기능검사이다. 30점 중, 27점 이하는 경도 인지장애(Mild Cognitive Impairment: MCI)의 의혹, 23점 이하는 인지증의 의혹이 있다고 한다.The dementia is increasing along with the aging of the population, and it is not only a medical problem but also a great social problem. In the treatment of dementia, early detection and evaluation of the severity of dementia are very important. Currently, a mini-mental state examination (MMSE) is widely used in daily clinical practice for screening tests for dementia or evaluation of the severity. The MMSE is a cognitive function test consisting of 11 items out of 30 questions that look at material knowledge, memory, attention (calculation), verbal ability, and construction (graphical ability). Out of 30 points, a score of 27 or less indicates a suspicion of mild cognitive impairment (MCI), and a score of 23 or less indicates a suspicion of dementia.

종래, MMSE의 평가 항목마다 평가를 행하여 인지증의 이환(罹患) 가능성을 판정하고, 그 판정 결과에 기초하여 간호지원을 하는 시스템이 알려져 있다(예를 들면, 특허문헌 1 참조). 특허문헌 1에 기재된 시스템에서는, 피개호자의 신체적 또는 정신적 건강 상태를 MMSE에 의해 조사하고, 그 조사 결과로부터 피개호자의 건강 상태 평가를 행한다. 그리고, 피개호자의 건강 상태 평가에 따라 음성 또는 영상을 작성하여 보호자에 전달하고, 보호자가 그 전달된 음성 또는 영상에 기초하여 피개호자를 간호한다. 그 후, 피개호자의 신체적 또는 정신적 건강 상태를 재조사하여 피개호자의 건강 상태 재평가를 행한다. 조사에 관해서는, 기억 장애, 소재식, ADL(일상 작업 동작 능력), 신체기능의 4항목의 관점에서 행하는 것이 기재되어 있다.Conventionally, there is known a system that evaluates each evaluation item of MMSE, determines the morbidity of dementia, and provides nursing support based on the determination result (for example, refer to Patent Document 1). In the system described in Patent Document 1, the physical or mental health state of a caregiver is investigated by MMSE, and the health status of the caregiver is evaluated from the investigation result. Then, according to the health status evaluation of the caregiver, a voice or image is prepared and transmitted to the guardian, and the guardian takes care of the caregiver based on the transmitted voice or image. Thereafter, the physical or mental health status of the caregiver is reexamined, and the health status of the caregiver is reevaluated. Regarding the investigation, it is described that it is performed from the viewpoint of 4 items: memory impairment, whereabouts, ADL (daily work and movement ability), and physical function.

일본공개특허 제2002-251467호 공보Japanese Laid-Open Patent Publication No. 2002-251467

MMSE는, 재현성이 높은 테스트로서 널리 알려져 있다. 그러나, 동일한 환자에 대하여 테스트를 복수 회 행하면, 그 연습 효과에 의해 환자가 질문의 내용을 기억하게 되어, 정확한 득점을 계측할 수없게 된다. 이 때문에, 인지증의 중증도를 빈번하게 측정하는 것이 어려운 문제가 있었다. 상기한 특허문헌 1에 기재된 시스템은, 이와 같이 MMSE가 반복적인 사용에 맞지 않는 문제는 조금도 고려되어 있지 않다.MMSE is widely known as a highly reproducible test. However, if the test is performed a plurality of times on the same patient, the patient remembers the contents of the question due to the practice effect, and it becomes impossible to measure an accurate score. For this reason, it was difficult to measure the severity of dementia frequently. The system described in Patent Document 1 described above does not consider the problem that MMSE is not suitable for repeated use.

본 발명은, 이와 같은 문제점을 해결하기 위해 이루어진 것이며, 인지증의 중증도를 반복적으로 측정하는 경우라도, 환자에 의한 연습 효과를 배제한 측정 결과를 얻을 수 있도록 하는 것을 목적으로 한다.The present invention has been made to solve such a problem, and it is an object of the present invention to obtain a measurement result excluding the effect of practice by a patient even when the severity of dementia is repeatedly measured.

상기한 문제점을 해결하기 위하여, 본 발명의 인지증 예측 장치에서는, 인지증의 중증도가 기지(旣知)인 복수 인(人)의 환자가 행한 자유회화의 내용을 각각 나타낸 복수의 문장을 학습용 데이터로서 입력하고, 상기 입력된 복수의 문장을 형태소 해석하여 복수의 분해 요소를 추출하고, 복수의 문장을 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 문장 벡터를 산출하고 또한, 복수의 분해 요소를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 요소 벡터를 산출하고, 또한 복수의 문장 벡터와 복수의 요소 벡터와의 내적을 각각 계산함으로써, 복수의 문장 및 복수의 분해 요소 사이의 관계성을 반영한 관계성 지표값을 산출한다. 그리고, 1개의 문장에 대하여 복수의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성하도록 하고 있다. 예측 대상으로 하는 환자에 대하여 인지증의 중증도를 예측할 때는, 예측 대상으로 하는 환자가 행한 자유회화의 내용을 나타낸 문장을 예측용 데이터로서 입력하고, 상기 입력한 예측용 데이터에 대하여 요소 추출, 문장 벡터 산출, 요소 벡터 산출 및 지표값 산출의 각처리를 실행함으로써 얻어지는 관계성 지표값을 예측 모델에 적용함으로써, 예측 대상으로 하는 환자의 인지증 중증도를 예측하도록 하고 있다.In order to solve the above problems, in the dementia prediction device of the present invention, a plurality of sentences each representing the contents of free conversations performed by a plurality of patients whose severity of dementia is known is used as learning data. A plurality of sentence vectors composed of q axis components are calculated by inputting the inputted sentences as In addition, by vectorizing the plurality of decomposition elements in q dimensions according to a predetermined rule, a plurality of element vectors composed of q axis components are calculated, and the dot product of the plurality of sentence vectors and the plurality of element vectors is calculated, respectively. By doing so, a relationship index value reflecting the relationship between the plurality of sentences and the plurality of decomposition elements is calculated. In addition, a predictive model for predicting the severity of dementia is generated based on a sentence index value group comprising a plurality of relational index values for one sentence. When predicting the severity of dementia with respect to a patient as a prediction target, a sentence representing the content of free conversation performed by the patient as a prediction target is input as prediction data, and element extraction and sentence vectors are performed with respect to the input prediction data. By applying the relation index value obtained by executing each process of calculation, element vector calculation, and index value calculation to the predictive model, the severity of cognitive impairment of the patient to be predicted is predicted.

상기한 바와 같이 구성한 본 발명에 의하면, 환자가 행한 자유회화를 해석함으로써 인지증의 중증도가 예측되므로, 미니 멘탈 스테이트 검사(MMSE)를 행할 필요가 없다. 이 때문에, 인지증의 중증도를 반복적으로 측정하는 경우라도, 환자에 의한 연습 효과를 배제한 측정 결과(예측 결과)를 얻을 수 있다. 특히, 환자가 인지증에 걸려 있으면, 인지증 특유의 회화 특징이 자유회화 중에 보여지게 되고, 이와 같은 회화 특징이 반영된 상태로 관계성 지표값이 산출되고, 상기 관계성 지표값을 사용하여 예측 모델이 생성되므로, 환자가 행한 자유회화로부터 인지증의 중증도를 예측할 수 있다.According to the present invention constructed as described above, since the severity of dementia is predicted by analyzing the free conversation performed by the patient, there is no need to perform a mini mental state examination (MMSE). For this reason, even when it is a case of repeatedly measuring the severity of dementia, the measurement result (prediction result) which excluded the exercise effect by a patient can be obtained. In particular, if the patient suffers from dementia, speech characteristics peculiar to dementia are shown during free conversation, a relation index value is calculated in a state in which such speech characteristics are reflected, and a predictive model using the relationship index value Since this is generated, the severity of dementia can be predicted from the free conversation performed by the patient.

도 1은 제1 실시예에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다.
도 2는 제1 실시예에 의한 문장지표값군의 설명도이다.
도 3은 제1 실시예에 의한 인지증 예측 장치의 동작예를 나타내는 흐름도이다.
도 4는 제2 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다.
도 5는 제2 실시형태에 의한 품사 추출부의 처리 내용을 예시하는 도면이다.
도 6은 제2 실시형태에 의한 품사 추출부가 추출하는 품사의 예를 나타내는 도면이다.
도 7은 제3 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다.
도 8은 제3 실시형태에 의한 예측 모델 생성부의 처리 내용을 예시하는 도면이다.
도 9는 제4 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다.
도 10은 제4 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다.
도 11은 제5 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다.
도 12는 인지증 예측 장치의 변형예를 나타내는 블록도이다.Fig. 1 is a block diagram showing an example of the functional configuration of an apparatus for predicting dementia according to a first embodiment.
2 is an explanatory diagram of a sentence index value group according to the first embodiment.
3 is a flowchart illustrating an operation example of the apparatus for predicting dementia according to the first embodiment.
Fig. 4 is a block diagram showing an example of the functional configuration of the dementia prediction apparatus according to the second embodiment.
5 is a diagram illustrating processing contents of the part-of-speech extraction unit according to the second embodiment.
6 is a diagram illustrating an example of the part-of-speech extracted by the part-of-speech extraction unit according to the second embodiment.
Fig. 7 is a block diagram showing an example of the functional configuration of the dementia prediction device according to the third embodiment.
8 is a diagram illustrating processing contents of a predictive model generation unit according to the third embodiment.
9 is a block diagram showing an example of the functional configuration of the dementia prediction apparatus according to the fourth embodiment.
Fig. 10 is a block diagram showing an example of the functional configuration of the dementia prediction apparatus according to the fourth embodiment.
11 is a block diagram showing an example of the functional configuration of the dementia prediction apparatus according to the fifth embodiment.
12 is a block diagram showing a modified example of the dementia prediction apparatus.

(제1 실시예)(Example 1)

이하, 본 발명에 의한 제1 실시예를 도면에 기초하여 설명한다. 도 1은, 제1 실시예에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다. 제1 실시예에 의한 인지증 예측 장치는, 그 기능 구성으로서, 학습용 데이터 입력부(10), 단어 추출부(11A), 벡터 산출부(12A), 지표값 산출부(13A), 예측 모델 생성부(14A), 예측용 데이터 입력부(20) 및 인지증 예측부(21A)를 구비하여 구성되어 있다. 벡터 산출부(12A)는, 보다 구체적인 기능 구성으로서, 문장 벡터 산출부(121) 및 단어 벡터 산출부(122)를 구비하고 있다. 또한, 본 실시형태의 인지증 예측 장치는, 기억 매체로서, 예측 모델 기억부(30A)를 구비하고 있다. 그리고, 이하의 설명의 편의 상, 단어 추출부(11A), 벡터 산출부(12A) 및 지표값 산출부(13A)로 구성되는 부분을 관계성 지표값 산출부(100A)로 칭한다.Hereinafter, a first embodiment according to the present invention will be described with reference to the drawings. Fig. 1 is a block diagram showing an example of the functional configuration of the dementia prediction apparatus according to the first embodiment. The dementia prediction apparatus according to the first embodiment has, as its functional configuration, a learning data input unit 10, a word extraction unit 11A, a vector calculation unit 12A, an index value calculation unit 13A, and a predictive model generation unit. (14A), the prediction data input part 20, and the dementia prediction part 21A are provided, and it is comprised. The vector calculation unit 12A includes a sentence vector calculation unit 121 and a word vector calculation unit 122 as a more specific functional configuration. Moreover, the dementia prediction apparatus of this embodiment is equipped with the predictive model storage part 30A as a storage medium. In addition, for the convenience of the following description, the part comprised by the word extraction part 11A, the vector calculation part 12A, and the index value calculation part 13A is called the relationship index value calculation part 100A.

관계성 지표값 산출부(100A)는, 문장에 관한 문장 데이터를 입력하고, 문장과 그 안에 포함되는 단어의 관계성을 반영한 관계성 지표값을 산출하여 출력하는 것이다. 또한, 본 실시형태의 인지증 예측 장치는, 환자가 행한 자유회화의 내용을 나타낸 문장을 관계성 지표값 산출부(100A)가 해석하고, 그에 따라 산출되는 관계성 지표값을 사용하여, 환자에 의한 자유회화의 내용으로부터 환자의 인지증 중증도를 예측하는 것이다. 그리고, 학습용 데이터 입력부(10), 관계성 지표값 산출부(100A) 및 예측 모델 생성부(14A)에 의해, 본 발명의 예측 모델 생성 장치가 구성된다.The relation index value calculation unit 100A inputs sentence data related to a sentence, calculates and outputs a relation indicator value reflecting the relation between the sentence and a word included therein. Further, in the dementia prediction apparatus of the present embodiment, the relational index value calculation unit 100A interprets the sentence indicating the contents of the free conversation performed by the patient, and uses the relational index value calculated accordingly to give the patient It is to predict the severity of cognitive impairment of the patient from the contents of free conversation by And the predictive model generating apparatus of this invention is comprised by the data input part 10 for learning, the relationship index value calculating part 100A, and the predictive model generating part 14A.

본 명세서에 있어서 「문장」이란, 일반적으로 구점으로 구획되는 1개의 문장(센텐스(sentence))을 2개 이상 포함한 것을 일컫는다. 특히, 본 명세서에서는, 의사와 환자 사이에서 행해지는 일련의 자유회화(대화의 연속) 중에서 환자가 발화(發話)한 복수 회의 발언 내용(복수의 문장에 상당)을 1개의 문장(텍스트)로서 취급한다. 즉, 1인의 환자 1회의 자유회화(일련의 대화)에 대하여, 복수의 문장을 포함하는 1개의 문장이 정의된다.In this specification, a "sentence" generally refers to a sentence including two or more of one sentence (sentence) divided by a period. In particular, in this specification, among a series of free conversations (continuation of conversation) performed between a doctor and a patient, the contents of a plurality of speeches uttered by the patient (corresponding to a plurality of sentences) are treated as one sentence (text). do. That is, one sentence including a plurality of sentences is defined for one free conversation (series of conversations) per patient.

도 1에 나타낸 각 기능 블록은, 하드웨어, DSP(Digital Signal Processor), 소프트웨어의 어느 것에 의해서도 구성할 수 있다. 예를 들면, 소프트웨어에 의해 구성하는 경우, 상기 각 기능 블록은, 실제로는 컴퓨터의 CPU, RAM, ROM 등을 구비하여 구성되며, RAM이나 ROM, 하드디스크 또는 반도체 메모리 등의 기록 매체에 기억된 프로그램이 동작함으로써 실현된다.Each functional block shown in Fig. 1 can be configured by any of hardware, DSP (Digital Signal Processor), and software. For example, in the case of a software configuration, each functional block is actually configured with a computer CPU, RAM, ROM, etc., and is a program stored in a recording medium such as a RAM, ROM, hard disk or semiconductor memory. This is realized by operation.

학습용 데이터 입력부(10)는, 인지증의 중증도가 기지인 m 인(m은 2 이상의 임의의 정수)의 환자가 행한 자유회화의 내용을 각각 나타낸 m개의 문장을 학습용 데이터로서 입력한다. 예를 들면, 학습용 데이터 입력부(10)는, 사전에 트레이닝된 의사에 의해 MMSE의 스코어가 부여된 환자와 의사 사이에서 행해진 자유회화의 음성을 문자 데이터로 치환하고, 상기 문자 데이터에 포함되는 환자의 발화 부분의 문장을 학습용 데이터로서 입력한다. 이 경우에, 환자에 대하여 기지인 인지증의 중증도는, MMSE의 스코어의 값인 것을 의미한다. 학습용 데이터 입력부(10)는, m인의 환자의 자유회화의 각각으로부터 취득되는 m개의 문장을 복수의 학습용 데이터로서 입력한다.The learning data input unit 10 inputs, as learning data, m sentences each representing the contents of free conversations performed by a patient with a known severity of dementia m (m is an arbitrary integer greater than or equal to 2). For example, the learning data input unit 10 substitutes text data for voices of free conversations between a patient and a doctor, to which a score of MMSE was given by a doctor trained in advance, and The sentence of the utterance part is input as data for learning. In this case, the severity of cognitive impairment known to the patient is meant to be the value of the score of the MMSE. The data input unit for learning 10 inputs m sentences obtained from each of the free conversations of m patients as a plurality of data for learning.

환자와 의사 사이의 자유회화는, 예를 들면, 5∼10 분 정도의 문진 형식으로 행한다. 즉, 의사가 환자에 대하여 질문하고, 환자가 그 질문에 대하여 대답하는 형식의 대화를 반복적으로 행한다. 그리고, 그 때의 대화를 마이크에 의해 입력하고 녹음하고, 사람이 기록하고, 또는 자동음성인식 기술을 사용하여, 일련의 대화(자유회화)의 음성을 문자 데이터로 치환한다. 이 문자 데이터 중, 환자에 의한 발화 부분만을 추출하여 학습용 데이터로 한다. 그리고, 자유회화의 음성을 문자 데이터로 치환할 때, 환자에 의한 발화 부분만을 문자 데이터로 치환하도록 해도 된다.Free conversation between the patient and the doctor is conducted, for example, in the form of a questionnaire for about 5 to 10 minutes. That is, a conversation in the form of a doctor asking a question about a patient and the patient answering the question is repeatedly performed. Then, the conversation at that time is inputted by a microphone and recorded, recorded by a person, or by using automatic speech recognition technology, the voice of a series of conversations (free conversation) is replaced with text data. Of the text data, only the portion uttered by the patient is extracted and used as the learning data. In addition, when replacing the voice of a free conversation with text data, you may make it replace with text data only the part uttered by a patient.

단어 추출부(11A)는, 특허청구의 범위 「요소 추출부」의 일례이며, 학습용 데이터 입력부(10)에 의해 학습용 데이터로서 입력된 m개의 문장을 해석하여, 상기 m개의 문장으로부터 n개(n은 2 이상의 임의의 정수)의 단어(특허청구의 범위의 분해 요소에 상당)를 추출한다. 문장의 해석 방법으로서는, 예를 들면, 공지의 형태소 해석을 사용하는 것이 가능하다. 여기서, 단어 추출부(11A)는, 형태소 해석에 의해 분할되는 모든 품사의 형태소를 단어로서 추출하도록 해도 되고, 특정한 품사의 형태소만을 단어로서 추출하도록 해도 된다.The word extraction unit 11A is an example of the "element extraction unit" in the claims, and analyzes m sentences input as learning data by the learning data input unit 10, and extracts n (n) sentences from the m sentences. extracts the word (corresponding to the decomposition element of the claim) of an arbitrary integer greater than or equal to 2). As an analysis method of a sentence, it is possible to use a well-known morpheme analysis, for example. Here, the word extraction unit 11A may extract all morphemes of the part-of-speech divided by the morpheme analysis as words, or may extract only the morphemes of a specific part-of-speech as words.

그리고, m개의 문장 안에는, 동일한 단어가 복수 포함되어 있는 경우가 있다. 이 경우에, 단어 추출부(11A)는, 동일한 단어를 복수 개 추출하는 않고, 1개만 추출한다. 즉, 단어 추출부(11A)가 추출하는 n개의 단어는, n종류의 단어의 의미이다. 다만, 추출하는 n개의 단어에는, 각각 문장 중에서의 출현 빈도를 나타낸 정보가 부수되어 있다. 여기서, 단어 추출부(11A)는, m개의 문장으로부터 동일한 단어가 추출되는 빈도를 계측하고, 출현 빈도가 큰 쪽부터 n개(n종류)의 단어, 혹은 출현 빈도가 임계값 이상인 n개(n종류)의 단어를 추출하도록 해도 된다.In addition, a plurality of the same word may be included in m sentences. In this case, the word extraction unit 11A does not extract a plurality of identical words, but only one. That is, the n words extracted by the word extraction unit 11A are meanings of n kinds of words. However, information indicating the frequency of appearance in each sentence is attached to the n words to be extracted. Here, the word extraction unit 11A measures the frequency at which the same word is extracted from m sentences, and n words (n types) from the one with the largest appearance frequency, or n words with the appearance frequency greater than or equal to a threshold value (n) type) may be extracted.

인지증에 걸려 있는 환자에게는, 한번 이야기한 것을 몇 번이나 반복하는 경향이 관찰되는 경우가 있다. 또한, 인지증에 걸려 있는 환자에게는, 자발적인 말이 나오기 어렵고, 의사로부터의 질문에 대하여 같은 말을 연발하는 중복 회화(앵무새처럼 한 말을 그대로 흉내내는 것)의 경향이 보이는 경우도 있다. 이 때문에, 단어 추출부(11A)에 의해, 이와 같은 인지증 특유의 회화 특징을 포함한 자유회화의 문장으로부터 n개의 단어가 추출되게 된다.In patients with dementia, a tendency to repeat what has been said once may be observed. Also, in patients suffering from dementia, spontaneous speech is difficult to come out, and there is a tendency for repeated conversations (imitation of words spoken like a parrot) in which the same words are repeated in response to a question from a doctor in some cases. For this reason, by the word extraction part 11A, n words are extracted from the sentence of free conversation which contains the conversation characteristic peculiar to such dementia.

벡터 산출부(12A)는, m개의 문장 및 n개의 단어로부터, m개의 문장 벡터 및 n개의 단어 벡터를 산출한다. 여기서, 문장 벡터 산출부(121)는, 단어 추출부(11A)에 의한 해석 대상이 된 m개의 문장을 각각 소정의 룰에 따라 q차원(q는 2 이상의 임의의 정수)으로 벡터화함으로써, q개의 축 성분으로 이루어지는 m개의 문장 벡터를 산출한다. 또한, 단어 벡터 산출부(122)는, 단어 추출부(11A)에 의해 추출된 n개의 단어를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 n개의 단어 벡터를 산출한다.The vector calculation unit 12A calculates m sentence vectors and n word vectors from the m sentences and n words. Here, the sentence vector calculation unit 121 vectorizes the m sentences, which are the objects of analysis by the word extraction unit 11A, into q dimensions (q is an arbitrary integer of 2 or more) according to a predetermined rule. Calculate m sentence vectors composed of axis components. Further, the word vector calculating unit 122 calculates n word vectors composed of q axis components by vectorizing the n words extracted by the word extracting unit 11A in q dimensions according to a predetermined rule, respectively. .

본 실시형태에서는, 일례로서, 이하와 같이 하여 문장 벡터 및 단어 벡터를 산출한다. 지금, m개의 문장과 n개의 단어로 이루어지는 집합 S=<d∈, w∈>를 생각한다. 여기서, 각 문장 d_i(i=1, 2, ··· , m) 및 각 단어 w_j(j=1, 2, ··· , n)에 대하여 각각 문장 벡터 d_i→ 및 단어 벡터 w_j→(이하에서는, 기호 "→"는 벡터인 것을 지칭함)을 관련시킨다. 그리고, 임의의 단어 w_j와 임의의 문장 d_i에 대하여, 하기 식(1)에 나타낸 확률 P(w_j|d_i)를 계산한다.In the present embodiment, as an example, sentence vectors and word vectors are calculated as follows. Now, consider a set S=<d∈, w∈> consisting of m sentences and n words. Here, for each sentence d _i (i=1, 2, ... , m) and each word w _j (j=1, 2, ... , n), respectively, the sentence vector d _i → and the word vector w _j → (hereafter, the symbol "→" refers to being a vector). Then, for an arbitrary word w _j and an arbitrary sentence d _i _{, the probability P(w j} |d _i ) shown in the following equation (1) is calculated.

[수식 1][Formula 1]

그리고, 이 확률 P(w_j|d_i)은, 예를 들면, 문장이나 문서를 단락(paragraph)·벡터에 의해 평가하는 것에 대하여 기술한 논문 「"Distributed Representations of Sentences and Documents" by Quoc Le and Tomas Mikolov, Google Inc, Proceedings of the 31st International Conference on Machine Learning Held in Bejing, China on 22-24 June 2014」에 개시되어 있는 확률 p에 따라 산출할 수 있는 값이다. 이 논문에는, 예를 들면, "the", "cat", "sat"라는 3개의 단어가 있을 때, 4개째의 단어로서 "on"을 예측한다고 하며, 그 예측 확률 p의 산출식이 게재되고 있다. 상기 논문에 기재되어 있는 확률 p(wt|wt-k, ··· , wt+k)는, 복수의 단어 wt-k, ··· , wt+k로부터 다른 1개의 단어 wt를 예측했을 때의 정답 확률이다.And, this probability P(w _j |d _i ) is, for example, a paper ""Distributed Representations of Sentences and Documents" by Quoc Le and Tomas Mikolov, Google Inc, Proceedings of the 31st International Conference on Machine Learning Held in Bejing, China on 22-24 June 2014” is a value that can be calculated according to the probability p. In this paper, for example, when there are three words "the", "cat", and "sat", it is said that "on" is predicted as the fourth word, and the calculation formula for the prediction probability p is published. . The probability p(wt|wt-k, ... , wt+k) described in the above paper is the probability of predicting another word wt from a plurality of words wt-k, ... , wt+k is the probability of a correct answer.

이에 비해, 본 실시형태에서 사용하는 식(1)에 나타내는 확률 P(w_j|d_i)는, m개의 문장 중 하나의 문장 d_i로부터, n개의 단어 중 하나의 단어 w_j가 예상되는 정답 확률을 나타내고 있다. 1개의 문장 d_i로부터 1개의 단어 w_j를 예측한다는 것은, 구체적으로는, 어떤 문장 d_i가 출현했을 때, 그 중에 단어 w_j가 포함될 가능성을 예측하는 것이다.On the other hand, the probability P(w _j |d _i ) shown in Equation (1) used in this embodiment is the correct answer in which one word w _j among n words is expected from one sentence d _{i among m sentences} represents the probability. Predicting one word w _j from one sentence d _i specifically predicts the possibility that the word w _j will be included in a certain sentence d _{i when it appears.}

식(1)에서는, e를 밑으로 하고, 단어 벡터 w→와 문장 벡터 d→의 내적값을 지수로 하는 지수함수값을 사용한다. 그리고, 예측 대상으로 하는 문장 d_i와 단어 w_j와의 조합으로부터 계산되는 지수함수값과, 문장 d_i와 n개의 단어 w_k(k=1, 2, ··· , n)의 각 조합으로부터 계산되는 n개의 지수함수값의 합계값의 비율을, 하나의 문장 d_i로부터 하나의 단어 w_j가 예상되는 정답 확률로서 계산하고 있다.In Equation (1), an exponential function value is used with e as the base and the dot product of the word vector w→ and the sentence vector d→ as the exponent. Then, the exponential function value calculated from the combination of the sentence d _i and the word w _j _{to be predicted, and the sentence d i} and n words w _k (k = 1, 2, ... , n) are calculated from each combination The ratio of the sum of n exponential function values is calculated as the probability of a correct answer predicted by one word w _j _{from one sentence d i .}

여기서, 단어 벡터 w_j→와 문장 벡터 d_i→의 내적값은, 단어 벡터 w_j→와 문장 벡터 d_i→의 방향으로 투영한 경우의 스칼라값, 즉 단어 벡터 w_j→가 가지고 있는 문장 벡터 d_i→의 방향의 성분값이라고도 할 수 있다. 이는, 단어 w_j가 문장 d_i에 기여하고 있는 정도를 나타내고 있다고 생각할 수 있다. 따라서, 이와 같은 내적을 이용하여 계산되는 지수함수값을 사용하여, n개의 단어 w_k(k=1, 2, ··· , n)에 대하여 계산되는 지수함수값의 합계에 대한, 1개의 단어 w_j에 대하여 계산되는 지수함수값의 비율을 구하는 것은, 1개의 문장 d_i로부터 n개의 단어 중 1개의 단어 w_j가 예상되는 정답 확률을 구하는 것에 상당한다.Here, the word vector w _j → the sentence vector inner product value of d _i → is, word vector w _j → the sentence vector d _i scalar value in the case of projecting the direction of →, i.e., the word vector w _j → have sentence vectors It can also be said to be a component value in the direction of d _{i →.} This can be considered to indicate the degree to which the word w _j contributes to the sentence d _{i .} Therefore, using the exponential function value calculated using such a dot product, one word for the sum of the exponential function values calculated for n _{words w k (k=1, 2, ... , n)} It is to obtain the ratio of the exponential value to be calculated for the w _j, which corresponds to the probability to obtain the correct answer which one word w _j of the n words from the estimated one sentence d _i.

그리고, 식(1)은, d_i와 w_j에 대하여 대칭이므로, n개의 단어 중 하나의 단어 w_j로부터, m개의 문장 중 하나의 문장 d_i가 예상되는 확률 P(d_i|w_j)를 계산해도 된다. 1개의 단어 w_j로부터 1개의 문장 d_i를 예측한다는 것은, 어떤 단어 w_j가 출현했을 때, 그것이 문장 d_i 중에 포함될 가능성을 예측하는 것이다. 이 경우에, 문장 벡터 d_i→와 단어 벡터 w_j→의 내적값은, 문장 벡터 d_i→와 단어 벡터 w_j→의 방향으로 투영한 경우의 스칼라값, 즉 문장 벡터 d_i→가 가지고 있는 단어 벡터 w_j→의 방향의 성분값이라고도 할 수 있다. 이는, 문장 d_i가 단어 w_j에 기여하고 있는 정도를 나타내고 있다고 생각할 수 있다.And, since Equation (1) is symmetric with respect to _{d i} and w _j , the probability P(d _i w _j ) that one sentence d _i _{among m sentences is expected from one word w j among n words} may be calculated. It is predicted that one sentence d _i from one word w _j, word w _j which, when the appearance, to predict the probability that it be included within the text of d _i. In this case, the sentence vector d _i → and word vector inner product value of w _j → is a scalar value of the sentence vector d _i → and word vector w _j → If the projection in the direction of, or sentence vector d _i → have It can also be said to be a component value in the direction of the word vector w _{j →.} This can be considered to indicate the degree to which the sentence d _i contributes to the word w _{j .}

그리고, 여기서는, 단어 벡터 w→와 문장 벡터 d→의 내적값을 지수로 하는 지수함수값을 사용하는 계산예를 나타냈으나, 지수함수값을 사용하는 것을 필수로 하는 것은 아니다. 단어 벡터 w→와 문장 벡터 d→의 내적값을 이용한 계산식이면 되며, 예를 들면, 내적값 그 자체(다만, 내적값이 항상 양의 값이 되도록 하기 위한 소정의 연산(예를 들면, 내적값+1)을 행하는 것을 포함함)의 비율에 의해 확률을 구하도록 해도 된다.Incidentally, here, a calculation example using an exponential function value using the dot product of the word vector w→ and the sentence vector d→ as an exponent is shown, but it is not essential to use the exponential function value. A calculation expression using the dot product of the word vector w→ and the sentence vector d→ is sufficient, for example, the dot product itself (however, a predetermined operation to ensure that the dot product is always a positive value (e.g., the dot product value) You may make it so that a probability is calculated|required by the ratio of +1)).

다음으로, 벡터 산출부(12A)는, 하기 식(2)에 나타낸 바와 같이, 상기 식(1)에 의해 산출되는 확률 P(w_j|d_i)를 모든 집합 S에 대하여 합계한 값 L을 최대화하도록 한 문장 벡터 d_i→ 및 단어 벡터 w_j→를 산출한다. 즉, 문장 벡터 산출부(121) 및 단어 벡터 산출부(122)은, 상기 식(1)에 의해 산출되는 확률 P(w_j|d_i)를, m개의 문장과 n개의 단어의 모든 조합에 대하여 산출하고, 이들을 합계한 값을 목표변수 L로 하고, 상기 목표변수 L을 최대화하는 문장 벡터 d_i→ 및 단어 벡터 w_j→를 산출한다.Next, as shown in the following equation (2), the vector calculating unit 12A calculates the value L obtained by summing the _{probability P(w j} |d _{i ) calculated by the above equation (1) for all sets S, as shown in the following equation (2).} A sentence vector d _i → and a word vector w _j → are calculated to be maximized. That is, the sentence vector calculating unit 121 and the word vector calculating unit 122 apply the probability P(w _j |d _i ) calculated by the above equation (1) to all combinations of m sentences and n words. is calculated, and the sum of them is used as the target variable L, and a sentence vector d _i → and a word vector w _j → maximizing the target variable L are calculated.

[수식 2][Formula 2]

m개의 문장과 n개의 단어의 모든 조합에 대하여 산출한 확률 P(w_j|d_i)의 합계값 L을 최대화한다는 것은, 어떤 문장 d_i(i=1, 2, ··· , m)로부터 어떤 단어 w_j(j=1, 2, ··· , n)가 예상되는 정답 확률을 최대화하는 것이다. 즉, 벡터 산출부(12A)는, 이 정답 확률이 최대화하는 문장 벡터 d_i→ 및 단어 벡터 w_j→를 산출하는 것이라고 할 수 있다.Maximizing the sum value L of _{the probability P(w j} |d _i ) calculated for all combinations of m sentences and n words _{is from a certain sentence d i} (i=1, 2, ... , m) What word w _j (j=1, 2, ... , n) maximizes the expected probability of correct answer. That is, it can be said that the vector calculation unit 12A calculates the sentence vector d _i → and the word vector w _{j → in which the probability of correct answer is maximized.}

여기서, 본 실시형태에서는, 전술한 바와 같이, 벡터 산출부(12A)는, m개의 문장 d_i를 각각 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 m개의 문장 벡터 d_i→를 산출하고 또한, n개의 단어를 각각 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 n개의 단어 벡터 w_j→를 산출한다. 이는, q개의 축 방향을 가변으로 하고, 전술한 목표변수 L이 최대화하는 문장 벡터 d_i→ 및 단어 벡터 w_j→를 산출하는 것에 상당한다.Here, in the present embodiment, as described above, the vector calculating unit 12A calculates _{m sentence vectors d i} → composed of q axial components by vectorizing the _{m sentences d i in the q dimension, and} , , and n words are vectorized in q dimensions to calculate _{n word vectors w j → composed of q axis components.} This corresponds to calculating the _{sentence vector d i} → and the word vector w _j → maximizing the above-described target variable L with variable q axis directions.

지표값 산출부(13A)는, 벡터 산출부(12A)에 의해 산출된 m개의 문장 벡터 d_i→와 n개의 단어 벡터 w_j→의 내적을 각각 계산함으로써, m개의 문장 d_i 및 n개의 단어 w_j 사이의 관계성을 반영한 m×n개의 관계성 지표값을 산출한다. 본 실시형태에서는, 지표값 산출부(13A)는, 하기 식(3)에 나타낸 바와 같이, m개의 문장 벡터 d_i→와 각 q개의 축 성분(d₁₁∼d_mq)을 각 요소로 하는 문장행렬 D와, n개의 단어 벡터 w_j→의 각 q개의 축 성분(w₁₁∼w_nq)을 각 요소로 하는 단어행렬 W의 곱을 계산함으로써, m×n개의 관계성 지표값을 각 요소로 하는 지표값 행렬 DW를 산출한다. 여기서, W^t는 단어행렬의 전치행렬이다.The index value calculation unit 13A calculates _{the dot product of the m sentence vectors d i} → and the n word vectors w _j → calculated by the vector calculation unit 12A, respectively, so that the m sentences d _i and n words are calculated. m × n relation index values reflecting the relation between w _{j are calculated.} In the present embodiment, the index value calculation unit 13A is a sentence using m sentence vectors d _i → and q axial components d _{11 to} d _mq as elements as shown in the following formula (3). matrix D and, for the n-word word vector by calculating a product of a matrix w, m × n number of relationships index value for each of the q-axis component (w ₁₁ ~w _nq) by each element of w → _j as the elements Calculate the index value matrix DW. Here, W ^t is the transpose matrix of the word matrix.

[수식 3][Equation 3]

이와 같이 하여 산출된 지표값 행렬 DW의 각 요소 dw_ij(i=1, 2, ··· , m, j=1, 2, ··· , n)는, 어느 단어가 어느 문장에 대하여 어느 정도 기여하고 있는 지를 나타낸 것이라고 할 수 있다. 예를 들면, 1행 2열의 요소 dw₁₂는, 단어 w₂가 문장 d₁에 대하여 어느 정도 기여하고 있는 것인가를 나타낸 값이다. 이로써, 지표값 행렬 DW의 각 행은 문장의 유사성을 평가하는 것으로서 사용하는 것이 가능하며, 각각의 열은 단어의 유사성을 평가하는 것으로서 사용하는 것이 가능하다. _{Each element dw ij} (i = 1, 2, ... , m, j = 1, 2, ... , n) of the index value matrix DW calculated in this way is, to some extent, a certain word with respect to a certain sentence. It can be said to indicate that they are contributing. For example, the element dw ₁₂ _{in row 1 and column 2} is a value indicating how much the word w _{2 contributes to the sentence d 1 .} Thus, each row of the index value matrix DW can be used as evaluating the similarity of sentences, and each column can be used as evaluating the similarity of words.

예측 모델 생성부(14A)는, 지표값 산출부(13A)에 의해 산출된 m×n개의 관계성 지표값을 사용하여, 1개의 문장 d_i에 대하여 n개의 관계성 지표값 dw_ij(j=1, 2, ··· , n)로 이루어지는 문장지표값군를 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성한다. 여기서 예측하는 인지증의 중증도는, MMSE의 스코어의 값이다. 즉, 예측 모델 생성부(14A)는, MMSE의 스코어가 기지(旣知)(예를 들면, x점)인 환자의 자유회화를 바탕으로 산출되는 문장지표값군에 대해서는 x점에 가능한 근접한 스코어가 예측되는 예측 모델을 생성한다. 그리고, 예측 모델 생성부(14A)는, 생성한 예측 모델을 예측 모델 기억부(30A)에 기억시킨다.The predictive model generating unit 14A uses the m × n relational index values calculated by the index value calculating unit 13A, and for one sentence d _i , n relational index values dw _ij (j= A predictive model for predicting the severity of dementia is generated based on the sentence index value group consisting of 1, 2, ... , n). The severity of dementia predicted here is the value of the MMSE score. That is, the predictive model generation unit 14A, for the sentence index value group calculated based on the free conversation of the patient whose MMSE score is known (for example, x point), a score as close as possible to the x point. Create a predictive model to be predicted. Then, the predictive model generation unit 14A stores the generated predictive model in the predictive model storage unit 30A.

도 2는, 문장지표값군을 설명하기 위한 도면이다. 도 2에 나타낸 바와 같이, 문장지표값군은, 예를 들면, 첫번째 문장 d₁인 경우, 지표값 행렬 DW의 1행째에 포함되는 n개의 관계성 지표값 dw₁₁∼dw_1n이 이것에 해당한다. 마찬가지로, 두 번째 문장 d₂인 경우, 지표값 행렬 DW의 2행째에 포함되는 n개의 관계성 지표값 dw₂₁∼dw_2n이 이것에 해당한다. 이하, m번째의 문장 d_m에 대한 문장지표값군(n개의 관계성 지표값 dw_m1∼dw_mn)까지 동일하다.2 is a diagram for explaining a sentence index value group. As shown in FIG. 2 , the sentence index value group corresponds to, for example, the first sentence d ₁ _{, n relational index values dw 11 to} _{dw 1n} included in the first row of the index value matrix DW. Similarly, in the case of the second sentence d ₂ _{, n relational index values dw 21 to} _{dw 2n} included in the second row of the index value matrix DW correspond to this. Hereinafter, the same applies to the sentence index value group (n relational index values dw _{m1 to} _{dw mn} ) for the m-th sentence d _{m .}

예측 모델 생성부(14A)는, 지표값 산출부(13A)로 의해 산출된 m×n개의 관계성 지표값 dw₁₁∼dw_mn을 사용하여, 각 문장 d_i(i=1, 2, ··· , m)의 문장지표값군에 대하여 각각 인지증의 중증도와 관련이 있는 특징량을 산출하고, 상기 산출한 특징량에 기초하여, 1개의 문장지표값군으로부터 인지증의 중증도를 예측하기 위한 예측 모델을 생성한다. 기서, 예측 모델 생성부(14A)가 생성하는 예측 모델은, 문장 d_i의 문장지표값군을 입력으로 하여, MMSE의 스코어를 출력하는 학습 모델이다.The predictive model generation unit 14A uses the m × n relationship index values dw _{11 to} _{dw mn} calculated by the index value calculation unit 13A for each sentence d _i (i = 1, 2, ... A predictive model for calculating a feature amount related to the severity of dementia for each sentence index value group of , m), and predicting the severity of dementia from one sentence index value group based on the calculated feature amount create Note that the predictive model generated by the predictive model generation unit 14A is a learning model that outputs a score of the MMSE by inputting the sentence index value group of the sentence _{d i as an input.}

예를 들면, 예측 모델 생성부(14A)가 생성하는 예측 모델의 형태는, 회귀 모델(선형회귀, 로지스틱(logistic)회귀, 서포트벡터머신 등을 베이스로 하는 학습 모델), 트리 모델(결정 트리, 회귀 트리, 랜덤 포레스트, 구배(勾配) 부스팅 트리 등을 베이스로 하는 학습 모델), 뉴럴 네트워크 모델(퍼셉트론, 컨벌루션 뉴럴 네트워크, 재기형 뉴럴 네트워크, 잔차(殘差) 네트워크, RBF 네트워크, 확률적 뉴럴 네트워크, 스파이킹 뉴럴 네트워크, 복소(複素) 뉴럴 네트워크 등을 베이스로 하는 학습 모델), 베이즈 모델(베이즈 추론 등을 베이스로 하는 학습 모델), 클러스터링 모델(k최근접 이웃 알고리즘, 계층(階層)형 클러스터링, 비계층형 클러스터링, 토픽 모델 등을 베이스로 하는 학습 모델) 등 중 어느 하나로 하는 것이 가능하다. 그리고, 여기에서 예로 든 분류 모델은 일례에 지나지 않으며, 이것으로 한정되는 것은 아니다.For example, the form of the predictive model generated by the predictive model generation unit 14A is a regression model (a learning model based on linear regression, logistic regression, support vector machine, etc.), a tree model (decision tree, Learning models based on regression trees, random forests, gradient boosting trees, etc.), neural network models (perceptrons, convolutional neural networks, reconstructed neural networks, residual networks, RBF networks, stochastic neural networks, etc.) , spiking neural networks, learning models based on complex neural networks, etc.), Bayesian models (learning models based on Bayesian inference, etc.), clustering models (k-nearest neighbor algorithm, hierarchical) It is possible to use either type clustering, non-hierarchical clustering, or a learning model based on a topic model). In addition, the classification model cited here is only an example, and is not limited to this.

예측 모델 생성부(14A)가 예측 모델을 생성할 때 산출하는 특징량은, 소정의 알고리즘에 의해 산출되면 된다. 바꾸어 말하면, 예측 모델 생성부(14A)에 있어서 행하는 특징량의 산출 방법은, 임의로 설계가 가능하다. 예를 들면, 예측 모델 생성부(14A)는, 각 문장 d_i의 문장지표값군의 각각에 대하여, 가중치 계산에 의해 얻어지는 값이 인지증의 중증도를 나타낸 기지의 값(MMSE의 스코어)에 근접하도록 소정의 가중치 계산을 행하고, 문장지표값군에 대한 가중치를 특징량으로서 사용하여, 문장 d_i의 문장지표값군으로부터 인지증의 중증도(MMSE의 스코어)을 예측하기 위한 예측 모델을 생성한다.What is necessary is just to calculate the feature quantity calculated when the predictive model generation part 14A generate|occur|produces a predictive model by a predetermined algorithm. In other words, the calculation method of the feature amount performed in the predictive model generation unit 14A can be arbitrarily designed. For example, the predictive model generation unit 14A, for each _{of the sentence index value groups of each sentence d i} , is configured such that a value obtained by weight calculation is close to a known value (MMSE score) indicating the severity of dementia. A predictive model for predicting the severity of dementia (score of MMSE) is generated from the sentence index value group of the sentence _{d i} by performing a predetermined weight calculation and using the weight for the sentence index value group as a feature quantity.

즉, 지표값 행렬 DW의 1행째에 포함되는 n개의 관계성 지표값 dw₁₁∼dw_1n으로 이루어지는 첫번째의 문장 d₁의 문장지표값군에 대하여,That is, for the sentence index value group of the first sentence d ₁ _{consisting of n relational index values dw 11 to} _{dw 1n} included in the first row of the index value matrix DW,

a₁₁·dw₁₁+a₁₂·dw₁₂+… a_1n·dw_1n≒MMSE의 기지의 스코어가 되도록 가중치{a₁₁, a₁₂, ··· , a_1n}을 특징량으로서 산출한다. 또한, 지표값 행렬 DW의 2행째에 포함되는 n개의 관계성 지표값 dw₂₁∼dw_2n으로 이루어지는 두 번째 문장 d₂의 문장지표값군에 대하여,a ₁₁ ·dw ₁₁ +a ₁₂ ·dw ₁₂ +… The _{weight {a 11} , a ₁₂ , ... , a _1n } is calculated as a feature quantity _{so that a 1n} ·dw _1n ≒MMSE is a known score. In addition, with respect to the sentence index value group of the second sentence d ₂ consisting of _n _{relational index values dw 21 to} dw 2n included in the second row of the index value matrix DW,

a₂₁·dw₂₁+a₂₂·dw₂₂+… a_2n·dw_2n≒MMSE의 기지의 스코어가 되도록 가중치{a₂₁, a₂₂, ··· , a_2n}을 특징량으로서 산출한다. 이하 마찬가지로, m개째의 문장 d_m의 문장지표값군에 대하여,a ₂₁ ·dw ₂₁ +a ₂₂ ·dw ₂₂ +… A _{weight {a 21} , a ₂₂ , ... , a _2n } is calculated as a feature quantity _{so that a 2n} ·dw _2n ≒MMSE is a known score. Similarly, for the sentence index value group of the _{m-th sentence d m,}

a_m1·dw_m1+a_m2·dw_m2+… a_mn·dw_mn≒MMSE의 기지의 스코어가 되도록 가중치{a_m1, a_m2, ··· , a_mn}을 특징량으로서 산출한다. 그리고, 이들 특징량이 각각 MMSE의 기지의 스코어에 관련되는 예측 모델을 생성한다.a _m1 dw _m1 +a _m2 dw _m2 +… A _{weight {a m1} , a _m2 , ... , a _mn } is computed as a feature quantity so that it may become a known score of _{a mn} ·dw _{mn ≒MMSE.} Then, each of these feature quantities generates a predictive model that is related to a known score of the MMSE.

그리고, 여기서는 m조의 가중치{a₁₁, a₁₂, ··· , a_1n}, ··· , {a_m1, a_m2, ··· , a_mn}을 각각 특징량으로서 사용하는 예에 대하여 설명하였으나, 이것으로 한정되지 않는다. 예를 들면, m인의 환자에 대한 m개의 학습용 데이터로부터 얻어지는 m개의 문장지표값군 중, MMSE의 스코어가 동일한 환자의 학습용 데이터로부터 얻어지는 문장지표값군끼리에 공통되는 특징을 가진 1개 또는 복수의 가중치, 또는 상기 복수의 가중치를 사용한 소정의 연산값 등을 특징량으로서 추출하도록 해도 된다.Incidentally, here, an example in which m sets of weights {a ₁₁ , a ₁₂ , ... , a _1n }, ... , {a _m1 , a _m2 , ... , a _mn } is used as a feature will be described. However, it is not limited to this. For example, among m sentence index value groups obtained from m learning data for m patients, one or more weights having a characteristic common to sentence index value groups obtained from learning data for patients with the same MMSE score; Alternatively, a predetermined calculated value using the plurality of weights may be extracted as a feature quantity.

예측용 데이터 입력부(20)는, 예측 대상으로 하는 m' 인(m'는 1 이상의 임의의 정수)의 환자가 행한 자유회화의 내용을 각각 나타낸 m'개의 문장을 예측용 데이터로서 입력한다. 즉, 예측용 데이터 입력부(20)는, MMSE의 스코어가 미지인 환자와 의사 사이에서 행해진 자유회화의 음성을 문자 데이터로 치환하고, 상기 문자 데이터에 포함되는 환자의 발화 부분의 문장을 예측용 데이터로서 입력한다. 예측 대상의 환자에 대하여 의사와의 자유회화로부터 m'개의 문장을 취득하는 방법은, 학습 대상의 환자에 대하여 의사와의 자유회화로부터 m개의 문장을 취득하는 전술한 방법과 동일하다.The prediction data input unit 20 inputs, as prediction data, m' sentences each representing the contents of free conversations performed by the patient m' as the prediction target (m' is an arbitrary integer greater than or equal to 1). That is, the prediction data input unit 20 substitutes text data for voices of free conversations between a patient and a doctor whose MMSE score is unknown, and replaces the sentences of the patient's utterances included in the text data with the prediction data. input as The method of acquiring m' sentences from free conversation with a doctor with respect to the patient to be predicted is the same as the above-described method for acquiring m sentences from free conversation with the doctor for the patient as a learning target.

예측 대상으로 하는 환자는, 초진 환자라도 되고, 인지증의 의혹이 있다고 진단된 재진 환자라도 된다. 초진 환자를 예측 대상으로 하는 경우에는, 환자에 대하여 MMSE를 실시하지 않고, 환자와 의사 사이에서 문진에 의한 자유회화를 행하는 것뿐이며, 그 환자가 인지증의 의혹이 있는 지의 여부나, 인지증인 경우에는 그 중증도를 이하에 설명하는 바와 같이 예측할 수 있다. 한편, 재진 환자를 예측 대상으로 하는 경우도, 그 환자에 대하여 MMSE를 실시하지 않고, 환자와 의사 사이에서 문진에 의한 자유회화를 행하는 것만으로 인지증의 중증도를 예측할 수 있다. 이로써, 환자의 MMSE에 대한 연습 효과에 의한 영향을 받지 않고, 증상이 개선 또는 악화하고 있는 지를 판정하는 것이 가능하다.The patient made into the prediction object may be a first-visit patient, or a revisited patient diagnosed with suspicion of dementia may be sufficient as it. In the case of a first-time patient as a predictive target, no MMSE is performed on the patient, only free conversation is conducted between the patient and the doctor through an interview, and whether the patient is suspected of dementia, or if the patient is dementia Its severity can be predicted as described below. On the other hand, even when a revisited patient is a prediction target, the severity of dementia can be predicted only by conducting a free conversation between a patient and a doctor through a questionnaire, without performing an MMSE on the patient. Thereby, it is possible to determine whether a symptom is improving or worsening, without being affected by the effect of the practice with respect to the MMSE of a patient.

인지증 예측부(21A)는, 예측용 데이터 입력부(20)에 의해 입력된 예측용 데이터에 대하여 단어 추출부(11A), 문장 벡터 산출부(121), 단어 벡터 산출부(122) 및 지표값 산출부(13A)의 처리를 실행함으로써 얻어지는 관계성 지표값을, 예측 모델 생성부(14A)에 의해 생성된 예측 모델(예측 모델 기억부(30A)에 기억된 예측 모델)에 적용함으로써, 예측 대상으로 하는 m' 인의 환자에 대하여 인지증의 중증도를 예측한다.The dementia prediction unit 21A includes a word extraction unit 11A, a sentence vector calculation unit 121, a word vector calculation unit 122, and an index value with respect to the prediction data input by the prediction data input unit 20 . A prediction target by applying the relation index value obtained by executing the processing of the calculation unit 13A to the prediction model generated by the prediction model generation unit 14A (the prediction model stored in the prediction model storage unit 30A). Predict the severity of dementia for patients with m'

예를 들면, MMSE의 스코어가 미지인 m' 인의 환자의 자유회화로부터 취득된 m'개의 문장이 예측용 데이터 입력부(20)에 의해 예측용 데이터로서 입력된 경우, 인지증 예측부(21A)의 지시에 따라 이 m'개의 문장에 대하여 관계성 지표값 산출부(100A)의 처리를 실행함으로써, m'개의 문장지표값군을 얻는다. 인지증 예측부(21)는, 관계성 지표값 산출부(100A)에 의해 산출된 m'개의 문장지표값군을 예측 모델에 입력 데이터로서 부여함으로써, m' 인의 환자에 대한 인지증의 중증도를 각각 예측한다.For example, when m' sentences obtained from free conversation of a patient whose MMSE score is unknown m' are inputted as prediction data by the prediction data input unit 20, the cognitive impairment prediction unit 21A By executing the processing of the relation index value calculating unit 100A for the m' sentences according to the instruction, m' sentence index value groups are obtained. The dementia prediction unit 21 assigns the m' sentence index value groups calculated by the relationship index value calculation unit 100A as input data to the predictive model, so as to determine the severity of dementia for the m' patient, respectively. predict

이 예측 시에 단어 추출부(11A)는, 예측용 데이터 입력부(20)에 의해 예측용 데이터로서 입력된 m'개의 문장으로부터 n개의 단어를 추출한다. 단어 추출부(11A)가 예측 시에 m'개의 문장으로부터 추출하는 단어의 수는, 단어 추출부(11A)가 학습 시에 m개의 문장으로부터 추출하는 단어의 수 n과 동일하다. 그리고, 예를 들면, m'=1의 경우, 즉 1인의 환자의 자유회화에 의한 1개의 문장으로부터 n개의 단어를 추출하는 경우도 있다. 이에, 5∼10 분 정도의 문진 형식에 의한 자유회화 중에서 1인의 환자가 발언할 단어의 표준적인 종류를 미리 상정하고, 예측용 데이터에 1개의 문장으로부터 추출되는 n개의 단어와, 학습용 데이터의 m개의 문장으로부터 추출되는 n개의 단어와의 중첩(동일한 단어일 것)이 하나도 없는 사태가 생기지 않도록 n의 값을 결정하는 것이 바람직하다.At the time of this prediction, the word extraction part 11A extracts n words from the m' sentences input by the data input part 20 for prediction as data for prediction. The number of words extracted by the word extracting unit 11A from m' sentences in prediction is equal to the number n of words extracted by the word extracting unit 11A from m sentences in learning. And, for example, in the case of m'=1, that is, there is a case where n words are extracted from one sentence by the free conversation of one patient. Therefore, it is assumed in advance that the standard kind of words to be spoken by one patient among free conversations in the form of a questionnaire for about 5 to 10 minutes, n words extracted from one sentence in the prediction data, and m of the learning data It is preferable to determine the value of n so that there is no overlapping (the same word) with n words extracted from sentences.

또한, 예측 시에 문장 벡터 산출부(121)는, m'개의 문장을 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 m'개의 문장 벡터를 산출한다. 단어 벡터 산출부(122)은, n개의 단어를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 n개의 단어 벡터를 산출한다. 지표값 산출부(13A)는, m'개의 문장 벡터와 n개의 단어 벡터의 내적을 각각 계산함으로써, m'개의 문장 및 n개의 단어 사이의 관계성을 반영한 m'×n개의 관계성 지표값을 산출한다. 인지증 예측부(21A)는, 지표값 산출부(13A)에 의해 산출된 m'개의 관계성 지표값을, 예측 모델 기억부(30A)에 기억된 예측 모델에 적용함으로써, 예측 대상으로 하는 m' 인의 환자에 대하여 인지증의 중증도를 예측한다.In addition, at the time of prediction, the sentence vector calculating unit 121 calculates m' sentence vectors composed of q axial components by vectorizing the m' sentences in q dimensions according to a predetermined rule, respectively. The word vector calculation unit 122 calculates n word vectors composed of q axis components by vectorizing each of the n words in q dimensions according to a predetermined rule. The index value calculation unit 13A calculates the dot product of the m' sentence vectors and the n word vectors, respectively, so that m' × n relation index values reflecting the relationship between the m' sentences and the n words are calculated. Calculate. The dementia prediction unit 21A applies the m' number of relational index values calculated by the index value calculation unit 13A to the predictive model stored in the predictive model storage unit 30A, so that m as a prediction target ' Predicting the severity of dementia for a patient.

그리고, 예측 시에서의 연산 부하의 경감을 목적으로 하여, 단어 벡터 산출부(122)에 의한 단어 벡터의 산출을 생략하고, 학습 시에 산출한 n개의 단어 벡터를 기억해 두고, 이것을 예측 시에 사용하도록 해도 된다. 이와 같이, 예측에 있어서 단어 벡터 산출부(122)이, 학습 시에 산출한 n개의 단어 벡터를 판독하여 이용하는 처리도, 예측용 데이터에 대하여 문장 벡터 산출부(122)의 처리를 실행하는 것의 일태양으로서 포함한다.Then, for the purpose of reducing the computational load at the time of prediction, the word vector calculation by the word vector calculation unit 122 is omitted, the n word vectors calculated at the time of learning are stored, and these are used at the time of prediction. you can do it In this way, in prediction, the processing by which the word vector calculation unit 122 reads and uses the n word vectors calculated at the time of learning is also one of executing the processing of the sentence vector calculation unit 122 on the prediction data. Included as the sun.

도 3은, 상기한 바와 같이 구성한 제1 실시예에 의한 인지증 예측 장치의 동작예를 나타내는 흐름도이다. 도 3의 (a)는, 예측 모델을 생성하는 학습 시의 동작예를 나타내고, 도 3의 (b)는, 생성된 예측 모델을 사용하여 인지증의 중증도의 예측을 행하는 예측 시의 동작예를 나타내고 있다.3 is a flowchart showing an operation example of the apparatus for predicting dementia according to the first embodiment configured as described above. Fig. 3 (a) shows an operation example at the time of learning to generate a predictive model, and Fig. 3 (b) shows an operation example at the time of prediction to predict the severity of dementia using the generated predictive model. is indicating

도 3의 (a)에 나타낸 학습에 있어서, 먼저, 학습용 데이터 입력부(10)는, 인지증의 중증도(MMSE의 스코어)가 기지인 m인의 환자가 행한 자유회화의 내용을 각각 나타낸 m개의 문장을 학습용 데이터로서 입력한다(단계 S1). 단어 추출부(11A)는, 학습용 데이터 입력부(10)에 의해 입력된 m개의 문장을 해석하여, 상기 m개의 문장으로부터 n개의 단어를 추출한다(단계 S2).In the learning shown in Fig. 3(a), first, the data input unit 10 for learning receives m sentences each representing the contents of free conversations performed by m patients whose severity of dementia (score of MMSE) is known. It is input as data for learning (step S1). The word extraction unit 11A analyzes the m sentences input by the learning data input unit 10 and extracts n words from the m sentences (step S2).

다음으로, 벡터 산출부(12A)는, 학습용 데이터 입력부(10)에 의해 입력된 m개의 문장 및 단어 추출부(11A)에 의해 추출된 n개의 단어로부터, m개의 문장 벡터 d_i→ 및 n개의 단어 벡터 w_j→를 산출한다(단계 S3). 그리고, 지표값 산출부(13A)는, m개의 문장 벡터 d_i→와 n개의 단어 벡터 w_j→의 내적을 각각 계산함으로써, m개의 문장 d_i 및 n개의 단어 w_j 사이의 관계성을 반영한 m×n개의 관계성 지표값(m×n개의 관계성 지표값을 각 요소로 하는 지표값 행렬 DW)을 산출한다(단계 S4).Next, the vector calculation unit 12A, from the m sentences input by the learning data input unit 10 and the n words extracted by the word extraction unit 11A, m sentence vectors d _i → and n A word vector w _j → is calculated (step S3). Then, the index value calculation unit 13A calculates the dot product of the _{m sentence vectors d i} → and the n word vectors w _j →, respectively, reflecting the relationship between the _{m sentences d i} and the n words w _j mxn relational index values (index value matrix DW having mxn relational index values as elements) are calculated (step S4).

또한, 예측 모델 생성부(14A)는, 이상과 같이 하여 m인의 환자에 대한 학습용 데이터로부터 관계성 지표값 산출부(100A)에 의해 산출된 m×n개의 관계성 지표값을 사용하여, 1개의 문장 d_i에 대하여 n개의 관계성 지표값 dw_ij로 이루어지는 문장지표값군를 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성하고, 생성한 예측 모델을 예측 모델 기억부(30A)에 기억시킨다(단계 S5). 이상에 의해, 학습 시의 동작이 종료한다.In addition, the predictive model generation unit 14A uses m × n relation index values calculated by the relation index value calculation unit 100A from the learning data for m patients as described above, and uses one A predictive model for predicting the severity of dementia is generated based on the sentence index value group consisting of n relational index values dw _ij with respect to the sentence d _{i , and the generated predictive model is stored in the predictive model storage unit 30A} (Step S5). As a result, the operation at the time of learning ends.

도 3의 (b)에 나타낸 예측에 있어서, 먼저, 예측용 데이터 입력부(20)는, 예측 대상으로 하는 m' 인의 환자가 행한 자유회화의 내용을 각각 나타낸 m'개의 문장을 예측용 데이터로서 입력한다(단계 S11). 인지증 예측부(21A)는, 예측용 데이터 입력부(20)에 의해 입력된 예측용 데이터를 관계성 지표값 산출부(100A)에 공급하고, 관계성 지표값의 산출을 지시한다.In the prediction shown in Fig. 3(b), first, the prediction data input unit 20 inputs m' sentences each representing the contents of free conversations performed by the m' patients as prediction data as prediction data. (step S11). The dementia prediction unit 21A supplies the prediction data input by the prediction data input unit 20 to the relation index value calculation unit 100A, and instructs the calculation of the relation index value.

이 지시에 따라, 단어 추출부(11A)는, 예측용 데이터 입력부(20)에 의해 입력된 m'개의 문장을 해석하고, 상기 m'개의 문장으로부터 n개의 단어를 추출한다(단계 S12). 다음으로, 벡터 산출부(12A)는, 예측용 데이터 입력부(20)에 의해 입력된 m'개의 문장 및 단어 추출부(11A)에 의해 추출된 n개의 단어로부터, m'개의 문장 벡터 d_i→ 및 n개의 단어 벡터 w_j→를 산출한다(단계 S13).In accordance with this instruction, the word extraction unit 11A analyzes m' sentences input by the prediction data input unit 20, and extracts n words from the m' sentences (step S12). Next, the vector calculating unit 12A, from the m' sentences input by the prediction data input unit 20 and the n words extracted by the word extracting unit 11A, m' sentence vectors d _i → and n word vectors w _j → are calculated (step S13).

그리고, 지표값 산출부(13A)는, m'개의 문장 벡터 d_i→와 n개의 단어 벡터 w_j→의 내적을 각각 계산함으로써, m'개의 문장 d_i 및 n개의 단어 w_j 사이의 관계성을 반영한 m'×n개의 관계성 지표값(m'×n개의 관계성 지표값을 각 요소로 하는 지표값 행렬 DW)을 산출한다(단계 S14). 지표값 산출부(13A)는, 산출한 m'×n개의 관계성 지표값을 인지증 예측부(21A)에 공급한다.Then, the index value calculation unit 13A calculates _{the dot product of the m' sentence vectors d i} → and the n word vectors w _j →, respectively, so that the relationship between the _{m' sentences d i} and the n words w _j m'xn relational index values (index value matrix DW having m'xn relational index values as elements) are calculated (step S14). The index value calculation unit 13A supplies the calculated m'×n relational index values to the dementia prediction unit 21A.

인지증 예측부(21A)는, 관계성 지표값 산출부(100A)로부터 공급된 m'×n개의 관계성 지표값을 예측 모델 기억부(30A)에 기억된 예측 모델에 적용함으로써, 예측 대상으로 하는 m' 인의 환자에 대하여 인지증의 중증도를 예측한다(단계 S15). 이로써, 예측 시의 동작이 종료한다.The dementia prediction unit 21A applies the m'×n relational index values supplied from the relational index value calculation unit 100A to the predictive model stored in the predictive model storage unit 30A, thereby becoming a prediction target. The severity of cognitive impairment is predicted for the patients of m' (step S15). Thereby, the operation at the time of prediction ends.

이상 상세하게 설명한 바와 같이, 제1 실시예에서는, 인지증의 중증도가 기지인 환자가 행한 자유회화의 내용을 나타내는 m개의 문장을 학습용 데이터로서 입력하고, 상기 입력된 문장으로부터 산출한 문장 벡터와, 문장 내에 포함되는 단어로부터 산출한 단어 벡터의 내적을 계산함으로써, 문장 및 단어 사이의 관계성을 반영한 관계성 지표값을 산출하고, 이 관계성 지표값을 사용하여 예측 모델을 생성하고 있다. 또한, 예측 대상으로 하는 환자에 대하여 인지증의 중증도를 예측할 때는, 예측 대상으로 하는 환자가 행한 자유회화의 내용을 나타내는 m'개의 문장을 예측용 데이터로서 입력하고, 상기 입력한 예측용 데이터로부터 동일하게 하여 산출되는 관계성 지표값을 예측 모델에 적용함으로써, 예측 대상으로 하는 환자의 인지증 중증도를 예측하도록 하고 있다.As described in detail above, in the first embodiment, m sentences representing the content of free conversation performed by a patient with a known severity of dementia are input as learning data, and a sentence vector calculated from the inputted sentences; By calculating the dot product of the word vector calculated from the words included in the sentence, a relation index value reflecting the relation between the sentence and the word is calculated, and a predictive model is generated using the relation index value. In addition, when predicting the severity of dementia with respect to the prediction target patient, m' sentences representing the contents of free conversation performed by the prediction target patient are input as prediction data, and the same is obtained from the input prediction data. By applying the relation index value calculated by making the patient to the prediction model, the severity of cognitive impairment of the patient to be predicted is predicted.

이와 같이 구성한 제1 실시예에 의하면, 환자가 행한 자유회화를 해석함으로써 인지증의 중증도가 예측되므로, 미니 멘탈 스테이트 검사(MMSE)를 행할 필요가 없다. 이 때문에, 인지증의 중증도를 반복적으로 측정하는 경우라도, 환자에 의한 연습 효과를 배제한 측정 결과(예측 결과)를 얻을 수 있다. 특히, 환자가 인지증에 걸려 있으면, 반복 발언되는 단어 등을 포함하는 인지증 특유의 회화 특징이 자유회화 중에 보이게 되고, 이와 같은 회화 특징이 반영된 상태로 관계성 지표값이 산출되고, 상기 관계성 지표값을 사용하여 예측 모델이 생성되므로, 환자가 행한 자유회화로부터 인지증의 중증도를 예측할 수 있다.According to the first embodiment configured in this way, since the severity of dementia is predicted by analyzing the free conversation performed by the patient, there is no need to perform the mini mental state examination (MMSE). For this reason, even when it is a case of repeatedly measuring the severity of dementia, the measurement result (prediction result) which excluded the exercise effect by a patient can be obtained. In particular, if the patient is suffering from dementia, speech characteristics peculiar to dementia, including words repeatedly spoken, etc., are seen during free conversation, and a relation index value is calculated in a state in which such conversation characteristics are reflected, and the relation Since a predictive model is generated using the index value, the severity of dementia can be predicted from the free conversation performed by the patient.

(제2 실시형태)(Second embodiment)

다음으로, 본 발명에 의한 제2 실시형태를 도면에 기초하여 설명한다. 도 4는, 제2 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다. 이 도 4에 있어서, 도 1에 나타낸 부호와 동일한 부호를 부여한 것은 동일한 기능을 가지는 것이므로, 여기서는 중복되는 설명을 생략한다.Next, a second embodiment according to the present invention will be described with reference to the drawings. 4 is a block diagram showing an example of the functional configuration of the dementia prediction apparatus according to the second embodiment. In this FIG. 4, since the code|symbol same as the code|symbol shown in FIG. 1 has the same function, the overlapping description is abbreviate|omitted here.

도 4에 나타낸 바와 같이, 제2 실시형태에 의한 인지증 예측 장치는, 관계성 지표값 산출부(100A), 예측 모델 생성부(14A), 인지증 예측부(21A) 및 예측 모델 기억부(30A) 대신, 관계성 지표값 산출부(100B), 예측 모델 생성부(14B), 인지증 예측부(21B) 및 예측 모델 기억부(30B)를 구비하고 있다. 제2 실시형태에 의한 관계성 지표값 산출부(100B)는, 단어 추출부(11A), 벡터 산출부(12A) 및 지표값 산출부(13A) 대신, 품사 추출부(11B), 벡터 산출부(12B) 및 지표값 산출부(13B)를 구비하고 있다. 벡터 산출부(12B)는, 보다 구체적인 기능 구성으로서, 단어 벡터 산출부(122) 대신 품사 벡터 산출부(123)를 구비하고 있다. 그리고, 학습용 데이터 입력부(10), 관계성 지표값 산출부(100B) 및 예측 모델 생성부(14B)에 의해, 본 발명의 예측 모델 생성 장치가 구성된다.As shown in Fig. 4 , the dementia prediction device according to the second embodiment includes a relation index value calculation unit 100A, a predictive model generation unit 14A, a dementia prediction unit 21A, and a predictive model storage unit ( Instead of 30A), a relation index value calculation unit 100B, a predictive model generation unit 14B, a dementia prediction unit 21B, and a predictive model storage unit 30B are provided. The relation index value calculation unit 100B according to the second embodiment includes a part-of-speech extraction unit 11B and a vector calculation unit instead of the word extraction unit 11A, the vector calculation unit 12A, and the index value calculation unit 13A. 12B and an index value calculation unit 13B are provided. The vector calculation unit 12B includes a part-of-speech vector calculation unit 123 instead of the word vector calculation unit 122 as a more specific functional configuration. And the predictive model generating apparatus of this invention is comprised by the data input part 10 for learning, the relationship index value calculating part 100B, and the predictive model generating part 14B.

제2 실시형태에 의한 관계성 지표값 산출부(100B)는, 제1 실시예와 마찬가지의 문장에 대한 문장 데이터를 입력하고, 문장과 그 중에 포함되는 각 형태소의 품사의 관계성을 반영한 관계성 지표값을 산출하여 출력하는 것이다.The relation index value calculation unit 100B according to the second embodiment inputs sentence data for the same sentence as in the first embodiment, and reflects the relation between the sentence and each morpheme included therein. It calculates and outputs the index value.

품사 추출부(11B)는, 특허청구의 범위 「요소 추출부」의 일례이며, 학습용 데이터 입력부(10)에 의해 학습용 데이터로서 입력된 m개의 문장을 해석하고, 상기 m개의 문장으로부터 p개(p는 2 이상의 임의의 정수)의 품사(특허청구의 범위의 분해 요소에 상당)을 추출한다. 문장의 해석 방법으로서는, 예를 들면, 공지의 형태소 해석을 사용하는 것이 가능하다. 여기서, 품사 추출부(11B)는, 형태소 해석에 의해 분할되는 각 형태소에 대하여, 도 5의 (a)와 같이 단일인 형태소마다 1개의 품사를 추출하도록 해도 되고, 도 5의 (b)와 같이 연속하는 복수의 형태소마다 1세트의 품사를 추출하도록 해도 된다.The part-of-speech extraction unit 11B is an example of the "element extraction unit" in the claims, and interprets m sentences input as learning data by the learning data input unit 10, and from the m sentences, p (p) extracts a part-of-speech (corresponding to a decomposition element of a claim) of an arbitrary integer of 2 or more. As an analysis method of a sentence, it is possible to use a well-known morpheme analysis, for example. Here, the part-of-speech extraction unit 11B may extract one part-of-speech for each single morpheme as shown in FIG. One set of parts-of-speech may be extracted for each of a plurality of consecutive morphemes.

그리고, 본 실시형태에 있어서 추출하는 품사는, 동사, 형용사, 형용동사, 명사, 대명사, 수사, 연체사, 부사, 접속사, 감동사, 조동사, 조사와 같은 대분류뿐만 아니라, 도 6과 같이 중분류, 소분류, 세분류까지 세분화된 품사를 추출한다. 도 6은, 품사 추출부(11B)가 추출하는 품사의 일례를 나타낸 것이다. 여기에 나타낸 품사는 일례이며, 이것으로 한정되는 것은 아니다.And, the part-of-speech extracted in the present embodiment is not only a major classification such as a verb, an adjective, an adjective verb, a noun, a pronoun, a rhetoric, a compound verb, an adverb, a conjunction, an emotional verb, an auxiliary verb, a verb, but also a medium classification and a small classification as shown in FIG. , extracts the subdivided parts of speech up to the subcategory. 6 shows an example of the part-of-speech extracted by the part-of-speech extraction unit 11B. The part-of-speech shown here is an example, and is not limited thereto.

그리고, m개의 문장 중에는, 동일 품사(또는 동일 품사 세트)가 복수 포함되어 있는 경우가 있다. 이 경우에, 품사 추출부(11B)는, 동일 품사(또는 동일 품사 세트)를 복수 개 추출하지는 않고, 1개만 추출한다. 즉, 품사 추출부(11B)가 추출하는 p개(p세트를 포함하는 개념으로 한다. 이하 마찬가지임)의 품사는, p종류의 품사라는 의미이다. 다만, 추출하는 p개의 품사에는, 각각 문장 중에서의 출현 빈도를 나타낸 정보가 부수되어 있다.Also, in the m sentences, a plurality of the same part-of-speech (or the same part-of-speech set) may be included. In this case, the part-of-speech extraction unit 11B does not extract a plurality of the same part-of-speech (or the same part-of-speech set), but extracts only one. That is, the p parts of speech extracted by the part-of-speech extraction unit 11B (a concept including a p set; the same applies hereinafter) means that they are p types of parts-of-speech. However, information indicating the frequency of appearance in each sentence is attached to the p parts of speech to be extracted.

인지증에 걸려 있는 환자에게는, 고유 명사가 생각나지 않아 「저거」, 「이거」, 「그거」 등의 지시어를 다용하는 경향이 보이는 경우가 있다. 또한, 인지증에 걸려 있는 환자에게는, 다음 말이 나오지 않아 「저기」, 「음~」, 「에~」 등의 필러를 다용하는 경향이 보이는 경우가 있다. 이 때문에, 이와 같은 인지증 특유의 회화 특징에 따라, 자유회화의 문장 중에 몇 번이나 출현하는 동일 품사가 존재한다. 품사 추출부(11B)에 의해, 이와 같은 인지증 특유의 회화 특징을 포함한 자유회화의 문장으로부터 p개의 품사가 추출되게 된다.Patients suffering from dementia may tend to use descriptive words such as "that", "this", and "that" because they can't remember proper nouns. In addition, the patient suffering from dementia may have a tendency to use fillers such as "here", "um", and "e" because the next word does not come out. For this reason, according to the characteristic conversational characteristics of dementia, the same part-of-speech that appears several times in the sentences of free conversation exists. By the part-of-speech extraction unit 11B, p parts-of-speech are extracted from the sentences of free conversation including the conversational features peculiar to dementia.

벡터 산출부(12B)는, m개의 문장 및 p개의 품사로부터, m개의 문장 벡터 및 p개의 품사 벡터를 산출한다. 여기서, 문장 벡터 산출부(121)는, 품사 추출부(11B)에 의한 해석 대상이 된 m개의 문장을 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 m개의 문장 벡터를 산출한다. 또한, 품사 벡터 산출부(123)는, 품사 추출부(11B)에 의해 추출된 p개의 품사를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 p개의 품사 벡터를 산출한다.The vector calculating unit 12B calculates m sentence vectors and p parts-of-speech vectors from the m sentences and p parts-of-speech. Here, the sentence vector calculation unit 121 vectorizes the m sentences, which are the objects of analysis by the part-of-speech extraction unit 11B, in q dimensions according to a predetermined rule, so that m sentence vectors composed of q axis components are obtained. Calculate. In addition, the part-of-speech vector calculation unit 123 calculates p parts-of-speech vectors comprising q axial components by vectorizing the p parts-of-speech extracted by the part-of-speech extraction unit 11B in q dimensions according to a predetermined rule, respectively. .

문장 벡터 및 품사 벡터의 산출법은, 제1 실시예와 동일하다. 즉, 제2 실시형태에 있어서, 벡터 산출부(12B)는, m개의 문장과 p개의 품사로 이루어지는 집합 S=<d∈, h∈>를 생각한다. 여기서, 각 문장 d_i(i=1, 2, ··· , m) 및 각 품사 h_j(j=1, 2, ··· , p)에 대하여 각각 문장 벡터 d_i→ 및 품사 벡터 h_j→를 관련시킨다. 그리고, 벡터 산출부(12B)는, 상기 식(1)과 동일하게 산출되는 확률 P(h_j|d_i)를, m개의 문장과 p개의 품사의 모든 조합에 대하여 산출하고, 이들을 합계한 값을 목표변수 L로 하고, 상기 목표변수 L을 최대화하는 문장 벡터 d_i→ 및 품사 벡터 h_j→를 산출한다.The calculation method of the sentence vector and the part-of-speech vector is the same as that of the first embodiment. That is, in the second embodiment, the vector calculating unit 12B considers a set S=<d∈, h∈> composed of m sentences and p parts-of-speech. Here, for each sentence d _i (i=1, 2, ... , m) and each part-of-speech h _j (j=1, 2, ... , p), the sentence vector d _i → and the part-of-speech vector h _j → relate to Then, the vector calculating unit 12B calculates the probability P(h _j |d _i ) calculated in the same manner as in Equation (1) for all combinations of m sentences and p parts of speech, and sums them up. is a target variable L, and a sentence vector d _i → and a part-of-speech vector h _j → maximizing the target variable L are calculated.

지표값 산출부(13B)는, 벡터 산출부(12B)에 의해 산출된 m개의 문장 벡터 d_i→와 p개의 품사 벡터 h_j→의 내적을 각각 계산함으로써, m개의 문장 d_i 및 p개의 품사 h_j 사이의 관계성을 반영한 m×p개의 관계성 지표값을 산출한다. 제2 실시형태에서는, 지표값 산출부(13B)는, 하기 식(4)에 나타낸 바와 같이, m개의 문장 벡터 d_i→의 각 q개의 축 성분(d₁₁∼d_mq)을 각 요소로 하는 문장행렬 D와, p개의 품사 벡터 h_j→의 각 q개의 축 성분(h₁₁∼h_pq)을 각 요소로 하는 품사행렬 H의 곱을 계산함으로써, m×p개의 관계성 지표값을 각 요소로 하는 지표값 행렬 DH를 산출한다. 여기서, H^t는 품사행렬의 전치행렬이다.The index value calculation unit 13B calculates _{the dot product of the m sentence vectors d i} → and the p parts of speech vector h _j → calculated by the vector calculation unit 12B, respectively, so that the m sentences d _i and p parts of speech are calculated. m×p relation index values reflecting the relation between h _{j are calculated.} In the second embodiment, the index value calculation unit 13B uses, as each element, q axial components d _{11 to} d _{mq of the} _{m sentence vectors d i → as shown in the following formula (4).} By calculating the product of the sentence matrix D and the part-of-speech matrix H with each q axis components ( h ₁₁ ∼ _{h pq} ) of the p parts-of-speech vectors h _{j →} to calculate the index value matrix DH. Here, H ^t is the transpose matrix of the part-of-speech matrix.

[수식 4][Equation 4]

예측 모델 생성부(14B)는, 지표값 산출부(13B)에 의해 산출된 m×p개의 관계성 지표값을 사용하여, 1개의 문장 d_i에 대하여 p개의 관계성 지표값 dh_ij(j=1, 2, ··· , p)로 이루어지는 문장지표값군를 바탕으로 인지증의 중증도(MMSE의 스코어의 값)을 예측하기 위한 예측 모델을 생성한다. 즉, 예측 모델 생성부(14B)는, 제1 실시예에서 설명한 것과 동일한 방법에 의해, MMSE의 스코어가 기지(예를 들면, x점)인 환자의 자유회화를 바탕으로 산출되는 문장지표값군에 대해서는 x점에 가능한 근접한 스코어가 예측되는 예측 모델을 생성한다. 그리고, 예측 모델 생성부(14B)는, 생성한 예측 모델을 예측 모델 기억부(30B)에 기억시킨다.The predictive model generating unit 14B uses the m×p relational index values calculated by the index value calculating unit 13B, and for one sentence d _i , p relational index values dh _ij (j = A predictive model for predicting the severity of dementia (the value of the score of MMSE) is generated based on the sentence index value group consisting of 1, 2, ... , p). That is, the predictive model generation unit 14B, by the same method as described in the first embodiment, the MMSE score is calculated based on the free conversation of the patient (for example, the x point) to the group of sentence index values. For this, a predictive model is generated in which a score as close as possible to the point x is predicted. Then, the predictive model generation unit 14B stores the generated predictive model in the predictive model storage unit 30B.

인지증 예측부(21B)는, 예측용 데이터 입력부(20)에 의해 입력된 예측용 데이터에 대하여 품사 추출부(11B), 문장 벡터 산출부(121), 품사 벡터 산출부(123) 및 지표값 산출부(13B)의 처리를 실행함으로써 얻어지는 관계성 지표값을, 예측 모델 생성부(14B)에 의해 생성된 예측 모델(예측 모델 기억부(30B)에 기억된 예측 모델)에 적용함으로써, 예측 대상으로 하는 m' 인의 환자에 대하여 인지증의 중증도를 예측한다.The dementia prediction unit 21B includes a part-of-speech extraction unit 11B, a sentence vector calculation unit 121, a part-of-speech vector calculation unit 123 and an index value with respect to the prediction data input by the prediction data input unit 20 . A prediction target by applying the relation index value obtained by executing the processing of the calculation unit 13B to the prediction model generated by the prediction model generation unit 14B (the prediction model stored in the prediction model storage unit 30B). Predict the severity of dementia for patients with m'

이상 상세하게 설명한 바와 같이, 제2 실시형태에서는, 인지증의 중증도가 기지인 환자가 행한 자유회화의 내용을 나타내는 m개의 문장을 학습용 데이터로서 입력하고, 상기 입력된 문장으로부터 산출한 문장 벡터와, 문장 내에 포함되는 형태소의 품사로부터 산출한 품사 벡터의 내적을 계산함으로써, 문장 및 품사 사이의 관계성을 반영한 관계성 지표값을 산출하고, 이 관계성 지표값을 사용하여 예측 모델을 생성하고 있다. 또한, 예측 대상으로 하는 환자에 대하여 인지증의 중증도를 예측할 때는, 예측 대상으로 하는 환자가 행한 자유회화의 내용을 나타내는 m'개의 문장을 예측용 데이터로서 입력하고, 상기 입력한 예측용 데이터로부터 동일하게 하여 산출되는 관계성 지표값을 예측 모델에 적용함으로써, 예측 대상으로 하는 환자의 인지증 중증도를 예측하도록 하고 있다.As described in detail above, in the second embodiment, m sentences representing the contents of free conversation performed by a patient with a known severity of dementia are input as learning data, and a sentence vector calculated from the inputted sentences; By calculating the dot product of the part-of-speech vector calculated from the part-of-speech of morphemes included in the sentence, a relation index value reflecting the relationship between the sentence and the part-of-speech is calculated, and a predictive model is generated using the relation indicator value. In addition, when predicting the severity of dementia with respect to the prediction target patient, m' sentences representing the contents of free conversation performed by the prediction target patient are input as prediction data, and the same is obtained from the input prediction data. By applying the relation index value calculated by making the patient to the prediction model, the severity of cognitive impairment of the patient to be predicted is predicted.

이와 같이 구성한 제2 실시형태에 있어서도, 환자가 행한 자유회화를 해석함으로써 인지증의 중증도가 예측되므로, 미니 멘탈 스테이트 검사(MMSE)를 행할 필요가 없다. 이 때문에, 인지증의 중증도를 반복적으로 측정하는 경우라도, 환자에 의한 연습 효과를 배제한 측정 결과(예측 결과)를 얻을 수 있다. 특히, 환자가 인지증에 걸려 있으면, 소정의 품사의 형태소를 많이 포함한 인지증 특유의 회화 특징이 자유회화 중에 보이게 되고, 이와 같은 회화 특징이 반영된 상태로 관계성 지표값이 산출되고, 상기 관계성 지표값을 사용하여 예측 모델이 생성되므로, 환자가 행한 자유회화로부터 인지증의 중증도를 예측할 수 있다.Also in the second embodiment configured in this way, since the severity of dementia is predicted by analyzing the free conversation performed by the patient, there is no need to perform a mini mental state examination (MMSE). For this reason, even when it is a case of repeatedly measuring the severity of dementia, the measurement result (prediction result) which excluded the exercise effect by a patient can be obtained. In particular, if the patient suffers from dementia, a conversational characteristic peculiar to dementia including many morphemes of a given part-of-speech is seen during free conversation, and a relation index value is calculated in a state in which the conversational characteristic is reflected, and the relation Since a predictive model is generated using the index value, the severity of dementia can be predicted from the free conversation performed by the patient.

(제3 실시형태)(Third embodiment)

다음으로, 본 발명에 의한 제3 실시형태를 도면에 기초하여 설명한다. 도 7은, 제3 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다. 이 도 7에 있어서, 도 4에 나타낸 부호와 동일한 부호를 부여한 것은 동일한 기능을 가지는 것이므로, 여기서는 중복되는 설명을 생략한다. 제3 실시형태는, 제1 실시예에서 설명한 문장 벡터와 단어 벡터로부터 산출하는 지표값 행렬 DW와, 제2 실시형태에서 설명한 문장 벡터와 품사 벡터로부터 산출하는 지표값 행렬 DH의 양쪽을 사용하는 것이다.Next, a third embodiment according to the present invention will be described with reference to the drawings. 7 is a block diagram showing an example of the functional configuration of the dementia prediction device according to the third embodiment. In this FIG. 7, since the code|symbol same as the code|symbol shown in FIG. 4 has the same function, the overlapping description is abbreviate|omitted here. The third embodiment uses both the index value matrix DW calculated from the sentence vector and word vector described in the first embodiment, and the index value matrix DH calculated from the sentence vector and the part-of-speech vector described in the second embodiment. .

도 7에 나타낸 바와 같이, 제3 실시형태에 의한 인지증 예측 장치는, 관계성 지표값 산출부(100B), 예측 모델 생성부(14B), 인지증 예측부(21B) 및 예측 모델 기억부(30B) 대신에, 관계성 지표값 산출부(100C), 예측 모델 생성부(14C), 인지증 예측부(21C) 및 예측 모델 기억부(30C)를 구비하고 있다. 제3 실시형태에 의한 관계성 지표값 산출부(100C)는, 단어 추출부(11A) 및 품사 추출부(11B)를 구비하고 또한, 벡터 산출부(12B) 및 지표값 산출부(13B) 대신 벡터 산출부(12C) 및 지표값 산출부(13C)를 구비하고 있다. 벡터 산출부(12C)는, 보다 구체적인 기능 구성으로서, 문장 벡터 산출부(121), 단어 벡터 산출부(122) 및 품사 벡터 산출부(123)를 구비하고 있다. 그리고, 학습용 데이터 입력부(10), 관계성 지표값 산출부(100C) 및 예측 모델 생성부(14C)에 의해, 본 발명의 예측 모델 생성 장치가 구성된다.As shown in FIG. 7 , the dementia prediction device according to the third embodiment includes a relation index value calculation unit 100B, a predictive model generation unit 14B, a dementia prediction unit 21B, and a predictive model storage unit ( Instead of 30B), a relation index value calculation unit 100C, a predictive model generation unit 14C, a dementia prediction unit 21C, and a predictive model storage unit 30C are provided. The relation index value calculation unit 100C according to the third embodiment includes a word extraction unit 11A and a part-of-speech extraction unit 11B, and instead of the vector calculation unit 12B and the index value calculation unit 13B A vector calculation unit 12C and an index value calculation unit 13C are provided. The vector calculation unit 12C includes a sentence vector calculation unit 121 , a word vector calculation unit 122 , and a part-of-speech vector calculation unit 123 as a more specific functional configuration. And the predictive model generating apparatus of this invention is comprised by the data input part 10 for learning, the relationship index value calculating part 100C, and the predictive model generating part 14C.

지표값 산출부(13C)는, 상기 식(3)에 나타낸 바와 같이, m개의 문장 벡터 d_i→와 n개의 단어 벡터 w_j→의 내적을 각각 계산함으로써, m개의 문장 d_i 및 n개의 단어 w_j 사이의 관계성을 반영한 m×n개의 관계성 지표값 dw_ij(제1 평가값행렬 DW라고 함)을 산출한다. 또한, 지표값 산출부(13C)는, 상기 식(4)에 나타낸 바와 같이, m개의 문장 벡터 d_i→와 p개의 품사 벡터 h_j→의 내적을 각각 계산함으로써, m개의 문장 d_i 및 p개의 품사 h_j 사이의 관계성을 반영한 m×p개의 관계성 지표값 dh_ij(제2 평가값행렬 DH라고 함)을 산출한다.The index value calculation unit 13C calculates the dot product of the _{m sentence vectors d i} → and the n word vectors w _j →, respectively, as shown in Equation (3) above, _{whereby the m sentences d i} and n words are calculated. m x n relation index values dw _ij (referred to as a first evaluation value matrix DW) reflecting the relation between w _{j are calculated.} Also, the index value calculation unit 13C calculates the dot product of the _{m sentence vectors d i} → and the p parts of speech vector h _j →, respectively, as shown in Equation (4) above, _{whereby the m sentences d i} and p m×p relation index values dh _ij (referred to as a second evaluation value matrix DH) reflecting the relation between the parts of speech h _{j are calculated.}

예측 모델 생성부(14C)는, 지표값 산출부(13C)로 의해 산출된 m×n개의 관계성 지표값 dw_ij 및 m×p개의 관계성 지표값 dh_ij를 사용하여, 1개의 문장 d_i에 대하여 n개의 관계성 지표값으로 이루어지는 문장지표값군 dw_ij(j=1, 2, ··· , n) 및 p개의 관계성 지표값으로 이루어지는 문장지표값군 dh_ij(j=1, 2, ··· , p)를 바탕으로 인지증의 중증도(MMSE의 스코어의 값)을 예측하기 위한 예측 모델을 생성한다. 그리고, 예측 모델 생성부(14C)는, 생성한 예측 모델을 예측 모델 기억부(30C)에 기억시킨다.The predictive model generation unit 14C uses the m × n relation index values dw _ij and the m × p relation index values dh _ij calculated by the index value calculation unit 13C to generate one sentence d _i dw _ij (j = 1, 2, ... , n) consisting of n relational index values and a sentence index value group dh _ij consisting of p relational index values (j = 1, 2, ... Create a predictive model for predicting the severity of cognitive impairment (value of the score of MMSE) based on ... , p). Then, the predictive model generation unit 14C stores the generated predictive model in the predictive model storage unit 30C.

여기서, 예측 모델 생성부(14C)가 2조의 문장지표값군 dw_ij, dh_ij를 어떻게 사용하여 예측 모델을 생성하는 지는, 임의로 설계하는 것이 가능하다. 예를 들면, 도 8의 (a)와 같이, 문장·단어 사이의 제1 지표값 행렬 DW와, 문장·품사 사이의 제2 지표값 행렬 DH를 가로(행 방향)로 배열하고, 동일 행 i에 속하는 문장지표값군 dw_ij, dh_ij를 연결하고, (n+p)개의 관계성 지표값을 포함하는 1개의 문장지표값군을 생성하고, 이 문장지표값군를 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성하도록 해도 된다.Here, it is possible to arbitrarily design how the predictive model generating unit 14C generates the predictive model by using the two sets of sentence index value groups dw _ij and dh _{ij .} For example, as shown in Fig. 8(a), the first index value matrix DW between sentences and words and the second index value matrix DH between sentences and parts of speech are arranged horizontally (row direction) in the same row i Connect the sentence indicator value groups dw _ij , dh _ij belonging to , generate one sentence indicator value group including (n+p) relational indicator values, and predict the severity of cognitive impairment based on this sentence indicator value group You may make it generate a predictive model for

혹은, 도 8의 (b)와 같이, 문장·단어 사이의 제1 지표값 행렬 DW에 포함되는 i행째의 문장지표값군 dw_ij와, 문장·품사 사이의 제2 지표값 행렬 DH에 포함되는 동일한 i행째의 문장지표값군 dh_ij를 세로(열 방향)로 배열하여 2×n차원의 문장지표값군 행렬을 생성하고, 이 문장지표값군 행렬을 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성하도록 해도 된다. 도 8의 (b)의 예에서는, n>p인 것을 상정하고, 2×n차원의 문장지표값군 행렬에서의 2행째의 행렬 성분에 대해서는, 문장지표값군 dh_ij의 값을 좌측으로 채워서 설정하고, 상기 2행째의 좌단으로부터 p개를 초과하는 행렬 성분은 모두 값을 0로 하고 있다.Alternatively, as shown in Fig. 8(b), the i-th sentence index value group dw _ij included in the first index value matrix DW between sentences and words, and the second index value matrix DH between sentences and parts of speech include the same By arranging the i-th sentence index value group dh _ij vertically (column direction), a 2×n-dimensional sentence indicator value group matrix is generated, and a predictive model for predicting the severity of dementia based on the sentence indicator value group matrix is generated you can do it In the example of Fig. 8 (b), it is assumed that n>p, and for the matrix component of the second row in the 2 × n-dimensional sentence indicator value group matrix, _{the value of the sentence indicator value group dh ij} is set by filling the left side, , all of the matrix components exceeding p from the left end of the second row have a value of 0.

그리고, m×n차원의 제1 지표값 행렬 DW에 대하여, 제4 실시형태에서 후술하는 바와 같은 차원 압축 처리를 행함으로써 m×p차원의 제1 지표값 행렬 DW_SVD를 생성하고, 이 차원 압축된 제1 지표값 행렬 DW_SVD에 포함되는 i행째의 문장지표값군dw_ij(j=1∼p)와, 제2 지표값 행렬 DH에 포함되는 i행째의 문장지표값군 dh_ij(j=1∼p)를 세로(열 방향)로 배열하여 2×p차원의 문장지표값군 행렬을 생성하고, 이 문장지표값군 행렬을 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성하도록 해도 된다.Then, the mxn-dimensional first index value matrix DW is subjected to a dimensional compression process as described later in the fourth embodiment _{to generate an mxp-} dimensional first index value matrix DW SVD, and this dimensional compression The i-th sentence index value group dw _ij (j=1 to p) included in the first index value matrix DW _SVD _{, and the i-th sentence index value group dh ij} (j=1-p) included in the second index value matrix DH p) may be arranged vertically (column direction) to generate a 2xp-dimensional sentence index value group matrix, and a predictive model for predicting the severity of dementia may be generated based on the sentence index value group matrix.

또한 다른 예로서, 도 8의 (c)와 같이, 문장·단어 사이의 제1 지표값 행렬 DW에 포함되는 i행째의 문장지표값군 dw_ij를 1×n차원의 제1 문장지표값군 행렬로 하고, 문장·품사 사이의 제2 지표값 행렬 DH에 포함되는 동일한 i행째의 문장지표값군 dh_ij를 n×1차원의 제2 문장지표값군 행렬로 하여(다만, p개를 초과하고 n개에 미치지 않는 부족분의 행렬 성분의 값은 0로 함), 제1 문장지표값군 행렬과 제2 문장지표값군 행렬의 내적을 계산한다. 그리고, 계산된 값을 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성하도록 해도 된다.As another example, as shown in FIG. 8(c), the i-th sentence index value group dw _ij included in the first index value matrix DW between sentences and words is a 1×n-dimensional first sentence index value group matrix, _{, using the same i-th sentence index value group dh ij} included in the second index value matrix DH between sentences and parts of speech as an n × 1-dimensional second sentence index value group matrix (provided that more than p and less than n The value of the matrix component of the insufficient deficit is set to 0), and the dot product of the first sentence indicator value group matrix and the second sentence indicator value group matrix is calculated. In addition, a predictive model for predicting the severity of dementia may be generated based on the calculated value.

이 경우에 대해서도, 문장·단어 사이의 제1 지표값 행렬 DW를 차원 압축하여 m×p차원의 제1 지표값 행렬 DW_SVD를 생성하고, 이 차원 압축된 제1 지표값 행렬 DW_SVD에 포함되는 i행째의 문장지표값군 dw_ij를 1×p차원의 제1 문장지표값군 행렬로 하고, 문장·품사 사이의 제2 지표값 행렬 DH에 포함되는 동일한 i행째의 문장지표값군 dh_ij를 p×1차원의 제2 문장지표값군 행렬로 하여, 제1 문장지표값군 행렬과 제2 문장지표값군 행렬의 내적을 계산하도록 해도 된다.Also in this case, the first index value matrix DW between sentences and words is dimensionally compressed to generate an m×p-dimensional first index value matrix DW _SVD , and is included in the dimensionally compressed first index value matrix DW _SVD. Let the i-th sentence index value group dw _ij be a 1×p-dimensional first sentence index value group matrix, and the same i-th sentence index value group dh _ij included in the second index value matrix DH between sentences and parts of speech is p×1 The dot product of the first sentence indicator value group matrix and the second sentence indicator value group matrix may be calculated as the second sentence indicator value group matrix of the dimension.

인지증 예측부(21C)는, 예측용 데이터 입력부(20)에 의해 입력된 예측용 데이터에 대하여 단어 추출부(11A), 품사 추출부(11B), 문장 벡터 산출부(121), 단어 벡터 산출부(122), 품사 벡터 산출부(123) 및 지표값 산출부(13C)의 처리를 실행함으로써 얻어지는 관계성 지표값을, 예측 모델 생성부(14C)에 의해 생성된 예측 모델(예측 모델 기억부(30C)에 기억된 예측 모델)에 적용함으로써, 예측 대상으로 하는 m' 인의 환자에 대하여 인지증의 중증도를 예측한다.The dementia prediction unit 21C includes a word extraction unit 11A, a part-of-speech extraction unit 11B, a sentence vector calculation unit 121, and a word vector calculation with respect to the prediction data input by the prediction data input unit 20 . The predictive model (predictive model storage unit) generated by the predictive model generating unit 14C is the relation index value obtained by executing the processing of the unit 122, the part-of-speech vector calculating unit 123, and the index value calculating unit 13C. By applying to the prediction model stored in (30C)), the severity of dementia is predicted for the patient of m' as the prediction target.

이상 상세하게 설명한 바와 같이, 제3 실시형태에서는, 인지증의 중증도가 기지인 환자가 행한 자유회화의 내용을 나타내는 m개의 문장을 학습용 데이터로서 입력하고, 상기 입력된 문장으로부터 산출한 문장 벡터와, 문장 내에 포함되는 단어로부터 산출한 단어 벡터의 내적을 계산함으로써, 문장 및 단어 사이의 관계성을 반영한 관계성 지표값을 산출하고 또한, 입력된 문장으로부터 산출한 문장 벡터와, 문장 내에 포함되는 형태소의 품사로부터 산출한 품사 벡터의 내적을 계산함으로써, 문장 및 품사 사이의 관계성을 반영한 관계성 지표값을 산출하고, 이들 관계성 지표값을 사용하여 예측 모델을 생성하고 있다. 또한, 예측 대상으로 하는 환자에 대하여 인지증의 중증도를 예측할 때는, 예측 대상으로 하는 환자가 행한 자유회화의 내용을 나타내는 m'개의 문장을 예측용 데이터로서 입력하고, 상기 입력한 예측용 데이터로부터 동일하게 하여 산출되는 관계성 지표값을 예측 모델에 적용함으로써, 예측 대상으로 하는 환자의 인지증 중증도를 예측하도록 하고 있다.As described in detail above, in the third embodiment, m sentences representing the content of free conversation performed by a patient with a known severity of dementia are input as learning data, and a sentence vector calculated from the inputted sentences; By calculating the dot product of the word vector calculated from the words included in the sentence, a relation index value reflecting the relationship between the sentence and the word is calculated, and the sentence vector calculated from the input sentence and the morpheme included in the sentence are calculated. By calculating the dot product of the part-of-speech vector calculated from the part-of-speech, a relationship index value reflecting the relationship between the sentence and the part-of-speech is calculated, and a predictive model is generated using these relationship index values. In addition, when predicting the severity of dementia with respect to the prediction target patient, m' sentences representing the contents of free conversation performed by the prediction target patient are input as prediction data, and the same is obtained from the input prediction data. By applying the relation index value calculated by making the patient to the prediction model, the severity of cognitive impairment of the patient to be predicted is predicted.

이와 같이 구성한 제3 실시형태에 있어서도, 환자가 행한 자유회화를 해석함으로써 인지증의 중증도가 예측되므로, 미니 멘탈 스테이트 검사(MMSE)를 행할 필요가 없다. 이 때문에, 인지증의 중증도를 반복적으로 측정하는 경우라도, 환자에 의한 연습 효과를 배제한 측정 결과(예측 결과)를 얻을 수 있다. 특히, 제3 실시형태에서는, 자유회화 중에서 사용되는 단어나 품사에 대하여 인지증 특유의 회화 특징이 반영된 상태로 관계성 지표값이 산출되고, 상기 관계성 지표값을 사용하여 예측 모델이 생성되므로, 환자가 행한 자유회화로부터 인지증의 중증도를 보다 정확하게 예측할 수 있다.Also in the third embodiment configured in this way, since the severity of dementia is predicted by analyzing the free conversation performed by the patient, there is no need to perform a mini mental state examination (MMSE). For this reason, even when it is a case of repeatedly measuring the severity of dementia, the measurement result (prediction result) which excluded the exercise effect by a patient can be obtained. In particular, in the third embodiment, since the relation index value is calculated in a state in which the speech characteristics peculiar to dementia are reflected with respect to the words or parts of speech used in free conversation, and a predictive model is generated using the relation index value, The severity of cognitive impairment can be predicted more accurately from the free conversation performed by the patient.

(제4 실시형태)(Fourth embodiment)

다음으로, 본 발명에 의한 제4 실시형태를 도면에 기초하여 설명한다. 도 9는, 제4 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다. 이 도 9에 있어서, 도 1에 나타낸 부호와 동일한 부호를 부여한 것은 동일한 기능을 가지는 것이므로, 여기서는 중복되는 설명을 생략한다. 그리고, 이하에서는, 제1 실시예에 대한 변형예로서 제4 실시형태를 설명하지만, 도 10의 (a), (b)에 각각 나타낸 바와 같이 제2 실시형태에 대한 변형예 또는 제3 실시형태에 대한 변형예로서도 동일하게 제4 실시형태를 적용할 수 있다.Next, a fourth embodiment according to the present invention will be described with reference to the drawings. Fig. 9 is a block diagram showing an example of the functional configuration of the dementia prediction device according to the fourth embodiment. In this FIG. 9, since the code|symbol same as the code|symbol shown in FIG. 1 has the same function, the overlapping description is abbreviate|omitted here. In the following, the fourth embodiment will be described as a modification to the first embodiment, but as shown in FIGS. The fourth embodiment can be applied similarly as a modified example for .

도 9에 나타낸 바와 같이, 제4 실시형태에 의한 인지증 예측 장치는, 관계성 지표값 산출부(100A), 예측 모델 생성부(14A), 인지증 예측부(21A) 및 예측 모델 기억부(30A) 대신, 관계성 지표값 산출부(100D), 예측 모델 생성부(14D), 인지증 예측부(21D) 및 예측 모델 기억부(30D)를 구비하고 있다. 제4 실시형태에 의한 관계성 지표값 산출부(100D)는, 도 1에 나타낸 구성뿐만 아니라 차원 압축부(15)를 더 포함하고 있다. 그리고, 학습용 데이터 입력부(10), 관계성 지표값 산출부(100D) 및 예측 모델 생성부(14D)에 의해, 본 발명의 예측 모델 생성 장치가 구성된다.As shown in Fig. 9 , the dementia prediction device according to the fourth embodiment includes a relation index value calculation unit 100A, a predictive model generation unit 14A, a dementia prediction unit 21A, and a predictive model storage unit ( Instead of 30A), a relation index value calculation unit 100D, a predictive model generation unit 14D, a dementia prediction unit 21D, and a predictive model storage unit 30D are provided. The relation index value calculation unit 100D according to the fourth embodiment further includes a dimension compression unit 15 in addition to the configuration shown in FIG. 1 . And the predictive model generating apparatus of this invention is comprised by the data input part 10 for learning, the relationship index value calculating part 100D, and the predictive model generating part 14D.

차원 압축부(15)는, 지표값 산출부(13A)에 의해 산출된 m×n개의 관계성 지표값을 사용하여, 소정의 차원 압축 처리를 행함으로써, m×k개(k는 1≤k<n을 만족시키는 임의의 정수)의 관계성 지표값을 산출한다. 차원 압축 처리는, 예를 들면, 행렬을 분해하는 방법으로서 공지의 특이값 분해(singular value decomposition: SVD)을 사용하는 것이 가능하다.The dimension compression unit 15 performs predetermined dimensional compression processing using the m × n relational index values calculated by the index value calculation unit 13A, whereby m × k pieces (k is 1≤k). (an arbitrary integer satisfying n) is calculated. The dimensional compression processing can use, for example, a known singular value decomposition (SVD) as a method of decomposing a matrix.

즉, 차원 압축부(15)는, 상기 식(3)과 같이 하여 산출된 평가값행렬 DW를, 3개의 행렬 U, S, V로 분해한다. 여기서, 행렬 U는 m×k차원의 좌특이 행렬이며, 각각의 열은 DW*DW^t의 고유 벡터이다(DW^t는 평가값행렬 DW의 전치행렬을 나타낸다). 행렬 S는 k×k차원의 정방행렬이며, 대각행렬 성분이 평가값 행렬 DW의 특이값을 나타내고, 그 이외의 값이 모두 0가 되어 있다. 행렬 V는 k×n차원의 우특이 행렬이며, 각 행은 DW^t*DW의 고유 벡터이다. 그리고, 압축 후의 차원 k는, 미리 정한 고정된 값으로 해도 되고, 임의의 값을 지정 가능하게 해도 된다.That is, the dimension compression unit 15 decomposes the evaluation value matrix DW calculated by the above formula (3) into three matrices U, S, and V. Here, the matrix U is an m×k-dimensional left singular matrix, and each column is an eigenvector ^{of DW*DW t} ^{(DW t} represents the transpose matrix of the evaluation value matrix DW). The matrix S is a k×k-dimensional square matrix, and the diagonal matrix components represent singular values of the evaluation value matrix DW, and all other values are 0. The matrix V is a k×n-dimensional right singular matrix, and each row is ^{an eigenvector of DW t} * DW. In addition, the dimension k after compression is good also as a predetermined fixed value, and may make it possible to designate an arbitrary value.

차원 압축부(15)는, 이상과 같이 하여 분해한 3개의 행렬 중, 우특이 행렬 V의 전치행렬 V^t에 의해 평가값행렬 DW를 변환함으로써, 평가값행렬 DW의 차원을 압축한다. 즉, m×n차원의 평가값행렬 DW와, n×k차원의 우특이 전치행렬V^t의 내적을 계산함으로써, m×n차원의 평가값행렬 DW를 m×k차원의 평가값행렬 DW_SVD로 차원 압축한다(DW_SVD=DW*V^t). 그리고, DW_SVD는 평가값행렬 DW를 SVD에 의해 차원 압축한 행렬을 나타내고, DW≒*S*V=DW_SVD*V의 관계가 성립한다.The dimension compression unit 15 compresses the dimension of the evaluation value matrix DW by transforming the evaluation value matrix DW with ^{the transpose matrix V t} of the right singularity matrix V among the three matrices decomposed as described above. That is, by calculating the dot product of an m×n-dimensional evaluation value matrix DW and an n×k-dimensional right singular transposition matrix V ^t , an m×n-dimensional evaluation value matrix DW is converted to an m×k-dimensional evaluation value matrix DW _SVD dimensionally compressed (DW _SVD =DW*V ^t ). And, DW _SVD represents a matrix obtained by dimensional compression of the evaluation value matrix DW by SVD, and _{the relationship DW≒*S*V=DW SVD} *V holds.

이와 같이, SVD의 방법을 사용하여 평가값행렬 DW의 차원을 압축함으로써, 평가값행렬 DW로 표현되는 특징을 가능한 손상시키지 않고 평가값행렬 DW를 저랭크 근사할 수 있다. 그리고, 여기서는 우특이 행렬 V의 전치행렬 V^t에 의해 평가값행렬 DW를 변환하는 예에 대하여 설명하였으나, m의 값과 n의 값이 일치하는 경우에는, 좌특이 행렬 U에 의해 평가값 행렬 DW를 변환하도록 해도 된다(DW_SVD=DW*U).As described above, by compressing the dimension of the evaluation value matrix DW using the SVD method, the evaluation value matrix DW can be approximated with a low rank without damaging the characteristics expressed by the evaluation value matrix DW as much as possible. ^{Here, an example of transforming the evaluation value matrix DW by the transpose matrix V t} of the right singularity matrix V has been described. However, when the value of m and the value of n match, the evaluation value matrix DW is determined by the left singularity matrix U may be converted to (DW _SVD =DW*U).

예측 모델 생성부(14D)는, 차원 압축부(15)에 의해 차원 압축된 m×k개의 관계성 지표값을 사용하여, 1개의 문장 d_i에 대하여 k개의 관계성 지표값 dw_ij(i=(1, 2), ··· , k)로 이루어지는 문장지표값군를 바탕으로 인지증의 중증도를 예측하기 위한 예측 모델을 생성한다. 그리고, 예측 모델 생성부(14D)는, 생성한 예측 모델을 예측 모델 기억부(30D)에 기억시킨다.Prediction model generating section (14D), the dimension with a dimension-compressed m × k of relationship index value by the compression unit (15), k of relationship indicators for one sentence d _i value dw _ij (i = A predictive model for predicting the severity of cognitive impairment is generated based on the sentence index value group consisting of (1,2), ... , k). Then, the predictive model generation unit 14D stores the generated predictive model in the predictive model storage unit 30D.

인지증 예측부(21D)는, 예측용 데이터 입력부(20)에 의해 입력된 예측용 데이터에 대하여 단어 추출부(11A), 문장 벡터 산출부(121), 단어 벡터 산출부(122), 지표값 산출부(13A) 및 차원 압축부(15)의 처리를 실행함으로써 얻어지는 관계성 지표값을, 예측 모델 생성부(14D)에 의해 생성된 예측 모델(예측 모델 기억부(30D)에 기억된 예측 모델)에 적용함으로써, 예측 대상으로 하는 m' 인의 환자에 대하여 인지증의 중증도를 예측한다.The dementia prediction unit 21D includes a word extraction unit 11A, a sentence vector calculation unit 121, a word vector calculation unit 122, and an index value with respect to the prediction data input by the prediction data input unit 20 . A predictive model (predictive model stored in the predictive model storage unit 30D) generated by the predictive model generating unit 14D for relation index values obtained by executing the processing of the calculation unit 13A and the dimension compression unit 15 ) to predict the severity of dementia for the patient of m' to be predicted.

상기 제1 실시예에서는, 5∼10 분 정도의 문진 형식에 의한 자유회화 중에서 1인의 환자가 발언할 단어의 표준적인 종류를 상정하여, n의 값을 선택할 필요가 있었다. n의 값이 작으면, 예측 대상으로 하는 1인의 환자가 발언하는 단어와, 학습용 데이터의 문장으로부터 추출한 n종류의 단어의 중첩이 적어지고, 중첩이 1개도 없어질 가능성도 있다. 또한, n개로부터 누락된 단어(단어 추출부(11)에 의해 추출되지 않는 단어)의 정보는 평가값행렬 DW에 가미되지 않게 된다. 이 때문에, n의 값이 작아질수록, 예측의 정밀도는 저하된다. 한편, 충분히 큰 n의 값을 선택하면, 중첩이 0개가 될 가능성은 적어지고, n개로부터 누락되는 단어도 적어지지만, 행렬의 사이즈가 커져 계산량이 증가한다. 또한, 출현 빈도가 낮은 단어도 특징량으로서 포함시킴으로써, 과학습을 일으키기 쉽다.In the first embodiment, it was necessary to select the value of n, assuming a standard kind of words to be spoken by one patient among free conversations in the form of a questionnaire for about 5 to 10 minutes. When the value of n is small, the overlap between the word spoken by one patient as the prediction target and the n types of words extracted from the sentences of the learning data is reduced, and there is a possibility that even one overlap is eliminated. In addition, information on words (words not extracted by the word extraction unit 11) missing from the n pieces is not added to the evaluation value matrix DW. For this reason, the smaller the value of n, the lower the prediction accuracy. On the other hand, if a sufficiently large value of n is selected, the probability of zero overlap is reduced, and the number of words omitted from n is reduced, but the size of the matrix increases and the amount of calculation increases. Moreover, by including a word with a low frequency of appearance as a feature quantity, it is easy to generate|occur|produce scientific learning.

이에 대하여, 제4 실시형태에 의하면, m개의 문장에 포함되는 대부분(예를 들면, 모두)의 단어를 n개의 단어로서 추출하여 평가값행렬 DW를 생성하고, 이 평가값행렬 DW로 표현되는 특징을 반영시킨 상태로 차원 압축한 평가값행렬 DW_SVD를 산출할 수 있다. 이로써, 적은 계산 부하로, 학습에 의한 예측 모델의 생성 및 상기 생성한 예측 모델을 사용한 인지증의 중증도로 예측을 보다 정확하게 행할 수 있다.On the other hand, according to the fourth embodiment, the evaluation value matrix DW is generated by extracting most (eg, all) words included in the m sentences as n words, and the characteristic expressed by the evaluation value matrix DW It is possible to calculate the dimension-compressed evaluation value matrix DW _{SVD in a state where .} Thereby, with a small computational load, it is possible to more accurately predict generation of a predictive model by learning and the severity of dementia using the generated predictive model.

그리고, 여기서는 차원 압축의 일례로서 SVD를 사용하는 예에 대하여 설명하였으나, 본 발명은 이것으로 한정되지 않는다. 예를 들면, 주성분분석(principal component analysis: PCA) 등 다른 차원 압축 방법을 사용해도 된다.Incidentally, an example of using SVD as an example of dimensional compression has been described here, but the present invention is not limited thereto. For example, other dimensional compression methods such as principal component analysis (PCA) may be used.

또한, 도 9에서는, 제1 실시예에 있어서 생성하는 문장·단어 사이의 평가값행렬 DW를 차원 압축하는 예에 대하여 설명하였으나, 도 10의 (a)와 같이 제2 실시형태에 있어서 생성하는 문장·품사 사이의 평가값행렬 DH를 차원 압축하는 경우도 동일하게 행하는 것이 가능하다. 이에 대하여, 도 10의 (b)와 같이 제3 실시형태에 있어서 생성하는 제1 평가값행렬 DW 및 제2 평가값행렬 DH를 차원 압축하는 경우에는, 하기와 같은 태양으로 행할 수 있다.In Fig. 9, an example of dimensionally compressing the evaluation value matrix DW between sentences and words generated in the first embodiment has been described. As shown in Fig. 10(a), a sentence generated in the second embodiment The same can be done in the case of dimensional compression of the evaluation value matrix DH between parts of speech. In contrast, when the first evaluation value matrix DW and the second evaluation value matrix DH generated in the third embodiment are dimensionally compressed as shown in Fig. 10B, it can be performed in the following manner.

예를 들면, 제1 평가값행렬 DW 및 제2 평가값행렬 DH의 각각에 대하여 개별적으로 차원 압축을 행할 수 있다. 즉, m×n차원의 제1 평가값행렬 DW를 m×k차원의 제1 평가값행렬 DW_SVD로 차원 압축하고 또한 m×p차원의 제2 평가값행렬 DH를 m×k차원의 제2 평가값행렬 DH_SVD로 차원 압축한다. 다른 예로서, 도 8의 (a)와 같이 제1 지표값 행렬 DW와 제2 지표값 행렬 DH를 옆으로 배열하여 m×(n+p)차원의 1개의 지표값 행렬을 생성하고, 이 생성한 지표값 행렬을 m×k차원의 평가값행렬로 차원 압축하도록 해도 된다.For example, dimensional compression may be performed individually for each of the first evaluation value matrix DW and the second evaluation value matrix DH. That is, the first evaluation value matrix DW of m × n dimension _{is dimensionally compressed into the first evaluation value matrix DW SVD} of m × k dimension, and the second evaluation value matrix DH of m × p dimension is converted into the second evaluation value matrix DH of m × k dimension. The evaluation value matrix DH _SVD is dimensionally compressed. As another example, as shown in (a) of FIG. 8 , one index value matrix of m×(n+p) dimension is generated by arranging the first index value matrix DW and the second index value matrix DH sideways, and this generation One index value matrix may be dimensionally compressed into an m x k-dimensional evaluation value matrix.

(제5 실시형태)(fifth embodiment)

다음으로, 본 발명에 의한 제5 실시형태를 도면에 기초하여 설명한다. 도 11은, 제5 실시형태에 의한 인지증 예측 장치의 기능 구성예를 나타내는 블록도이다. 이 도 11에 있어서, 도 1에 나타낸 부호와 동일한 부호를 부여한 것은 동일한 기능을 가지는 것이므로, 여기서는 중복되는 설명을 생략한다. 그리고, 이하에서는, 제1 실시예에 대한 변형예로서 제5 실시형태를 설명하지만, 제2 실시형태∼제4 실시형태 중 어느 하나에 대한 변형예로서도 동일하게 제5 실시형태를 적용할 수 있다.Next, a fifth embodiment according to the present invention will be described with reference to the drawings. 11 is a block diagram showing an example of the functional configuration of the dementia prediction apparatus according to the fifth embodiment. In this FIG. 11, since the code|symbol same as the code|symbol shown in FIG. 1 has the same function, the overlapping description is abbreviate|omitted here. Incidentally, the fifth embodiment will be described below as a modification to the first embodiment, but the fifth embodiment is similarly applicable as a modification to any one of the second to fourth embodiments. .

도 11에 나타낸 바와 같이, 제4 실시형태에 의한 인지증 예측 장치는, 학습용 데이터 입력부(10), 예측 모델 생성부(14A), 인지증 예측부(21A) 및 예측 모델 기억부(30A) 대신에, 학습용 데이터 입력부(10E), 예측 모델 생성부(14E), 인지증 예측부(21E) 및 예측 모델 기억부(30E)를 구비하고 있다. 그리고, 학습용 데이터 입력부(10E), 관계성 지표값 산출부(100A) 및 예측 모델 생성부(14E)에 의해, 본 발명의 예측 모델 생성 장치가 구성된다.As shown in Fig. 11 , the dementia prediction device according to the fourth embodiment replaces the learning data input unit 10, the predictive model generation unit 14A, the dementia prediction unit 21A, and the predictive model storage unit 30A. A data input unit 10E for learning, a prediction model generation unit 14E, a dementia prediction unit 21E, and a prediction model storage unit 30E are provided. And the predictive model generating apparatus of this invention is comprised by the data input part 10E for learning, the relationship index value calculating part 100A, and the predictive model generating part 14E.

학습용 데이터 입력부(10E)는, 인지증의 복수의 평가 항목마다 중증도가 기지인 m인의 환자가 행한 자유회화의 내용을 각각 나타낸 m개의 문장을 학습용 데이터로서 입력한다. 인지증의 복수의 평가 항목마다의 중증도는, MMSE의 5개의 평가 항목, 즉 소재식, 기억력, 주의력(계산력), 언어적 능력, 구성력(도형적 능력)마다의 각 스코어의 값인 것을 의미한다.The learning data input unit 10E inputs, as learning data, m sentences each representing the contents of free conversations performed by m patients of known severity for each of the plurality of evaluation items of dementia. The severity of each of the plurality of evaluation items of dementia means the value of each score for each of the five evaluation items of the MMSE, namely, whereabouts, memory, attention (calculation ability), verbal ability, and construction power (graphical ability).

예측 모델 생성부(14E)는, 관계성 지표값 산출부(100A)에 의해 산출된 m×n개의 관계성 지표값을 사용하여, 1개의 문장 d_i에 대하여 n개의 관계성 지표값 dw_ij(j=1, 2, ··· , n)로 이루어지는 문장지표값군를 바탕으로 인지증의 평가 항목마다의 중증도를 예측하기 위한 예측 모델을 생성한다. 여기서 예측하는 인지증의 중증도는, MMSE의 5개의 평가 항목마다의 스코어의 값이다.The predictive model generating unit 14E uses the m × n relational index values calculated by the relational index value calculating unit 100A, and for one sentence d _i , n relational index values dw _ij ( A predictive model for predicting the severity of each evaluation item of cognitive impairment is generated based on the sentence index value group consisting of j = 1, 2, ... , n). The severity of dementia predicted here is a value of the score for each of the five evaluation items of MMSE.

즉, 예측 모델 생성부(14E)는, MMSE의 소재식, 기억력, 주의력, 언어적 능력, 구성력의 각 스코어가 기지(예를 들면, 각각 ×1점, ×2점, ×3점, ×4점, ×5점)인 환자의 자유회화를 바탕으로 산출되는 문장지표값군에 대해서는, 각 평가 항목에 대하여 각각 ×1점, ×2점, ×3점, ×4점, ×5점에 가능한 근접한 스코어가 예측되는 예측 모델을 생성한다. 그리고, 예측 모델 생성부(14E)는, 생성한 예측 모델을 예측 모델 기억부(30E)에 기억시킨다.That is, the predictive model generation unit 14E knows each score of MMSE's whereabouts, memory, attention, verbal ability, and constructive power (for example, x1, x2, x3, x4, respectively). For the sentence index value group calculated based on the patient's free conversation with points, ×5), the closest possible score to ×1, ×2, ×3, ×4, and ×5 points for each evaluation item, respectively. Create a predictive model from which scores are predicted. Then, the predictive model generation unit 14E stores the generated predictive model in the predictive model storage unit 30E.

예측 모델 생성부(14E)는, 지표값 산출부(13A)에 의해 산출된 m×n개의 관계성 지표값 dw₁₁∼dw_mn을 사용하고, 각 문장 d_i(i=1, 2, ··· , m)의 문장지표값군에 대하여 각각 인지증의 평가 항목마다의 중증도와 관련이 있는 특징량을 평가 항목마다 산출하여, 상기 산출한 특징량에 기초하여, 1개의 문장지표값군으로부터 인지증의 평가 항목마다의 중증도를 예측하기 위한 예측 모델을 생성한다. 여기서, 예측 모델 생성부(14E)가 생성하는 예측 모델은, 문장 d_i의 문장지표값군을 입력으로 하여, MMSE의 평가 항목마다의 스코어를 해로서 출력하는 학습 모델이다.The predictive model generation unit 14E uses the m × n relation index values dw _{11 to} _{dw mn} calculated by the index value calculation unit 13A, and each sentence d _i (i = 1, 2, ... For each sentence index value group of , m), a feature amount related to the severity of each evaluation item of dementia is calculated for each evaluation item, and based on the calculated feature amount, from one sentence index value group, the A predictive model for predicting the severity of each evaluation item is generated. Here, the predictive model generated by the predictive model generation unit 14E is _{a learning model that takes the sentence index value group of the sentence d i} as an input and outputs the score for each evaluation item of the MMSE as a solution.

제5 실시형태에 있어서도, 예측 모델 생성부(14E)가 예측 모델을 생성할 때 산출하는 특징량은, 소정의 알고리즘에 의해 산출되는 것이면 된다. 바꾸어 말하면, 예측 모델 생성부(14E)에 있어서 행하는 특징량의 산출 방법은, 임의로 설계가 가능하다. 예를 들면, 예측 모델 생성부(15E)는, 각 문장 d_i의 문장지표값군의 각각에 대하여, 평가 항목마다, 가중치 계산에 의해 얻어지는 값이 인지증의 평가 항목마다의 중증도를 나타낸 기지의 값(MMSE의 평가 항목마다의 스코어)에 근접하도록 소정의 가중치 계산을 행하고, 문장지표값군에 대한 가중치를 평가 항목마다의 특징량으로서 사용하여, 문장 d_i의 문장지표값군으로부터 인지증의 평가 항목마다의 중증도(MMSE의 평가 항목마다의 스코어)을 예측하기 위한 예측 모델을 생성한다.Also in the fifth embodiment, the feature amount calculated by the predictive model generation unit 14E when generating the predictive model may be calculated by a predetermined algorithm. In other words, the calculation method of the feature amount performed in the predictive model generation unit 14E can be arbitrarily designed. For example, the predictive model generation unit 15E, for each _{of the sentence index value groups of each sentence d i} , for each evaluation item, a value obtained by weight calculation is a known value indicating the severity of each evaluation item of dementia. (Score for each evaluation item of the MMSE), a predetermined weight is calculated so as to approximate, and the weight for the sentence index value group is used as a feature amount for each evaluation item, and from the sentence index value group of the sentence _{d i for each evaluation item of dementia} A predictive model for predicting the severity (score for each evaluation item of MMSE) of

예를 들면, 예측 모델 생성부(14E)는, 문장 d_i의 문장지표값군에 대하다 n개의 가중치{a_i1, a_i2, ··· , a_in} 중, 어느 하나 또는 복수의 가중치를 특징량으로하여 제1 평가 항목(소재식)의 스코어를 예측하고, 다른 하나 또는 복수의 가중치를 특징량으로 하여 제2 평가 항목(기억력)의 스코어를 예측하고, 이하 동일하게 또 다른 하나 또는 복수의 가중치를 특징량으로 하여 제3 평가 항목∼제5 평가 항목(주의력, 언어적 능력, 구성력)의 스코어를 예측하는 예측 모델을 생성한다.For example, the predictive model generation unit 14E assigns any one or a plurality of weights out of _{n weights {a i1} , a _i2 , ... , a _in } to the sentence index value group of the _{sentence d i as a feature quantity.} to predict the score of the first evaluation item (locus), and predict the score of the second evaluation item (memory) by using another one or a plurality of weights as a feature amount, and similarly, another one or a plurality of weights A predictive model for predicting the scores of the third to fifth evaluation items (attention, verbal ability, and constructive power) is generated using .

인지증 예측부(21E)는, 예측용 데이터 입력부(20)에 의해 입력된 예측용 데이터에 대하여 단어 추출부(11A), 문장 벡터 산출부(121), 단어 벡터 산출부(122) 및 지표값 산출부(13A)의 처리를 실행함으로써 얻어지는 관계성 지표값을, 예측 모델 생성부(14E)에 의해 생성된 예측 모델(예측 모델 기억부(30E)에 기억된 예측 모델)에 적용함으로써, 예측 대상으로 하는 m' 인의 환자에 대하여 인지증의 평가 항목마다의 중증도를 예측한다.The dementia prediction unit 21E includes a word extraction unit 11A, a sentence vector calculation unit 121, a word vector calculation unit 122, and an index value with respect to the prediction data input by the prediction data input unit 20 . A prediction target by applying the relation index value obtained by executing the processing of the calculation unit 13A to the prediction model generated by the prediction model generation unit 14E (the prediction model stored in the prediction model storage unit 30E). The severity of each evaluation item of cognitive impairment is predicted for a patient of m' who is

이상과 같이 구성한 제5 실시형태에 의하면, 미니 멘탈 스테이트 검사(MMSE)를 행하지 않고, MMSE의 평가 항목마다의 스코어를 예측할 수 있다.According to the fifth embodiment configured as described above, the score for each evaluation item of the MMSE can be predicted without performing the mini-mental state test (MMSE).

그리고, 여기서는 MMSE의 5개의 평가 항목마다 스코어를 예측하는 예에 대하여 설명하였으나, 상기 5개의 평가 항목을 보다 세분화한 보다 많은 평가 항목마다 스코어를 예측하도록 해도 된다.In addition, although the example of predicting a score for every five evaluation items of MMSE was demonstrated here, you may make it predict the score for each more evaluation item which further subdivided the said five evaluation items.

상기 제1∼제5 실시형태에서는, 학습기와 예측기를 구비한 인지증 예측 장치에 대하여 예시하였으나, 학습기만을 구비한 예측 모델 생성 장치와, 예측기만을 구비한 인지증 예측 장치를 별도로 구성해도 된다. 학습기만을 구비한 예측 모델 생성 장치의 구성은, 상기 제1∼제5 실시형태에서 설명한 바와 같다. 한편, 예측기만을 구비한 인지증 예측 장치의 구성은, 예를 들면, 도 12에 나타낸 바와 같다.In the above first to fifth embodiments, the apparatus for predicting dementia provided with a learner and a predictor has been exemplified, but the apparatus for generating a predictive model equipped with only the learner and the apparatus for predicting dementia including only the predictor may be configured separately. The configuration of the predictive model generating apparatus including only the learner is as described in the first to fifth embodiments. On the other hand, the configuration of the dementia prediction apparatus including only the predictor is as shown in Fig. 12, for example.

도 12에 있어서, 제2 요소 추출부(11')는, 단어 추출부(11A), 품사 추출부(11B), 또는, 단어 추출부(11A) 및 품사 추출부(11B)의 조합 중 어느 하나와 동일한 기능을 가지는 것이다. 제2 문장 벡터 산출부(121')는, 문장 벡터 산출부(121)와 동일한 기능을 가지는 것이다. 제2 요소 벡터 산출부(120')는, 단어 벡터 산출부(122), 품사 벡터 산출부(123), 또는, 단어 벡터 산출부(122) 및 품사 벡터 산출부(123)의 조합 중 어느 하나와 동일한 기능을 가지는 것이다. 제2 지표값 산출부(13')는, 지표값 산출부(13A∼13E) 중 어느 하나와 동일한 기능을 가지는 것이다. 인지증 예측부(21')는, 인지증 예측부(21A∼21E) 중 어느 하나와 동일한 기능을 가지는 것이다. 예측 모델 기억부(30')는, 예측 모델 기억부(30A∼30E) 중 어느 하나와 동일한 예측 모델을 기억하는 것이다.In FIG. 12 , the second element extraction unit 11 ′ includes any one of a word extraction unit 11A, a part-of-speech extraction unit 11B, or a combination of the word extraction unit 11A and the part-of-speech extraction unit 11B. has the same function as The second sentence vector calculating unit 121 ′ has the same function as the sentence vector calculating unit 121 . The second element vector calculating unit 120 ′ is configured to be any one of the word vector calculating unit 122 , the part-of-speech vector calculating unit 123 , or a combination of the word vector calculating unit 122 and the part-of-speech vector calculating unit 123 . has the same function as The second index value calculation unit 13' has the same function as any one of the index value calculation units 13A to 13E. The dementia prediction unit 21' has the same function as any one of the dementia prediction units 21A to 21E. The predictive model storage unit 30' stores the same predictive model as any one of the predictive model storage units 30A to 30E.

또한, 상기 제1∼제5 실시형태에서는, 「인지증의 중증도」가 MMSE의 스코어인 경우의 예, 즉 MMSE의 스코어를 예측하는 예에 대하여 설명하였으나, 본 발명은 이것으로 한정되지 않는다. 예를 들면, 인지증의 중증도는, MMSE의 스코어의 최대값 미만 또한 2 이상의 수로 분류한 카테고리로 해도 된다. 예를 들면, MMSE의 스코어가 30∼27 점인 경우에는 인지증의 의혹 없음, 26∼22 점인 경우에는 경도 인지증 장애의 의혹 있음, 21점 이하인 경우에는 인지증의 의혹 있음과 같이 인지증의 중증도를 3개의 카테고리로 분류하고, 환자가 어느 분류에 해당하는지를 예측하도록 해도 된다.Moreover, in the said 1st - 5th embodiment, the example when "severity of dementia" is the score of MMSE, ie, the example of predicting the score of MMSE, was demonstrated, but this invention is not limited to this. For example, the severity of dementia may be classified into a category that is less than the maximum value of the MMSE score and is classified into a number of two or more. For example, if the MMSE score is 30 to 27, there is no suspicion of dementia, if it is 26 to 22, there is suspicion of mild cognitive impairment, and if it is 21 or less, there is suspicion of dementia. may be classified into three categories, and the patient may be predicted to which classification belongs.

이 경우에, 예를 들면, 제1 실시예에 있어서 예측 모델 생성부(14A)는, MMSE의 스코어가 30∼27 점인 것이 기지의 환자의 자유회화에 대응하는 문장 데이터를 바탕으로 산출되는 문장지표값군에 대해서는 「인지증의 의혹 없음」의 제1 카테고리로 분류되고, MMSE의 스코어가 26∼22 점인 것이 기지의 환자의 자유회화에 대응하는 문장 데이터를 바탕으로 산출되는 문장지표값군에 대해서는 「경도 인지증 장애의 의혹 있음」의 제2 카테고리로 분류되고, MMSE의 스코어가 21점 이하인 것이 기지의 환자의 자유회화에 대응하는 문장 데이터를 바탕으로 산출되는 문장지표값군에 대해서는 「인지증의 의혹 있음」의 제3 카테고리로 분류되는 예측 모델을 생성한다.In this case, for example, in the first embodiment, the predictive model generating unit 14A is a sentence index calculated on the basis of sentence data corresponding to the patient's free speech in which it is known that the MMSE score is 30 to 27 points. For the value group, it is classified into the first category of "no suspicion of dementia", and for the sentence index value group calculated based on the sentence data corresponding to the patient's free speech, it is known that the MMSE score is 26 to 22 points, "hardness" It is classified into the second category of “suspicious of cognitive impairment”, and for the sentence index value group calculated based on sentence data corresponding to the patient’s free speech, for which it is known that the MMSE score is 21 or less, “there is suspicion of dementia” Create a predictive model classified into the third category of

예를 들면, 예측 모델 생성부(14A)는, 각 문장 d_i의 문장지표값군에 대하여 각각 특징량을 산출하고, 상기 산출한 특징량의 값에 따라, 마르코프(markov) 연쇄 몬테카를로법에 의한 카테고리 분리의 최적화를 행함으로써, 각 문장 d_i를 복수의 카테고리로 분류하기 위한 예측 모델을 생성한다. 여기서, 예측 모델 생성부(14A)가 생성하는 예측 모델은, 문장지표값군을 입력으로 하여, 예측하고자 하는 복수의 카테고리 중 어느 하나를 해로서 출력하는 학습 모델이다. 혹은, 어느 하나의 카테고리로 분류되는 확률을 수치로서 출력하는 학습 모델로 해도 된다. 학습 모델의 형태는 임의이다.For example, the predictive model generation unit 14A _{calculates a feature amount for each sentence index value group of each sentence d i} , and according to the value of the calculated feature amount, the category by the Markov chain Monte Carlo method By optimizing the separation, _{a predictive model for classifying each sentence d i} into a plurality of categories is generated. Here, the predictive model generated by the predictive model generation unit 14A is a learning model that receives a sentence index value group as an input and outputs any one of a plurality of categories to be predicted as a solution. Or it is good also as a learning model which outputs the probability of being classified into any one category as a numerical value. The form of the learning model is arbitrary.

또한, 상기 제1∼제5 실시형태에서는, MMSE의 스코어를 기준으로 하여 인지증의 중증도를 예측하는 예에 대하여 설명하였으나, 본 발명은 이것으로 한정되지 않는다. 즉, MMSE의 스코어 이외로 인지증의 중증도를 파악하는 방법, 예를 들면, 개정판 하세가와(長谷川)식 간이 지능 평가 스케일(Hasegawa's Dementia Scale-Revised: HDS-R), ADAS-cog(Alzheimer's Disease Assessment Scale-cognitive subscale), CDR(Clinical Dementia Rating), CDT(Clock Drawing Test), COGNISTAT(Neurobehavioral Cognitive Status Examination), 세븐미니츠클리닝 등을 기준으로 하여 인지증의 중증도를 예측하도록 하는 것도 가능하다.Moreover, in the said 1st - 5th embodiment, although the example of predicting the severity of dementia based on the MMSE score was demonstrated, this invention is not limited to this. That is, a method for determining the severity of dementia other than the score of the MMSE, for example, the revised Hasegawa's Dementia Scale-Revised (HDS-R), ADAS-cog (Alzheimer's Disease Assessment Scale) -cognitive subscale), CDR (Clinical Dementia Rating), CDT (Clock Drawing Test), COGNISTAT (Neurobehavioral Cognitive Status Examination), Seven Minute Cleaning, etc., can be used to predict the severity of dementia.

또한, 상기 제1∼제5 실시형태에서는, 의사와 환자의 문진 형식에 의한 자유회화를 문자 데이터화하고, 이것을 인지증의 중증도에 관한 학습 및 예측에 이용하는 예에 대하여 설명하였으나, 본 발명은 이것으로 한정되지 않는다. 예를 들면, 환자가 일상생활 중에서 행하고 있는 자유회화를 문자 데이터화하고, 이것을 인지증의 중증도에 관한 학습 및 예측에 이용하도록 해도 된다.In addition, in the above first to fifth embodiments, examples have been described in which free conversations between doctors and patients in the form of questionnaires are converted into text data and used for learning and prediction regarding the severity of dementia. not limited For example, free conversations performed by the patient in daily life may be converted into text data, and this may be used for learning and predicting the severity of dementia.

그 외, 상기 제1∼제5 실시형태는, 모두 본 발명을 실시하는 데 있어서의 구체화의 일례를 나타낸 것에 지나지 않고, 이들에 의해 본 발명의 기술적 범위가 한정적으로 해석되어서는 안된다. 즉, 본 발명은 그 요지, 또는 그 주요한 특징으로부터 일탈하지 않고, 다양한 형태로 실시할 수 있다.In addition, the said 1st - 5th Embodiments all show only an example of embodiment in implementing this invention, and the technical scope of this invention should not be interpreted limitedly by these. That is, the present invention can be implemented in various forms without departing from the gist or main characteristics thereof.

10, 10E: 학습용 데이터 입력부
11A: 단어 추출부(요소 추출부)
11B: 품사 추출부(요소 추출부)
12A∼12E: 벡터 산출부
121: 문장 벡터 산출부(요소 벡터 산출부)
122: 단어 벡터 산출부(요소 벡터 산출부)
123: 품사 벡터 산출부(요소 벡터 산출부)
13A∼13C: 지표값 산출부
14A∼14E: 예측 모델 생성부
15: 차원 압축부
20: 예측용 데이터 입력부
21A∼21E: 인지증 예측부
30A∼30E: 예측 모델 기억부
100A∼100E: 관계성 지표값 산출부10, 10E: data input for training
11A: word extraction unit (element extraction unit)
11B: Part-of-speech extraction unit (element extraction unit)
12A to 12E: vector calculation unit
121: sentence vector calculation unit (element vector calculation unit)
122: word vector calculation unit (element vector calculation unit)
123: part-of-speech vector calculation unit (element vector calculation unit)
13A-13C: index value calculation unit
14A to 14E: predictive model generation unit
15: dimensional compression part
20: data input for prediction
21A-21E: dementia prediction unit
30A-30E: predictive model storage unit
100A~100E: Relational index value calculation unit

Claims

인지증(認知症)의 중증도가 기지(旣知)인 복수 인(人)의 환자가 행한 자유회화의 내용을 각각 나타낸 복수의 문장을 학습용 데이터로서 입력하는 학습용 데이터 입력부;
상기 학습용 데이터 입력부에 의해 상기 학습용 데이터로서 입력된 상기 복수의 문장을 형태소 해석하여, 상기 복수의 문장으로부터 복수의 분해 요소를 추출하는 요소 추출부;
상기 복수의 문장을 각각 소정의 룰에 따라 q차원(q는 2 이상의 임의의 정수)으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 문장 벡터를 산출하는 문장 벡터 산출부;
상기 복수의 분해 요소를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 요소 벡터를 산출하는 요소 벡터 산출부;
상기 복수의 문장 벡터와 상기 복수의 요소 벡터의 내적을 각각 계산함으로써, 상기 복수의 문장 및 상기 복수의 분해 요소 사이의 관계성을 반영한 관계성 지표값을 산출하는 지표값 산출부;
상기 지표값 산출부에 의해 산출된 상기 관계성 지표값을 사용하여, 1개의 문장에 대하여 복수의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 상기 인지증의 중증도를 예측하기 위한 예측 모델을 생성하는 예측 모델 생성부;
예측 대상으로 하는 환자가 행한 자유회화의 내용을 나타낸 문장을 예측용 데이터로서 입력하는 예측용 데이터 입력부; 및
상기 예측용 데이터 입력부에 의해 입력된 상기 예측용 데이터에 대하여 상기 요소 추출부, 상기 문장 벡터 산출부, 상기 요소 벡터 산출부 및 상기 지표값 산출부의 처리를 실행함으로써 얻어지는 관계성 지표값을, 상기 예측 모델 생성부에 의해 생성된 상기 예측 모델에 적용함으로써, 상기 예측 대상으로 하는 환자에 대하여 상기 인지증의 중증도를 예측하는 인지증 예측부;
를 포함하는 인지증 예측 장치.a learning data input unit for inputting, as learning data, a plurality of sentences each representing the contents of free conversations performed by a plurality of patients with known severity of dementia;
an element extraction unit for morphologically analyzing the plurality of sentences input as the learning data by the learning data input unit, and extracting a plurality of decomposition elements from the plurality of sentences;
a sentence vector calculating unit for calculating a plurality of sentence vectors composed of q axis components by vectorizing the plurality of sentences in a q dimension (q is an arbitrary integer greater than or equal to 2) according to a predetermined rule;
an element vector calculating unit for calculating a plurality of element vectors composed of q axis components by vectorizing the plurality of decomposition elements in q dimensions according to a predetermined rule;
an index value calculation unit for calculating a relationship index value reflecting the relationship between the plurality of sentences and the plurality of decomposition elements by calculating a dot product of the plurality of sentence vectors and the plurality of element vectors, respectively;
A predictive model for predicting the severity of dementia based on a sentence index value group comprising a plurality of relationship index values for one sentence is generated using the relation index value calculated by the index value calculation unit. a predictive model generation unit;
a prediction data input unit for inputting, as prediction data, a sentence representing the content of free conversation performed by a patient as a prediction target; and
A relation index value obtained by executing the processing of the element extraction unit, the sentence vector calculation unit, the element vector calculation unit, and the index value calculation unit on the prediction data input by the prediction data input unit is calculated as the prediction. a cognitive impairment predictor for predicting the severity of the dementia with respect to the patient to be predicted by applying to the predictive model generated by the model generating unit;
A dementia prediction device comprising a.

제1항에 있어서,
상기 학습용 데이터 입력부는, 인지증의 중증도가 기지인 m 인(m은 2 이상의 임의의 정수)의 환자가 행한 자유회화의 내용을 각각 나타낸 m개의 문장을 상기 학습용 데이터로서 입력하고,
상기 요소 추출부는, 상기 학습용 데이터 입력부에 의해 상기 학습용 데이터로서 입력된 상기 m개의 문장을 해석하여, 상기 m개의 문장으로부터 n개(n은 2 이상의 임의의 정수)의 단어를 추출하는 단어 추출부이며,
상기 문장 벡터 산출부는, 상기 m개의 문장을 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 m개의 문장 벡터를 산출하고,
상기 요소 벡터 산출부는, 상기 n개의 단어를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 n개의 단어 벡터를 산출하는 단어 벡터 산출부이며,
상기 지표값 산출부는, 상기 m개의 문장 벡터와 상기 n개의 단어 벡터의 내적을 각각 계산함으로써, 상기 m개의 문장 및 상기 n개의 단어 사이의 관계성을 반영한 m×n개의 관계성 지표값을 산출하고,
상기 예측 모델 생성부는, 상기 지표값 산출부에 의해 산출된 상기 m×n개의 관계성 지표값을 사용하여, 1개의 문장에 대하여 n개의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 상기 인지증의 중증도를 예측하기 위한 예측 모델을 생성하고,
상기 예측용 데이터 입력부는, 예측 대상으로 하는 m' 인(m'는 1 이상의 임의의 정수)의 환자가 행한 자유회화의 내용을 각각 나타낸 m'개의 문장을 예측용 데이터로서 입력하고,
상기 인지증 예측부는, 상기 예측용 데이터 입력부에 의해 입력된 상기 예측용 데이터에 대하여 상기 단어 추출부, 상기 문장 벡터 산출부, 상기 단어 벡터 산출부 및 상기 지표값 산출부의 처리를 실행함으로써 얻어지는 관계성 지표값을, 상기 예측 모델 생성부에 의해 생성된 상기 예측 모델에 적용함으로써, 상기 예측 대상으로 하는 m' 인의 환자에 대하여 상기 인지증의 중증도를 예측하는,
인지증 예측 장치.According to claim 1,
The learning data input unit inputs, as the learning data, m sentences each representing the contents of a free conversation performed by a patient with a known severity of dementia m (m is an arbitrary integer greater than or equal to 2),
The element extraction unit is a word extraction unit that interprets the m sentences input as the learning data by the learning data input unit and extracts n words (n is an arbitrary integer greater than or equal to 2) from the m sentences. ,
The sentence vector calculation unit calculates m sentence vectors composed of q axis components by vectorizing the m sentences in q dimensions according to a predetermined rule,
The element vector calculation unit is a word vector calculation unit that calculates n word vectors composed of q axis components by vectorizing the n words in q dimensions according to a predetermined rule,
The index value calculation unit calculates the dot product of the m sentence vectors and the n word vectors, respectively, thereby calculating m × n relational index values reflecting the relationship between the m sentences and the n words, ,
The predictive model generation unit, using the m × n relational index values calculated by the index value calculating unit, based on the sentence index value group consisting of n relational index values for one sentence, the cognitive impairment create a predictive model for predicting the severity of
The prediction data input unit inputs, as prediction data, m' sentences each representing the contents of a free conversation performed by a patient who is m' as a prediction target (m' is an arbitrary integer greater than or equal to 1),
The dementia prediction unit is a relation obtained by executing the processing of the word extraction unit, the sentence vector calculation unit, the word vector calculation unit, and the index value calculation unit on the prediction data input by the prediction data input unit. By applying an index value to the predictive model generated by the predictive model generating unit, predicting the severity of the dementia for the m' patient as the prediction target,
A cognitive predictor.

제1항에 있어서,
상기 학습용 데이터 입력부는, 인지증의 중증도가 기지인 m 인(m은 2 이상의 임의의 정수)의 환자가 행한 자유회화의 내용을 각각 나타낸 m개의 문장을 상기 학습용 데이터로서 입력하고,
상기 요소 추출부는, 상기 학습용 데이터 입력부에 의해 상기 학습용 데이터로서 입력된 상기 m개의 문장을 해석하여, 상기 m개의 문장으로부터 p개(p는 2 이상의 임의의 정수)의 품사를 추출하는 품사 추출부이며,
상기 문장 벡터 산출부는, 상기 m개의 문장을 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 m개의 문장 벡터를 산출하고,
상기 요소 벡터 산출부는, 상기 p개의 품사를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 p개의 품사 벡터를 산출하는 품사 벡터 산출부이며,
상기 지표값 산출부는, 상기 m개의 문장 벡터와 상기 p개의 품사 벡터의 내적을 각각 계산함으로써, 상기 m개의 문장 및 상기 p개의 품사 사이의 관계성을 반영한 m×p개의 관계성 지표값을 산출하고,
상기 예측 모델 생성부는, 상기 지표값 산출부에 의해 산출된 상기 m×p개의 관계성 지표값을 사용하여, 1개의 문장에 대하여 p개의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 상기 인지증의 중증도를 예측하기 위한 예측 모델을 생성하고,
상기 인지증 예측부는, 상기 예측용 데이터 입력부에 의해 입력된 상기 예측용 데이터에 대하여 상기 품사 추출부, 상기 문장 벡터 산출부, 상기 품사 벡터 산출부 및 상기 지표값 산출부의 처리를 실행함으로써 얻어지는 관계성 지표값을, 상기 예측 모델 생성부에 의해 생성된 상기 예측 모델에 적용함으로써, 상기 예측 대상으로 하는 m' 인의 환자에 대하여 상기 인지증의 중증도를 예측하는,
인지증 예측 장치.According to claim 1,
The learning data input unit inputs, as the learning data, m sentences each representing the contents of a free conversation performed by a patient with a known severity of dementia m (m is an arbitrary integer greater than or equal to 2),
The element extraction unit interprets the m sentences input as the learning data by the learning data input unit, and extracts p parts of speech (p is an arbitrary integer greater than or equal to 2) from the m sentences. ,
The sentence vector calculation unit calculates m sentence vectors composed of q axis components by vectorizing the m sentences in q dimensions according to a predetermined rule,
The element vector calculation unit is a part-of-speech vector calculation unit that calculates p parts-of-speech vectors composed of q axis components by vectorizing the p parts-of-speech in q dimensions according to a predetermined rule,
The index value calculating unit calculates m × p relation index values reflecting the relationship between the m sentences and the p parts of speech by calculating the dot products of the m sentence vectors and the p parts of speech vectors, respectively, ,
The predictive model generation unit, using the m × p relational index values calculated by the index value calculating unit, based on the sentence index value group consisting of p relational index values for one sentence, the cognitive impairment create a predictive model for predicting the severity of
A relation obtained by executing the processing of the part-of-speech extraction unit, the sentence vector calculation unit, the part-of-speech vector calculation unit, and the index value calculation unit with respect to the prediction data input by the prediction data input unit, in the dementia prediction unit By applying an index value to the predictive model generated by the predictive model generating unit, predicting the severity of the dementia for the m' patient as the prediction target,
A cognitive predictor.

제1항에 있어서,
상기 학습용 데이터 입력부는, 인지증의 중증도가 기지인 m 인(m은 2 이상의 임의의 정수)의 환자가 행한 자유회화의 내용을 각각 나타낸 m개의 문장을 상기 학습용 데이터로서 입력하고,
상기 요소 추출부는, 상기 학습용 데이터 입력부에 의해 상기 학습용 데이터로서 입력된 상기 m개의 문장을 해석하여, 상기 m개의 문장으로부터 n개(n은 2 이상의 임의의 정수)의 단어를 추출하는 단어 추출부와, 상기 학습용 데이터 입력부에 의해 상기 학습용 데이터로서 입력된 상기 m개의 문장을 해석하여, 상기 m개의 문장으로부터 p개(p는 2 이상의 임의의 정수)의 품사를 추출하는 품사 추출부를 포함하고,
상기 문장 벡터 산출부는, 상기 m개의 문장을 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 m개의 문장 벡터를 산출하고,
상기 요소 벡터 산출부는, 상기 n개의 단어를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 n개의 단어 벡터를 산출하는 단어 벡터 산출부와, 상기 p개의 품사를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 p개의 품사 벡터를 산출하는 품사 벡터 산출부를 포함하고,
상기 지표값 산출부는, 상기 m개의 문장 벡터와 상기 n개의 단어 벡터의 내적을 각각 계산함으로써, 상기 m개의 문장 및 상기 n개의 단어 사이의 관계성을 반영한 m×n개의 관계성 지표값을 산출하고 또한, 상기 m개의 문장 벡터와 상기 p개의 품사 벡터의 내적을 각각 계산함으로써, 상기 m개의 문장 및 상기 p개의 품사 사이의 관계성을 반영한 m×p개의 관계성 지표값을 산출하고,
상기 예측 모델 생성부는, 상기 지표값 산출부에 의해 산출된 상기 m×n개의 관계성 지표값 및 상기 m×p개의 관계성 지표값을 사용하여, 1개의 문장에 대하여 n개의 관계성 지표값으로 이루어지는 문장지표값군 및 p개의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 상기 인지증의 중증도를 예측하기 위한 예측 모델을 생성하고,
상기 인지증 예측부는, 상기 예측용 데이터 입력부에 의해 입력된 상기 예측용 데이터에 대하여 상기 단어 추출부, 상기 품사 추출부, 상기 문장 벡터 산출부, 상기 단어 벡터 산출부, 상기 품사 벡터 산출부 및 상기 지표값 산출부의 처리를 실행함으로써 얻어지는 상기 관계성 지표값을, 상기 예측 모델 생성부에 의해 생성된 상기 예측 모델에 적용함으로써, 상기 예측 대상으로 하는 m' 인의 환자에 대하여 상기 인지증의 중증도를 예측하는,
인지증 예측 장치.According to claim 1,
The learning data input unit inputs, as the learning data, m sentences each representing the contents of a free conversation performed by a patient with a known severity of dementia m (m is an arbitrary integer greater than or equal to 2),
The element extraction unit interprets the m sentences input as the learning data by the learning data input unit, and extracts n words (n is an arbitrary integer greater than or equal to 2) from the m sentences; , a part-of-speech extraction unit that interprets the m sentences input as the learning data by the learning data input unit and extracts p parts of speech (p is an arbitrary integer greater than or equal to 2) from the m sentences,
The sentence vector calculation unit calculates m sentence vectors composed of q axis components by vectorizing the m sentences in q dimensions according to a predetermined rule,
The element vector calculation unit includes a word vector calculation unit that calculates n word vectors composed of q axis components by vectorizing the n words in q dimensions according to a predetermined rule, and sets the p parts of speech to a predetermined value, respectively. a part-of-speech vector calculating unit that calculates p parts-of-speech vectors composed of q axis components by vectorizing in q dimensions according to a rule,
The index value calculation unit calculates the dot product of the m sentence vectors and the n word vectors, respectively, thereby calculating m × n relational index values reflecting the relationship between the m sentences and the n words, In addition, by calculating the dot product of the m sentence vectors and the p parts-of-speech vectors, respectively, m × p relation index values reflecting the relation between the m sentences and the p parts-of-speech are calculated,
The predictive model generation unit, by using the m × n relational index values and the m × p relational index values calculated by the index value calculating unit, is configured as n relational index values for one sentence. generating a predictive model for predicting the severity of the dementia based on the sentence indicator value group consisting of the sentence indicator value group and the sentence indicator value group consisting of p relation indicator values;
The cognitive impairment prediction unit may include the word extraction unit, the part-of-speech extraction unit, the sentence vector calculation unit, the word vector calculation unit, the part-of-speech vector calculation unit, and the prediction data input by the prediction data input unit. By applying the relation index value obtained by executing the processing of the index value calculating unit to the predictive model generated by the predictive model generating unit, predicting the severity of the dementia for the m' patient as the prediction target doing,
A cognitive predictor.

제1항 내지 제4항 중 어느 한 항에 있어서,
상기 지표값 산출부에 의해 산출된 상기 관계성 지표값에 대하여 소정의 차원 압축 처리를 행함으로써, 차원 압축된 관계성 지표값을 산출하는 차원 압축부를 더 포함하고,
상기 예측 모델 생성부는, 상기 차원 압축부에 의해 차원 압축된 관계성 지표값을 사용하여, 1개의 문장에 대하여 복수의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 상기 인지증의 중증도를 예측하기 위한 예측 모델을 생성하고,
상기 인지증 예측부는, 상기 지표값 산출부에 의해 산출된 관계성 지표값에 대하여 또한 상기 차원 압축부의 처리를 실행함으로써 얻어지는 관계성 지표값을, 상기 예측 모델 생성부에 의해 생성된 상기 예측 모델에 적용함으로써, 상기 예측 대상으로 하는 환자에 대하여 상기 인지증의 중증도를 예측하는, 인지증 예측 장치.5. The method according to any one of claims 1 to 4,
A dimension compression unit for calculating a dimensionally compressed relational index value by performing a predetermined dimensional compression process on the relational index value calculated by the index value calculating unit;
The predictive model generator is configured to predict the severity of the dementia based on a sentence index value group including a plurality of relationship index values for one sentence using the relationship index value dimensionally compressed by the dimension compression unit. create a predictive model for
The dementia prediction unit is configured to apply a relation index value obtained by further processing the dimension compression unit to the predictive model generated by the predictive model generation unit with respect to the relation index value calculated by the index value calculation unit. A dementia prediction apparatus for predicting the severity of the dementia with respect to the patient to be predicted by applying it.

제1항 내지 제4항 중 어느 한 항에 있어서,
상기 예측 모델 생성부는, 상기 문장지표값군에 대하여 상기 인지증의 중증도와 관련이 있는 특징량을 산출하고, 상기 산출한 특징량에 기초하여, 상기 문장지표값군으로부터 상기 인지증의 중증도를 예측하기 위한 상기 예측 모델을 생성하는, 인지증 예측 장치.5. The method according to any one of claims 1 to 4,
The predictive model generator calculates a feature amount related to the severity of the dementia with respect to the sentence index value group, and based on the calculated feature amount, predicts the severity of the dementia from the sentence index value group For generating the predictive model, dementia prediction device.

제6항에 있어서,
상기 예측 모델 생성부는, 상기 문장지표값군에 대하여, 가중치 계산에 의해 얻어지는 값이 상기 인지증의 중증도를 나타낸 기지의 값에 가까워지도록 소정의 가중치 계산을 행하고, 상기 문장지표값군에 대한 가중치를 상기 특징량으로서 사용하여, 상기 문장지표값군으로부터 상기 인지증의 중증도를 예측하기 위한 상기 예측 모델을 생성하는, 인지증 예측 장치.7. The method of claim 6,
The predictive model generation unit performs a predetermined weight calculation for the sentence index value group so that a value obtained by the weight calculation approaches a known value representing the severity of the dementia, and sets the weight for the sentence index value group as the feature A cognitive impairment prediction apparatus for generating the predictive model for predicting the severity of the dementia from the sentence index value group by using it as an amount.

제1항 내지 제4항 중 어느 한 항에 있어서,
상기 학습용 데이터 입력부는, 상기 인지증의 복수의 평가 항목마다 중증도가 기지인 복수 인의 환자가 행한 자유회화의 내용을 각각 나타낸 복수의 문장을 학습용 데이터로서 입력하고,
상기 예측 모델 생성부는, 상기 문장지표값군를 바탕으로 상기 인지증의 상기 평가 항목마다의 중증도를 예측하기 위한 예측 모델을 생성하고,
상기 인지증 예측부는, 상기 예측 대상으로 하는 환자에 대하여 상기 인지증의 상기 평가 항목마다의 중증도를 예측하는, 인지증 예측 장치.5. The method according to any one of claims 1 to 4,
The learning data input unit inputs, as learning data, a plurality of sentences each representing the content of free conversations performed by a plurality of patients with known severity for each of the plurality of evaluation items of the dementia,
The predictive model generation unit generates a predictive model for predicting the severity of each evaluation item of the cognitive impairment based on the sentence index value group,
The cognitive impairment prediction unit predicts a severity of the cognitive impairment for each evaluation item with respect to the prediction target patient.

제8항에 있어서,
상기 예측 모델 생성부는, 상기 문장지표값군에 대하여 상기 인지증의 상기 평가 항목마다의 중증도와 관련이 있는 특징량을 상기 평가 항목마다 산출하고, 상기 산출한 특징량에 기초하여, 상기 문장지표값군으로부터 상기 인지증의 상기 평가 항목마다의 중증도를 예측하기 위한 상기 예측 모델을 생성하는, 인지증 예측 장치.9. The method of claim 8,
The predictive model generation unit calculates, for each evaluation item, a feature amount related to the severity of each evaluation item of the cognitive impairment with respect to the sentence index value group, and based on the calculated feature amount, from the sentence index value group A cognitive impairment prediction device that generates the predictive model for predicting the severity of each of the evaluation items of the cognitive impairment.

제9항에 있어서,
상기 예측 모델 생성부는, 상기 문장지표값군에 대하여, 상기 평가 항목마다, 가중치 계산에 의해 얻어지는 값이 상기 인지증의 상기 평가 항목마다의 중증도를 나타낸 기지의 값에 가까워지도록 소정의 가중치 계산을 행하고, 상기 문장지표값군에 대한 가중치를 상기 평가 항목마다의 상기 특징량으로서 사용하여, 상기 문장지표값군으로부터 상기 인지증의 상기 평가 항목마다의 중증도를 예측하기 위한 상기 예측 모델을 생성하는, 인지증 예측 장치.10. The method of claim 9,
The predictive model generation unit performs a predetermined weight calculation for the sentence index value group so that, for each evaluation item, a value obtained by weight calculation is close to a known value indicating the severity of each evaluation item of the dementia, A cognitive impairment predicting device for generating the predictive model for predicting the severity of each evaluation item of the dementia from the sentence index value group by using a weight for the sentence index value group as the feature amount for each evaluation item .

제1항 내지 제4항 중 어느 한 항에 있어서,
상기 인지증의 중증도는, 미니 멘탈 스테이트 검사(Mini-Mental State Examination)의 스코어값인, 인지증 예측 장치.5. The method according to any one of claims 1 to 4,
The dementia prediction device, wherein the severity of dementia is a score value of a Mini-Mental State Examination.

제1항 내지 제4항 중 어느 한 항에 있어서,
상기 인지증의 중증도는, 미니 멘탈 스테이트 검사의 스코어의 최대값 미만 또한 2 이상의 수로 분류한 카테고리인, 인지증 예측 장치.5. The method according to any one of claims 1 to 4,
The dementia prediction device, wherein the severity of the dementia is a category classified into a number less than the maximum value of the score of the mini-mental state test and 2 or more.

인지증의 중증도가 기지인 복수 인의 환자가 행한 자유회화의 내용을 각각 나타낸 복수의 문장을 학습용 데이터로서 입력하는 학습용 데이터 입력부;
상기 학습용 데이터 입력부에 의해 상기 학습용 데이터로서 입력된 상기 복수의 문장을 형태소 해석하여, 상기 복수의 문장으로부터 복수의 분해 요소를 추출하는 요소 추출부;
상기 복수의 문장을 각각 소정의 룰에 따라 q차원(q는 2 이상의 임의의 정수)으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 문장 벡터를 산출하는 문장 벡터 산출부;
상기 복수의 분해 요소를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 요소 벡터를 산출하는 요소 벡터 산출부;
상기 복수의 문장 벡터와 상기 복수의 요소 벡터의 내적을 각각 계산함으로써, 상기 복수의 문장 및 상기 복수의 분해 요소 사이의 관계성을 반영한 관계성 지표값을 산출하는 지표값 산출부; 및
상기 지표값 산출부에 의해 산출된 상기 관계성 지표값을 사용하여, 1개의 문장에 대하여 복수의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 상기 인지증의 중증도를 예측하기 위한 예측 모델을 생성하는 예측 모델 생성부;
를 포함한 예측 모델 생성 장치.a learning data input unit for inputting, as learning data, a plurality of sentences each representing the contents of free conversation performed by a plurality of patients with known severity of dementia;
an element extraction unit for morphologically analyzing the plurality of sentences input as the learning data by the learning data input unit, and extracting a plurality of decomposition elements from the plurality of sentences;
a sentence vector calculating unit for calculating a plurality of sentence vectors composed of q axis components by vectorizing the plurality of sentences in a q dimension (q is an arbitrary integer greater than or equal to 2) according to a predetermined rule;
an element vector calculating unit for calculating a plurality of element vectors composed of q axis components by vectorizing the plurality of decomposition elements in q dimensions according to a predetermined rule;
an index value calculation unit for calculating a relationship index value reflecting the relationship between the plurality of sentences and the plurality of decomposition elements by calculating a dot product of the plurality of sentence vectors and the plurality of element vectors, respectively; and
A predictive model for predicting the severity of dementia based on a sentence index value group comprising a plurality of relationship index values for one sentence is generated using the relation index value calculated by the index value calculation unit. a predictive model generation unit;
Predictive model generation device, including

제13항에 있어서,
상기 학습용 데이터 입력부는, 상기 인지증의 복수의 평가 항목마다 중증도가 기지인 복수 인의 환자가 행한 자유회화의 내용을 각각 나타낸 복수의 문장을 학습용 데이터로서 입력하고,
상기 예측 모델 생성부는, 상기 문장지표값군를 바탕으로 상기 인지증의 상기 평가 항목마다의 중증도를 예측하기 위한 예측 모델을 생성하는, 예측 모델 생성 장치.14. The method of claim 13,
The learning data input unit inputs, as learning data, a plurality of sentences each representing the content of free conversations performed by a plurality of patients with known severity for each of the plurality of evaluation items of the dementia,
The predictive model generating unit is configured to generate a predictive model for predicting the severity of each evaluation item of the cognitive impairment based on the sentence index value group.

예측 대상으로 하는 환자가 행한 자유회화의 내용을 나타낸 1 이상의 문장을 예측용 데이터로서 입력하는 예측용 데이터 입력부;
상기 예측용 데이터 입력부에 의해 상기 예측용 데이터로서 입력된 상기 1 이상의 문장을 형태소 해석하여, 상기 1 이상의 문장으로부터 복수의 분해 요소를 추출하는 제2 요소 추출부;
상기 1 이상의 문장을 소정의 룰에 따라 q차원(q는 2 이상의 임의의 정수)으로 벡터화함으로써, q개의 축 성분으로 이루어지는 1 이상의 문장 벡터를 산출하는 제2 문장 벡터 산출부;
상기 복수의 분해 요소를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 요소 벡터를 산출하는 제2 요소 벡터 산출부;
상기 1 이상의 문장 벡터와 상기 복수의 요소 벡터의 내적을 각각 계산함으로써, 상기 1 이상의 문장 및 상기 복수의 분해 요소 사이의 관계성을 반영한 관계성 지표값을 산출하는 제2 지표값 산출부; 및
상기 제2 지표값 산출부에 의해 산출된 관계성 지표값을, 청구항 13의 예측 모델 생성 장치에 의해 생성된 예측 모델에 적용함으로써, 상기 예측 대상으로 하는 환자에 대하여 상기 인지증의 중증도를 예측하는 인지증 예측부
를 포함한 인지증 예측 장치.a prediction data input unit for inputting, as prediction data, one or more sentences indicating the contents of free conversation performed by a patient as a prediction target;
a second element extraction unit that morphologically analyzes the one or more sentences input as the prediction data by the prediction data input unit, and extracts a plurality of decomposition elements from the one or more sentences;
a second sentence vector calculating unit for calculating one or more sentence vectors composed of q axis components by vectorizing the one or more sentences in q dimensions (q is an arbitrary integer of 2 or more) according to a predetermined rule;
a second element vector calculation unit for calculating a plurality of element vectors composed of q axis components by vectorizing the plurality of decomposition elements in q dimensions according to a predetermined rule;
a second index value calculation unit calculating a relationship index value reflecting the relationship between the one or more sentences and the plurality of decomposition elements by calculating a dot product of the one or more sentence vectors and the plurality of element vectors, respectively; and
predicting the severity of the dementia for the patient to be predicted by applying the relation index value calculated by the second index value calculating unit to the predictive model generated by the predictive model generating device of claim 13 dementia prediction unit
A cognitive prediction device, including

인지증의 중증도가 기지인 복수 인의 환자가 행한 자유회화의 내용을 각각 나타낸 복수의 문장을 학습용 데이터로서 입력하는 학습용 데이터 입력 수단;
상기 학습용 데이터 입력 수단에 의해 상기 학습용 데이터로서 입력된 상기 복수의 문장을 형태소 해석하여, 상기 복수의 문장으로부터 복수의 분해 요소를 추출하는 요소 추출 수단;
상기 복수의 문장을 각각 소정의 룰에 따라 q차원(q는 2 이상의 임의의 정수)으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 문장 벡터를 산출하는 문장 벡터 산출 수단;
상기 복수의 분해 요소를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 요소 벡터를 산출하는 요소 벡터 산출 수단;
상기 복수의 문장 벡터와 상기 복수의 요소 벡터의 내적을 각각 계산함으로써, 상기 복수의 문장 및 상기 복수의 분해 요소 사이의 관계성을 반영한 관계성 지표값을 산출하는 지표값 산출 수단; 및
상기 지표값 산출 수단에 의해 산출된 상기 관계성 지표값을 사용하여, 1개의 문장에 대하여 복수의 관계성 지표값으로 이루어지는 문장지표값군를 바탕으로 상기 인지증의 중증도를 예측하기 위한 예측 모델을 생성하는 예측 모델 생성 수단;
으로서 컴퓨터를 기능시키기 위한, 기억 매체에 기억된 인지증 예측용 프로그램.learning data input means for inputting, as learning data, a plurality of sentences each representing the contents of free conversations performed by a plurality of patients with known severity of dementia;
element extraction means for morphologically analyzing the plurality of sentences input as the learning data by the learning data input means, and extracting a plurality of decomposition elements from the plurality of sentences;
sentence vector calculating means for calculating a plurality of sentence vectors composed of q axis components by vectorizing the plurality of sentences in a q dimension (q is an arbitrary integer greater than or equal to 2) according to a predetermined rule;
element vector calculating means for calculating a plurality of element vectors composed of q axis components by vectorizing the plurality of decomposition elements in q dimensions according to a predetermined rule;
index value calculation means for calculating a relationship index value reflecting the relationship between the plurality of sentences and the plurality of decomposition elements by calculating the dot product of the plurality of sentence vectors and the plurality of element vectors, respectively; and
A predictive model for predicting the severity of dementia based on a sentence index value group comprising a plurality of relationship index values for one sentence is generated using the relation index value calculated by the index value calculating means. means for generating a predictive model;
A program for predicting dementia stored in a storage medium for making a computer function as a

제16항에 있어서,
예측 대상으로 하는 환자가 행한 자유회화의 내용을 나타낸 문장을 예측용 데이터로서 입력하는 예측용 데이터 입력 수단; 및
상기 예측용 데이터 입력 수단에 의해 입력된 상기 예측용 데이터에 대하여 상기 요소 추출 수단, 상기 문장 벡터 산출 수단, 상기 요소 벡터 산출 수단 및 상기 지표값 산출 수단의 처리를 실행함으로써 얻어지는 관계성 지표값을, 상기 예측 모델 생성 수단에 의해 생성된 상기 예측 모델에 적용함으로써, 상기 예측 대상으로 하는 환자에 대하여 상기 인지증의 중증도를 예측하는 인지증 예측 수단;
으로서 컴퓨터를 더욱 기능시키기 위한, 기억 매체에 기억된 인지증 예측용 프로그램.17. The method of claim 16,
prediction data input means for inputting, as prediction data, a sentence representing the content of free conversation performed by a patient as a prediction target; and
a relation index value obtained by executing processing of the element extraction means, the sentence vector calculation means, the element vector calculation means, and the index value calculation means on the prediction data input by the prediction data input means; dementia prediction means for predicting the severity of the dementia with respect to the patient to be predicted by applying to the predictive model generated by the predictive model generating means;
A program for predicting dementia stored in a storage medium to further function as a computer.

예측 대상으로 하는 환자가 행한 자유회화의 내용을 나타낸 1 이상의 문장을 예측용 데이터로서 입력하는 예측용 데이터 입력 수단;
상기 예측용 데이터 입력 수단에 의해 상기 예측용 데이터로서 입력된 상기 1 이상의 문장을 형태소 해석하여, 상기 1 이상의 문장으로부터 복수의 분해 요소를 추출하는 제2 요소 추출 수단;
상기 1 이상의 문장을 소정의 룰에 따라 q차원(q는 2 이상의 임의의 정수)으로 벡터화함으로써, q개의 축 성분으로 이루어지는 1 이상의 문장 벡터를 산출하는 제2 문장 벡터 산출 수단;
상기 복수의 분해 요소를 각각 소정의 룰에 따라 q차원으로 벡터화함으로써, q개의 축 성분으로 이루어지는 복수의 요소 벡터를 산출하는 제2 요소 벡터 산출 수단;
상기 1 이상의 문장 벡터와 상기 복수의 요소 벡터의 내적을 각각 계산함으로써, 상기 1 이상의 문장 및 상기 복수의 분해 요소 사이의 관계성을 반영한 관계성 지표값을 산출하는 제2 지표값 산출 수단; 및
상기 제2 지표값 산출 수단에 의해 산출된 관계성 지표값을, 제16항에 기재된 예측 모델 생성 수단에 의해 생성된 예측 모델에 적용함으로써, 상기 예측 대상으로 하는 환자에 대하여 상기 인지증의 중증도를 예측하는 인지증 예측 수단;
으로서 컴퓨터를 기능시키기 위한, 기억 매체에 기억된 인지증 예측용 프로그램.prediction data input means for inputting, as prediction data, one or more sentences indicating the contents of free conversation performed by a patient as a prediction target;
second element extraction means for morphologically analyzing the one or more sentences input as the prediction data by the prediction data input means, and extracting a plurality of decomposition elements from the one or more sentences;
a second sentence vector calculating means for calculating one or more sentence vectors composed of q axis components by vectorizing the one or more sentences in q dimensions (q is an arbitrary integer of 2 or more) according to a predetermined rule;
second element vector calculating means for calculating a plurality of element vectors composed of q axis components by vectorizing the plurality of decomposition elements in q dimensions according to a predetermined rule;
a second index value calculating means for calculating a relationship index value reflecting the relationship between the one or more sentences and the plurality of decomposition elements by calculating a dot product of the one or more sentence vectors and the plurality of element vectors, respectively; and
By applying the relation index value calculated by the second index value calculating means to the predictive model generated by the predictive model generating means according to claim 16, the severity of the dementia for the patient to be predicted is determined. predictive dementia prediction means;
A program for predicting dementia stored in a storage medium for making a computer function as a