KR20130067758A

KR20130067758A - Apparatus and method for detecting human by using svm learning

Info

Publication number: KR20130067758A
Application number: KR1020110134652A
Authority: KR
Inventors: 고종국; 최진우; 박소희; 유장희
Original assignee: 한국전자통신연구원
Priority date: 2011-12-14
Filing date: 2011-12-14
Publication date: 2013-06-25

Abstract

PURPOSE: Human detection apparatus using SVM(Support Vector Machine) learning and a method thereof are provided to generate a histogram for a binary characteristic pattern, obtain a characteristic vector, and then detect a person from an input image, thereby obtaining a stable pattern through a cumulative sum-based analysis in spite of noise in the vicinity when obtaining a characteristic pattern for each pixel. CONSTITUTION: A learning image characteristic extraction block(112) extracts a binary learning characteristic pattern from an input learning image(102) and generates a learning characteristic vector. An SVM learning block(114) generates a classifier through the SVM learning of the generated learning characteristic vector. An image characteristic extraction block(132) extracts a binary input characteristic pattern from an input image(104) and generates an input characteristic vector. A human detection judgment block(136) detects a person by using a pre-set search region and the classifier stored in a database(120). [Reference numerals] (112) Learning image characteristic extraction block; (114) SVM learning block; (116) SVM determination weighted value generation block; (132) Input image characteristic extraction block; (134) Search area setting block; (136) Human detection judgment block

Description

ＳＶＭ학습을 이용한 사람 검출 장치 및 그 방법{APPARATUS AND METHOD FOR DETECTING HUMAN BY USING SVM LEARNING}Person detection device using SMB learning and its method {APPARATUS AND METHOD FOR DETECTING HUMAN BY USING SVM LEARNING}

본 발명은 카메라를 통해 획득되는 입력 영상으로부터 사람을 검출하는 기법에 관한 것으로, 더욱 상세하게는 영상 감시 등의 서비스를 위해 카메라를 통해 수집되는 영상으로부터 SVM(support vector machine) 학습 알고리즘을 이용하여 사람을 검출하는데 적합한 SVM 학습을 이용한 사람 검출 장치 및 그 방법에 관한 것이다.
The present invention relates to a technique for detecting a person from an input image obtained through a camera, and more particularly, using a support vector machine (SVM) learning algorithm from an image collected through a camera for a service such as video surveillance. The present invention relates to a human detection apparatus using SVM learning and a method thereof suitable for detecting the same.

일반적으로, CCTV(Closed-circuit television)로 통칭되는 카메라를 이용한 영상 감시 시스템에서는 카메라를 통해 획득되는 영상으로부터 사람을 검출하고, 추적하는 기술들을 통해 감시 서비스를 제공하고 있다.In general, a video surveillance system using a camera commonly referred to as a closed-circuit television (CCTV) provides a surveillance service through technologies for detecting and tracking a person from an image acquired through the camera.

이러한 영상 감시 시스템에서 사람을 검출하는 데 있어서, 기존의 방법들은 특징 추출을 위해 픽셀들의 기울기에 대한 방향 정보를 이용하여 패턴을 구성하는 방법을 사용하는데, 이러한 종래 방법과 관련한 선행특허로서는 대한민국 공개특허 제2009-0099349호(공개일 : 2009년 9월 22일)가 있다.In detecting a person in such a video surveillance system, conventional methods use a method of constructing a pattern by using direction information on the slope of pixels for feature extraction. 2009-0099349 (published: September 22, 2009).

즉, 선행특허에서는 사람 검출의 후보 영역인 움직임 영역을 크기가 서로 다른 다수의 기울기 영상으로 생성하며, 생성된 기울기 영상들의 셀 히스토그램을 이용하여 특징 벡터들을 합성하고 특징 벡터들을 기반으로 사람 영역을 인식하는 기술사상을 개시하고 있다.
That is, the prior patent generates a plurality of gradient images having different sizes of motion regions, which are candidate regions for human detection, synthesizes feature vectors using cell histograms of the generated gradient images, and recognizes human regions based on the feature vectors. The technical idea is disclosed.

대한민국 공개특허 제2009-0099349호(공개일 : 2009. 09. 22.)Republic of Korea Patent Publication No. 2009-0099349 (Published: 2009. 09. 22.)

그러나, 픽셀들의 기울기에 대한 방향 정보를 이용하여 패턴을 구성하는 전술한 종래의 선행특허는 방향에 특징 패턴의 종류가 많지 않아 다양한 특징 패턴을 묘사하지 못하는 근본적인 문제점을 가지며, 이러한 문제는 결국 사람 검출의 정확도를 저하시키는 요인으로 작용하고 있다.However, the above-mentioned prior art, which constitutes a pattern by using the direction information on the slope of the pixels, has a fundamental problem of not describing various feature patterns because there are not many kinds of feature patterns in the direction. It is acting as a factor to lower the accuracy of the.

따라서, 카메라로부터 획득한 영상으로부터 보다 정확하게 사람을 검출하기 위해서는 보다 더 다양한 특징 패턴을 묘사할 수 있는 새로운 기법의 출현이 절실한 실정이나, 현재로서는 이러한 새로운 기법에 대한 어떠한 제안도 제시도 없는 실정이다.
Therefore, in order to detect a person more accurately from an image obtained from a camera, the emergence of a new technique capable of describing more diverse feature patterns is urgently needed, but there is no suggestion for such a new technique.

본 발명은, 일 관점에 따라, 입력되는 학습 영상으로부터 학습 이진 특징 패턴을 추출하고, 상기 학습 이진 특징 패턴들에 대해 히스토그램을 생성한 후, 생성된 히스토그램을 기반으로 학습 특징 벡터를 생성하는 학습 영상 특징 추출 블록과, 생성된 상기 학습 특징 벡터를 SVM(support vector machine) 학습하여 분류기를 생성하는 SVM 학습 블록과, 생성된 상기 분류기에 대한 SVM 판별 가중치를 생성하여 데이터베이스에 저장하는 SVM 판별 가중치 생성 블록과, 입력 영상으로부터 입력 이진 특징 패턴을 추출하고, 상기 입력 이진 특징 패턴들에 대해 히스토그램을 생성한 후, 생성된 히스토그램을 기반으로 입력 특징 벡터를 생성하는 입력 영상 특징 추출 블록과, 상기 입력 특징 벡터가 생성된 전체의 입력 영상에 대한 탐색 영역을 설정하는 탐색 영역 설정 블록과, 설정된 탐색 영역과 상기 데이터베이스에 저장된 분류기를 이용하여 사람을 검출하는 사람 검출 판별 블록을 포함하는 SVM 학습을 이용한 사람 검출 장치를 제공한다.According to an aspect of the present invention, a learning image extracts a learning binary feature pattern from an input learning image, generates a histogram with respect to the learning binary feature patterns, and then generates a learning feature vector based on the generated histogram. A SVM learning block for generating a classifier by learning the feature extraction block, the SVM (support vector machine) of the generated learning feature vector, and an SVM discrimination weight generation block for generating and storing SVM discrimination weights for the generated classifiers in a database. An input image feature extraction block for extracting an input binary feature pattern from an input image, generating a histogram with respect to the input binary feature patterns, and generating an input feature vector based on the generated histogram, and the input feature vector Area setting block to set the search area for the entire input image in which And, it provides a human detection apparatus using the SVM learning, including man detection determination block for detecting a person using a classifier stored in the search area and the database is set.

본 발명은, 다른 관점에 따라, 입력되는 학습 영상으로부터 학습 이진 특징 패턴을 추출하는 과정과, 추출된 상기 학습 이진 특징 패턴들에 대한 히스토그램을 생성하고, 이 생성된 히스토그램을 기반으로 학습 특징 벡터를 생성하는 과정과, 생성된 상기 학습 특징 벡터를 SVM(support vector machine) 학습하여 분류기를 생성하는 과정과, 생성된 상기 분류기에 대한 SVM 판별 가중치를 생성하여 데이터베이스에 저장하는 과정과, 입력 영상으로부터 입력 이진 특징 패턴을 추출하는 과정과, 추출된 상기 입력 이진 특징 패턴들에 대해 히스토그램을 생성하고, 이 생성된 히스토그램을 기반으로 입력 특징 벡터를 생성하는 과정과, 상기 입력 특징 벡터가 생성된 전체의 입력 영상에 대한 탐색 영역을 설정하고, 설정된 탐색 영역과 상기 데이터베이스에 저장된 분류기를 이용하여 사람을 검출하는 과정을 포함하는 SVM 학습을 이용한 사람 검출 방법을 제공한다.
According to another aspect of the present invention, a process of extracting a learning binary feature pattern from an input learning image, generating a histogram of the extracted learning binary feature patterns, and generating a learning feature vector based on the generated histogram Generating, classifying the generated learning feature vector by a support vector machine (SVM), generating a classifier, generating and storing an SVM discrimination weight for the generated classifier, and storing it in a database; Extracting a binary feature pattern, generating a histogram with respect to the extracted input binary feature patterns, generating an input feature vector based on the generated histogram, and inputting the entire input feature vector. Set the search area for the image, and set the search area and the minutes stored in the database. Group provides a human detection method using the SVM learning comprises the step of detecting a person using.

본 발명은, 각 픽셀의 특징 패턴을 구할 때 누적합 기반의 변화 분석을 통해 이진 형태의 특징 패턴을 구하고, 이들에 대해 히스토그램을 생성하여 특징 벡터(학습 특징 벡터 및 입력 특징 벡터)를 구하는 방식으로 입력 영상으로부터 사람을 검출함으로써, 각 픽셀별로 특징 패턴을 구할 때 주변의 노이즈가 있더라도 누적합 기반의 분석을 통해 안정적인 패턴을 구할 수 있고, 기울기의 방향을 구하는 것에 비해 상대적으로 더 다양하고 차별성 있는 형태의 특징 패턴을 구할 수가 있으며, 이를 통해 상대적으로 차별성 있는 특징 패턴을 가지고 사람 검출을 수행하기 때문에 고정확한 사람 검출을 실현할 수 있다.
The present invention obtains a feature pattern (learning feature vector and input feature vector) by obtaining a feature pattern in binary form through cumulative sum-based change analysis when obtaining a feature pattern of each pixel, and generating a histogram for them. By detecting a person from the input image, a stable pattern can be obtained through cumulative sum-based analysis even if there is noise in the surroundings when the feature pattern for each pixel is obtained. It is possible to obtain a feature pattern of, and thus it is possible to realize a highly accurate person detection because it performs a person detection with a relatively distinct feature pattern.

도 1은 본 발명에 따른 SVM 학습을 이용한 사람 검출 장치의 블록구성도,
도 2는 본 발명에 따라 하나의 픽셀에 대한 특징 패턴을 구하기 위해 사용되는 이웃하는 8개의 픽셀 값들을 보여주는 예시도,
도 3은 본 발명에 따라 하나의 픽셀에 대한 누적합을 구하는 예를 설명하기 위해 도시한 예시도,
도 4는 본 발명에 따라 SVM 학습을 이용하여 입력 영상으로부터 사람을 검출하는 주요 과정을 도시한 순서도.1 is a block diagram of an apparatus for detecting a person using SVM learning according to the present invention;
2 is an exemplary diagram showing eight neighboring pixel values used to obtain a feature pattern for one pixel in accordance with the present invention;
3 is an exemplary diagram for explaining an example of obtaining a cumulative sum for one pixel according to the present invention;
4 is a flow chart illustrating the main process of detecting a person from an input image using SVM learning in accordance with the present invention.

본 발명의 기술요지는, 픽셀들의 기울기에 대한 방향 정보를 이용하여 패턴을 구성하는 전술한 종래 방식과는 달리, 누적합 기반의 변화 분석을 통해 학습 영상으로부터 추출한 학습 이진 특징 패턴들에 대한 히스토그램을 생성한 후, 히스토그램을 기반으로 학습 특징 벡터를 생성하고, 생성된 학습 특징 벡터를 SVM(support vector machine) 학습하여 분류기를 생성하며, 생성된 분류기에 대한 SVM 판별 가중치를 생성하여 데이터베이스에 저장하고, 입력 영상으로부터 추출한 입력 이진 특징 패턴들에 대해 히스토그램을 생성한 후, 히스토그램을 기반으로 입력 특징 벡터를 생성하며, 입력 특징 벡터가 생성된 전체의 입력 영상에 대한 탐색 영역을 설정하고, 설정된 탐색 영역과 데이터베이스에 저장된 분류기를 이용하여 사람을 검출한다는 것으로, 본 발명은 이러한 기술적 수단을 통해 목적으로 하는 바를 실현함으로써 종래 방식에서의 문제점을 효과적으로 개선할 수 있다.The technical spirit of the present invention is different from the above-described conventional method of constructing a pattern by using direction information on the slope of pixels, and a histogram of learning binary feature patterns extracted from a learning image through change-based analysis based on cumulative sum. After generating, generate a learning feature vector based on the histogram, generate a classifier by learning the generated learning feature vector (SVM), generate the SVM discrimination weight for the generated classifier, and store it in the database, After generating a histogram of the input binary feature patterns extracted from the input image, an input feature vector is generated based on the histogram, a search region for the entire input image in which the input feature vector is generated, and the set search region and The present invention detects a person using a classifier stored in a database. It is possible to effectively solve the problems in the conventional manner by realizing the object through these technical means.

아울러, 아래의 본 발명을 설명함에 있어서 공지 기능 또는 구성 등에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명을 생략할 것이다. 그리고, 후술되는 용어들은 본 발명에서의 기능을 고려하여 정의된 용어들인 것으로, 이는 사용자, 운용자 등의 의도 또는 관례 등에 따라 달라질 수 있음은 물론이다. 그러므로, 그 정의는 본 명세서의 전반에 걸쳐 기술되는 기술사상을 토대로 이루어져야 할 것이다.In the following description of the present invention, detailed description of known functions and configurations incorporated herein will be omitted when it may make the subject matter of the present invention rather unclear. In addition, terms to be described below are terms defined in consideration of functions in the present invention, which may be changed according to intention or custom of a user, an operator, or the like. Therefore, the definition should be based on the technical idea described throughout this specification.

이하, 첨부된 도면을 참조하여 본 발명의 바람직한 실시 예에 대하여 상세하게 설명한다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명에 따른 SVM 학습을 이용한 사람 검출 장치의 블록구성도로서, 크게 구분해 볼 때, 영상 학습 모듈(110), SVM 데이터베이스(120) 및 사람 판별 모듈(130) 등을 포함할 수 있으며, 영상 학습 모듈(110)은 학습 영상 특징 추출 블록(112), SVM 학습 블록(114) 및 SVM 판별 가중치 생성 블록(116) 등을 포함할 수 있고, 사람 판별 모듈(130)은 입력 영상 특징 추출 블록(132), 탐색 영역 설정 블록(134) 및 사람 검출 판별 블록(136) 등을 포함한 수 있다.FIG. 1 is a block diagram of an apparatus for detecting a person using SVM learning according to the present invention. When divided into two, the image learning module 110, the SVM database 120, and the person determining module 130 may be included. The image learning module 110 may include a learning image feature extraction block 112, an SVM learning block 114, an SVM determination weight generation block 116, and the like. The person determination module 130 may include an input image feature. The extraction block 132, the search region setting block 134, the person detection determination block 136, and the like may be included.

도 1을 참조하면, 영상 학습 모듈(110) 내 학습 영상 특징 추출 블록(112)은 입력되는 학습 영상(102)으로부터 학습 이진 특징 패턴을 추출하고, 추출된 학습 이진 특징 패턴들에 대해 히스토그램을 생성하며, 생성된 히스토그램을 기반으로 학습 특징 벡터를 생성하는 등의 기능을 제공할 수 있다. 여기에서, 학습 영상(102)은, 예컨대 128 48 픽셀 사이즈의 영상일 수 있다.Referring to FIG. 1, the learning image feature extraction block 112 in the image learning module 110 extracts a learning binary feature pattern from an input learning image 102 and generates a histogram with respect to the extracted learning binary feature patterns. A function of generating a learning feature vector based on the generated histogram may be provided. Here, the training image 102 may be, for example, an image having a size of 128 48 pixels.

즉, 학습 영상 특징 추출 블록(112)은 각 픽셀별로 누적합 기반의 변화 분석을 통해 학습 이진 특징 패턴을 생성하는데, 일예로서 도 2에 도시된 바와 같이, 하나의 픽셀(x_i, y_j)의 이웃하는 8개의 픽셀{(x_i _-1, y_j _-1), (x_i, y_j _-1), (x_i ₊₁, y_j _-1), (x_i-1, y_j), (x_i ₊₁, y_j), (x_i _-1, y_j ₊₁), (x_i, y_j ₊₁), (x_i ₊₁, y_j ₊₁)} 값들을 이용하여 한 픽셀(x_i, y_j)에 대한 학습 이진 특징 패턴을 구한다. 보다 구체적으로, 이웃하는 8개의 픽셀 값들의 누적합을 구하고, 각 누적합의 최대값과 최소값을 구한 후, 최소값이 앞에 있고 최대값이 뒤에 오면 이를 증가로 판단하여 최소값과 최대값의 사이 값들을“1”로 설정하고 나머지 값들을“0”으로 설정하여 학습 이진 특징 패턴을 생성하며, 이와는 반대로 최대값이 앞에 있고 최소값이 뒤에 오면 이를 감소로 판단하여 최대값과 최소값의 사이 값들을“0”으로 설정하고 나머지 값들을“1”로 설정하여 학습 이진 특징 패턴을 생성할 수 있다. 여기에서, 누적합 기반의 변화 분석이라 함은 이웃하는 주변 8개의 픽셀의 누적합들을 계산했을 때 이들 값이 상대적으로 크거나 작거나 하는 변화를 분석하는 것을 의미한다.That is, the training image feature extraction block 112 generates a learning binary feature pattern through change analysis based on a cumulative sum for each pixel. As an example, as illustrated in FIG. 2, one pixel (x _i , y _j ) is used. Neighboring 8 pixels of ((x _i _-1 , y _j _-1) , (x _i , y _j _-1 ), (x _i ₊₁ , y _j _-1 ), (x _i-1 , y _j ) , (x _i ₊₁ , y _j ), (x _i _-1 , y _j ₊₁ ), (x _i , y _j ₊₁ ), (x _i ₊₁ , y _j ₊₁ )} Obtain the learning binary feature pattern for pixel (x _i , y _j ). More specifically, the cumulative sum of the neighboring eight pixel values is obtained, and the maximum and minimum values of each cumulative sum are obtained. 1 ”and the rest of the values to“ 0 ”to generate a learning binary feature pattern. On the contrary, if the maximum value is at the front and the minimum value is at the back, it is determined to be reduced and the values between the maximum and minimum values are set to“ 0 ”. Set up and set the rest of the values to "1" to generate a learning binary feature pattern. Here, the cumulative sum-based change analysis means analyzing changes in which these values are relatively large or small when the cumulative sums of neighboring eight pixels are calculated.

일예로서, 이웃하는 주변 8개의 각 픽셀(s0, s1, s2, s3, s4, s5, s6, s7)의 누적합들의 값이 10, 50, 78, 20, -40, -15, -8, -2 라고 가정할 때, 최대값이 78이고, 최소값이 40이며, 최대값이 앞에 있으므로, 최대값과 최소값 사이는 “0”으로 설정하고, 나머지는 “1”로 설정하게 되어, 패턴은 <11000111>이 된다.For example, the cumulative sums of the neighboring neighboring eight pixels s0, s1, s2, s3, s4, s5, s6, and s7 are 10, 50, 78, 20, -40, -15, -8, Assuming -2, the maximum value is 78, the minimum value is 40, and the maximum value is in front, so the maximum value and minimum value are set to “0” and the rest to “1”, and the pattern is < 11000111>.

또한, 학습 영상 특징 추출 블록(112)은 추출된 학습 이진 특징 패턴들에 대해 픽셀의 기울기의 크기 값을 구해진 학습 이진 특징 패턴의 크기 값으로 설정하여 히스토그램을 생성하고, 이와 같이 생성된 히스토그램을, 일예로서 도 3에 도시된 바와 같이, 16 16 사이즈의 블록에 대해 블록 히스토그램을 누적(즉, 각 패턴의 빈도 값을 누적)하며, 이렇게 누적된 패턴들을 모아서 전체 이미지에 대한 하나의 학습 특징 벡터를 구성한다.In addition, the training image feature extraction block 112 generates a histogram by setting the magnitude value of the gradient of the pixel with respect to the extracted learning binary feature patterns to the size value of the obtained learning binary feature pattern, and generates the histogram as described above. As an example, as shown in FIG. 3, a block histogram is accumulated (ie, a frequency value of each pattern) for 16 16 blocks, and the accumulated patterns are collected to form one learning feature vector for the entire image. Configure.

따라서, 16 16 영역의 크기로 오른쪽 및 아래쪽으로 8픽셀씩 이동하여 학습 특징 벡터를 추출하게 되므로, 학습 영상이 128 48 크기라고 가정할 때, 가로로 5개 블록과 세로로 15개 블록이 생성되며, 이 블록들에 대한 특징값을 하나로 합쳐서 하나의 전체 특징 벡터를 생성, 즉 128 48 크기의 사람 이미지 또는 비사람 이미지들에 대해 각각의 학습 특징 벡터를 생성하며, 이와 같이 생성되는 학습 특징 벡터는 SVM 학습 블록(114)으로 전달된다.Therefore, since the learning feature vector is extracted by moving 8 pixels to the right and the bottom by the size of 16 16 areas, assuming that the training image is 128 48 size, 5 blocks horizontally and 15 blocks vertically are generated. We combine the feature values for these blocks into one to generate one full feature vector, i.e. each learning feature vector for 128 or 48 human or non-human images. Passed to SVM learning block 114.

다음에, SVM 학습 블록(114)은 SVM(support vector machine) 학습기(또는 SVM 학습 알고리즘)를 이용하여 생성된 학습 특징 벡터를 학습함으로써 다차원 공간에 있는 사람의 특징 벡터점들과 비사람의 특징 벡터점들을 분류하는 분류기를 생성하여 SVM 판별 가중치 생성 블록(116)으로 전달하는 등의 기능을 제공할 수 있다.Next, the SVM learning block 114 learns the feature vector points of a person and the non-human feature vector in a multidimensional space by learning a learning feature vector generated using a support vector machine (SVM) learner (or SVM learning algorithm). A classifier that classifies the points may be generated and transferred to the SVM determination weight generation block 116.

여기에서, SVM은 두 클래스, 즉 사람과 비사람을 분류하는 하이퍼-플랜(hyper-plane)을 설계하는 것으로, 최적 하이퍼-플랜과 거리가 가장 가까운 서포트 벡터를 이용하여 최적 하이퍼-플랜을 나타내는 방법이며, 입력 데이터가 선형 분리가 가능하지 않은 경우에는 커널 함수를 이용하여 최적의 하이퍼-플랜을 얻을 수 있다. 이러한 SVM은 다차원 공간상에서 두 클래스(사람, 비사람)를 분류할 때 가장 대표적인 특성을 갖는 소수의 벡터들을 이용하여 두 클래스간의 거리를 최대화하면서 같은 클래스에 속하는 데이터들을 같은 쪽에 위치시켜 다차원의 최적 하이퍼-플랜을 찾아가면서 학습을 수행한다.Here, SVM designs a hyper-plane that classifies two classes, namely, people and non-people, and shows the optimal hyper-plan using a support vector closest to the optimal hyper-plan. If the input data cannot be linearly separated, an optimal hyper-plan can be obtained by using a kernel function. When classifying two classes (man, non-person) in multi-dimensional space, SVM maximizes the distance between two classes by using a few vectors with the most representative characteristics and places the data belonging to the same class on the same side. -Go to the plan and do the learning.

그리고, SVM 판별 가중치 생성 블록(116)은 SVM 학습 블록(114)에서의 학습을 통해 생성되는 분류기(다차원 공간에 있는 사람의 특징 벡터점들과 비사람의 특징 벡터점들을 분류하는 분류기)에 대한 SVM 판별 가중치, 즉 하나의 학습 특징 벡터에 있는 각 엘리먼트들과 결합되는 SVM 판별 가중치를 생성하여 SVM DB(120)에 저장하는 등의 기능을 제공할 수 있다. 여기에서, SVM 판별 가중치는 선형 판별 함수의 가중치 값으로 정의될 수 있다.In addition, the SVM determination weight generation block 116 is a classifier (classifier for classifying human feature vector points and non-human feature vector points in a multidimensional space) generated through learning in the SVM learning block 114. SVM determination weights, that is, SVM determination weights that are combined with each element in one learning feature vector, may be generated and stored in the SVM DB 120. Here, the SVM determination weight may be defined as a weight value of the linear determination function.

다음에, 사람 판별 모듈(130) 내 입력 영상 특징 추출 블록(132)은 카메라 등으로부터 전달되는 입력 영상(104)으로부터 입력 이진 특징 패턴을 추출하고, 추출된 입력 이진 특징 패턴들에 대해 히스토그램을 생성하며, 생성된 히스토그램을 기반으로 입력 특징 벡터를 생성하는 등의 기능을 제공할 수 있는데, 이러한 기능들은 학습 영상 특징 추출 블록(112)에서의 기능들과 실질적으로 동일하다. 따라서, 명세서의 간결화를 위한 불필요한 중복 기재를 피하기 위하여 여기에서는 입력 영상 특징 추출 블록의 기능에 대한 상세 설명을 생략한다. 여기에서, 입력 영상(104)은, 예컨대 640 480 사이즈의 영상일 수 있다.Next, the input image feature extraction block 132 in the person discrimination module 130 extracts an input binary feature pattern from the input image 104 transmitted from a camera or the like, and generates a histogram with respect to the extracted input binary feature patterns. In addition, a function of generating an input feature vector based on the generated histogram may be provided, and these functions are substantially the same as those of the learning image feature extraction block 112. Therefore, in order to avoid unnecessary duplication of description for the sake of brevity of the specification, detailed description of the function of the input image feature extraction block is omitted here. Here, the input image 104 may be, for example, an image of 640 480 size.

다음에, 탐색 영역 설정 블록(134)은 입력 특징 벡터가 생성된 전체의 입력 영상에 대한 탐색 영역을 설정, 즉 <0, 0>을 시작점으로 하여 영상의 오른쪽으로 그리고 왼쪽으로, 예컨대 128 48 크기의 부분을 탐색 영역으로 설정할 수 있는데, 실제의 사람 크기가 128 48 보다 작거나 혹은 클 수 있으므로, 128 48 탐색 사이즈를 다양하게 스캐일링(scaling)하여 설정할 수 있다.Next, the search region setting block 134 sets a search region for the entire input image in which the input feature vector is generated, i.e., to the right and left of the image, eg, 128 48 size with <0, 0> as the starting point. The portion of may be set as a search area, and since the actual person size may be smaller or larger than 128 48, the size of the 128 48 search may be variously scaled.

마지막으로, 사람 검출 판별 블록(136)은 설정된 탐색 영역에 기반하여 SVM DB(120)에 저장된 분류기를 이용하여 사람을 검출, 예컨대 SVM의 하이퍼-플랜(결정면)에 대해“0”보다 큰 값이 나오면 사람으로,“0”보다 작은 값으로 나오면 비사람으로 판별하는 방식으로 사람 검출을 판별하는 등의 기능을 제공할 수 있다. 즉, 입력 특징 벡터들을 SVM 판별 가중치들로 구성된 분류기에 넣어서 나온 결과값을 보고 사람과 비사람의 검출을 판별한다. 도 1의 미설명번호 106은 사람 검출 영역을 포함하는 출력 영상을 나타낸다.Finally, the person detection determination block 136 detects a person using a classifier stored in the SVM DB 120 based on the set search area, for example, a value greater than "0" for the hyper-plan of the SVM. When it comes out, it is possible to provide a function of discriminating the person detection in such a way that it is discriminated as a person and coming out as a value less than "0". That is, the detection of the person and the non-person is determined by looking at the result value obtained by inserting the input feature vectors into the classifier composed of the SVM discrimination weights. Reference numeral 106 of FIG. 1 denotes an output image including a human detection area.

다음에, 상술한 바와 같은 구성을 갖는 사람 검출 장치를 이용하여 SVM 학습을 통해 얻어진 분류기에 의거하여 입력 영상으로부터 사람을 검출하는 일련의 과정들에 대하여 설명한다.Next, a series of processes for detecting a person from an input image based on a classifier obtained through SVM learning using the person detection device having the above-described configuration will be described.

도 4는 본 발명에 따라 SVM 학습을 이용하여 입력 영상으로부터 사람을 검출하는 주요 과정을 도시한 순서도이다.4 is a flowchart illustrating a main process of detecting a person from an input image using SVM learning according to the present invention.

도 4를 참조하면, 학습 영상, 예컨대 128 48의 학습 영상(102)이 들어오면, 학습 영상 특징 추출 블록(112)에서는 입력되는 학습 영상(102)으로부터 학습 이진 특징 패턴을 추출, 즉 이웃하는 8개의 픽셀 값들의 누적합을 구하고, 각 누적합의 최대값과 최소값을 구한 후, 최소값이 앞에 있고 최대값이 뒤에 오면 이를 증가로 판단하여 최소값과 최대값의 사이 값들을“1”로 설정하고 나머지 값들을“0”으로 설정하거나 혹은 최대값이 앞에 있고 최소값이 뒤에 오면 이를 감소로 판단하여 최대값과 최소값의 사이 값들을“0”으로 설정하고 나머지 값들을“1”로 설정하는 방식으로 학습 이진 특징 패턴을 추출한다(단계 402).Referring to FIG. 4, when a learning image, for example, a learning image 102 of 128 48 is input, the learning image feature extraction block 112 extracts a learning binary feature pattern from an input learning image 102, that is, neighboring 8. The cumulative sum of the two pixel values is obtained, and the maximum and minimum values of each cumulative sum are calculated, and when the minimum value is present and the maximum value is followed, it is determined to be increased, and the values between the minimum value and the maximum value are set to “1” and the remaining values Learning binary feature by setting them to "0" or if the maximum value is in front and the minimum value is followed by a reduction, and the values between the maximum and minimum values are set to "0" and the remaining values are set to "1". The pattern is extracted (step 402).

다음에, 추출된 학습 이진 특징 패턴들에 대해 히스토그램을 생성한 후, 생성된 히스토그램을 기반으로 학습 특징 벡터를 생성, 즉 추출된 학습 이진 특징 패턴들에 대해 픽셀의 기울기의 크기 값을 구해진 학습 이진 특징 패턴의 크기 값으로 설정하여 히스토그램을 생성하며, 16 16 사이즈의 블록에 대해 블록 히스토그램을 누적(즉, 각 패턴의 빈도 값을 누적)한 패턴들을 모아서 전체 이미지에 대한 하나의 학습 특징 벡터를 생성한다(단계 404).Next, a histogram is generated for the extracted learning binary feature patterns, and then a learning feature vector is generated based on the generated histogram, that is, a learning binary obtained by obtaining a magnitude value of the slope of the pixel with respect to the extracted learning binary feature patterns. A histogram is generated by setting the size of the feature pattern, and a learning feature vector for the entire image is generated by collecting patterns obtained by accumulating block histograms (that is, accumulating frequency values of each pattern) for 16 16 size blocks. (Step 404).

이어서, SVM 학습 블록(114)에서는 SVM 학습 알고리즘을 이용하여 학습 특징 벡터를 학습함으로써 다차원 공간에 있는 사람의 특징 벡터점들과 비사람의 특징 벡터점들을 분류하는 분류기를 생성하며(단계 406), SVM 판별 가중치 생성 블록(116)에서는 SVM 학습을 통해 생성된 분류기, 즉 다차원 공간에 있는 사람의 특징 벡터점들과 비사람의 특징 벡터점들을 분류하는 분류기에 대한 SVM 판별 가중치, 즉 하나의 학습 특징 벡터에 있는 각 엘리먼트들과 결합되는 SVM 판별 가중치를 생성하여 SVM DB(120)에 저장한다(단계 408).Subsequently, the SVM learning block 114 generates a classifier that classifies the feature vector points of the person and the non-person feature vector points in the multidimensional space by learning the learning feature vectors using the SVM learning algorithm (step 406). In the SVM discriminant weight generation block 116, the SVM discriminant weights, i.e., one learning feature, for the classifier generated through SVM learning, that is, the classifier classifying the feature vector points of the person and the non-person feature vector points in the multidimensional space. An SVM discrimination weight associated with each element in the vector is generated and stored in SVM DB 120 (step 408).

다음에, 사람 검출을 위한 입력 영상이 도시 생략된 카메라 등으로부터 제공되면, 입력 영상 특징 추출 블록(132)에서는 상술한 단계(402)에서와 동일한 방법으로 입력 영상에 대한 입력 이진 특징 패턴들을 추출하고(단계 410), 상술한 단계(404)에서와 동일한 방법으로 입력 영상(104)에서 추출한 입력 이진 특징 패턴의 히스토그램에 기반하여 입력 특징 벡터를 생성한다(단계 412).Next, if an input image for detecting a person is provided from a camera or the like not shown, the input image feature extraction block 132 extracts the input binary feature patterns for the input image in the same manner as in step 402 described above. In operation 410, an input feature vector is generated based on the histogram of the input binary feature pattern extracted from the input image 104 in the same manner as in operation 404 (operation 412).

그리고, 탐색 영역 설정 블록(134)에서는 입력 특징 벡터가 생성된 전체의 입력 영상에 대한 탐색 영역을 설정, 즉 <0, 0>을 시작점으로 하여 영상의 오른쪽으로 그리고 왼쪽으로 탐색 영역을 설정하며(단계 414), 사람 검출 판별 블록(136)에서는 설정된 탐색 영역에 기반하여 SVM DB(120)에 저장된 분류기를 이용하여 사람을 검출, 예컨대 SVM의 하이퍼-플랜(결정면)에 대해“0”보다 큰 값이 나오면 사람 검출로 판별하고,“0”보다 작은 값으로 나오면 비사람으로 판별하는 방식으로 사람 검출을 판별한다(단계 416).In the search region setting block 134, the search region for the entire input image in which the input feature vector is generated is set, that is, the search region is set to the right and left of the image with <0, 0> as a starting point; In step 414, the person detection determination block 136 detects a person using a classifier stored in the SVM DB 120 based on the set search area, for example, a value greater than "0" for the hyper-plan of the SVM. Is detected, the person is detected. If the value is smaller than "0", the person is discriminated by non-person (step 416).

이상의 설명에서는 본 발명의 바람직한 실시 예들을 제시하여 설명하였으나 본 발명이 반드시 이에 한정되는 것은 아니며, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 여러 가지 치환, 변형 및 변경이 가능함을 쉽게 알 수 있을 것이다.
In the above description, the present invention has been described with reference to preferred embodiments, but the present invention is not necessarily limited thereto, and a person having ordinary skill in the art to which the present invention pertains can make various changes without departing from the technical spirit of the present invention. It will be readily appreciated that branch substitutions, modifications and variations are possible.

110 : 영상 학습 모듈 112 : 학습 영상 특징 추출 블록
114 : SVM 학습 블록 116 : SVM 판별 가중치 생성 블록
120 : SVM DB 130 : 사람 판별 모듈
132 : 입력 영상 특징 추출 블록 134 : 탐색 영역 설정 블록
136 : 사람 검출 판별 블록110: image learning module 112: learning image feature extraction block
114: SVM Learning Block 116: SVM Discrimination Weight Generation Block
120: SVM DB 130: person determination module
132: input image feature extraction block 134: search area setting block
136: person detection determination block

Claims

입력되는 학습 영상으로부터 학습 이진 특징 패턴을 추출하고, 상기 학습 이진 특징 패턴들에 대해 히스토그램을 생성한 후, 생성된 히스토그램을 기반으로 학습 특징 벡터를 생성하는 학습 영상 특징 추출 블록과,
생성된 상기 학습 특징 벡터를 SVM(support vector machine) 학습하여 분류기를 생성하는 SVM 학습 블록과,
생성된 상기 분류기에 대한 SVM 판별 가중치를 생성하여 데이터베이스에 저장하는 SVM 판별 가중치 생성 블록과,
입력 영상으로부터 입력 이진 특징 패턴을 추출하고, 상기 입력 이진 특징 패턴들에 대해 히스토그램을 생성한 후, 생성된 히스토그램을 기반으로 입력 특징 벡터를 생성하는 입력 영상 특징 추출 블록과,
상기 입력 특징 벡터가 생성된 전체의 입력 영상에 대한 탐색 영역을 설정하는 탐색 영역 설정 블록과,
설정된 탐색 영역과 상기 데이터베이스에 저장된 분류기를 이용하여 사람을 검출하는 사람 검출 판별 블록
을 포함하는 SVM 학습을 이용한 사람 검출 장치.
A learning image feature extraction block for extracting a learning binary feature pattern from an input learning image, generating a histogram with respect to the learning binary feature patterns, and generating a learning feature vector based on the generated histogram;
An SVM learning block for generating a classifier by learning the SVM (support vector machine),
An SVM determination weight generation block for generating an SVM determination weight for the generated classifier and storing it in a database;
An input image feature extraction block for extracting an input binary feature pattern from an input image, generating a histogram with respect to the input binary feature patterns, and generating an input feature vector based on the generated histogram;
A search region setting block configured to set a search region for the entire input image in which the input feature vector is generated;
A person detection determination block that detects a person using a set search area and a classifier stored in the database.
Person detection apparatus using SVM learning comprising a.

제 1 항에 있어서,
상기 학습 영상 특징 추출 블록은,
각 픽셀별로 누적합 기반의 변화 분석을 통해 상기 학습 이진 특징 패턴을 생성하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 1,
The learning video feature extraction block,
Generating the learning binary feature pattern through change analysis based on cumulative sum for each pixel
Person detection device using SVM learning.

제 2 항에 있어서,
상기 학습 영상 특징 추출 블록은,
한 픽셀의 이웃하는 8개의 픽셀 값들을 이용하여 상기 한 픽셀에 대한 상기 학습 이진 특징 패턴을 구하는
SVM 학습을 이용한 사람 검출 장치.
3. The method of claim 2,
The learning video feature extraction block,
Obtaining the learning binary feature pattern for the one pixel using neighboring eight pixel values of one pixel
Person detection device using SVM learning.

제 3 항에 있어서,
상기 학습 영상 특징 추출 블록은,
상기 이웃하는 8개의 픽셀 값들의 누적합을 구하고, 각 누적합의 최대값과 최소값 사이가 증가일 때 그 사이 값들을“1”로 설정하고 나머지 값들을“0”으로 설정하여 상기 학습 이진 특징 패턴을 생성하며, 상기 각 누적합의 최대값과 최소값 사이가 감소일 때 그 사이 값들을“0”으로 설정하고 나머지 값들을“1”로 설정하여 상기 학습 이진 특징 패턴을 생성하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 3, wherein
The learning video feature extraction block,
The learning binary feature pattern is obtained by calculating a cumulative sum of the neighboring eight pixel values, and setting the values between the values of "1" and the remaining values of "0" when the maximum value and the minimum value of each cumulative sum increase. And generating the learning binary feature pattern by setting values between the maximum and minimum values of each cumulative sum to "0" and setting the remaining values to "1".
Person detection device using SVM learning.

제 1 항에 있어서,
상기 학습 영상 특징 추출 블록은,
픽셀의 기울기의 크기 값을 구해진 상기 학습 이진 특징 패턴의 크기 값으로 설정하여 상기 히스토그램을 생성하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 1,
The learning video feature extraction block,
Generating the histogram by setting the magnitude value of the gradient of the pixel to the magnitude value of the learned binary feature pattern
Person detection device using SVM learning.

제 1 항에 있어서,
상기 SVM 학습 블록은,
SVM 학습기를 통해 다차원 공간에 있는 사람의 특징 벡터점들과 비사람의 특징 벡터점들을 분류하는 상기 분류기를 생성하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 1,
The SVM learning block,
An SVM learner generates the classifier that classifies the feature vector points of a person and the non-person feature vector points in a multidimensional space.
Person detection device using SVM learning.

제 1 항에 있어서,
상기 SVM 판별 가중치 생성 블록은,
하나의 학습 특징 벡터에 있는 각 엘리먼트들과 결합되는 상기 SVM 판별 가중치를 생성하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 1,
The SVM determination weight generation block,
Generating the SVM discriminant weight associated with each element in one learning feature vector
Person detection device using SVM learning.

제 1 항에 있어서,
상기 입력 영상 특징 추출 블록은,
각 픽셀별로 누적합 기반의 변화 분석을 통해 상기 입력 이진 특징 패턴을 생성하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 1,
The input image feature extraction block,
Generating the input binary feature pattern through change analysis based on a cumulative sum for each pixel
Person detection device using SVM learning.

제 8 항에 있어서,
상기 입력 영상 특징 추출 블록은,
한 픽셀의 이웃하는 8개의 픽셀 값들을 이용하여 상기 한 픽셀에 대한 상기 입력 이진 특징 패턴을 구하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 8,
The input image feature extraction block,
Obtaining the input binary feature pattern for the one pixel using neighboring eight pixel values of one pixel
Person detection device using SVM learning.

제 9 항에 있어서,
상기 입력 영상 특징 추출 블록은,
상기 이웃하는 8개의 픽셀 값들의 누적합을 구하고, 각 누적합의 최대값과 최소값 사이가 증가일 때 그 사이 값들을“1”로 설정하고 나머지 값들을“0”으로 설정하여 상기 입력 이진 특징 패턴을 생성하며, 상기 각 누적합의 최대값과 최소값 사이가 감소일 때 그 사이 값들을“0”으로 설정하고 나머지 값들을“1”로 설정하여 상기 입력 이진 특징 패턴을 생성하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 9,
The input image feature extraction block,
The cumulative sum of the neighboring eight pixel values is obtained, and when the difference between the maximum value and the minimum value of each cumulative sum is increased, the values between them are set to “1” and the remaining values are set to “0” to set the input binary feature pattern. Generating the input binary feature pattern by setting the values between the values "0" and the remaining values to "1" when the difference between the maximum value and the minimum value of each cumulative sum decreases.
Person detection device using SVM learning.

제 1 항에 있어서,
상기 입력 영상 특징 추출 블록은,
픽셀의 기울기의 크기 값을 구해진 상기 입력 이진 특징 패턴의 크기 값으로 설정하여 상기 히스토그램을 생성하는
SVM 학습을 이용한 사람 검출 장치.
The method of claim 1,
The input image feature extraction block,
Generating the histogram by setting the magnitude value of the gradient of the pixel to the magnitude value of the input binary feature pattern obtained
Person detection device using SVM learning.

입력되는 학습 영상으로부터 학습 이진 특징 패턴을 추출하는 과정과,
추출된 상기 학습 이진 특징 패턴들에 대한 히스토그램을 생성하고, 이 생성된 히스토그램을 기반으로 학습 특징 벡터를 생성하는 과정과,
생성된 상기 학습 특징 벡터를 SVM(support vector machine) 학습하여 분류기를 생성하는 과정과,
생성된 상기 분류기에 대한 SVM 판별 가중치를 생성하여 데이터베이스에 저장하는 과정과,
입력 영상으로부터 입력 이진 특징 패턴을 추출하는 과정과,
추출된 상기 입력 이진 특징 패턴들에 대해 히스토그램을 생성하고, 이 생성된 히스토그램을 기반으로 입력 특징 벡터를 생성하는 과정과,
상기 입력 특징 벡터가 생성된 전체의 입력 영상에 대한 탐색 영역을 설정하고, 설정된 탐색 영역과 상기 데이터베이스에 저장된 분류기를 이용하여 사람을 검출하는 과정
을 포함하는 SVM 학습을 이용한 사람 검출 방법.
Extracting a learning binary feature pattern from an input learning image;
Generating a histogram of the extracted learning binary feature patterns, and generating a learning feature vector based on the generated histogram;
Generating a classifier by learning the generated SNC (support vector machine);
Generating a SVM determination weight for the generated classifier and storing the weight in the database;
Extracting an input binary feature pattern from an input image;
Generating a histogram with respect to the extracted input binary feature patterns, and generating an input feature vector based on the generated histogram;
Setting a search area for the entire input image in which the input feature vector is generated, and detecting a person using the set search area and a classifier stored in the database
Person detection method using SVM learning comprising a.

제 12 항에 있어서,
상기 학습 이진 특징 패턴의 추출은,
각 픽셀별로 누적합 기반의 변화 분석을 통해 수행되는
SVM 학습을 이용한 사람 검출 방법.
13. The method of claim 12,
Extraction of the learning binary feature pattern,
Through change analysis based on cumulative sum for each pixel
Person detection method using SVM learning.

제 13 항에 있어서,
상기 학습 이진 특징 패턴은,
한 픽셀의 이웃하는 8개의 픽셀 값들을 이용하여 추출되는
SVM 학습을 이용한 사람 검출 방법.
The method of claim 13,
The learning binary feature pattern,
Extracted using neighboring eight pixel values of one pixel
Person detection method using SVM learning.

제 14 항에 있어서,
상기 학습 이진 특징 패턴은,
상기 이웃하는 8개의 픽셀 값들의 각 누적합의 최대 값과 최소 값 사이가 증가시에 그 사이 값들이 “1”로 설정되고, 감소시에 그 사이 값들이“0”으로 설정되는
SVM 학습을 이용한 사람 검출 방법.
15. The method of claim 14,
The learning binary feature pattern,
When the value between the maximum value and the minimum value of each cumulative sum of the neighboring eight pixel values increases, the values therebetween are set to "1", and the values between them are set to "0".
Person detection method using SVM learning.

제 12 항에 있어서,
상기 분류기는,
SVM 학습기를 통해 다차원 공간에 있는 사람의 특징 벡터점들과 비사람의 특징 벡터점들을 분류하는
SVM 학습을 이용한 사람 검출 방법.
13. The method of claim 12,
The classifier,
SVM learner classifies the feature vector points of human and non-human feature vector points in multidimensional space.
Person detection method using SVM learning.

제 12 항에 있어서,
상기 입력 이진 특징 패턴의 추출은,
각 픽셀별로 누적합 기반의 변화 분석을 통해 수행되는
SVM 학습을 이용한 사람 검출 방법.
13. The method of claim 12,
Extraction of the input binary feature pattern,
Through change analysis based on cumulative sum for each pixel
Person detection method using SVM learning.

제 17 항에 있어서,
상기 입력 이진 특징 패턴은,
한 픽셀의 이웃하는 8개의 픽셀 값들을 이용하여 추출되는
SVM 학습을 이용한 사람 검출 방법.
The method of claim 17,
The input binary feature pattern,
Extracted using neighboring eight pixel values of one pixel
Person detection method using SVM learning.

제 18 항에 있어서,
상기 입력 이진 특징 패턴은,
상기 이웃하는 8개의 픽셀 값들의 각 누적합의 최대값과 최소값 사이가 증가일 때 그 사이 값들이“1”로 설정되고 나머지 값들을“0”으로 설정되며, 상기 각 누적합의 최대값과 최소값 사이가 감소일 때 그 사이 값들이“0”으로 설정되고 나머지 값들이“1”로 설정되는
SVM 학습을 이용한 사람 검출 방법.
The method of claim 18,
The input binary feature pattern,
When the maximum value and the minimum value of each cumulative sum of the neighboring eight pixel values are increased, the values therebetween are set to "1" and the remaining values are set to "0". When decrementing, the values in between are set to "0" and the remaining values are set to "1".
Person detection method using SVM learning.

제 12 항에 있어서,
상기 입력 이진 특징 벡터에 대한 히스토그램은,
픽셀의 기울기의 크기 값을 구해진 상기 입력 이진 특징 패턴의 크기 값으로 설정하여 생성되는
SVM 학습을 이용한 사람 검출 방법.13. The method of claim 12,
The histogram for the input binary feature vector is
Generated by setting the magnitude value of the gradient of the pixel to the magnitude value of the input binary feature pattern obtained
Person detection method using SVM learning.