WO2004051573A2 - Method and apparatus for removing false edges from a segmented image - Google Patents

Method and apparatus for removing false edges from a segmented image Download PDF

Info

Publication number
WO2004051573A2
WO2004051573A2 PCT/IB2003/005677 IB0305677W WO2004051573A2 WO 2004051573 A2 WO2004051573 A2 WO 2004051573A2 IB 0305677 W IB0305677 W IB 0305677W WO 2004051573 A2 WO2004051573 A2 WO 2004051573A2
Authority
WO
WIPO (PCT)
Prior art keywords
pixel
segmentation
set forth
images
image
Prior art date
Application number
PCT/IB2003/005677
Other languages
French (fr)
Other versions
WO2004051573A3 (en
Inventor
Christiaan Varekamp
Original Assignee
Koninklijke Philips Electronics N.V.
U.S. Philips Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V., U.S. Philips Corporation filed Critical Koninklijke Philips Electronics N.V.
Priority to AU2003283706A priority Critical patent/AU2003283706A1/en
Priority to US10/537,209 priority patent/US20060104535A1/en
Priority to EP03775687A priority patent/EP1570429A2/en
Priority to JP2004556701A priority patent/JP2006509292A/en
Publication of WO2004051573A2 publication Critical patent/WO2004051573A2/en
Publication of WO2004051573A3 publication Critical patent/WO2004051573A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/12Edge-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/97Determining parameters from multiple pictures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence

Definitions

  • the present invention relates generally to the art of image and video processing. It particularly relates to region-based segmentation and filtering of images and video and will be described with particular reference thereto.
  • Video sequences are used to estimate the time-varying, three-dimensional (3D) structure of objects from the observed motion field.
  • Applications that benefit from a time-varying 3D reconstruction include vision-based control (robotics) , security systems, and the conversion of traditional monoscopic video (2D) for viewing on a stereoscopic (3D) television.
  • structure from motion methods are used to derive a depth map from two consecutive images in the video sequence.
  • Image segmentation is an important first step that often precedes other tasks such as segment based depth estimation.
  • image segmentation is the process of partitioning an image into a set of non-overlapping parts, or segments, that together correspond as much as possible to the physical objects that are present in the scene.
  • There are various ways of approaching the task of image segmentation including histogram-based segmentation, traditional edge-based segmentation, region-based segmentation, and hybrid segmentation.
  • one of the problems with any segmentation method is that false edges may occur in a segmented image. These false edges may occur for a number of reasons, including that the pixel color at the boundary between two objects may vary smoothly instead of abruptly, resulting in a thin elongated segment with two corresponding false edges instead of a single true edge.
  • the problem tends to occur at defocused object boundaries or in video material that has a reduced spatial resolution in one or more of the three color channels.
  • the problem of false edges is particularly troublesome with the conversion of traditional 2D video to 3D video for viewing on a 3D television
  • U.S. Patent No. 5,268,967 discloses a digital image processing method which automatically segments the desired regions in a digital radiographic image from the undesired regions. The method includes the steps of edge detection, block generation, block classification, block refinement and bit map generation.
  • U.S. Patent No. 5,025,478 discloses a method and apparatus for processing a picture signal for transmission in which the picture signal is applied to a segmentation device, which identifies regions of similar intensity.
  • the resulting region signal is applied to a modal filter in which region edges are straightened and then sent to an adaptive contour smoothing circuit where contour sections that are identified as false edges are smoothed.
  • the filtered signal is subtracted from the original luminance signal to produce a luminance texture signal which is encoded.
  • the region signal is encoded together with flags indicating which of the contours in the region signal represent false edges.
  • an imaging process apparatus is provided.
  • a segmenting means is provided for segmenting an image into a segmentation map including a plurality of pixel groups separated by edges including at least some false edges.
  • a filtering means is provided for filtering the segmentation map to remove the false edges, the filtering means outputting the filtered segmentation next to the segmentation means for presegmentation.
  • a method for processing one or more images is provided.
  • An image is segmented into a segmentation map including a plurality of pixel groups separated by edges including at least some false edges.
  • the segmentation map is filtered to remove the false edges.
  • the segmentation step is repeated to generate an output image.
  • One advantage of the present invention resides in improving the segmentation quality for the conversion of 2D video material to 3D video.
  • Another advantage of the present invention resides in improving video image segmentation quality at object edges.
  • Yet another advantage of the present invention resides in decreasing edge coding cost for image and video compression.
  • FIGURE 1 shows an image segmentation method with a false edge removal filter between segmentation steps .
  • FIGURE 2(a) shows an example of an input image.
  • FIGURE 2 (b) shows an example of an initial segmentation map with square regions of 5x5 pixels.
  • FIGURE 2 (c) shows an example of an output segmentation map with false edges.
  • FIGURE 2 (d) shows an example of a filtered segmentation map with false edges removed.
  • FIGURE 3 shows an exemplary false edge removal filtering method.
  • FIGURE 4 shows an example of a 5x5 pixel window, centered at pixel location ( i ,j ) .
  • An important step in converting 2D video to 3D video is the identification of image regions with homogeneous color, i.e., image segmentation. Depth discontinuities are assumed to coincide with the detected edges of homogeneous color regions. A single depth value is estimated for each color region. This depth estimation per region has the advantage that there exists per definition a large color contrast along the region boundary. The temporal stability of color edge positions is critical for the final quality of the depth maps. When the edges are not stable over time, an annoying flicker may be perceived by the viewer when the video is shown on a 3D color television.
  • a time-stable segmentation method is the first step in the conversion process from 2D to 3D video. Region-based image segmentation using a constant color model achieves this desired effect. This method of image segmentation is described in greater detail below.
  • the constant color model assumes that the time-varying image of an object region can be described in sufficient detail by the mean region color.
  • An image is represented by a vector-valued function of image coordinates:
  • segmenta tion I a region partition referred to as segmenta tion I consisting of a fixed number of regions N.
  • the optimal segmentation is defined as the segmentation that
  • Equations for a simple and efficient update of the error criterion when one sample is moved from one cluster to another cluster are derived by Richard 0. Duda, Peter E. Hart, and David G. Stork in "Pattern Classification,” pp. 548-549, John Wiley and Sons, Inc., New York, 2001. These derivations were applied in deriving the equations of the segmentation method.
  • the regularization term is based on a measure presented by C. Oliver and S. Quegan in "Understanding Synthetic Aperture Radar Images,” Artech- House, 1998.
  • the regularization term limits the influence that random signal fluctuations (such as sensor noise) have on the edge positions.
  • the error e(x, y) at pixel position (x,y) depends on the color value l(x,y) and on the region label l(x,y) :
  • m c is the mean color for region c and l(x,y) is the region label at position (x,y) in the region label map.
  • the subscript at the double vertical bars denotes the Euclidian norm.
  • the regularization term f(x,y) depends on the shape of regions:
  • the segmentation is initialized with a square tessellation. Given the initial segmentation, a change is made at a region boundary by assigning a boundary pixel to an adjoining region.
  • n ⁇ and n B are the number of pixels inside regions A and B respectively.
  • the proposed label change causes a corresponding change in the error function given by
  • the proposed label change from A to B at pixel (x, y) also changes the global regularization function f.
  • the proposed move affects f not only at (x, y) , but also at the 8-connected neighbor pixel positions of X, y) .
  • the change in regularization function is given by the sum
  • the proposed label change improves the fit criterion if ⁇ e+/c ⁇ / ⁇ 0.
  • regions are merged.
  • the above procedure for updating the segmentation map and accepting the proposed update when it improves the fit of model to data is done for each image in the sequence separately. Only after the merge step are the region mean values updated with a new image that is read from the video stream. The region fitting and merging starts again for the new image.
  • a region-based segmentation operation 30 takes as its inputs a color image 10 and an initial segmentation map 20.
  • the output of the segmentation operation 30 is a segmentation map 40, which shows the objects found in the image.
  • An example of the input color image 10 is illustrated in FIGURE 2(a).
  • An image is of a series of ovals decreasing in size as well as a series of rectangles decreasing in size.
  • the image is segmented into square regions of 5x5 pixels in the exemplary embodiment shown in FIGURE 2 (b) .
  • An example of the output segmentation map 40 is illustrated in FIGURE 2 (c) .
  • FIGURE 2(c) The result of applying the filter 50 to the image data as shown in FIGURE 2(c) is shown in FIGURE 2(d).
  • Image segmentation applications require a small number of regions with high edge accuracy. For example, accurate edges are a requirement for the accurate conversion of 2D monoscopic video to 3D steroscopic video.
  • segmentation is used for depth estimation and a single depth value is assigned to each region in the segmented image. The edge position and its temporal stability are then important for the perceptual quality of the 3D video.
  • the preferred embodiment includes the color image 10, the initial segmentation map 20, the segmentation step 30, the first output segmentation map 40, the false edge removal filter step 50, a filtered segmentation map 60, a second segmentation step 70, and a second output segmentation map 80.
  • the filter 50 operates on the segmentation map 40 and is thus independent of the color image 10.
  • the operation of the false edge removal filter 50 is described as follows. In a step 100, each pixel (i, ) of the output segmentation map 40 is labeled with a region number (or segment label) , depending on its color. The value assigned to each region number k is an arbitrary integer.
  • a histogram of the segment labels is computed inside a square window w.
  • the histogram is represented by the vector [ h k ] , l ⁇ k ⁇ n (ii),
  • h k is the frequency of region number k inside the window w
  • n is the total number of regions in the segmentation.
  • the frequency of occurrence for each region number is determined.
  • the most frequently occurring region number is determined.
  • a tiebreaker 160 is used, such as assigning the smallest of the equally frequent region numbers to the output segmentation or assigning the largest region number to the output segmentation.
  • FIGURE 4 is an illustration of an exemplary 5x5 pixel window 100, centered at pixel location ⁇ i ,j ) .
  • window sizes such as a 3x3 pixel window
  • FIGURE 4 is an illustration of an exemplary 5x5 pixel window 100, centered at pixel location ⁇ i ,j ) .
  • other window sizes such as a 3x3 pixel window
  • the filter operation gives as an output the number 3. This result can be verified by counting the frequency for each region number in the input window :
  • region numbers 3 and 4 both have a frequency of 7.
  • the false edge removal filter step 50 is repeated until all of the pixels ( i ,j ) in the segmentation map 40 have been analyzed.
  • region segmentation methods may be used so long as the method is able to iteratively fit (or update) the region boundaries given an initial segmentation.
  • the false edge removal filter 50 not only removes small and elongated regions, but can also distort region boundaries.
  • the distortion is corrected by running the segmentation operation 70 again after having applied the filter operation.
  • the filtered and segmented image map is loaded into the filtered segmentation map or memory space 60.
  • a second segmentation process 70 is performed to re-segment the map 60 to generation output map 80. Potentially, the filtering and segmenting steps are repeated one or more times .

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Processing (AREA)
  • Image Analysis (AREA)
  • Apparatus For Radiation Diagnosis (AREA)
  • Facsimile Image Signal Circuits (AREA)

Abstract

In a method for processing one or more images, an image is segmented into a segmentation map including a plurality of pixel groups separated by edges, including at least some false edges. The segmentation map is filtered to remove the false edges. The segmentation step is repeated to generate an output segmentation map.

Description

METHOD AND APPARATUS FOR REMOVING FALSE EDGES FROM A SEGMENTED IMAGE
The present invention relates generally to the art of image and video processing. It particularly relates to region-based segmentation and filtering of images and video and will be described with particular reference thereto. Video sequences are used to estimate the time-varying, three-dimensional (3D) structure of objects from the observed motion field. Applications that benefit from a time-varying 3D reconstruction include vision-based control (robotics) , security systems, and the conversion of traditional monoscopic video (2D) for viewing on a stereoscopic (3D) television. In this technology, structure from motion methods are used to derive a depth map from two consecutive images in the video sequence.
Image segmentation is an important first step that often precedes other tasks such as segment based depth estimation. Generally, image segmentation is the process of partitioning an image into a set of non-overlapping parts, or segments, that together correspond as much as possible to the physical objects that are present in the scene. There are various ways of approaching the task of image segmentation, including histogram-based segmentation, traditional edge-based segmentation, region-based segmentation, and hybrid segmentation. However, one of the problems with any segmentation method is that false edges may occur in a segmented image. These false edges may occur for a number of reasons, including that the pixel color at the boundary between two objects may vary smoothly instead of abruptly, resulting in a thin elongated segment with two corresponding false edges instead of a single true edge. The problem tends to occur at defocused object boundaries or in video material that has a reduced spatial resolution in one or more of the three color channels. The problem of false edges is particularly troublesome with the conversion of traditional 2D video to 3D video for viewing on a 3D television.
Several methods have been proposed to detect false edges in other applications. For example, U.S. Patent No. 5,268,967 discloses a digital image processing method which automatically segments the desired regions in a digital radiographic image from the undesired regions. The method includes the steps of edge detection, block generation, block classification, block refinement and bit map generation.
U.S. Patent No. 5,025,478 discloses a method and apparatus for processing a picture signal for transmission in which the picture signal is applied to a segmentation device, which identifies regions of similar intensity. The resulting region signal is applied to a modal filter in which region edges are straightened and then sent to an adaptive contour smoothing circuit where contour sections that are identified as false edges are smoothed. The filtered signal is subtracted from the original luminance signal to produce a luminance texture signal which is encoded. The region signal is encoded together with flags indicating which of the contours in the region signal represent false edges.
Published PCT application WO 00/77735 discloses an image seg enter that uses a progressive flood fill to fill incompletely bounded segments and scale transformations and guiding segmentation at one scale with segmentation results from another scale, detects edges using a composite image that is a composite of multiple color planes, generates edge chains using multiple classes of edge pixels, generates edge chains using the scale transformations, and filters false edges at one scale based on edges detected at another scale. However, the prior art only involves edge detection and/or smoothing of the false edges. None of the inventions actually remove the false edges from the segmented image, such as through the use of a filter that operates only on the segmentation map. The present invention contemplates an improved apparatus and method that overcomes the aforementioned limitations and others.
According to one aspect of the invention, an imaging process apparatus is provided. A segmenting means is provided for segmenting an image into a segmentation map including a plurality of pixel groups separated by edges including at least some false edges.
A filtering means is provided for filtering the segmentation map to remove the false edges, the filtering means outputting the filtered segmentation next to the segmentation means for presegmentation.
According to another aspect of the invention, a method for processing one or more images is provided. An image is segmented into a segmentation map including a plurality of pixel groups separated by edges including at least some false edges. The segmentation map is filtered to remove the false edges. The segmentation step is repeated to generate an output image.
One advantage of the present invention resides in improving the segmentation quality for the conversion of 2D video material to 3D video.
Another advantage of the present invention resides in improving video image segmentation quality at object edges.
Yet another advantage of the present invention resides in decreasing edge coding cost for image and video compression.
Numerous additional advantages and benefits of the present invention will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiment .
The invention may take form in various components and arrangements of components, and in various steps and arrangements of steps. The drawings are only for the purpose of illustrating preferred embodiments and are not to be considered as limiting the invention.
FIGURE 1 shows an image segmentation method with a false edge removal filter between segmentation steps .
FIGURE 2(a) shows an example of an input image. FIGURE 2 (b) shows an example of an initial segmentation map with square regions of 5x5 pixels.
FIGURE 2 (c) shows an example of an output segmentation map with false edges.
FIGURE 2 (d) shows an example of a filtered segmentation map with false edges removed.
FIGURE 3 shows an exemplary false edge removal filtering method. FIGURE 4 shows an example of a 5x5 pixel window, centered at pixel location ( i ,j ) .
An important step in converting 2D video to 3D video is the identification of image regions with homogeneous color, i.e., image segmentation. Depth discontinuities are assumed to coincide with the detected edges of homogeneous color regions. A single depth value is estimated for each color region. This depth estimation per region has the advantage that there exists per definition a large color contrast along the region boundary. The temporal stability of color edge positions is critical for the final quality of the depth maps. When the edges are not stable over time, an annoying flicker may be perceived by the viewer when the video is shown on a 3D color television. Thus, a time-stable segmentation method is the first step in the conversion process from 2D to 3D video. Region-based image segmentation using a constant color model achieves this desired effect. This method of image segmentation is described in greater detail below. The constant color model assumes that the time-varying image of an object region can be described in sufficient detail by the mean region color. An image is represented by a vector-valued function of image coordinates:
l(χ, y) = {r{x, y), g{χ,
Figure imgf000005_0001
b(x, y)) { 1 ) ,
where r(x,y) , g(x,y) and b x, y) are the red, green and blue color channel. The object is to find a region partition referred to as segmenta tion I consisting of a fixed number of regions N. The optimal segmentation is defined as the segmentation that
minimizes the sum of an error term plus a regularization term f (xry) over all pixels in the image:
Figure imgf000006_0001
where k is a regularization parameter that weights the importance of the regularization term. Equations for a simple and efficient update of the error criterion when one sample is moved from one cluster to another cluster are derived by Richard 0. Duda, Peter E. Hart, and David G. Stork in "Pattern Classification," pp. 548-549, John Wiley and Sons, Inc., New York, 2001. These derivations were applied in deriving the equations of the segmentation method. Note that the regularization term is based on a measure presented by C. Oliver and S. Quegan in "Understanding Synthetic Aperture Radar Images," Artech- House, 1998. The regularization term limits the influence that random signal fluctuations (such as sensor noise) have on the edge positions. The error e(x, y) at pixel position (x,y) depends on the color value l(x,y) and on the region label l(x,y) :
Figure imgf000006_0002
where mc is the mean color for region c and l(x,y) is the region label at position (x,y) in the region label map. The subscript at the double vertical bars denotes the Euclidian norm. The regularization term f(x,y) depends on the shape of regions:
Figure imgf000006_0003
where (x',y') are coordinates from the 8-connected neighbor pixels of (x,y) . The value of Z(A, B) depends on whether region labels A and B differ: if A ≠ B xx-{o 0 otherwise
Function f(x,y) has a straightforward interpretation. For a given pixel position(x, >), the function simply returns the number of
8-connected neighbor pixels that have a different region label.
The segmentation is initialized with a square tessellation. Given the initial segmentation, a change is made at a region boundary by assigning a boundary pixel to an adjoining region.
Suppose that a pixel with coordinates (x,y) currently in region with labels is tentatively moved to region with label B . Then the change in mean color for region A is:
Figure imgf000007_0001
and the change in mean color for region B is:
_l(x, )-m
Δm B
B nB + l (7)' where nΛ and nB are the number of pixels inside regions A and B respectively. The proposed label change causes a corresponding change in the error function given by
Figure imgf000007_0002
The proposed label change from A to B at pixel (x, y) also changes the global regularization function f. The proposed move affects f not only at (x, y) , but also at the 8-connected neighbor pixel positions of X, y) . The change in regularization function is given by the sum
Figure imgf000008_0001
where the summation is over all 8-connected neighbor positions denoted by . This simple form for the change Δ/ follows from
Figure imgf000008_0002
the fact that χ is symmetric:
χ(A,B) = χ{B,A). (10)
The proposed label change improves the fit criterion if Δe+/cΔ/<0.
Finally, regions are merged. The above procedure for updating the segmentation map and accepting the proposed update when it improves the fit of model to data is done for each image in the sequence separately. Only after the merge step are the region mean values updated with a new image that is read from the video stream. The region fitting and merging starts again for the new image.
With reference to FIGURE 1, a region-based segmentation operation 30, preferably based upon the constant color model, takes as its inputs a color image 10 and an initial segmentation map 20. The output of the segmentation operation 30 is a segmentation map 40, which shows the objects found in the image. An example of the input color image 10 is illustrated in FIGURE 2(a). There, an image is of a series of ovals decreasing in size as well as a series of rectangles decreasing in size. The image is segmented into square regions of 5x5 pixels in the exemplary embodiment shown in FIGURE 2 (b) . An example of the output segmentation map 40 is illustrated in FIGURE 2 (c) .
The false edges that may occur in a segmented image are best seen in FIGURE 2(c). These false edges can occur because of defocus at the boundary between two objects. False edges can also occur because many films have a reduced spacial resolution of the color channels.
Furthermore, color undersampling causes problems for segmentation algorithms. While a segmentation algorithm tries to detect edges with high accuracy, a spatial undersampling of the signal generally occurs and results in small and elongated regions near object boundaries. This unwanted effect is best illustrated in FIGURE 2(c) . Multiple edges, which are coded in white, are visible near object boundaries. These small and elongated regions are removed by adding a false edge removal filter step 50 between segmentation steps. The result of applying the filter 50 to the image data as shown in FIGURE 2(c) is shown in FIGURE 2(d).
Image segmentation applications require a small number of regions with high edge accuracy. For example, accurate edges are a requirement for the accurate conversion of 2D monoscopic video to 3D steroscopic video. For such an application, segmentation is used for depth estimation and a single depth value is assigned to each region in the segmented image. The edge position and its temporal stability are then important for the perceptual quality of the 3D video.
A solution to the problem of false edges is the addition of the false edge removal filter step 50 between segmentation operations. With reference to FIGURE 1, the preferred embodiment includes the color image 10, the initial segmentation map 20, the segmentation step 30, the first output segmentation map 40, the false edge removal filter step 50, a filtered segmentation map 60, a second segmentation step 70, and a second output segmentation map 80. The filter 50 operates on the segmentation map 40 and is thus independent of the color image 10. With reference to FIGURE 3, the operation of the false edge removal filter 50 is described as follows. In a step 100, each pixel (i, ) of the output segmentation map 40 is labeled with a region number (or segment label) , depending on its color. The value assigned to each region number k is an arbitrary integer. In a step 110, for each pixel ( i ,j ) a histogram of the segment labels is computed inside a square window w. The histogram is represented by the vector [ hk] , l<k<n (ii),
where hk is the frequency of region number k inside the window w, and n is the total number of regions in the segmentation. In a step 120, the frequency of occurrence for each region number is determined. In a step 130, the most frequently occurring region number is determined. In a step 140, a determination is made whether the histogram has a single maximum value. If so, in a step 150 the filtered segmentation map at pixel (i, ) is given by the region number Jmax for which the maximum occurs as follows:
kmax = arg max ( [ hk] ) (12) .
However, it may be the case that two or more region numbers have the same frequency and that this frequency is higher than the frequency of all other numbers inside the window w. In that situation, a tiebreaker 160 is used, such as assigning the smallest of the equally frequent region numbers to the output segmentation or assigning the largest region number to the output segmentation.
FIGURE 4 is an illustration of an exemplary 5x5 pixel window 100, centered at pixel location { i ,j ) . However, in the alternative, other window sizes, such as a 3x3 pixel window, are also contemplated. On the left-hand side of the filter operation is the window 100 with the input region numbers . Pixel locations containing an asterisk (*) lie outside the image plane. That is, the illustrated example is of the edge of the picture. Region numbers at these pixel locations are ignored when constructing the histogram. The filter operation gives as an output the number 3. This result can be verified by counting the frequency for each region number in the input window :
[ hk] = ( h h2 , h3 , hi . . . l hπ) = (6,0,7,7, ...,0) (13).
In this example, there is more than one global maximum value in the histogram. That is, region numbers 3 and 4 both have a frequency of 7. The smaller region number (k=3 ) is selected by the tiebreaker as the answer and assigned to the output segmentation at pixel location ( i ,j ) . However, in the alternative, the larger region number (J=4) could have also been selected and assigned to the output segmentation at pixel location (l,j ) . The false edge removal filter step 50 is repeated until all of the pixels ( i ,j ) in the segmentation map 40 have been analyzed.
Any number of region segmentation methods may be used so long as the method is able to iteratively fit (or update) the region boundaries given an initial segmentation. The false edge removal filter 50 not only removes small and elongated regions, but can also distort region boundaries. Thus, the distortion is corrected by running the segmentation operation 70 again after having applied the filter operation. The filtered and segmented image map is loaded into the filtered segmentation map or memory space 60. A second segmentation process 70 is performed to re-segment the map 60 to generation output map 80. Potentially, the filtering and segmenting steps are repeated one or more times . Applications for the false edge removal filter include improving the segmentation quality for the conversion of existing 2D video material to 3D video; improving video image quality at object edges (edge sharpening algorithms) ; and decreasing edge coding cost for image and video compression. The invention has been described with reference to the preferred embodiments. Obviously, modifications and alterations will occur to others upon reading and understanding the preceding detailed description. It is intended that the invention be construed as including all such modifications and alterations insofar as they come within the scope of the appended claims or the equivalents thereof.

Claims

Having thus described the preferred embodiments, the invention is now claimed to be:
1. An image processing apparatus comprising: a first segmentation means (30) for segmenting one or more images (10)' into an output segmentation map (40) including a plurality of pixel groups separated by edges including at least some false edges; a filtering means (50) for filtering the segmentation map (40) to remove the false edges, the filtering means (50) outputting the filtered segmentation (60) next to a second segmentation means (70) for re-segmentation.
2. The image processing apparatus as set forth in claim 1, wherein the first and second segmentation means (30, 70) use a constant color model, the constant color model including an identification means for identifying image regions with homogeneous color or grey scale.
3. The image processing apparatus as set forth in claim 1, wherein the pixel groups are initially rectangular shaped regions.
4. The image processing apparatus as set forth in claim 1, wherein the filtering means includes: a computing means (110) for computing a histogram (200) of the pixel labels inside a window surrounding a given pixel in the segmentation map; and a first determining means (120) for determining a frequency of occurrence for each pixel label in the window.
5. The image processing apparatus as set forth in claim 4, wherein the filtering means further includes: a second determining means (130) for determining a most frequently occurring pixel label in the histogram; an assigning means (150) for assigning to the given pixel in the output segmentation map (40) the pixel label which occurs most frequently.
6. The image processing apparatus as set forth in claim 5, further including a tie breaking means (160) for selecting one of:. a larger of equally, most frequently occurring labels, and a smaller of equal, most frequently occurring labels, to be assigned to the given pixel when two or more labels occur equally and most frequently.
7. The imaging processing apparatus as set forth in claim 5, further including a tie breaking means (160) for selecting the pixel label to be assigned to the given pixel where two or more pixel labels have the same frequency and the frequency is higher than the frequency of all other pixel labels inside the histogram.
8. The image processing apparatus as set forth in claim 4, wherein the window (110) is a square of 5x5 pixels.
9. The image processing apparatus as set forth in claim 1, wherein the one or more images (10) include frames of a two- dimensional video.
10. A method for processing one or more images, the method including: segmenting an image into a segmentation map including a plurality of pixel groups separated by edges including at least some false edges; filtering the segmentation map to remove the false edges; and repeating the segmenting step to generate an output image .
11. The method for processing one or more images as set forth in claim 10, further including repeating the region segmenting step and the filtering step a plurality of times to further refine the edges .
12. The method for processing one or more images as set forth in claim 10, wherein the segmenting of the image is region-based.
13. The method for processing one or more images as set forth in claim 12, wherein the region-based segmenting step uses a constant color model, the constant color model including the identification of image regions with homogeneous color.
14. The method for processing one or more images as set forth in claim 10, wherein the pixel groups are square regions of 5x5 pixels .
15. The method for processing one or more images as set forth in claim 10, wherein the filtering step includes: computing a histogram of the pixel labels inside a window for a given output pixel in the segmentation map; and determining the frequency of occurrence for each pixel label in the window.
16. The method for processing one or more images as set forth in claim 15, wherein the filtering further includes: determining a most frequently occurring label of the histogram; assigning to the output pixel the pixel label with the maximum occurrence .
17. The method for processing one or more images as set forth in claim 16, further including when more than one label occurs with equal most frequency assigning the given pixel one of: the smallest of the equally frequent labels, and the largest of the equally frequent labels.
18. The method for processing one or more images as set forth in claim 10, wherein the one or more images include frames of a two- dimensional video.
PCT/IB2003/005677 2002-12-05 2003-12-04 Method and apparatus for removing false edges from a segmented image WO2004051573A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU2003283706A AU2003283706A1 (en) 2002-12-05 2003-12-04 Method and apparatus for removing false edges from a segmented image
US10/537,209 US20060104535A1 (en) 2002-12-05 2003-12-04 Method and apparatus for removing false edges from a segmented image
EP03775687A EP1570429A2 (en) 2002-12-05 2003-12-04 Method and apparatus for removing false edges from a segmented image
JP2004556701A JP2006509292A (en) 2002-12-05 2003-12-04 Apparatus and method for removing false edges from a segmented image

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US43117102P 2002-12-05 2002-12-05
US60/431,171 2002-12-05

Publications (2)

Publication Number Publication Date
WO2004051573A2 true WO2004051573A2 (en) 2004-06-17
WO2004051573A3 WO2004051573A3 (en) 2005-03-17

Family

ID=32469598

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2003/005677 WO2004051573A2 (en) 2002-12-05 2003-12-04 Method and apparatus for removing false edges from a segmented image

Country Status (7)

Country Link
US (1) US20060104535A1 (en)
EP (1) EP1570429A2 (en)
JP (1) JP2006509292A (en)
KR (1) KR20050085355A (en)
CN (1) CN1720550A (en)
AU (1) AU2003283706A1 (en)
WO (1) WO2004051573A2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007109632A2 (en) * 2006-03-17 2007-09-27 Qualcomm Incorporated Systems, methods, and apparatus for exposure control
EP1931150A1 (en) * 2006-12-04 2008-06-11 Koninklijke Philips Electronics N.V. Image processing system for processing combined image data and depth data
US8064705B2 (en) 2007-04-12 2011-11-22 Canon Kabushiki Kaisha Image processing apparatus and control method thereof
US8515172B2 (en) 2007-12-20 2013-08-20 Koninklijke Philips N.V. Segmentation of image data

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7840067B2 (en) * 2003-10-24 2010-11-23 Arcsoft, Inc. Color matching and color correction for images forming a panoramic image
US7551795B2 (en) * 2005-11-01 2009-06-23 Samsung Electronics Co., Ltd. Method and system for quantization artifact removal using super precision
US8090210B2 (en) 2006-03-30 2012-01-03 Samsung Electronics Co., Ltd. Recursive 3D super precision method for smoothly changing area
US8503796B2 (en) * 2006-12-29 2013-08-06 Ncr Corporation Method of validating a media item
US7925086B2 (en) * 2007-01-18 2011-04-12 Samsung Electronics Co, Ltd. Method and system for adaptive quantization layer reduction in image processing applications
DE102007021518B4 (en) * 2007-05-04 2009-01-29 Technische Universität Berlin Method of processing a video data set
WO2010036098A1 (en) * 2008-09-25 2010-04-01 Tele Atlas B.V. Method of and arrangement for blurring an image
US9007435B2 (en) * 2011-05-17 2015-04-14 Himax Technologies Limited Real-time depth-aware image enhancement system
US9582888B2 (en) * 2014-06-19 2017-02-28 Qualcomm Incorporated Structured light three-dimensional (3D) depth map based on content filtering
JP6316330B2 (en) * 2015-04-03 2018-04-25 コグネックス・コーポレーション Homography correction
CN105930843A (en) * 2016-04-19 2016-09-07 鲁东大学 Segmentation method and device of fuzzy video image
US10510148B2 (en) * 2017-12-18 2019-12-17 Hong Kong Applied Science And Technology Research Institute Co., Ltd. Systems and methods for block based edgel detection with false edge elimination
CN108235775B (en) * 2017-12-18 2021-06-15 香港应用科技研究院有限公司 System and method for block-based edge pixel detection with false edge elimination
TWI743746B (en) * 2020-04-16 2021-10-21 瑞昱半導體股份有限公司 Image processing method and image processing circuit

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB8906587D0 (en) * 1989-03-22 1989-05-04 Philips Electronic Associated Region/texture coding systems
US5268967A (en) * 1992-06-29 1993-12-07 Eastman Kodak Company Method for automatic foreground and background detection in digital radiographic images
US5659624A (en) * 1995-09-01 1997-08-19 Fazzari; Rodney J. High speed mass flow food sorting appartus for optically inspecting and sorting bulk food products
US6035060A (en) * 1997-02-14 2000-03-07 At&T Corp Method and apparatus for removing color artifacts in region-based coding
US6741655B1 (en) * 1997-05-05 2004-05-25 The Trustees Of Columbia University In The City Of New York Algorithms and system for object-oriented content-based video search
US6631212B1 (en) * 1999-09-13 2003-10-07 Eastman Kodak Company Twostage scheme for texture segmentation based on clustering using a first set of features and refinement using a second set of features
US7085401B2 (en) * 2001-10-31 2006-08-01 Infowrap Systems Ltd. Automatic object extraction
US7116820B2 (en) * 2003-04-28 2006-10-03 Hewlett-Packard Development Company, Lp. Detecting and correcting red-eye in a digital image

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
JIANHUA XUAN ET AL: "Segmentation of magnetic resonance brain image: integrating region growing and edge detection" PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING. (ICIP). WASHINGTON, OCT. 23 - 26, 1995, LOS ALAMITOS, IEEE COMP. SOC. PRESS, US, vol. VOL. 3, 23 October 1995 (1995-10-23), pages 544-547, XP010197242 ISBN: 0-7803-3122-2 *
KERMAD C; CHEHDI K; CARIOU C: "Image segmentation by iterative region-contour control minimizing a convergence criterion" PROCEEDINGS OF THIRD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING ICSP 96, 14 October 1996 (1996-10-14), - 18 October 1996 (1996-10-18) pages 1312-1315, XP002310879 BEIJING, CHINA *
ZUGAJ D ET AL: "A new approach of color images segmentation based on fusing region and edge segmentations outputs" PATTERN RECOGNITION, ELSEVIER, KIDLINGTON, GB, vol. 31, no. 2, February 1998 (1998-02), pages 105-113, XP004101082 ISSN: 0031-3203 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007109632A2 (en) * 2006-03-17 2007-09-27 Qualcomm Incorporated Systems, methods, and apparatus for exposure control
WO2007109632A3 (en) * 2006-03-17 2011-04-07 Qualcomm Incorporated Systems, methods, and apparatus for exposure control
US8107762B2 (en) 2006-03-17 2012-01-31 Qualcomm Incorporated Systems, methods, and apparatus for exposure control
US8824827B2 (en) 2006-03-17 2014-09-02 Qualcomm Incorporated Systems, methods, and apparatus for exposure control
EP1931150A1 (en) * 2006-12-04 2008-06-11 Koninklijke Philips Electronics N.V. Image processing system for processing combined image data and depth data
WO2008068707A2 (en) * 2006-12-04 2008-06-12 Koninklijke Philips Electronics N.V. Image processing system for processing combined image data and depth data
WO2008068707A3 (en) * 2006-12-04 2009-07-16 Koninkl Philips Electronics Nv Image processing system for processing combined image data and depth data
US9948943B2 (en) 2006-12-04 2018-04-17 Koninklijke Philips N.V. Image processing system for processing combined image data and depth data
US8064705B2 (en) 2007-04-12 2011-11-22 Canon Kabushiki Kaisha Image processing apparatus and control method thereof
US8515172B2 (en) 2007-12-20 2013-08-20 Koninklijke Philips N.V. Segmentation of image data

Also Published As

Publication number Publication date
EP1570429A2 (en) 2005-09-07
WO2004051573A3 (en) 2005-03-17
US20060104535A1 (en) 2006-05-18
KR20050085355A (en) 2005-08-29
JP2006509292A (en) 2006-03-16
CN1720550A (en) 2006-01-11
AU2003283706A8 (en) 2004-06-23
AU2003283706A1 (en) 2004-06-23

Similar Documents

Publication Publication Date Title
US20060104535A1 (en) Method and apparatus for removing false edges from a segmented image
EP2230855B1 (en) Synthesizing virtual images from texture and depth images
JP3862140B2 (en) Method and apparatus for segmenting a pixelated image, recording medium, program, and image capture device
US9137512B2 (en) Method and apparatus for estimating depth, and method and apparatus for converting 2D video to 3D video
US8384763B2 (en) Generating a depth map from a two-dimensional source image for stereoscopic and multiview imaging
EP2230640B1 (en) Method for filtering depth images
US6668097B1 (en) Method and apparatus for the reduction of artifact in decompressed images using morphological post-filtering
US9183617B2 (en) Methods, devices, and computer readable mediums for processing a digital picture
EP2230856A2 (en) Method for up-sampling images
EP1008106A1 (en) A method and apparatus for segmenting images prior to coding
EP3718306B1 (en) Cluster refinement for texture synthesis in video coding
US11323717B2 (en) Frequency adjustment for texture synthesis in video coding
JP2005151568A (en) Temporal smoothing apparatus and method for compositing intermediate image
EP1815441B1 (en) Rendering images based on image segmentation
JP2020506484A (en) Method and apparatus for processing image property maps
Xu et al. Depth map misalignment correction and dilation for DIBR view synthesis
US11252413B2 (en) Polynomial fitting for motion compensation and luminance reconstruction in texture synthesis
EP1863283B1 (en) A method and apparatus for frame interpolation
EP2525324A2 (en) Method and apparatus for generating a depth map and 3d video
WO2004097737A1 (en) Segmentation refinement
Xu et al. Watershed based depth map misalignment correction and foreground biased dilation for DIBR view synthesis
WO2023102189A2 (en) Iterative graph-based image enhancement using object separation
Lee et al. Depth resampling for mixed resolution multiview 3D videos
Ko et al. Effective reconstruction of stereoscopic image pair by using regularized adaptive window matching algorithm

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003775687

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2006104535

Country of ref document: US

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 20038A49725

Country of ref document: CN

Ref document number: 1020057010121

Country of ref document: KR

Ref document number: 2004556701

Country of ref document: JP

Ref document number: 10537209

Country of ref document: US

WWP Wipo information: published in national office

Ref document number: 1020057010121

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2003775687

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 10537209

Country of ref document: US

WWW Wipo information: withdrawn in national office

Ref document number: 2003775687

Country of ref document: EP