CN106875397A - A kind of method for realizing interactive image segmentation, device and terminal - Google Patents
A kind of method for realizing interactive image segmentation, device and terminal Download PDFInfo
- Publication number
- CN106875397A CN106875397A CN201710004616.XA CN201710004616A CN106875397A CN 106875397 A CN106875397 A CN 106875397A CN 201710004616 A CN201710004616 A CN 201710004616A CN 106875397 A CN106875397 A CN 106875397A
- Authority
- CN
- China
- Prior art keywords
- pixel
- point
- mask artwork
- directed graph
- track
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Image Analysis (AREA)
Abstract
Disclosed herein is a kind of method for realizing interactive image segmentation, device and terminal.Methods described includes:The first neighboring region that track will be smeared on original image or sketch the contours track is defined as mark zone, and the second neighboring region is defined as area-of-interest;Generate the input mask artwork of image segmentation algorithm:Using the pixel in mark zone as the foreground point in mask artwork, the pixel in area-of-interest outside mark zone is used as the background dot in mask artwork;Obtain depth map, the partitioning parameters of each pixel are determined according to depth map and mask artwork, build non-directed graph and the partitioning parameters of each pixel in mask artwork are mapped in non-directed graph, the non-directed graph is processed according to minimal cut maximum-flow algorithm, the mask artwork after fine segmentation is obtained, the corresponding image in foreground point in the mask artwork after fine segmentation is partitioned into from original image.The run time of algorithm can be shortened herein, the depth information using image improves the effect of image segmentation.
Description
Technical field
The present invention relates to technical field of image processing, more particularly to a kind of method for realizing interactive image segmentation, device
And terminal.
Background technology
Image segmentation, if refer to by plane picture according to the features such as color, texture, shape be divided into mutually it is disjunct
Dry region, this is a basic technology for practicality of image processing field.Existing image Segmentation Technology has dividing based on threshold value
Segmentation method, the dividing method based on edge, the dividing method based on region, the dividing method based on energy functional, based on graph theory
Dividing method etc..Comparing wherein in Graph-theoretical Approach well-known has GraphCut algorithms and its modified version GrabCut algorithms.
GraphCut algorithms and its modified version GrabCut algorithms, are the interactive image segmentation sides based on area marking
Method.GraphCut algorithms are based on Markov random field (Markov Random Field, MRF) energy minimization framework
A kind of algorithm, advantage is that can carry out global optimum's solution with reference to various knowwhies.GrabCut algorithms are to GraphCut
The improvement of algorithm, GrabCut algorithms by mark out on the original image foreground point (point on the destination object to be extracted) and
Background dot generates mask artwork, and gauss hybrid models (Gaussian is set up to prospect, background color space using artwork and mask artwork
Mixture Model, GMM), energy minimization is completed using the iterative algorithm that can be evolved in GMM parameter learnings, estimation procedure,
The foreground point in image and background dot are ruled out, the target image being made up of foreground point pixel is extracted from artwork.
When carrying out image segmentation using GrabCut algorithms on mobile phone, in order to reduce interactive complexity, generally to user
How to mark and be not strict with, therefore, in the case where the foreground point of user's mark is less, iterations may be a lot, algorithm
Run time is more long, have impact on the experience of user.On the other hand, the GrabCut algorithms in correlation technique are based on coloured image
Image segmentation is carried out, when the color characteristic of the destination object to be extracted is not obvious, using dividing that cromogram is split
Cut effect unsatisfactory.
The content of the invention
The technical problems to be solved by the invention are to provide a kind of method for realizing interactive image segmentation, device and end
End, can shorten the run time of algorithm, and the depth information using image improves the effect of image segmentation.
A kind of method for realizing interactive image segmentation is the embodiment of the invention provides, including:
After detecting the smearing track on original image or sketching the contours track, by the smearing track or the first of track is sketched the contours
Neighboring region is defined as mark zone, and the smearing track or the second neighboring region for sketching the contours track are defined as into area-of-interest,
The area-of-interest includes the mark zone;Generate the input mask artwork of image segmentation algorithm:To own in the mark zone
Pixel as the foreground point in the mask artwork, using the pixel outside mark zone in the area-of-interest as the mask artwork
In background dot;
The depth map comprising destination object depth information is obtained, the mask artwork is determined according to the depth map and mask artwork
The partitioning parameters of upper each pixel, the partitioning parameters are used to represent that pixel judgement is the probability and institute of foreground point or background dot
State the depth value difference of pixel and adjacent pixel;
Non-directed graph is built, the partitioning parameters of each pixel in the mask artwork is mapped in the non-directed graph, root
The non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtains the mask artwork after fine segmentation;
Before being partitioned into the mask artwork after the fine segmentation from the original image that the destination object is obtained is shot
The corresponding image in sight spot.
Alternatively, the partitioning parameters of each pixel on the mask artwork are determined according to the depth map and mask artwork, including:
Gauss hybrid models GMM calculating is carried out by EM methods, the EM methods include E steps and M steps;Iteration runs E steps and M steps
Suddenly, the iterative process is stopped after iteration operation reaches the condition of convergence;Last time is performed described in the acquisition of M steps
The classification of pixel is defined as the classification of the pixel, and the pixel that last time performs the acquisition of M steps is belonged into certain cluster
Most probable value PmaxIt is defined as the area item partitioning parameters of the pixel, the area item partitioning parameters are that the pixel is sentenced
It is certainly foreground point or the probability of background dot;
Wherein, the E steps and M steps include following treatment respectively:
E steps:The position relationship between depth value and pixel according to each pixel on the mask artwork is by clustering
Same type of pixel is polymerized to one or more clusters, the GMM model of each cluster is determined;Wherein, the classification of pixel includes prospect
Point or background dot;The classification of cluster includes foreground point cluster or background dot cluster;
M steps:GMM model according to each cluster determines that each pixel belongs to the probability of each cluster, to any one
Pixel, the most probable value P according to the pixelmaxCorresponding cluster determines the classification of the pixel.
Alternatively, the partitioning parameters of each pixel on the mask artwork are determined according to the depth map and mask artwork, is also wrapped
Include:Depth value difference according to the pixel and adjacent pixel determines the border item partitioning parameters of the pixel;
Wherein, to any one pixel, by the depth value absolute difference between the pixel and each neighbouring pixel
Added up, then to adding up and being normalized, cumulative and as the pixel the border item point after being normalized
Cut parameter.
Alternatively, non-directed graph is built, the partitioning parameters of each pixel in the mask artwork is mapped to described undirected
In figure, including:
A non-directed graph is built, two hanging point Q are set outside the plane of the non-directed graph0And Q1, the hanging point Q0For
Virtual foreground point, the hanging point Q1It is virtual background point;Each is set up on the mask artwork in the plane of the non-directed graph
The mapping point of pixel, in mapping point and the hanging point Q of foreground point0Between set up line, mapping point and institute in background dot
State hanging point Q1Between set up line;
To any one pixel P in mask artworki, by the pixel PiArea item partitioning parameters as the non-directed graph
Middle mapping point Pi' weights, by the pixel PiBorder item partitioning parameters as mapping point P in the non-directed graphi' with it is hanging
Point Q0Or Q1Between line weights.
Alternatively, it is described the non-directed graph is processed according to minimal cut-maximum-flow algorithm, after obtaining fine segmentation
Mask artwork, including:
Iteration performs following step C and D steps, and iterative process is stopped after iteration operation reaches the condition of convergence, will be described
Each pixel in prospect point set Q is used as the foreground point in the mask artwork after fine segmentation;
Wherein, step C and D steps include following treatment respectively:
Step C:One part of pixel in non-directed graph is divided into and vacantly point Q0Similar foreground point, by being divided into prospect
The pixel of point constitutes prospect point set Q;
D steps:The weights summation of the prospect point set Q is calculated, the weights summation is institute in the prospect point set Q
Have foreground point weights and, along with all foreground points in the prospect point set Q and hanging point Q0Between line weights and;
Wherein, the condition of convergence be prospect point set Q weights summation less than threshold value and change tend towards stability.
Alternatively, the original image that the shooting destination object is obtained is cromogram, artwork master or infrared figure.
Alternatively, before the input mask artwork of generation image segmentation algorithm, also include:
If currently detected smearing track is directed to the first time mark of the original image, will be currently detected
The mark zone determined of smearing track as final mark zone, by currently detected smearing track determine it is interested
Region is used as final area-of-interest;
If currently detected smearing track is not directed to the first time mark of the original image, current inspection is judged
Whether have between the mark zone that the smearing track that the mark zone and last time that the smearing track for measuring is determined are detected is determined
Occur simultaneously, if occuring simultaneously, then the smearing rail that the mark zone determined currently detected smearing track is detected with the last time
The mark zone that mark is determined is merged, using the mark zone after merging as final mark zone;If do not occured simultaneously, ought
Before the mark zone determined of the smearing track that detects as final mark zone;
If currently detected smearing track is not directed to the first time mark of the original image, current inspection is judged
The area-of-interest that area-of-interest that the smearing track for measuring is determined is determined with the last smearing track for detecting it
Between whether have common factor, if occuring simultaneously, then the area-of-interest determined currently detected smearings track and last inspection
The area-of-interest that the smearing track for measuring is determined is merged, using the area-of-interest after merging as final interested
Region;If do not occured simultaneously, the area-of-interest that currently detected smearing track is determined is used as final interested
Region.
Alternatively, described image partitioning algorithm is GrabCut algorithms.
The embodiment of the present invention additionally provides a kind of device for realizing interactive image segmentation, including:
Pretreatment module, after detecting the smearing track on original image or sketching the contours track, by the smearing track
Or sketch the contours the first neighboring region of track and be defined as mark zone, the smearing track or the second neighboring region for sketching the contours track is true
It is set to area-of-interest, the area-of-interest includes the mark zone;Generate the input mask artwork of image segmentation algorithm:By institute
All of pixel is used as the foreground point in the mask artwork in stating mark zone, by the pixel outside mark zone in the area-of-interest
As the background dot in the mask artwork;
Partitioning parameters computing module, for obtaining the depth map comprising destination object depth information, according to the depth map
With the partitioning parameters that mask artwork determines each pixel on the mask artwork, it is prospect that the partitioning parameters are used to represent that pixel is adjudicated
The probability and the pixel of point or background dot and the depth value difference of adjacent pixel;
Mask artwork adjusting module, for building non-directed graph, the partitioning parameters of each pixel in the mask artwork is reflected
It is mapped in the non-directed graph, the non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtains covering after fine segmentation
Mould figure;
Output module, after being partitioned into the fine segmentation from the original image that the destination object is obtained is shot
Mask artwork in the corresponding image in foreground point.
Alternatively, partitioning parameters computing module, for determining institute according to the depth map and mask artwork in the following ways
State the partitioning parameters of each pixel on mask artwork:Gauss hybrid models GMM calculating is carried out by EM methods, the EM methods include E
Step and M steps;Iteration runs E steps and M steps, and the iteration mistake is stopped after iteration operation reaches the condition of convergence
Journey;The classification that last time performs the pixel that M steps are obtained is defined as the classification of the pixel, last time is performed
The pixel that M steps are obtained belongs to the most probable value P of certain clustermaxIt is defined as the area item partitioning parameters of the pixel,
The area item partitioning parameters are that the pixel judgement is foreground point or the probability of background dot;
Wherein, the E steps and M steps include following treatment respectively:
E steps:The position relationship between depth value and pixel according to each pixel on the mask artwork is by clustering
Same type of pixel is polymerized to one or more clusters, the GMM model of each cluster is determined;Wherein, the classification of pixel includes prospect
Point or background dot;The classification of cluster includes foreground point cluster or background dot cluster;
M steps:GMM model according to each cluster determines that each pixel belongs to the probability of each cluster, to any one
Pixel, the most probable value P according to the pixelmaxCorresponding cluster determines the classification of the pixel.
Alternatively, partitioning parameters computing module, is additionally operable to be determined according to the depth map and mask artwork in the following ways
The partitioning parameters of each pixel on the mask artwork:Depth value difference according to the pixel and adjacent pixel determines the pixel
Border item partitioning parameters;
Wherein, to any one pixel, by the depth value absolute difference between the pixel and each neighbouring pixel
Added up, then to adding up and being normalized, cumulative and as the pixel the border item point after being normalized
Cut parameter.
Alternatively, mask artwork adjusting module, for building non-directed graph and will be every in the mask artwork in the following ways
The partitioning parameters of one pixel are mapped in the non-directed graph:
A non-directed graph is built, two hanging point Q are set outside the plane of the non-directed graph0And Q1, the hanging point Q0For
Virtual foreground point, the hanging point Q1It is virtual background point;Each is set up on the mask artwork in the plane of the non-directed graph
The mapping point of pixel, in mapping point and the hanging point Q of foreground point0Between set up line, mapping point and institute in background dot
State hanging point Q1Between set up line;
To any one pixel P in mask artworki, by the pixel PiArea item partitioning parameters as the non-directed graph
Middle mapping point Pi' weights, by the pixel PiBorder item partitioning parameters as mapping point P in the non-directed graphi' with it is hanging
Point Q0Or Q1Between line weights.
Alternatively, mask artwork adjusting module, in the following ways according to minimal cut-maximum-flow algorithm to described undirected
Figure is processed, and obtains the mask artwork after fine segmentation:
Iteration performs following step C and D steps, and iterative process is stopped after iteration operation reaches the condition of convergence, will be described
Each pixel in prospect point set Q is used as the foreground point in the mask artwork after fine segmentation;
Wherein, step C and D steps include following treatment respectively:
Step C:One part of pixel in non-directed graph is divided into and vacantly point Q0Similar foreground point, by being divided into prospect
The pixel of point constitutes prospect point set Q;
D steps:The weights summation of the prospect point set Q is calculated, the weights summation is institute in the prospect point set Q
Have foreground point weights and, along with all foreground points in the prospect point set Q and hanging point Q0Between line weights and;
Wherein, the condition of convergence be prospect point set Q weights summation less than threshold value and change tend towards stability.
Alternatively, the original image that the shooting destination object is obtained is cromogram, artwork master or infrared figure.
Alternatively, pretreatment module, was additionally operable to before the input mask artwork of generation image segmentation algorithm, to mark zone and
Area-of-interest is adjusted:
If currently detected smearing track is directed to the first time mark of the original image, will be currently detected
The mark zone determined of smearing track as final mark zone, by currently detected smearing track determine it is interested
Region is used as final area-of-interest;
If currently detected smearing track is not directed to the first time mark of the original image, current inspection is judged
Whether have between the mark zone that the smearing track that the mark zone and last time that the smearing track for measuring is determined are detected is determined
Occur simultaneously, if occuring simultaneously, then the smearing rail that the mark zone determined currently detected smearing track is detected with the last time
The mark zone that mark is determined is merged, using the mark zone after merging as final mark zone;If do not occured simultaneously, ought
Before the mark zone determined of the smearing track that detects as final mark zone;
If currently detected smearing track is not directed to the first time mark of the original image, current inspection is judged
The area-of-interest that area-of-interest that the smearing track for measuring is determined is determined with the last smearing track for detecting it
Between whether have common factor, if occuring simultaneously, then the area-of-interest determined currently detected smearings track and last inspection
The area-of-interest that the smearing track for measuring is determined is merged, using the area-of-interest after merging as final interested
Region;If do not occured simultaneously, the area-of-interest that currently detected smearing track is determined is used as final interested
Region.
Alternatively, described image partitioning algorithm is GrabCut algorithms.
The embodiment of the present invention additionally provides a kind of terminal, including the above-mentioned device for realizing interactive image segmentation.
Set forth herein a kind of method for realizing interactive image segmentation, device and terminal, rail will be smeared on original image
Mark sketches the contours the first neighboring region of track and is defined as mark zone, and the second neighboring region is defined as area-of-interest, and the sense is emerging
Interesting region includes the mark zone;Generate the input mask artwork of image segmentation algorithm:All of pixel in the mark zone is made
It is the foreground point in mask artwork, the pixel outside mark zone in the area-of-interest as the background dot in mask artwork is obtained
Depth map comprising destination object depth information, each pixel on the mask artwork is determined according to the depth map and mask artwork
Partitioning parameters, the partitioning parameters be used to representing pixel judgement for foreground point or background dot probability and the pixel with it is adjacent
The depth value difference of pixel, builds non-directed graph and the partitioning parameters of each pixel in the mask artwork is mapped into the nothing
In to figure, the non-directed graph is processed according to minimal cut-maximum-flow algorithm, the mask artwork after fine segmentation is obtained, from bat
Take the photograph and the corresponding figure in foreground point in the mask artwork after the fine segmentation is partitioned into the original image that the destination object is obtained
Picture.The prospect that the technical scheme of this paper can expand image segmentation algorithm mark by image preprocessing is counted out, and reduces image
The background of partitioning algorithm mark is counted out, so that shorten the run time of image segmentation algorithm, based on depth map operation image point
Algorithm is cut, so as to optimize the effect of image segmentation.
Brief description of the drawings
Fig. 1 is the hardware architecture diagram for realizing each optional mobile terminal of embodiment one of the invention;
Fig. 2 is the wireless communication system schematic diagram of mobile terminal as shown in Figure 1;
Fig. 3 is a kind of method flow diagram for realizing interactive image segmentation of the embodiment of the present invention 1;
Fig. 4 is a kind of schematic device for realizing interactive image segmentation of the embodiment of the present invention 2;
Fig. 5-a are the schematic diagram of original image and user's smearing track in present invention application example 1;
Fig. 5-b are by the mark zone (boundary rectangle extension) of smearing Track Pick-up and interested in present invention application example 1
The schematic diagram in region;
Fig. 5-c are the schematic diagram that the present invention applies the mask artwork generated by mark zone and area-of-interest in example 1;
Fig. 5-d are the schematic diagram of the depth map of present invention application example 1;
Fig. 5-e are the schematic diagram of the non-directed graph of present invention application example 1;
Fig. 5-f are the schematic diagram of the mask artwork after fine segmentation in present invention application example 1;
Fig. 5-g are the schematic diagram that the present invention applies the destination object being partitioned into example 1.
Fig. 6-a be the present invention application example 2 in by smearing Track Pick-up mark zone (seed growth) and area-of-interest
Schematic diagram;
Fig. 6-b-1 are the signal of the incipient activity point and peripheral neighborhood during seed growth in present invention application example 2
Figure;
Fig. 6-b-2 are the subregion that the incipient activity point in present invention application example 2 during seed growth grows into
Schematic diagram;
Fig. 6-b-3 are that incipient activity point and new moving point in present invention application example 2 during seed growth are common
With the schematic diagram in the region for growing into;
Fig. 6-c are the schematic diagram that the present invention applies the mask artwork generated by mark zone and area-of-interest in example 2;
Fig. 7-a are the schematic diagram that original image and user sketch the contours track in present invention application example 3;
Fig. 7-b be the present invention application example 3 in by sketch the contours Track Pick-up mark zone (being closed by mending line segment) and
The schematic diagram of area-of-interest;
Fig. 7-c are the schematic diagram that the present invention applies the mask artwork generated by mark zone and area-of-interest in example 3.
The realization of the object of the invention, functional characteristics and advantage will be described further referring to the drawings in conjunction with the embodiments.
Specific embodiment
Technical scheme is described in detail below in conjunction with drawings and Examples.
The mobile terminal of each embodiment of the application is realized referring now to Description of Drawings.In follow-up description, use
For represent element such as " module ", " part " or " unit " suffix only for being conducive to explanation of the invention, itself
Not specific meaning.Therefore, " module " can be used mixedly with " part ".
Mobile terminal can be implemented in a variety of manners.For example, the terminal described in the present invention can include such as moving
Phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP
The mobile terminal of (portable media player), guider etc. and such as numeral TV, desktop computer etc. are consolidated
Determine terminal.Hereinafter it is assumed that terminal is mobile terminal.However, it will be understood by those skilled in the art that, except being used in particular for movement
Outside the element of purpose, construction according to the embodiment of the present invention can also apply to the terminal of fixed type.
Fig. 1 is to realize the application one hardware architecture diagram of optional mobile terminal of each embodiment.
Mobile terminal 1 00 can include wireless communication unit 110, A/V (audio/video) input block 120, user input
Unit 130, sensing unit 140, output unit 150, memory 160, interface unit 170, controller 180 and power subsystem 190
Etc..
Fig. 1 shows the mobile terminal 1 00 with various assemblies, it should be understood that being not required for implementing all showing
The component for going out.More or less component can alternatively be implemented.The element of mobile terminal 1 00 will be discussed in more detail below.
Wireless communication unit 110 can generally include one or more assemblies, and it allows mobile terminal 1 00 and radio communication
Radio communication between system or network.For example, wireless communication unit 110 can include that broadcasting reception module 111, movement are logical
At least one of letter module 112, wireless Internet module 113, short range communication module 114 and location information module 115.
Broadcasting reception module 111 receives broadcast singal and/or broadcast via broadcast channel from external broadcast management server
Relevant information.Broadcast channel can include satellite channel and/or terrestrial channel.Broadcast management server can be generated and sent
The broadcast singal and/or broadcast related information generated before the server or reception of broadcast singal and/or broadcast related information
And send it to the server of terminal.Broadcast singal can include TV broadcast singals, radio signals, data broadcasting
Signal etc..And, broadcast singal may further include the broadcast singal combined with TV or radio signals.Broadcast phase
Pass information can also be provided via mobile communications network, and in said case, broadcast related information can be by mobile communication
Module 112 is received.Broadcast singal can exist in a variety of manners, for example, it can be with the electricity of DMB (DMB)
The form of sub- program guide (EPG), the electronic service guidebooks (ESG) of digital video broadcast-handheld (DVB-H) etc. and exist.Extensively
Broadcasting receiver module 111 can receive signal broadcast by using various types of broadcast systems.Especially, broadcasting reception module
111 can be by using such as multimedia broadcasting-ground (DMB-T), DMB-satellite (DMB-S), digital video
Broadcasting-Handheld (DVB-H), Radio Data System, the received terrestrial digital broadcasting integrated service of forward link media (MediaFLO@)
Etc. (ISDB-T) digit broadcasting system receives digital broadcasting.Broadcasting reception module 111 may be constructed such that and be adapted to provide for extensively
Broadcast the various broadcast systems and above-mentioned digit broadcasting system of signal.Via broadcasting reception module 111 receive broadcast singal and/
Or broadcast related information can be stored in memory 160 (or other types of storage medium).
Mobile communication module 112 sends radio signals to base station (for example, access point, node B etc.), exterior terminal
And at least one of server and/or receive from it radio signal.Such radio signal can be logical including voice
Words signal, video calling signal or the various types of data for sending and/or receiving according to text and/or Multimedia Message.
Wireless Internet module 113 supports the Wi-Fi (Wireless Internet Access) of mobile terminal.The module can be with internal or external
Be couple to terminal.Wi-Fi (Wireless Internet Access) technology involved by the module can include WLAN (WLAN) (Wi-Fi),
Wibro (WiMAX), Wimax (worldwide interoperability for microwave accesses), HSDPA (high-speed downlink packet access) etc..
Short range communication module 114 is the module for supporting junction service.Some examples of short-range communication technology include indigo plant
Tooth TM, radio frequency identification (RFID), Infrared Data Association (IrDA), ultra wide band (UWB), purple honeybee TM etc..
Location information module 115 is the module for checking or obtaining the positional information of mobile terminal.Location information module
115 typical case is GPS (global positioning system).According to current technology, GPS calculate from three or more satellites away from
Information application triangulation from information and correct time information and for calculating, so as to according to longitude, latitude and height
Degree calculates three-dimensional current location information exactly.Currently, three satellites are used simultaneously for calculating the method for position and temporal information
And the error of the position and temporal information for calculating is corrected by using an other satellite.Additionally, GPS can be by real-time
Ground Continuous plus current location information carrys out calculating speed information.
A/V input blocks 120 are used to receive audio or video signal.A/V input blocks 120 can include the He of camera 121
Microphone 122, the static images that 121 pairs, camera is obtained in Video Capture pattern or image capture mode by image capture apparatus
Or the view data of video is processed.Picture frame after treatment may be displayed on display unit 151.Processed through camera 121
Picture frame afterwards can be stored in memory 160 (or other storage mediums) or sent out via wireless communication unit 110
Send, two or more cameras 121 can be provided according to the construction of mobile terminal 1 00.Microphone 122 can be in telephone relation mould
In formula, logging mode, speech recognition mode etc. operational mode sound (voice data), and energy are received via microphone 122
Enough is voice data by such acoustic processing.Audio (voice) data after treatment can be in the case of telephone calling model
The form that being converted to can be sent to mobile communication base station via mobile communication module 112 is exported.Microphone 122 can be implemented various
The noise of type eliminates (or suppression) algorithm and is being received and making an uproar of producing during sending audio signal with eliminating (or suppression)
Sound or interference.
User input unit 130 can generate key input data to control mobile terminal 1 00 according to the order of user input
Various operations.User input unit 130 allow the various types of information of user input, and can include keyboard, metal dome,
Touch pad (for example, detection due to being touched caused by resistance, pressure, electric capacity etc. change sensitive component), roller, shake
Bar etc..Especially, when touch pad is superimposed upon on display unit 151 in the form of layer, touch-screen can be formed.
Sensing unit 140 detects the current state of mobile terminal 1 00, (for example, mobile terminal 1 00 opens or closes shape
State), the presence or absence of the contact (that is, touch input) of the position of mobile terminal 1 00, user for mobile terminal 1 00, mobile terminal
The acceleration or deceleration movement of 100 orientation, mobile terminal 1 00 and direction etc., and generate for controlling mobile terminal 1 00
The order of operation or signal.For example, when mobile terminal 1 00 is embodied as sliding-type mobile phone, sensing unit 140 can be sensed
The sliding-type phone is opened or closed.In addition, sensing unit 140 can detect whether power subsystem 190 provides electric power
Or whether interface unit 170 couples with external device (ED).Sensing unit 140 can include proximity transducer 141.
Interface unit 170 is connected the interface that can pass through with mobile terminal 1 00 as at least one external device (ED).For example,
External device (ED) can include wired or wireless head-band earphone port, external power source (or battery charger) port, wired or nothing
Line FPDP, memory card port, the port for connecting the device with identification module, audio input/output (I/O) end
Mouth, video i/o port, ear port etc..Identification module can be that storage uses each of mobile terminal 1 00 for verifying user
Kind of information and subscriber identification module (UIM), client identification module (SIM), Universal Subscriber identification module (USIM) can be included
Etc..In addition, the device (hereinafter referred to as " identifying device ") with identification module can take the form of smart card, therefore, know
Other device can be connected via port or other attachment means with mobile terminal 1 00.Interface unit 170 can be used for reception and come from
The input (for example, data message, electric power etc.) of the external device (ED) and input that will be received is transferred in mobile terminal 1 00
One or more elements can be used for transmitting data between mobile terminal 1 00 and external device (ED).
In addition, when mobile terminal 1 00 is connected with external base, interface unit 170 can serve as allowing by it by electricity
Power provides to the path of mobile terminal 1 00 from base or can serve as allowing the various command signals being input into from base to pass through it
It is transferred to the path of mobile terminal 1 00.Can serve as recognizing mobile terminal 1 00 from the various command signals or electric power of base input
Whether signal base on is accurately fitted within.Output unit 150 is configured to be provided with vision, audio and/or tactile manner
Output signal (for example, audio signal, vision signal, alarm signal, vibration signal etc.).Output unit 150 can include aobvious
Show unit 151, dio Output Modules 152, alarm unit 153 etc..
Display unit 151 may be displayed on the information processed in mobile terminal 1 00.For example, when mobile terminal 1 00 is in electricity
During words call mode, display unit 151 can show and converse or other communicate (for example, text messaging, multimedia file
Download etc.) related user interface (UI) or graphic user interface (GUI).When mobile terminal 1 00 is in video calling pattern
Or during image capture mode, display unit 151 can show the image of capture and/or the image of reception, show video or figure
UI or GUI of picture and correlation function etc..
Meanwhile, when display unit 151 and touch pad in the form of layer it is superposed on one another to form touch-screen when, display unit
151 can serve as input unit and output device.Display unit 151 can include liquid crystal display (LCD), thin film transistor (TFT)
In LCD (TFT-LCD), Organic Light Emitting Diode (OLED) display, flexible display, three-dimensional (3D) display etc. at least
It is a kind of.Some in these displays may be constructed such that transparence to allow user to be watched from outside, and this is properly termed as transparent
Display, typical transparent display can be, for example, TOLED (transparent organic light emitting diode) display etc..According to specific
Desired implementation method, mobile terminal 1 00 can include two or more display units (or other display devices), for example, moving
Dynamic terminal 100 can include outernal display unit (not shown) and inner display unit (not shown).Touch-screen can be used to detect
Touch input pressure and touch input position and touch input area.
Dio Output Modules 152 can be in call signal reception pattern, call mode, record mould in mobile terminal 1 00
It is that wireless communication unit 110 is received or in memory when under the isotypes such as formula, speech recognition mode, broadcast reception mode
In 160 store voice data transducing audio signal and be output as sound.And, dio Output Modules 152 can provide with
The audio output of the specific function correlation that mobile terminal 1 00 is performed is (for example, call signal receives sound, message sink sound etc.
Deng).Dio Output Modules 152 can include loudspeaker, buzzer etc..
Alarm unit 153 can provide output and be notified to mobile terminal 1 00 with by event.Typical event can be with
Including calling reception, message sink, key signals input, touch input etc..In addition to audio or video is exported, alarm unit
153 can in a different manner provide output with the generation of notification event.For example, alarm unit 153 can be in the form of vibrating
Output is provided, when calling, message or some other entrance communication (incoming communication) are received, alarm list
Unit 153 can provide tactile output (that is, vibrating) to notify to user.Exported by providing such tactile, even if
When in pocket of the mobile phone of user in user, user also can recognize that the generation of various events.Alarm unit 153
The output of the generation of notification event can be provided via display unit 151 or dio Output Modules 152.
Memory 160 can store software program for the treatment and control operation performed by controller 180 etc., Huo Zheke
Temporarily to store oneself data (for example, telephone directory, message, still image, video etc.) through exporting or will export.And
And, memory 160 can store the vibration of various modes on being exported when touching and being applied to touch-screen and audio signal
Data.
Memory 160 can include the storage medium of at least one type, and the storage medium includes flash memory, hard disk, many
Media card, card-type memory (for example, SD or DX memories etc.), random access storage device (RAM), static random-access storage
Device (SRAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM), programmable read only memory
(PROM), magnetic storage, disk, CD etc..And, mobile terminal 1 00 can perform memory with by network connection
The network storage device cooperation of 160 store function.
The overall operation of the generally control mobile terminal of controller 180.For example, controller 180 is performed and voice call, data
Communication, video calling etc. related control and treatment.In addition, controller 180 can be included for reproducing (or playback) many matchmakers
The multi-media module 181 of volume data, multi-media module 181 can be constructed in controller 180, or can be structured as and control
Device 180 is separated.Controller 180 can be with execution pattern identifying processing, the handwriting input that will be performed on the touchscreen or picture
Draw input and be identified as character or image.
Power subsystem 190 receives external power or internal power under the control of controller 180 and provides operation each unit
Appropriate electric power needed for part and component.
Various implementation methods described herein can be with use such as computer software, hardware or its any combination of calculating
Machine computer-readable recording medium is implemented.Implement for hardware, implementation method described herein can be by using application-specific IC
(ASIC), digital signal processor (DSP), digital signal processing device (DSPD), programmable logic device (PLD), scene can
Programming gate array (FPGA), processor, controller, microcontroller, microprocessor, it is designed to perform function described herein
At least one in electronic unit is implemented, and in some cases, such implementation method can be implemented in controller 180.
For software implementation, the implementation method of such as process or function can with allow to perform the single of at least one function or operation
Software module is implemented.Software code can be come by the software application (or program) write with any appropriate programming language
Implement, software code can be stored in memory 160 and performed by controller 180.
So far, oneself according to its function through describing mobile terminal 1 00.In addition, the mobile terminal 1 00 in the embodiment of the present invention
Can be such as folded form, board-type, oscillating-type, sliding-type and other various types of mobile terminals, not do herein specifically
Limit.
Mobile terminal 1 00 as shown in Figure 1 may be constructed such that using via frame or packet transmission data it is all if any
Line and wireless communication system and satellite-based communication system are operated.
The communication system that mobile terminal wherein of the invention can be operated is described referring now to Fig. 2.
Such communication system can use different air interface and/or physical layer.For example, used by communication system
Air interface includes such as frequency division multiple access (FDMA), time division multiple acess (TDMA), CDMA (CDMA) and universal mobile communications system
System (UMTS) (especially, Long Term Evolution (LTE)), global system for mobile communications (GSM) etc..As non-limiting example, under
The description in face is related to cdma communication system, but such teaching is equally applicable to other types of system.
With reference to Fig. 2, cdma wireless communication system can include multiple mobile terminal 1s 00, multiple base station (BS) 270, base station
Controller (BSC) 275 and mobile switching centre (MSC) 280.MSC 280 is configured to and Public Switched Telephony Network (PSTN)
290 form interface.MSC 280 is also structured to be formed with the BSC 275 that can be couple to base station 270 via back haul link and connects
Mouthful.If any one in the interface that back haul link can be known according to Ganji is constructed, the interface can include such as Europe mark
Quasi- high power capacity digital circuit/Unite States Standard high power capacity digital circuit (E1/T1), asynchronous transfer mode (ATM), procotol
(IP), point-to-point protocol (PPP), frame relay, high-bit-rate digital subscriber line road (HDSL), Asymmetrical Digital Subscriber Line (ADSL)
Or all kinds digital subscriber line (xDSL).It will be appreciated that system can include multiple BSC 275 as shown in Figure 2.
Each BS 270 can service one or more subregions (or region), by multidirectional antenna or the day of sensing specific direction
Each subregion of line covering is radially away from BS 270.Or, each subregion can by two for diversity reception or more
Multiple antennas are covered.Each BS 270 may be constructed such that the multiple frequency distribution of support, and the distribution of each frequency has specific frequency
Spectrum (for example, 1.25MHz, 5MHz etc.).
What subregion and frequency were distributed intersects can be referred to as CDMA Channel.BS 270 can also be referred to as base station transceiver
System (BTS) or other equivalent terms.In this case, term " base station " can be used for broadly representing single BSC
275 and at least one BS 270.Base station can also be referred to as " cellular station ".Or, each subregion of specific BS 270 can be claimed
It is multiple cellular stations.
As shown in Figure 2, broadcast singal is sent to broadcsting transmitter (BT) 295 mobile terminal operated in system
100.Broadcasting reception module 111 as shown in Figure 1 is arranged at mobile terminal 1 00 to receive the broadcast sent by BT 295
Signal.In fig. 2 it is shown that several global positioning system (GPS) satellites 300.Satellite 300 helps position multiple mobile terminals
At least one of 100.
In fig. 2, multiple satellites 300 are depicted, it is understood that be, it is possible to use any number of satellite obtains useful
Location information.Location information module 115 as shown in Figure 1 is (such as:GPS) it is generally configured to coordinate to obtain with satellite 300
The location information that must be wanted.Substitute GPS tracking techniques or outside GPS tracking techniques, it is possible to use can track mobile whole
Other technologies of the position at end.In addition, at least one gps satellite 300 can optionally or additionally process satellite dmb biography
It is defeated.
Used as a typical operation of wireless communication system, BS 270 receives the reverse strand from various mobile terminal 1s 00
Road signal.Mobile terminal 1 00 generally participates in call, information receiving and transmitting and other types of communication.Each of certain base station reception is anti-
Processed in specific BS 270 to link signal.The data of acquisition are forwarded to the BSC 275 of correlation.BSC provides logical
Words resource allocation and the mobile management function of the coordination including the soft switching process between BS 270.BSC 275 will also be received
Data be routed to MSC 280, its provide for PSTN 290 formed interface extra route service.Similarly, PSTN
290 form interface with MSC 280, and MSC and BSC 275 form interface, and BSC 275 correspondingly controls BS 270 with by forward direction
Link signal is sent to mobile terminal 1 00.
Based on above-mentioned mobile terminal hardware configuration and communication system, the application method each embodiment is proposed.
As shown in figure 3, the embodiment of the present invention proposes a kind of method for realizing interactive image segmentation, including:
S310, after detecting the smearing track on original image or sketching the contours track, by the smearing track or sketches the contours track
The first neighboring region be defined as mark zone, by the smearing track or sketch the contours track the second neighboring region be defined as it is interested
Region, the area-of-interest includes the mark zone;Generate the input mask artwork of image segmentation algorithm:By in the mark zone
All of pixel covers the pixel outside mark zone in the area-of-interest as described as the foreground point in the mask artwork
Background dot in mould figure;
S320, obtains the depth map comprising destination object depth information, according to the depth map and mask artwork determine
The partitioning parameters of each pixel on mask artwork, the partitioning parameters are used to represent that pixel judgement is foreground point or the probability of background dot
And the depth value difference of the pixel and adjacent pixel;
S330, builds non-directed graph, and the partitioning parameters of each pixel in the mask artwork are mapped into the non-directed graph
In, the non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtain the mask artwork after fine segmentation;
S340, the mask artwork after the fine segmentation is partitioned into from the original image that the destination object is obtained is shot
The corresponding image in middle foreground point;
Methods described can also include following features:
Wherein, it is two kinds of different mark modes to smear and sketch the contours;
Usually, it is the mark that carries out of interior zone in destination object to smear track, and it is along target pair to sketch the contours track
The mark that the exterior contour of elephant is carried out;
Wherein, mask artwork refers to that the part or all of pixel of a sub-picture has been carried out to be generated after prospect is distinguished with background
Mark figure, each pixel on the mask artwork is marked as foreground point or background dot.
In the present embodiment, described image partitioning algorithm is GrabCut algorithms.
In one embodiment, the first neighboring region of the smearing track is defined as mark zone, by the smearing
Second neighboring region of track is defined as area-of-interest, including:
The minimum enclosed rectangle of the smearing track is constructed, the length L according to the smearing track is external to the minimum
Rectangle is extended to form mark zone;
It is emerging according to rectangle template sense of the generation comprising the smearing track for rectangle template is applied mechanically in the smearing track
Interesting region (Region of Interest, ROI), if the area-of-interest does not include the mark zone, will be described
Area-of-interest is extended to comprising the mark zone;Or expansion process formation area-of-interest is carried out to the mark zone.
Wherein, the minimum enclosed rectangle of the construction smearing track, the length L according to the smearing track is to described
Minimum enclosed rectangle is extended to form mark zone, including:
Transverse and longitudinal coordinate value according to all pixels on the smearing track determines maximum abscissa value x_max, minimum horizontal stroke
Coordinate value x_min, maximum ordinate value y_max and minimum ordinate value y_min;
The minimum enclosed rectangle of the smearing track is constructed, four coordinates on summit of the minimum enclosed rectangle are successively
It is:(x_min, y_max), (x_min, y_min), (x_max, y_max), (x_max, y_min);
Length L according to the smearing track is extended to form mark zone to the minimum enclosed rectangle, the mark
Four apex coordinates in area are successively:(x_min-a, y_max+a), (x_min-a, y_min-a), (x_max+a, y_max+a),
(x_max+a,y_min-a);Wherein, a is adjustability coefficients, and a=L*b, b are constants;B can be empirical value.
In one embodiment, the first neighboring region of the smearing track is defined as mark zone, by the smearing
Second neighboring region of track is defined as area-of-interest, including:
The edge gradient information of original image is obtained, selected part pixel is used as kind from the pixel of the smearing track
Son, each seed forms a sub-regions to outgrowth respectively according to the edge gradient information of image, and all of seed is outside
Grow after the subregion for being formed merges and form an extended area, using the extended area as mark zone;
Features of shape according to the smearing track applies mechanically geometry template, according to the geometry template generation bag
Area-of-interest containing the smearing track, it is if the area-of-interest does not include the mark zone, the sense is emerging
Interesting region is extended to comprising the mark zone;Or expansion process formation area-of-interest is carried out to the mark zone.
Wherein, the geometry template includes:Rectangle template, polygon splat or oval template;
Wherein, described each seed forms a sub-regions according to the edge gradient information of image to outgrowth, including
Following step A-F is performed to each seed:
Step A:It is that the starting point sets energy value using the seed as starting point, the starting point is labeled as
Moving point;
Step B:Judge the current pixel with the presence or absence of labeled as moving point, be then to perform step C, otherwise perform step F;
Step C:To any one labeled as the pixel A of moving point, four, the upper and lower, left and right phase of the pixel A is judged
It is then to perform step D with the presence or absence of the pixel B not checked in adjoint point, otherwise performs step E;
Step D:The pixel B do not checked any one, if the pixel B meets growth conditions, by the picture
Plain B is labeled as new moving point, and the energy value that the energy value of the pixel B is set to pixel A is subtracted the edge gradient of pixel B
The difference being worth to, return to step C;If the pixel B is unsatisfactory for growth conditions, return to step C;Wherein, the growth
Condition is:Energy value of the edge gradient value of the pixel B less than or equal to the pixel A;
Step E:Judge that the pixel A has completed growth, the moving point mark of the pixel A is removed, by the pixel A
It is included into prospect point set, return to step B;
Step F:The subregion being made up of all pixels in the prospect point set is side of the seed according to image
The subregion that edge gradient information is formed to outgrowth.
In the present embodiment, first neighboring region for sketching the contours track is defined as mark zone, track is sketched the contours by described
The second neighboring region be defined as area-of-interest, including:
It is described sketch the contours track it is closed when, the region closed track of sketching the contours is defined as mark zone;Institute
State sketch the contours track it is not closed when, Seal treatment is done to the track of sketching the contours, if closed successfully, by closing after sketch the contours track
The region closed is defined as mark zone, if closing is unsuccessful, expansion process is carried out to the track of sketching the contours, will be described swollen
Region after swollen is defined as mark zone;
Geometry template is applied mechanically according to the features of shape for sketching the contours track, according to the geometry template generation bag
It is if the area-of-interest does not include the mark zone, the sense is emerging containing the area-of-interest for sketching the contours track
Interesting region is extended to comprising the mark zone;Or expansion process formation area-of-interest is carried out to the mark zone;
Wherein, Seal treatment is done to the track of sketching the contours, including:
If described the distance between beginning and end of track of sketching the contours is more than or equal to threshold value, obtain described in sketch the contours rail
The edge line in region between the beginning and end of mark, the edge line is superimposed with by the track of sketching the contours;If the hook
Le track is superimposed with the edge line can form closed area, then judge to close successfully, if the track of sketching the contours is superimposed with
The edge line can not form closed area, then judge closing failure;
If described the distance between beginning and end of track of sketching the contours is less than threshold value, between the beginning and end
Line segment connection is done, the closure of track is sketched the contours in completion.
In the present embodiment, the segmentation ginseng of each pixel on the mask artwork is determined according to the depth map and mask artwork
Number, including:
Gauss hybrid models GMM calculating is carried out by EM methods, the EM methods include E steps and M steps;Iteration runs E
Step and M steps, the iterative process is stopped after iteration operation reaches the condition of convergence;Last time is performed into M steps
The classification of the pixel for obtaining is defined as the classification of the pixel, and the pixel that last time performs the acquisition of M steps is returned
Belong to the most probable value P of certain clustermaxIt is defined as the area item partitioning parameters of the pixel, the area item partitioning parameters are
The pixel judgement is foreground point or the probability of background dot;
Wherein, the E steps and M steps include following treatment respectively:
E steps:The position relationship between depth value and pixel according to each pixel on the mask artwork is by clustering
Same type of pixel is polymerized to one or more clusters, the GMM model of each cluster is determined;Wherein, the classification of pixel includes prospect
Point or background dot;The classification of cluster includes foreground point cluster or background dot cluster;
M steps:GMM model according to each cluster determines that each pixel belongs to the probability of each cluster, to any one
Pixel, the most probable value P according to the pixelmaxCorresponding cluster determines the classification of the pixel;
Alternatively, the condition of convergence of the EM methods can be:Stop iteration when the number of times of iteration operation reaches threshold value
Process;
In the present embodiment, the segmentation ginseng of each pixel on the mask artwork is determined according to the depth map and mask artwork
Number, also includes:Depth value difference according to the pixel and adjacent pixel determines the border item partitioning parameters of the pixel;
Wherein, to any one pixel, by the depth value absolute difference between the pixel and each neighbouring pixel
Added up, then to adding up and being normalized, cumulative and as the pixel the border item point after being normalized
Cut parameter;
Alternatively, each pixel neighbouring with pixel can be 8 pixels on the pixel periphery;
In the present embodiment, non-directed graph is built, the partitioning parameters of each pixel in the mask artwork is mapped to institute
In stating non-directed graph, including:
A non-directed graph is built, two hanging point Q are set outside the plane of the non-directed graph0And Q1, the hanging point Q0For
Virtual foreground point, the hanging point Q1It is virtual background point;Each is set up on the mask artwork in the plane of the non-directed graph
The mapping point of pixel, in mapping point and the hanging point Q of foreground point0Between set up line, mapping point and institute in background dot
State hanging point Q1Between set up line;
To any one pixel P in mask artworki, by the pixel PiArea item partitioning parameters as the non-directed graph
Middle mapping point Pi' weights, by the pixel PiBorder item partitioning parameters as mapping point P in the non-directed graphi' with it is hanging
Point Q0Or Q1Between line weights;
In the present embodiment, it is described the non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtain fine point
Mask artwork after cutting, including:
Iteration performs following step C and D steps, and iterative process is stopped after iteration operation reaches the condition of convergence, will be described
Each pixel in prospect point set Q is used as the foreground point in the mask artwork after fine segmentation;
Wherein, step C and D steps include following treatment respectively:
Step C:One part of pixel in non-directed graph is divided into and vacantly point Q0Similar foreground point, by being divided into prospect
The pixel of point constitutes prospect point set Q;
D steps:The weights summation of the prospect point set Q is calculated, the weights summation is institute in the prospect point set Q
Have foreground point weights and, along with all foreground points in the prospect point set Q and hanging point Q0Between line weights and;
Wherein, the condition of convergence be prospect point set Q weights summation less than threshold value and change tend towards stability;
Alternatively, the original image that the shooting destination object is obtained can be cromogram, artwork master or infrared figure.
In one embodiment, before the input mask artwork of generation image segmentation algorithm, methods described also includes:
If currently detected smearing track is directed to the first time mark of the original image, will be currently detected
The mark zone determined of smearing track as final mark zone, by currently detected smearing track determine it is interested
Region is used as final area-of-interest;
If currently detected smearing track is not directed to the first time mark of the original image, current inspection is judged
Whether have between the mark zone that the smearing track that the mark zone and last time that the smearing track for measuring is determined are detected is determined
Occur simultaneously, if occuring simultaneously, then the smearing rail that the mark zone determined currently detected smearing track is detected with the last time
The mark zone that mark is determined is merged, using the mark zone after merging as final mark zone;If do not occured simultaneously, ought
Before the mark zone determined of the smearing track that detects as final mark zone;
If currently detected smearing track is not directed to the first time mark of the original image, current inspection is judged
The area-of-interest that area-of-interest that the smearing track for measuring is determined is determined with the last smearing track for detecting it
Between whether have common factor, if occuring simultaneously, then the area-of-interest determined currently detected smearings track and last inspection
The area-of-interest that the smearing track for measuring is determined is merged, using the area-of-interest after merging as final interested
Region;If do not occured simultaneously, the area-of-interest that currently detected smearing track is determined is used as final interested
Region.
Wherein, in addition to first time smears, the corresponding mark zone of smearing each time afterwards and area-of-interest all with it is upper
Once smear the mark zone determined of track and area-of-interest be compared so that foreground point in input mask artwork and
Background dot is more accurate, improves the quality of segmentation figure picture.
In correlation technique, the foreground point in the input mask artwork of image segmentation algorithm is user's hand labeled, original graph
As behind upper removing foreground point other pixel mark be have that foreground point mark is less and input mask artwork size compared with
It is big to cause image segmentation algorithm to be to distinguish foreground point and the iterations of background dot to increase, Riming time of algorithm problem long.
After using the method for the embodiment of the present invention, expand the input mask artwork of image segmentation algorithm automatically by way of generating mark zone
The prospect of middle mark is counted out, and image segmentation algorithm mark can be reduced to substitute view picture original image by generating area-of-interest
The background of note is counted out, and it is the iterations for distinguishing foreground point and background dot that can reduce image segmentation algorithm, is significantly decreased
The run time of image segmentation algorithm.On the other hand, the technical scheme of the embodiment of the present invention calculates partitioning parameters using depth map,
When the color character of the destination object to be extracted is not obvious, the effect of image segmentation can be improved.
As shown in figure 4, the embodiment of the present invention proposes a kind of device for realizing interactive image segmentation, including:
Pretreatment module 401, after detecting the smearing track on original image or sketching the contours track, by the smearing rail
Mark sketches the contours the first neighboring region of track and is defined as mark zone, by the smearing track or sketches the contours the second neighboring region of track
It is defined as area-of-interest, the area-of-interest includes the mark zone;Generate the input mask artwork of image segmentation algorithm:Will
All of pixel is used as the foreground point in the mask artwork in the mark zone, by the picture outside mark zone in the area-of-interest
Element is used as the background dot in the mask artwork;
Partitioning parameters computing module 402, for obtaining the depth map comprising destination object depth information, according to the depth
Figure and mask artwork determine the partitioning parameters of each pixel on the mask artwork, and the partitioning parameters are used to represent that pixel judgement is preceding
The probability and the pixel of sight spot or background dot and the depth value difference of adjacent pixel;
Mask artwork adjusting module 403, for building non-directed graph, by the partitioning parameters of each pixel in the mask artwork
It is mapped in the non-directed graph, the non-directed graph is processed according to minimal cut-maximum-flow algorithm, after obtains fine segmentation
Mask artwork;
Output module 404, for being partitioned into described fine point from the original image that the destination object is obtained is shot
The corresponding image in foreground point in mask artwork after cutting;
Described device can also include following features:
Wherein, it is two kinds of different mark modes to smear and sketch the contours;
Usually, it is the mark that carries out of interior zone in destination object to smear track, and it is along target pair to sketch the contours track
The mark that the exterior contour of elephant is carried out;
Wherein, mask artwork refers to that the part or all of pixel of a sub-picture has been carried out to be generated after prospect is distinguished with background
Mark figure, each pixel on the mask artwork is marked as foreground point or background dot;
In the present embodiment, described image partitioning algorithm is GrabCut algorithms.
In one embodiment, pretreatment module, in the following ways by the first adjoining of the smearing track
Region is defined as mark zone, and the second neighboring region of the smearing track is defined as into area-of-interest:
The minimum enclosed rectangle of the smearing track is constructed, the length L according to the smearing track is external to the minimum
Rectangle is extended to form mark zone;
It is emerging according to rectangle template sense of the generation comprising the smearing track for rectangle template is applied mechanically in the smearing track
Interesting region (Region of Interest, ROI), if the area-of-interest does not include the mark zone, will be described
Area-of-interest is extended to comprising the mark zone;Or expansion process formation area-of-interest is carried out to the mark zone.
Wherein, the minimum enclosed rectangle of the construction smearing track, the length L according to the smearing track is to described
Minimum enclosed rectangle is extended to form mark zone, including:
Transverse and longitudinal coordinate value according to all pixels on the smearing track determines maximum abscissa value x_max, minimum horizontal stroke
Coordinate value x_min, maximum ordinate value y_max and minimum ordinate value y_min;
The minimum enclosed rectangle of the smearing track is constructed, four coordinates on summit of the minimum enclosed rectangle are successively
It is:(x_min, y_max), (x_min, y_min), (x_max, y_max), (x_max, y_min);
Length L according to the smearing track is extended to form mark zone to the minimum enclosed rectangle, the mark
Four apex coordinates in area are successively:(x_min-a, y_max+a), (x_min-a, y_min-a), (x_max+a, y_max+a),
(x_max+a,y_min-a);Wherein, a is adjustability coefficients, and a=L*b, b are constants;B can be empirical value.
In one embodiment, the pretreatment module, in the following ways by the first of the smearing track
Neighboring region is defined as mark zone, and the second neighboring region of the smearing track is defined as into area-of-interest:
The edge gradient information of original image is obtained, selected part pixel is used as kind from the pixel of the smearing track
Son, each seed forms a sub-regions to outgrowth respectively according to the edge gradient information of image, and all of seed is outside
Grow after the subregion for being formed merges and form an extended area, using the extended area as mark zone;
Features of shape according to the smearing track applies mechanically geometry template, according to the geometry template generation bag
Area-of-interest containing the smearing track, it is if the area-of-interest does not include the mark zone, the sense is emerging
Interesting region is extended to comprising the mark zone;Or expansion process formation area-of-interest is carried out to the mark zone.
Wherein, the geometry template includes:Rectangle template, polygon splat or oval template;
Wherein, described each seed forms a sub-regions according to the edge gradient information of image to outgrowth, including
Following step A-F is performed to each seed:
Step A:It is that the starting point sets energy value using the seed as starting point, the starting point is labeled as
Moving point;
Step B:Judge the current pixel with the presence or absence of labeled as moving point, be then to perform step C, otherwise perform step F;
Step C:To any one labeled as the pixel A of moving point, four, the upper and lower, left and right phase of the pixel A is judged
It is then to perform step D with the presence or absence of the pixel B not checked in adjoint point, otherwise performs step E;
Step D:The pixel B do not checked any one, if the pixel B meets growth conditions, by the picture
Plain B is labeled as new moving point, and the energy value that the energy value of the pixel B is set to pixel A is subtracted the edge gradient of pixel B
The difference being worth to, return to step C;If the pixel B is unsatisfactory for growth conditions, return to step C;Wherein, the growth
Condition is:Energy value of the edge gradient value of the pixel B less than or equal to the pixel A;
Step E:Judge that the pixel A has completed growth, the moving point mark of the pixel A is removed, by the pixel A
It is included into prospect point set, return to step B;
Step F:The subregion being made up of all pixels in the prospect point set is side of the seed according to image
The subregion that edge gradient information is formed to outgrowth.
In the present embodiment, pretreatment module, in the following ways by first neighboring region for sketching the contours track
It is defined as mark zone, second neighboring region for sketching the contours track is defined as area-of-interest:
It is described sketch the contours track it is closed when, the region closed track of sketching the contours is defined as mark zone;Institute
State sketch the contours track it is not closed when, Seal treatment is done to the track of sketching the contours, if closed successfully, by closing after sketch the contours track
The region closed is defined as mark zone, if closing is unsuccessful, expansion process is carried out to the track of sketching the contours, will be described swollen
Region after swollen is defined as mark zone;
Geometry template is applied mechanically according to the features of shape for sketching the contours track, according to the geometry template generation bag
It is if the area-of-interest does not include the mark zone, the sense is emerging containing the area-of-interest for sketching the contours track
Interesting region is extended to comprising the mark zone;Or expansion process formation area-of-interest is carried out to the mark zone;
Wherein, Seal treatment is done to the track of sketching the contours, including:
If described the distance between beginning and end of track of sketching the contours is more than or equal to threshold value, obtain described in sketch the contours rail
The edge line in region between the beginning and end of mark, the edge line is superimposed with by the track of sketching the contours;If the hook
Le track is superimposed with the edge line can form closed area, then judge to close successfully, if the track of sketching the contours is superimposed with
The edge line can not form closed area, then judge closing failure;
If described the distance between beginning and end of track of sketching the contours is less than threshold value, between the beginning and end
Line segment connection is done, the closure of track is sketched the contours in completion.
In the present embodiment, partitioning parameters computing module, in the following ways according to the depth map and mask artwork
Determine the partitioning parameters of each pixel on the mask artwork:
Gauss hybrid models GMM calculating is carried out by EM methods, the EM methods include E steps and M steps;Iteration runs E
Step and M steps, the iterative process is stopped after iteration operation reaches the condition of convergence;Last time is performed into M steps
The classification of the pixel for obtaining is defined as the classification of the pixel, and the pixel that last time performs the acquisition of M steps is returned
Belong to the most probable value P of certain clustermaxIt is defined as the area item partitioning parameters of the pixel, the area item partitioning parameters are
The pixel judgement is foreground point or the probability of background dot;
Wherein, the E steps and M steps include following treatment respectively:
E steps:The position relationship between depth value and pixel according to each pixel on the mask artwork is by clustering
Same type of pixel is polymerized to one or more clusters, the GMM model of each cluster is determined;Wherein, the classification of pixel includes prospect
Point or background dot;The classification of cluster includes foreground point cluster or background dot cluster;
M steps:GMM model according to each cluster determines that each pixel belongs to the probability of each cluster, to any one
Pixel, the most probable value P according to the pixelmaxCorresponding cluster determines the classification of the pixel;
Alternatively, the condition of convergence of the EM methods can be:Stop iteration when the number of times of iteration operation reaches threshold value
Process;
In the present embodiment, partitioning parameters computing module, is additionally operable in the following ways according to the depth map and mask
Figure determines the partitioning parameters of each pixel on the mask artwork:Depth value difference according to the pixel and adjacent pixel determines institute
State the border item partitioning parameters of pixel;
Wherein, to any one pixel, by the depth value absolute difference between the pixel and each neighbouring pixel
Added up, then to adding up and being normalized, cumulative and as the pixel the border item point after being normalized
Cut parameter;
Alternatively, each pixel neighbouring with pixel can be 8 pixels on the pixel periphery.
In the present embodiment, mask artwork adjusting module, for building non-directed graph in the following ways and by the mask artwork
In the partitioning parameters of each pixel be mapped in the non-directed graph:
A non-directed graph is built, two hanging point Q are set outside the plane of the non-directed graph0And Q1, the hanging point Q0For
Virtual foreground point, the hanging point Q1It is virtual background point;Each is set up on the mask artwork in the plane of the non-directed graph
The mapping point of pixel, in mapping point and the hanging point Q of foreground point0Between set up line, mapping point and institute in background dot
State hanging point Q1Between set up line;
To any one pixel P in mask artworki, by the pixel PiArea item partitioning parameters as the non-directed graph
Middle mapping point Pi' weights, by the pixel PiBorder item partitioning parameters as mapping point P in the non-directed graphi' with it is hanging
Point Q0Or Q1Between line weights.
In the present embodiment, mask artwork adjusting module, in the following ways according to minimal cut-maximum-flow algorithm to institute
State non-directed graph to be processed, obtain the mask artwork after fine segmentation:
Iteration performs following step C and D steps, and iterative process is stopped after iteration operation reaches the condition of convergence, will be described
Each pixel in prospect point set Q is used as the foreground point in the mask artwork after fine segmentation;
Wherein, step C and D steps include following treatment respectively:
Step C:One part of pixel in non-directed graph is divided into and vacantly point Q0Similar foreground point, by being divided into prospect
The pixel of point constitutes prospect point set Q;
D steps:The weights summation of the prospect point set Q is calculated, the weights summation is institute in the prospect point set Q
Have foreground point weights and, along with all foreground points in the prospect point set Q and hanging point Q0Between line weights and;
Wherein, the condition of convergence be prospect point set Q weights summation less than threshold value and change tend towards stability.
Alternatively, the original image that the shooting destination object is obtained is cromogram, artwork master or infrared figure.
In one embodiment, pretreatment module, was additionally operable to before the input mask artwork of generation image segmentation algorithm,
Mark zone and area-of-interest are adjusted:
If currently detected smearing track is directed to the first time mark of the original image, will be currently detected
The mark zone determined of smearing track as final mark zone, by currently detected smearing track determine it is interested
Region is used as final area-of-interest;
If currently detected smearing track is not directed to the first time mark of the original image, current inspection is judged
Whether have between the mark zone that the smearing track that the mark zone and last time that the smearing track for measuring is determined are detected is determined
Occur simultaneously, if occuring simultaneously, then the smearing rail that the mark zone determined currently detected smearing track is detected with the last time
The mark zone that mark is determined is merged, using the mark zone after merging as final mark zone;If do not occured simultaneously, ought
Before the mark zone determined of the smearing track that detects as final mark zone;
If currently detected smearing track is not directed to the first time mark of the original image, current inspection is judged
The area-of-interest that area-of-interest that the smearing track for measuring is determined is determined with the last smearing track for detecting it
Between whether have common factor, if occuring simultaneously, then the area-of-interest determined currently detected smearings track and last inspection
The area-of-interest that the smearing track for measuring is determined is merged, using the area-of-interest after merging as final interested
Region;If do not occured simultaneously, the area-of-interest that currently detected smearing track is determined is used as final interested
Region.
The method of the embodiment of the present invention, expands the input mask of image segmentation algorithm automatically by way of generating mark zone
The prospect marked in figure is counted out, and image segmentation algorithm can be reduced to substitute view picture original image by generating area-of-interest
The background of mark is counted out, and it is the iterations for distinguishing foreground point and background dot that can reduce image segmentation algorithm, is significantly subtracted
The run time of few image segmentation algorithm.On the other hand, the technical scheme of the embodiment of the present invention calculates segmentation ginseng using depth map
Number, when the color character of the destination object to be extracted is not obvious, can improve the effect of image segmentation.
Embodiment 3
The embodiment of the present invention also provides a kind of terminal, and the terminal includes the above-mentioned device for realizing interactive image segmentation.
Using example 1
User's destination object interested in oneself on the original image is smeared, using the preprocess method of this paper
Image is pre-processed, be may comprise steps of:
Step S501, detects user's selection destination object is marked by the way of smearing;
Such as, two buttons for being used to mark are provided on interface, one is " smearing ", and one is " sketching the contours ", if user
" smearing " button is clicked, has then been pre-processed to smearing track.
Step S502, detects user and is smeared on the original image;
Such as, as shown in Fig. 5-a, user is smeared on the original image, and destination object is " stapler ", wherein,
Original image is cromogram;
Step S503, constructs the minimum enclosed rectangle of the smearing track, and the length L according to the smearing track is to described
Minimum enclosed rectangle is extended to form mark zone;For rectangle template is applied mechanically in the smearing track, given birth to according to the rectangle template
Into the area-of-interest comprising the smearing track;If the area-of-interest does not include the mark zone, will be described
Area-of-interest is extended to comprising the mark zone;
Wherein, the minimum enclosed rectangle of the construction smearing track, the length L according to the smearing track is to described
Minimum enclosed rectangle is extended to form mark zone, including:
Transverse and longitudinal coordinate value according to all pixels on the smearing track determines maximum abscissa value x_max, minimum horizontal stroke
Coordinate value x_min, maximum ordinate value y_max and minimum ordinate value y_min;
The minimum enclosed rectangle of the smearing track is constructed, four coordinates on summit of the minimum enclosed rectangle are successively
It is:(x_min, y_max), (x_min, y_min), (x_max, y_max), (x_max, y_min);
Length L according to the smearing track is extended to form mark zone to the minimum enclosed rectangle, the mark
Four apex coordinates in area are successively:(x_min-a, y_max+a), (x_min-a, y_max-a), (x_max+a, y_max+a),
(x_max+a,y_min-a);Wherein, a is adjustability coefficients, and a=L*b, b are constants;
Such as, as shown in Fig. 5-b, it is first rectangle region comprising the smearing track to smear the corresponding mark zone in track
Domain, area-of-interest is second rectangular area comprising the mark zone, and the frame of the area-of-interest is represented by dashed line.
Step S504, the input mask artwork of generation image segmentation algorithm (GrabCut algorithms):To own in the mark zone
Pixel as the foreground point in mask artwork, using the pixel outside mark zone in the area-of-interest as the background in mask artwork
Point.
Such as, as shown in Fig. 5-c, the rectangle darker regions comprising smearing track are mark zones, and the mark zone is mask
Foreground point block in figure (input mask artwork);The edge of area-of-interest is indicated with dotted line frame, is removed in the dotted line frame
The part for going to mark zone (foreground point block) is the background dot block in mask artwork.
Step S505, obtains the depth map comprising destination object depth information;
As shown in Fig. 5-d, depth map is a figure comprising depth information, and the size with cromogram is consistent;Depth
In figure, farther out, the shallower part shooting distance of color is nearer for the deeper part shooting distance of color.
Step S506, the partitioning parameters of each pixel on the mask artwork, institute are determined according to the depth map and mask artwork
It is the probability and the pixel of foreground point or background dot and the depth of adjacent pixel that partitioning parameters are stated for representing pixel judgement
Value difference is different;
Wherein, to any one pixel on the mask artwork, the partitioning parameters of the pixel include area item partitioning parameters
With border item partitioning parameters;It is general for foreground point or background dot that the area item partitioning parameters of the pixel refer to pixel judgement
Rate;The border item partitioning parameters of the pixel refer to the depth value difference of the pixel and adjacent pixel;
Wherein, the area item partitioning parameters of each pixel on the mask artwork are determined in the following ways:
Gauss hybrid models GMM calculating is carried out by EM methods, the EM methods include E steps and M steps;Iteration runs E
Step and M steps, the iterative process is stopped after the number of times of iteration operation reaches predetermined number of times;Last time is performed
The classification of the pixel that M steps are obtained is defined as the classification of the pixel, and last time is performed into the picture that M steps are obtained
Element belongs to the most probable value P of certain clustermaxIt is defined as the area item partitioning parameters of the pixel, the area item segmentation ginseng
Number is that the pixel judgement is foreground point or the probability of background dot;
Wherein, the E steps and M steps include following treatment respectively:
E steps:The position relationship between depth value and pixel according to each pixel on the mask artwork is by clustering
Same type of pixel is polymerized to one or more clusters, the GMM model of each cluster is determined;Wherein, the classification of pixel includes prospect
Point or background dot;The classification of cluster includes foreground point cluster or background dot cluster;
M steps:GMM model according to each cluster determines that each pixel belongs to the probability of each cluster, to any one
Pixel, the most probable value P according to the pixelmaxCorresponding cluster determines the classification of the pixel;
Wherein, the border item partitioning parameters of each pixel on the mask artwork are determined in the following ways:To any one
Pixel, the depth value absolute difference between each pixel in the pixel and 8 neighbouring neighborhoods is added up, then to tired
Plus and be normalized, adding up and as the border item partitioning parameters of the pixel after being normalized;
Step S507, builds non-directed graph, and the partitioning parameters of each pixel in the mask artwork are mapped into the nothing
In to figure, the non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtain the mask artwork after fine segmentation;
Wherein, non-directed graph sets two hanging point Q as shown in Fig. 5-e outside the plane of the non-directed graph0And Q1, it is described outstanding
Null point Q0It is virtual foreground point, the hanging point Q1It is virtual background point;The mask artwork is set up in the plane of the non-directed graph
The mapping point of upper each pixel, in mapping point and the hanging point Q of foreground point0Between set up line, in the mapping of background dot
Point and the hanging point Q1Between set up line;
To any one pixel P in mask artworki, by the pixel PiArea item partitioning parameters as the non-directed graph
Middle mapping point Pi' weights, by the pixel PiBorder item partitioning parameters as mapping point P in the non-directed graphi' with it is hanging
Point Q0Or Q1Between line weights.
Wherein, the non-directed graph is processed according to minimal cut-maximum-flow algorithm in the following ways, obtains fine point
Mask artwork after cutting:
Iteration performs following step C and D steps, and iterative process is stopped after iteration operation reaches the condition of convergence, will be described
Each pixel in prospect point set Q is used as the foreground point in the mask artwork after fine segmentation;
Wherein, step C and D steps include following treatment respectively:
Step C:One part of pixel in non-directed graph is divided into and vacantly point Q0Similar foreground point, by being divided into prospect
The pixel of point constitutes prospect point set Q;
D steps:The weights summation of the prospect point set Q is calculated, the weights summation is institute in the prospect point set Q
Have foreground point weights and, along with all foreground points in the prospect point set Q and hanging point Q0Between line weights and;
Wherein, the condition of convergence be prospect point set Q weights summation less than threshold value and change tend towards stability;
Wherein, the mask artwork after fine segmentation is as shown in Fig. 5-f, comprising smearing track and with target image (stapler)
The irregular darker regions of profile are mark zones, and the mark zone is the foreground point block in the mask artwork after fine segmentation;Sense
The part that mark zone (foreground point block) is removed in interest region is the background dot block in mask artwork.Mask after fine segmentation
Figure is compared with initial mask artwork, and the border between foreground point block and background dot block is more careful.
Step S508, is partitioned into covering after the fine segmentation from the original image that the destination object is obtained is shot
The corresponding image in foreground point in mould figure;
Wherein, destination object is partitioned into from the original cromogram according to the mask artwork after fine segmentation, is partitioned into
" stapler " image as shown in Fig. 5-g.
This application example is defined as mark zone by will smear the first rectangle neighboring region that track is extended to, by mark zone
In all pixels be labeled as foreground point, can expand automatically image segmentation algorithm input mask artwork in mark prospect points
Mesh, the second rectangle neighboring region being extended to by will smear track is defined as area-of-interest, will be marked in area-of-interest
Pixel outside area is labeled as background dot, can reduce the background marked in the input mask artwork of image segmentation algorithm and count out, on
It is the iterations for distinguishing foreground point and background dot to state treatment and can reduce image segmentation algorithm, significantly decreases image segmentation calculation
The run time of method.On the other hand, the technical scheme of this application example calculates partitioning parameters using depth map, when the mesh to be extracted
When the color character of mark object is not obvious, the effect of image segmentation can be improved.
Using example 2
User's destination object interested in oneself on the original image is smeared, using the preprocess method of this paper
Image is pre-processed, be may comprise steps of:
Step S601, detects user's selection destination object is marked by the way of smearing;
Such as, two buttons for being used to mark are provided on interface, one is " smearing ", and one is " sketching the contours ", if user
" smearing " button is clicked, has then been pre-processed to smearing track.
Step S602, detects user and is smeared on the original image;
Such as, as shown in Fig. 5-a, user is smeared on the original image, and destination object is " stapler ";
Step S603, obtains the edge gradient information of original image, the selected part picture from the pixel of the smearing track
Element is used as seed, and each seed forms a sub-regions to outgrowth respectively according to the edge gradient information of image, all of
Seed forms an extended area after merging to the subregion that outgrowth is formed, using the extended area as mark zone;According to
The features of shape of the smearing track applies mechanically geometry template, and the smearing rail is included according to the geometry template generation
The area-of-interest of mark;If the area-of-interest does not include the mark zone, the area-of-interest is extended to
Comprising the mark zone;
Wherein, as shown in Fig. 6-a, mark zone can be grown by the smearing track using seed growth method;Apply mechanically
Rectangle template area-of-interest of the generation comprising the smearing track;
Wherein, described each seed forms a sub-regions according to the edge gradient information of image to outgrowth, including
Following step A-F is performed to each seed:
Step A:It is that the starting point sets energy value using the seed as starting point, the starting point is labeled as
Moving point;
Step B:Judge the current pixel with the presence or absence of labeled as moving point, be then to perform step C, otherwise perform step F;
Step C:To any one labeled as the pixel A of moving point, four, the upper and lower, left and right phase of the pixel A is judged
It is then to perform step D with the presence or absence of the pixel B not checked in adjoint point, otherwise performs step E;
Step D:The pixel B do not checked any one, if the pixel B meets growth conditions, by the picture
Plain B is labeled as new moving point, and the energy value that the energy value of the pixel B is set to pixel A is subtracted the edge gradient of pixel B
The difference being worth to, return to step C;If the pixel B is unsatisfactory for growth conditions, return to step C;Wherein, the growth
Condition is:Energy value of the edge gradient value of the pixel B less than or equal to the pixel A;
Step E:Judge that the pixel A has completed growth, the moving point mark of the pixel A is removed, by the pixel A
It is included into prospect point set, return to step B;
Step F:The subregion being made up of all pixels in the prospect point set is side of the seed according to image
The subregion that edge gradient information is formed to outgrowth;
Growing state to a seed does description below:
As shown in accompanying drawing 6-b-1, on features of edge gradient maps, " the edge gradient value that precipitous 2 " represent pixel is 2, Grad
It is larger, belong to precipitous class, " the edge gradient value that gentle 1 " represents pixel is 1, and Grad is smaller, belongs to gentle class.
Seed is marked as moving point as starting point O, and moving point is represented with " * ";The energy value of starting point O can be with
It is set as 4;The difference that the energy value of starting point O is set can have influence on the size of growth district, and energy value is bigger, the area of growth
Domain is bigger.
As shown in accompanying drawing 6-b-2, when starting point O prepares to grow to the right, the edge gradient value of right-adjacent pixel is 1,
Meet growth conditions, so, first pixel on the right of starting point O is labeled as new moving point, the new moving point
Energy value is that the energy value (4) of starting point O subtracts the difference (3) that the edge gradient value (1) of the adjacent pixel is obtained.With same
Method, the adjacent pixel of starting point O upper and lower, left and right four direction is checked one by one, meets the adjacent picture of growth conditions
Element turns into new moving point, and four starting point O adjacent pixels are satisfied by growth conditions, are marked as new moving point.Rising
After initial point O completes to check, the moving point mark of the starting point O is removed.
To the moving point of each new mark, using the growing method same with starting point O, can be to the new work of external expansion
Moving point.All moving points stop growing after subregion schematic diagram as shown in accompanying drawing 6-b-3.
Step S604, the input mask artwork of generation image segmentation algorithm (GrabCut algorithms):To own in the mark zone
Pixel as the foreground point in mask artwork, using the pixel outside mark zone in the area-of-interest as the background in mask artwork
Point.
Such as, as shown in Fig. 6-c, the irregular darker regions comprising smearing track are mark zones, and the mark zone is to cover
Foreground point block in mould figure (input mask artwork);The edge of area-of-interest is indicated with dotted line frame, in the dotted line frame
The part for removing mark zone (foreground point block) is the background dot block in mask artwork.
Step S605, obtains the depth map comprising destination object depth information;
Wherein, depth map is a figure comprising depth information, and the size with cromogram is consistent;In depth map, face
Farther out, the shallower part shooting distance of color is nearer for the deeper part shooting distance of color.
Step S606, the partitioning parameters of each pixel on the mask artwork, institute are determined according to the depth map and mask artwork
It is the probability and the pixel of foreground point or background dot and the depth of adjacent pixel that partitioning parameters are stated for representing pixel judgement
Value difference is different;
Wherein, the step of method of the partitioning parameters of each pixel is with application example 1 on the mask artwork is specifically calculated
Correlation technique described in S506 is identical;
Step S607, builds non-directed graph, and the partitioning parameters of each pixel in the mask artwork are mapped into the nothing
In to figure, the non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtain the mask artwork after fine segmentation;
Wherein, it is specific to build non-directed graph and partitioning parameters are mapped to the method in the non-directed graph, and according to most
It is small to cut the-method that is processed the non-directed graph of maximum-flow algorithm, related side the step of with application example 1 described in S507
Method is identical;
Wherein, the mask artwork after fine segmentation is compared with initial mask artwork, the foreground point of the mask artwork after fine segmentation
Border between background dot is clearer, careful.
Step S608, is partitioned into covering after the fine segmentation from the original image that the destination object is obtained is shot
The corresponding image in foreground point in mould figure;
Wherein, destination object can be partitioned into from the original cromogram according to the mask artwork after fine segmentation " to order
Book machine ".
This application example by will smear track as seed growth into neighboring region be defined as mark zone, by mark zone
In all pixels be labeled as foreground point, can expand automatically image segmentation algorithm input mask artwork in mark prospect points
Mesh, is defined as area-of-interest, by the pixel outside mark zone in area-of-interest by by the rectangle neighboring region for smearing track
Labeled as background dot, the background marked in the input mask artwork of image segmentation algorithm can be reduced and counted out, above-mentioned treatment can subtract
Few image segmentation algorithm is the iterations for distinguishing foreground point and background dot, when significantly decreasing the operation of image segmentation algorithm
Between.On the other hand, the technical scheme of this application example calculates partitioning parameters using depth map, when the color of the destination object to be extracted
When color feature is not obvious, the effect of image segmentation can be improved.
Using example 3
User's destination object interested in oneself on the original image is sketched the contours, using the preprocess method of this paper
Image is pre-processed, be may comprise steps of:
Step S701, detects user's selection destination object is marked by the way of sketching the contours;
Such as, two buttons for being used to mark are provided on interface, one is " smearing ", and one is " sketching the contours ", if user
" sketching the contours " button is clicked, has then been pre-processed to sketching the contours track.
Step S702, detects user and is sketched the contours on the original image;
Such as, as shown in Fig. 7-a, user is sketched the contours on the original image, and destination object is " stapler ";
Step S703, it is described sketch the contours track it is closed when, the region closed track of sketching the contours is defined as mark
Note area;It is described sketch the contours track it is not closed when, Seal treatment is done to the track of sketching the contours, if closed successfully, will closing after
The region closed track of sketching the contours be defined as mark zone, if closing is unsuccessful, the track of sketching the contours is carried out at expansion
Reason, mark zone is defined as by the region after the expansion;Geometry template is applied mechanically according to the features of shape for sketching the contours track,
The area-of-interest of track is sketched the contours comprising described according to the geometry template generation;If the area-of-interest is not wrapped
Containing the mark zone, then the area-of-interest is extended to comprising the mark zone;
Wherein, as shown in Fig. 7-b, if described the distance between beginning and end of track of sketching the contours is less than threshold value,
Line segment connection is done between the beginning and end, the closure of track is sketched the contours in completion, by the closure after track of sketching the contours closed
Region be defined as mark zone;Apply mechanically rectangle template generation and include the area-of-interest for sketching the contours track;
Step S704, the input mask artwork of generation image segmentation algorithm (GrabCut algorithms):To own in the mark zone
Pixel as GrabCut algorithms input mask artwork in foreground point, by the pixel mark outside mark zone in the area-of-interest
It is designated as the background dot in GrabCut algorithms input mask artwork;
Such as, as shown in Fig. 7-c, sketch the contours track be superimposed with line segment closing irregular darker regions be mark zone, it is described
Mark zone is the foreground point block in mask artwork (input mask artwork);The edge of area-of-interest is indicated with dotted line frame, institute
The part for stating removing mark zone (foreground point block) in dotted line frame is the background dot block in mask artwork;
Step S705, obtains the depth map comprising destination object depth information;
Wherein, depth map is a figure comprising depth information, and the size with cromogram is consistent;In depth map, face
Farther out, the shallower part shooting distance of color is nearer for the deeper part shooting distance of color.
Step S706, the partitioning parameters of each pixel on the mask artwork, institute are determined according to the depth map and mask artwork
It is the probability and the pixel of foreground point or background dot and the depth of adjacent pixel that partitioning parameters are stated for representing pixel judgement
Value difference is different;
Wherein, the step of method of the partitioning parameters of each pixel is with application example 1 on the mask artwork is specifically calculated
Correlation technique described in S506 is identical;
Step S707, builds non-directed graph, and the partitioning parameters of each pixel in the mask artwork are mapped into the nothing
In to figure, the non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtain the mask artwork after fine segmentation;
Wherein, it is specific to build non-directed graph and partitioning parameters are mapped to the method in the non-directed graph, and according to most
It is small to cut the-method that is processed the non-directed graph of maximum-flow algorithm, related side the step of with application example 1 described in S507
Method is identical;
Wherein, the mask artwork after fine segmentation is compared with initial mask artwork, the foreground point of the mask artwork after fine segmentation
Border between background dot is clearer, careful.
Step S708, is partitioned into covering after the fine segmentation from the original image that the destination object is obtained is shot
The corresponding image in foreground point in mould figure;
Wherein, destination object can be partitioned into from the original cromogram according to the mask artwork after fine segmentation " to order
Book machine ".
This application example is defined as mark zone by will sketch the contours the region that track is closed, by all pixels mark in mark zone
Foreground point is designated as, can automatically be expanded the prospect marked in the input mask artwork of image segmentation algorithm and be counted out, by that will sketch the contours
The rectangle neighboring region of track is defined as area-of-interest, and the pixel outside mark zone in area-of-interest is labeled as into background dot,
The background marked in the input mask artwork of image segmentation algorithm can be reduced to count out, image segmentation can be reduced by above-mentioned treatment
Algorithm is the iterations for distinguishing foreground point and background dot, significantly decreases the run time of image segmentation algorithm.The opposing party
Face, the technical scheme of this application example calculates partitioning parameters using depth map, when the destination object to be extracted color character simultaneously
When unobvious, the effect of image segmentation can be improved.
It should be noted that herein, term " including ", "comprising" or its any other variant be intended to non-row
His property is included, so that process, method, article or device including a series of key elements not only include those key elements, and
And also include other key elements being not expressly set out, or also include for this process, method, article or device institute are intrinsic
Key element.In the absence of more restrictions, the key element limited by sentence "including a ...", it is not excluded that including institute
Also there is other identical element in process, method, article or the device of stating key element.
The embodiments of the present invention are for illustration only, and the quality of embodiment is not represented.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side
Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases
The former is more preferably implementation method.Based on such understanding, technical scheme is substantially done to prior art in other words
The part for going out contribution can be embodied in the form of software product, and the computer software product storage is in a storage medium
In (such as ROM/RAM, magnetic disc, CD), including some instructions are used to so that a station terminal equipment (can be mobile phone, computer, clothes
Business device, air-conditioner, or network equipment etc.) perform method described in each embodiment of the invention.
The preferred embodiments of the present invention are these are only, the scope of the claims of the invention is not thereby limited, it is every to utilize this hair
Equivalent structure or equivalent flow conversion that bright specification and accompanying drawing content are made, or directly or indirectly it is used in other related skills
Art field, is included within the scope of the present invention.
Claims (10)
1. a kind of method for realizing interactive image segmentation, including:
After detecting the smearing track on original image or sketching the contours track, by the smearing track or sketch the contours track first adjoining
Region is defined as mark zone, and the smearing track or the second neighboring region for sketching the contours track are defined as into area-of-interest, described
Area-of-interest includes the mark zone;Generate the input mask artwork of image segmentation algorithm:By all of picture in the mark zone
Element as the foreground point in the mask artwork, using the pixel outside mark zone in the area-of-interest as in the mask artwork
Background dot;
The depth map comprising destination object depth information is obtained, is determined according to the depth map and mask artwork each on the mask artwork
The partitioning parameters of individual pixel, the partitioning parameters are used to represent that pixel judgement is foreground point or the probability and the picture of background dot
The depth value difference of element and adjacent pixel;
Non-directed graph is built, the partitioning parameters of each pixel in the mask artwork is mapped in the non-directed graph, according to most
It is small cut-maximum-flow algorithm processed the non-directed graph, obtains the mask artwork after fine segmentation;
Foreground point in the mask artwork after the fine segmentation is partitioned into from the original image that the destination object is obtained is shot
Corresponding image.
2. method according to claim 1, it is characterised in that:
The partitioning parameters of each pixel on the mask artwork are determined according to the depth map and mask artwork, including:Enter by EM methods
Row gauss hybrid models GMM is calculated, and the EM methods include E steps and M steps;Iteration runs E steps and M steps, described
Iteration operation stops the iterative process after reaching the condition of convergence;Last time is performed into dividing for the pixel that M steps are obtained
Class is defined as the classification of the pixel, and the pixel that last time performs the acquisition of M steps is belonged into the most general of certain cluster
Rate value PmaxIt is defined as the area item partitioning parameters of the pixel, the area item partitioning parameters are that the pixel judgement is prospect
The probability of point or background dot;
Wherein, the E steps and M steps include following treatment respectively:
E steps:The position relationship between depth value and pixel according to each pixel on the mask artwork will be same by clustering
The pixel of type is polymerized to one or more clusters, determines the GMM model of each cluster;Wherein, the classification of pixel include foreground point or
Background dot;The classification of cluster includes foreground point cluster or background dot cluster;
M steps:GMM model according to each cluster determines that each pixel belongs to the probability of each cluster, to any one pixel,
Most probable value P according to the pixelmaxCorresponding cluster determines the classification of the pixel.
3. method according to claim 2, it is characterised in that:
The partitioning parameters of each pixel on the mask artwork are determined according to the depth map and mask artwork, is also included:According to described
Pixel determines the border item partitioning parameters of the pixel with the depth value difference of adjacent pixel;
Wherein, to any one pixel, the depth value absolute difference between the pixel and each neighbouring pixel is carried out
It is cumulative, then to adding up and being normalized, cumulative and as the pixel the border item after being normalized splits ginseng
Number.
4. method according to claim 3, it is characterised in that:
Non-directed graph is built, the partitioning parameters of each pixel in the mask artwork are mapped in the non-directed graph, including:
A non-directed graph is built, two hanging point Q are set outside the plane of the non-directed graph0And Q1, the hanging point Q0For virtual
Foreground point, the hanging point Q1It is virtual background point;Each pixel on the mask artwork is set up in the plane of the non-directed graph
Mapping point, in mapping point and the hanging point Q of foreground point0Between set up line, it is outstanding with described in the mapping point of background dot
Null point Q1Between set up line;
To any one pixel P in mask artworki, by the pixel PiArea item partitioning parameters as being reflected in the non-directed graph
Exit point Pi' weights, by the pixel PiBorder item partitioning parameters as mapping point P in the non-directed graphi' with hanging point Q0
Or Q1Between line weights.
5. method according to claim 4, it is characterised in that:
It is described the non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtain the mask artwork after fine segmentation, bag
Include:
Iteration performs following step C and D steps, stops iterative process after iteration operation reaches the condition of convergence, by the prospect
Each pixel in point set Q is used as the foreground point in the mask artwork after fine segmentation;
Wherein, step C and D steps include following treatment respectively:
Step C:One part of pixel in non-directed graph is divided into and vacantly point Q0Similar foreground point, by being divided into foreground point
Pixel constitutes prospect point set Q;
D steps:The weights summation of the prospect point set Q is calculated, before the weights summation is all in the prospect point set Q
The weights at sight spot and, along with all foreground points in the prospect point set Q and hanging point Q0Between line weights and;
Wherein, the condition of convergence be prospect point set Q weights summation less than threshold value and change tend towards stability.
6. a kind of device for realizing interactive image segmentation, including:
Pretreatment module, after detecting the smearing track on original image or sketching the contours track, by the smearing track or hook
The first neighboring region for strangling track is defined as mark zone, and the smearing track or the second neighboring region for sketching the contours track are defined as
Area-of-interest, the area-of-interest includes the mark zone;Generate the input mask artwork of image segmentation algorithm:By the mark
All of pixel is used as the foreground point in the mask artwork in note area, using the pixel outside mark zone in the area-of-interest as
Background dot in the mask artwork;
Partitioning parameters computing module, for obtaining the depth map comprising destination object depth information, according to the depth map and covering
Mould figure determines the partitioning parameters of each pixel on the mask artwork, the partitioning parameters be used to representing pixel judgement for foreground point or
The depth value difference of the probability of background dot and the pixel and adjacent pixel;
Mask artwork adjusting module, for building non-directed graph, the partitioning parameters of each pixel in the mask artwork is mapped to
In the non-directed graph, the non-directed graph is processed according to minimal cut-maximum-flow algorithm, obtain the mask after fine segmentation
Figure;
Output module, for being partitioned into covering after the fine segmentation from the original image that the destination object is obtained is shot
The corresponding image in foreground point in mould figure.
7. device according to claim 6, it is characterised in that:
Partitioning parameters computing module, determines each on the mask artwork in the following ways according to the depth map and mask artwork
The partitioning parameters of individual pixel:Gauss hybrid models GMM calculating is carried out by EM methods, the EM methods include E steps and M steps;
Iteration runs E steps and M steps, and the iterative process is stopped after iteration operation reaches the condition of convergence;Will last time
The classification for performing the pixel that M steps are obtained is defined as the classification of the pixel, and last time is performed into the institute that M steps are obtained
State the most probable value P that pixel belongs to certain clustermaxIt is defined as the area item partitioning parameters of the pixel, the area item point
It is that the pixel judgement is foreground point or the probability of background dot to cut parameter;
Wherein, the E steps and M steps include following treatment respectively:
E steps:The position relationship between depth value and pixel according to each pixel on the mask artwork will be same by clustering
The pixel of type is polymerized to one or more clusters, determines the GMM model of each cluster;Wherein, the classification of pixel include foreground point or
Background dot;The classification of cluster includes foreground point cluster or background dot cluster;
M steps:GMM model according to each cluster determines that each pixel belongs to the probability of each cluster, to any one pixel,
Most probable value P according to the pixelmaxCorresponding cluster determines the classification of the pixel.
8. device according to claim 7, it is characterised in that:
Partitioning parameters computing module, is additionally operable to be determined on the mask artwork according to the depth map and mask artwork in the following ways
The partitioning parameters of each pixel:Depth value difference according to the pixel and adjacent pixel determines the border item segmentation of the pixel
Parameter;
Wherein, to any one pixel, the depth value absolute difference between the pixel and each neighbouring pixel is carried out
It is cumulative, then to adding up and being normalized, cumulative and as the pixel the border item after being normalized splits ginseng
Number.
9. device according to claim 8, it is characterised in that:
Mask artwork adjusting module, for building non-directed graph and dividing each pixel in the mask artwork in the following ways
Parameter is cut to be mapped in the non-directed graph:
A non-directed graph is built, two hanging point Q are set outside the plane of the non-directed graph0And Q1, the hanging point Q0For virtual
Foreground point, the hanging point Q1It is virtual background point;Each pixel on the mask artwork is set up in the plane of the non-directed graph
Mapping point, in mapping point and the hanging point Q of foreground point0Between set up line, it is outstanding with described in the mapping point of background dot
Null point Q1Between set up line;
To any one pixel P in mask artworki, by the pixel PiArea item partitioning parameters as being reflected in the non-directed graph
Exit point Pi' weights, by the pixel PiBorder item partitioning parameters as mapping point P in the non-directed graphi' with hanging point Q0
Or Q1Between line weights.
10. the device for realizing interactive image segmentation any one of a kind of terminal, including the claims 6-9.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710004616.XA CN106875397B (en) | 2017-01-04 | 2017-01-04 | Method, device and terminal for realizing interactive image segmentation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710004616.XA CN106875397B (en) | 2017-01-04 | 2017-01-04 | Method, device and terminal for realizing interactive image segmentation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106875397A true CN106875397A (en) | 2017-06-20 |
CN106875397B CN106875397B (en) | 2020-04-14 |
Family
ID=59164621
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710004616.XA Active CN106875397B (en) | 2017-01-04 | 2017-01-04 | Method, device and terminal for realizing interactive image segmentation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106875397B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110910401A (en) * | 2019-10-31 | 2020-03-24 | 五邑大学 | Semi-automatic image segmentation data annotation method, electronic device and storage medium |
WO2023071707A1 (en) * | 2021-10-29 | 2023-05-04 | 北京字节跳动网络技术有限公司 | Video image processing method and apparatus, electronic device, and storage medium |
US12014508B2 (en) | 2021-10-18 | 2024-06-18 | Ford Global Technologies, Llc | Distance determination from image data |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102903110A (en) * | 2012-09-29 | 2013-01-30 | 宁波大学 | Segmentation method for image with deep image information |
CN104063876A (en) * | 2014-01-10 | 2014-09-24 | 北京理工大学 | Interactive image segmentation method |
CN104463843A (en) * | 2014-10-31 | 2015-03-25 | 南京邮电大学 | Interactive image segmentation method of android system |
CN104820990A (en) * | 2015-05-15 | 2015-08-05 | 北京理工大学 | Interactive-type image-cutting system |
-
2017
- 2017-01-04 CN CN201710004616.XA patent/CN106875397B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102903110A (en) * | 2012-09-29 | 2013-01-30 | 宁波大学 | Segmentation method for image with deep image information |
CN104063876A (en) * | 2014-01-10 | 2014-09-24 | 北京理工大学 | Interactive image segmentation method |
CN104463843A (en) * | 2014-10-31 | 2015-03-25 | 南京邮电大学 | Interactive image segmentation method of android system |
CN104820990A (en) * | 2015-05-15 | 2015-08-05 | 北京理工大学 | Interactive-type image-cutting system |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110910401A (en) * | 2019-10-31 | 2020-03-24 | 五邑大学 | Semi-automatic image segmentation data annotation method, electronic device and storage medium |
US12014508B2 (en) | 2021-10-18 | 2024-06-18 | Ford Global Technologies, Llc | Distance determination from image data |
WO2023071707A1 (en) * | 2021-10-29 | 2023-05-04 | 北京字节跳动网络技术有限公司 | Video image processing method and apparatus, electronic device, and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN106875397B (en) | 2020-04-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105354838B (en) | The depth information acquisition method and terminal of weak texture region in image | |
CN104835165B (en) | Image processing method and image processing device | |
CN106846345A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106651867A (en) | Interactive image segmentation method and apparatus, and terminal | |
CN106886999A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106898003A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN107018331A (en) | A kind of imaging method and mobile terminal based on dual camera | |
CN108022274A (en) | Image processing method, device, computer equipment and computer-readable recording medium | |
CN106875399A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106898005A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106846323A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106778887A (en) | The terminal and method of sentence flag sequence are determined based on condition random field | |
CN106791111A (en) | A kind of images share method, device and terminal | |
CN106875397A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106898002A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106780516A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106898004A (en) | A kind of preprocess method for realizing interactive image segmentation, device and terminal | |
CN106791119A (en) | A kind of photo processing method, device and terminal | |
CN106875398A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106887009A (en) | A kind of method for realizing interactive image segmentation, device and terminal | |
CN106873981A (en) | A kind of icon processing method, device and terminal | |
CN106651762A (en) | Photo processing method, device and terminal | |
CN107071275A (en) | A kind of image combining method and terminal | |
CN106855946A (en) | A kind of image information acquisition method and apparatus | |
CN106780517A (en) | A kind of method for realizing interactive image segmentation, device and terminal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |