CN104899572A - Content-detecting method and device, and terminal - Google Patents

Content-detecting method and device, and terminal Download PDF

Info

Publication number
CN104899572A
CN104899572A CN201510331116.8A CN201510331116A CN104899572A CN 104899572 A CN104899572 A CN 104899572A CN 201510331116 A CN201510331116 A CN 201510331116A CN 104899572 A CN104899572 A CN 104899572A
Authority
CN
China
Prior art keywords
scan image
content
relevant information
presumptive area
detection relevant
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510331116.8A
Other languages
Chinese (zh)
Other versions
CN104899572B (en
Inventor
张鹏
苏奕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics China R&D Center
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics China R&D Center
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics China R&D Center, Samsung Electronics Co Ltd filed Critical Samsung Electronics China R&D Center
Priority to CN201510331116.8A priority Critical patent/CN104899572B/en
Publication of CN104899572A publication Critical patent/CN104899572A/en
Application granted granted Critical
Publication of CN104899572B publication Critical patent/CN104899572B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a content-detecting method and device, and a terminal. The method in one embodiment comprises: acquiring a scanned image of content to be detected, wherein the scanned image satisfies a predetermined layout distribution rule; identifying and extracting the content of a predetermined area in the scanned image; determining whether the content of the predetermined area satisfies a corresponding predetermined rule; if not, setting the predetermined area as an error area; and marking the error area. The method automatically detects the format or the content of a large amount of content to be detected and improves detecting accuracy and detecting efficiency of the content.

Description

The method of Detection of content, device and terminal
Technical field
The application relates to field of computer technology, is specifically related to field of terminal technology, particularly relates to the method for Detection of content, device and terminal.
Background technology
In daily live and work, people need to fill in a large amount of stencil-type documents, such as some contracts, important official document, document, bill, Product labelling etc., these documents all have the strict requirement of filling in rule and form, if in the process of filling in, not do not fill according to the requirement of regulation, then the information of filling in may be caused invalid.Therefore, after completing such as foregoing, the content completed is detected, whether effective with some information in the content determined.
In the prior art, usually rely on and manually check the content of these documents, but the quantity of these documents is often very huge, if only go to check by manpower, workload will be very large, and easily occur mistake, make checking efficiency low.
Summary of the invention
This application provides a kind of method of Detection of content, device and terminal.
First aspect, this application provides a kind of method of Detection of content, and described method comprises: the scan image obtaining content to be detected, and wherein, described scan image meets predetermined space of a whole page distribution rule; Identify and extract the content of presumptive area in described scan image; Judge whether the content of described presumptive area meets corresponding pre-defined rule; If not, described presumptive area is defined as zone errors; Described zone errors are identified.
In some embodiments, the detection relevant information that described scan image is corresponding is determined in the operation based on user; Or based on the identification and analysis of described scan image being determined to corresponding detection relevant information; Wherein, described detection relevant information comprises the positional information of presumptive area in scan image and the pre-defined rule corresponding to described presumptive area.
In some embodiments, the described operation based on user is determined to comprise the detection relevant information that described scan image is corresponding: user is defined as detection relevant information corresponding to described scan image by the detection relevant information that operation interface is arranged; Or determine according to the contents attribute corresponding to described scan image that user is selected by operation interface the detection relevant information that described scan image is corresponding.
In some embodiments, described based on determining corresponding detection relevant information to the identification and analysis of described scan image, comprising: identify subregional content in the middle part of described scan image; The contents attribute of described scan image determined in the keyword comprised according to the content of described subregion; Detection relevant information corresponding for the contents attribute of described scan image is defined as detection relevant information corresponding to described scan image.
In some embodiments, described method also comprises: the image that is provided for searching problem search foundation, described problem image is the scan image comprising zone errors.
Second aspect, this application provides a kind of device of Detection of content, and described device comprises: acquiring unit, and for obtaining the scan image of content to be detected, wherein, described scan image meets predetermined space of a whole page distribution rule; Identify extraction unit, for identifying and extracting the content of presumptive area in the scan image that described acquiring unit obtains; Judging unit, for judging whether the content of the presumptive area that described identification extraction unit extracts meets corresponding pre-defined rule; Determining unit, for being defined as zone errors by the described presumptive area not meeting corresponding pre-defined rule; Identify unit, for identifying described zone errors.
In some embodiments, described device also comprises: detect relevant information determining unit; Described detection relevant information determining unit, for determining the detection relevant information that described scan image is corresponding based on the operation of user; Or based on the identification and analysis of described scan image being determined to corresponding detection relevant information; Wherein, described detection relevant information comprises the positional information of presumptive area in scan image and the pre-defined rule corresponding to described presumptive area.
In some embodiments, described detection relevant information determining unit is configured for: user is defined as detection relevant information corresponding to described scan image by the detection relevant information that operation interface is arranged; Or determine according to the contents attribute corresponding to described scan image that user is selected by operation interface the detection relevant information that described scan image is corresponding.
In some embodiments, described detection relevant information determining unit is configured for: identify subregional content in the middle part of described scan image; The contents attribute of described scan image determined in the keyword comprised according to the content of described subregion; Detection relevant information corresponding for the contents attribute of described scan image is defined as detection relevant information corresponding to described scan image.
In some embodiments, described device also comprises: search according to unit, searches foundation for the image that is provided for searching problem, and described problem image is the scan image comprising zone errors.
The third aspect, this application provides a kind of terminal, and described terminal comprises: processor, user interface, communication interface; Wherein, described processor controls the scan image that described communication interface obtains content to be detected, described processor identification also extracts the content of presumptive area in described scan image, judge whether the content of described presumptive area meets corresponding pre-defined rule, the described presumptive area not meeting corresponding pre-defined rule is defined as zone errors, and described zone errors are identified; Described user interface is used for user and described terminal is carried out alternately, and described user interface at least comprises display module, and described processor controls the scan image of the to be detected content of described display module display after identifying described zone errors; Wherein, described scan image meets predetermined space of a whole page distribution rule.
In some embodiments, described processor controls the operation information that described user interface obtains user, and described processor determines based on the operation information of described user the detection relevant information that described scan image is corresponding; Or described processor is based on the identification and analysis of described scan image being determined to corresponding detection relevant information; Wherein, described detection relevant information comprises the positional information of presumptive area in scan image and the pre-defined rule corresponding to described presumptive area.
The method of the Detection of content that the application provides, device and terminal, by obtaining the scan image of content to be detected, identify and extract the content of presumptive area in scan image, the presumptive area not meeting corresponding pre-defined rule being defined as zone errors, and zone errors are identified.Thus achieve the detection automatically a large amount of content to be detected being carried out to form or content, improve degree of accuracy and detection efficiency that content is detected.
Accompanying drawing explanation
By reading the detailed description done non-limiting example done with reference to the following drawings, the other features, objects and advantages of the application will become more obvious:
Fig. 1 is the process flow diagram of an embodiment of the method for the Detection of content that the embodiment of the present application provides;
Fig. 2 is the interface schematic diagram that detection relevant information that the embodiment of the present application provides arranges an embodiment at interface;
Fig. 3 is the interface schematic diagram that detection relevant information that the embodiment of the present application provides arranges another embodiment at interface;
Fig. 4 be the embodiment of the present application provide based on the process flow diagram of an embodiment of the method identification and analysis of scan image being determined to corresponding detection relevant information;
Fig. 5 is the structural representation of an embodiment of the device of the Detection of content that the embodiment of the present application provides;
Fig. 6 is the structural representation of an embodiment of the terminal that the embodiment of the present application provides.
Embodiment
Below in conjunction with drawings and Examples, the application is described in further detail.Be understandable that, specific embodiment described herein is only for explaining related invention, but not the restriction to this invention.It also should be noted that, for convenience of description, in accompanying drawing, illustrate only the part relevant to Invention.
It should be noted that, when not conflicting, the embodiment in the application and the feature in embodiment can combine mutually.Below with reference to the accompanying drawings and describe the application in detail in conjunction with the embodiments.
Terminal involved by the application can recognition image.For example, object and is for simplicity described, in ensuing discussion, in conjunction with the terminal of recognition image describing the exemplary embodiment of the application.Terminal can include but not limited to smart mobile phone, panel computer, pocket computer on knee and desktop computer etc.
Please refer to Fig. 1, it illustrates the flow process 100 of an embodiment of the method for the Detection of content according to the application.
As shown in Figure 1, in a step 101, obtain the scan image of content to be detected, wherein, above-mentioned scan image meets predetermined space of a whole page distribution rule.
In general, the such as partial content of some contracts and important official document, document, bill, Product labelling etc., all has the strict requirement of filling in rule and form, if in the process of filling in, not do not fill according to the requirement of regulation, then the information of filling in may be caused invalid.Therefore, after completing such as foregoing, the content completed is detected, whether effective with some information in the content determined.
In the present embodiment, content to be detected is need detected content, content to be detected can be full content or the partial content of a document or bill, also can be full content or the partial content of a file (contract or official document etc.), can also be content of Product labelling etc.Be appreciated that content to be detected can also be the content of other form, the application treats the concrete form of Detection of content and particular content does not limit.
In one implementation, the image scanning function that can be had by terminal itself, is treated Detection of content and directly scans, thus obtains the scan image of content to be detected.In another implementation, the scan image of the content to be detected that miscellaneous equipment scans can also be obtained, such as, the scan image of the content to be detected that miscellaneous equipment is uploaded is received by communication interface, or the scan image of the content to be detected stored is obtained from memory device, or the scan image of content to be detected is downloaded from assigned address, etc.Be appreciated that the scan image that can also obtain content to be detected by another way, the application does not limit the concrete mode obtaining above-mentioned scan image.
In the present embodiment, above-mentioned scan image will meet predetermined space of a whole page distribution rule, and wherein, space of a whole page distribution rule is the distribution rule of the content in scan image on the space of a whole page of scan image.Such as, predetermined space of a whole page distribution rule can be the center of content area in scan image and the center superposition of the scan image space of a whole page, also can be that the distance between the center of the content area in scan image and the coboundary of the scan image space of a whole page is predetermined value.Be appreciated that predetermined space of a whole page distribution rule can also be other content, the particular content of the application to space of a whole page distribution rule does not limit.
Then, in a step 102, identify and extract the content of presumptive area in above-mentioned scan image.
In the present embodiment, can comprise one or more presumptive area in a scan image, the content of presumptive area is in content to be detected the content needing to fill in strict accordance with the requirement of filling in rule and form.Such as, the signature region in certain file and date fill in region etc.Presumptive area can be preset in conjunction with content to be detected and predetermined space of a whole page distribution rule.
In the present embodiment, first obtain the positional information of presumptive area, then, according to the positional information determination presumptive area of presumptive area, extract the content of presumptive area in above-mentioned scan image by the identification of pictograph recognition technology.In one implementation, the content of presumptive area in OCR (Optical Character Recognition, optical character identification) technology identification scan image can be adopted.Be appreciated that the content of presumptive area in the mode identification scan image that can also adopt other, the application is to identifying that the mode of presumptive area in scan image does not limit.
Then, in step 103, judge whether the content of above-mentioned presumptive area meets corresponding pre-defined rule.
In the present embodiment, in a scan image, one or more presumptive area can be comprised, the pre-defined rule that each different presumptive area can be corresponding different.Wherein, the rule of content that the content that pre-defined rule is presumptive area will meet or the rule etc. of form.Such as, pre-defined rule can be that the content of presumptive area can not be blank, can be that character quantity in presumptive area can not more than one predetermined threshold, also can be must comprise certain key word/keyword in the content of presumptive area, or certain key word/keyword can not be comprised in the content of presumptive area, also can be that the content of presumptive area must meet certain canonical form, can also be that the numerical value of the numeral comprised in the content of presumptive area must in predetermined scope etc.Be appreciated that pre-defined rule can also be the rule of other content, the particular content of the application to pre-defined rule does not limit.
It should be noted that, can by the positional information of detection relevant information determination presumptive area in scan image and the pre-defined rule corresponding with this presumptive area.Have the detection relevant information that the scan image of different contents attributes is corresponding different, detection relevant information as corresponding in certain contract and the scan image of certain invoice is different.Wherein, detect relevant information and at least comprise the positional information of presumptive area in scan image and the pre-defined rule corresponding to this presumptive area.
In the one of the present embodiment realizes, can determine based on the operation of user the detection relevant information that scan image is corresponding.Specifically, user can be defined as detection relevant information corresponding to above-mentioned scan image by the detection relevant information that operation interface is arranged.Such as, as shown in Figure 2, on operation interface 201, multiple detection relevant information template 202 corresponding to the scan image of different content attribute is provided.Adopt presumptive area to identify 203 in each detection relevant information template 202 and identify the position of presumptive area in the space of a whole page, further, user can check by the operation (detecting relevant information template as being placed on by the cursor of mouse first-class) on operation interface the pre-defined rule that presumptive area is corresponding.User can select suitable detection relevant information template according to the contents attribute of scan image on operation interface, thus determines the detection relevant information that scan image is corresponding.
It should be noted that, the space of a whole page distribution of the to be detected content of contents attribute corresponding to scan image of scan image and the attribute of form, as the content of invoice and the corresponding different contents attribute of the content of label, the corresponding different contents attribute of content with the not same page of a contract, two spaces of a whole page are arranged contents attribute corresponding to the content of different invoices etc.
Again such as, as shown in Figure 3, on operation interface 301, demonstrate the scan image 302 of content to be detected, user based on the scan image 302 of content to be detected, operation interface 301 can mark the position of presumptive area 303 in the space of a whole page, and the pre-defined rule that input is corresponding with presumptive area.Thus obtain detection relevant information corresponding to scan image.
The contents attribute corresponding to above-mentioned scan image can also selected by operation interface according to user determines the detection relevant information that this scan image is corresponding.Specifically, there is the detection relevant information that the scan image of different contents attributes is corresponding different.On operation interface, provide the contents attribute of different scan images to user, user can select the contents attribute of the scan image corresponding to content to be detected on operation interface.According to the contents attribute of above-mentioned scan image, from the data stored, obtain the detection relevant information corresponding with this scan image.
In the another kind of the present embodiment realizes, can also based on the identification and analysis of above-mentioned scan image being determined to corresponding detection relevant information.With reference to figure 4, it illustrates the flow process 400 of an embodiment based on the method identification and analysis of above-mentioned scan image being determined to corresponding detection relevant information.
As shown in Figure 4, in step 401, subregional content in the middle part of scan image is identified.
In general, can according to the contents attribute of content determination scan image subregional in the middle part of scan image.Such as, according to the key word/keyword of the title of the content in scan image or a certain section of content, the contents attribute of scan image can be determined.Therefore, the content in the region of the contents attribute that can reflect arbitrarily scan image in scan image can be identified.Be appreciated that the application does not limit the particular location of subregion and scope.In the present embodiment, subregional content in the middle part of OCR technology identification scan image can be adopted.
In step 402, the contents attribute of this scan image determined in the keyword comprised according to the content of above-mentioned subregion.
In step 403, detection relevant information corresponding for the contents attribute of above-mentioned scan image is defined as detection relevant information corresponding to this scan image.
Then, at step 104, if the content of presumptive area does not meet corresponding pre-defined rule, this presumptive area is defined as zone errors.
In the present embodiment, if the content of presumptive area does not meet corresponding pre-defined rule, then illustrate that the content of presumptive area can not meet the rule of predetermined content or the rule etc. of form.Therefore, detect that the content of this presumptive area is invalid content, this presumptive area is defined as zone errors.
Finally, in step 105, above-mentioned zone errors are identified.
In the present embodiment, after detecting zone errors, these zone errors are identified.Such as, zone errors can be carried out highlighted display, designated color also can be adopted to identify zone errors, dotted line frame or bold box can also be adopted zone errors to be marked off, etc.Be appreciated that and other mode can also be adopted to identify above-mentioned zone errors, the application does not limit the mode that mark zone errors adopt.
The method that above-described embodiment of the application provides, by obtaining the scan image of content to be detected, identify and extract the content of presumptive area in scan image, the presumptive area not meeting corresponding pre-defined rule being defined as zone errors, and zone errors are identified.Thus achieve the detection automatically a large amount of content to be detected being carried out to form or content, improve degree of accuracy and detection efficiency that content is detected.
In some Alternate embodiments, said method also comprises: the image that is provided for searching problem search foundation, this problem image is the scan image comprising zone errors.
In the present embodiment, after detecting zone errors, except needs identify zone errors, what also need to be provided for searching to user the scan image (i.e. problem image) comprising zone errors searches foundation, finds out the image file of problem image for user.Wherein, for the identification information (file name etc. as problem image) searched according to being problem image of the image that searches problem, also can be the chained address of the file of problem image, be appreciated that, above-mentionedly search according to being other interior perhaps form, the particular content that the application searches foundation to above-mentioned and form do not limit.
Although it should be noted that the operation describing the inventive method in the accompanying drawings with particular order, this is not that requirement or hint must perform these operations according to this particular order, or must perform the result that all shown operation could realize expectation.On the contrary, the step described in process flow diagram can change execution sequence.Additionally or alternatively, some step can be omitted, multiple step be merged into a step and perform, and/or a step is decomposed into multiple step and perform.
With further reference to Fig. 5, it illustrates the structural representation of an embodiment of the device of the Detection of content according to the application.
As shown in Figure 5, the device 500 of the present embodiment comprises: acquiring unit 501, identifies extraction unit 502, judging unit 503, determining unit 504 and identify unit 505.Wherein, acquiring unit 501 is for obtaining the scan image of content to be detected, and wherein, this scan image meets predetermined space of a whole page distribution rule.Identify that extraction unit 502 is for identifying and extracting the content of presumptive area in the scan image that acquiring unit obtains.Judging unit 503 is for judging whether the content identifying the presumptive area that extraction unit extracts meets corresponding pre-defined rule.Determining unit 504 is for being defined as zone errors by the presumptive area not meeting corresponding pre-defined rule.Identify unit 505 is for identifying zone errors.
In some Alternate embodiments, device 500 also comprises: detect relevant information determining unit (not shown).Wherein, detect relevant information determining unit to be used for determining based on the operation of user the detection relevant information that scan image is corresponding.Or based on the identification and analysis of scan image being determined to corresponding detection relevant information.Wherein, detect relevant information and comprise the positional information of presumptive area in scan image and the pre-defined rule corresponding to presumptive area.
In some Alternate embodiments, detect relevant information determining unit and be configured for: user is defined as detection relevant information corresponding to scan image by the detection relevant information that operation interface is arranged; Or determine according to the contents attribute corresponding to scan image that user is selected by operation interface the detection relevant information that this scan image is corresponding.
In some Alternate embodiments, detect relevant information determining unit and be configured for: identify subregional content in the middle part of above-mentioned scan image.The contents attribute of this scan image determined in the keyword comprised according to the content of above-mentioned subregion.Detection relevant information corresponding for the contents attribute of this scan image is defined as detection relevant information corresponding to this scan image.
In some Alternate embodiments, the device 500 of the present embodiment also comprises: search according to unit (not shown), searches foundation for the image that is provided for searching problem, and this problem image is the scan image comprising zone errors.
Should be appreciated that all unit or the module of record in device 500 are corresponding with each step in the method described with reference to figure 1-4.Thus, above for the unit that operation and the feature of method description are equally applicable to device 500 and wherein comprise, do not repeat them here.Device 500 can pre-set in the terminal, also can be loaded in terminal by modes such as downloads.Corresponding units in device 500 can cooperatively interact the scheme realizing Detection of content with the unit in terminal.
With further reference to Fig. 6, it illustrates the structural representation of an embodiment of the terminal according to the application.Be appreciated that this terminal includes but not limited to smart mobile phone, panel computer, pocket computer on knee and desktop computer etc.
As shown in Figure 6, the terminal 600 of the present embodiment comprises: at least one processor 601, such as CPU (Central Processing Unit, central processing unit), at least one communication interface 602, at least one user interface 603, storer 604, at least one communication bus 605.Communication bus 605 is for realizing the connection communication between said modules.Terminal 600 optionally comprises user interface 603, as display module, and keyboard or pointing device (such as, mouse, trace ball (trackball), touch-sensitive plate or touch sensitive display screen) etc.Storer 604 may comprise high-speed RAM (Random Access Memory, random access memory), still may comprise nonvolatile memory (non-volatile memory), such as at least one magnetic disk memory.Storer 604 optionally can comprise at least one and be positioned at memory storage away from aforementioned processor 601.
In some embodiments, storer 604 stores following element, executable module or data structure, or their subset, or their superset:
Operating system 614, comprises various system program, for realizing various basic business and processing hardware based task.
Application program 624, comprises various application program, for realizing various applied business.
Concrete, can be, but not limited to comprise in application program 624:
Acquiring unit, for obtaining the scan image of content to be detected, wherein, described scan image meets predetermined space of a whole page distribution rule; Identify extraction unit, for identifying and extracting the content of presumptive area in the scan image that described acquiring unit obtains; Judging unit, for judging whether the content of the presumptive area that described identification extraction unit extracts meets corresponding pre-defined rule; Determining unit, for being defined as zone errors by the described presumptive area not meeting corresponding pre-defined rule; Identify unit, for identifying described zone errors.
Further, described device also comprises: detect relevant information determining unit; Described detection relevant information determining unit, for determining the detection relevant information that described scan image is corresponding based on the operation of user; Or based on the identification and analysis of described scan image being determined to corresponding detection relevant information; Wherein, described detection relevant information comprises the positional information of presumptive area in scan image and the pre-defined rule corresponding to described presumptive area.
Further, described detection relevant information determining unit is configured for: user is defined as detection relevant information corresponding to described scan image by the detection relevant information that operation interface is arranged; Or determine according to the contents attribute corresponding to described scan image that user is selected by operation interface the detection relevant information that described scan image is corresponding.
Further, described detection relevant information determining unit is configured for: identify subregional content in the middle part of described scan image; The contents attribute of described scan image determined in the keyword comprised according to the content of described subregion; Detection relevant information corresponding for the contents attribute of described scan image is defined as detection relevant information corresponding to described scan image.
Further, described device also comprises: search according to unit, searches foundation for the image that is provided for searching problem, and described problem image is the scan image comprising zone errors.
In the present embodiment, processor 601 is by calling the program or instruction execution corresponding steps that store in storer 604.Particularly, processor 601 controls the scan image that communication interface 602 obtains content to be detected, processor 601 identifies and extracts the content of presumptive area in above-mentioned scan image, judge whether the content of this presumptive area meets corresponding pre-defined rule, the presumptive area not meeting corresponding pre-defined rule is defined as zone errors, and zone errors are identified.User interface 603 carries out alternately for user and terminal, and user interface 603 at least comprises display module, the scan image of the to be detected content of processor 601 Control Items display after identifying zone errors.Wherein, above-mentioned scan image meets predetermined space of a whole page distribution rule.
Further, processor 601 controls the operation information that user interface 603 obtains user, and processor 601 is based on detection relevant information corresponding to the operation information determination scan image of user; Or processor 601 is based on the identification and analysis of scan image being determined to corresponding detection relevant information; Wherein, detect relevant information and comprise the positional information of presumptive area in scan image and the pre-defined rule corresponding to presumptive area.
Be described in unit module involved in the embodiment of the present application to be realized by the mode of software, also can be realized by the mode of hardware.Described unit module also can be arranged within a processor, such as, can be described as: a kind of processor comprises acquiring unit, identifies extraction unit, judging unit, determining unit, identify unit.Wherein, the title of these unit modules does not form the restriction to this unit module itself under certain conditions, and such as, acquiring unit can also be described to " for obtaining the unit of the scan image of content to be detected ".
As another aspect, present invention also provides a kind of computer-readable recording medium, this computer-readable recording medium can be the computer-readable recording medium comprised in device described in above-described embodiment; Also can be individualism, be unkitted the computer-readable recording medium allocated in terminal.Described computer-readable recording medium stores more than one or one program, and described program is used for performance description in the method for the Detection of content of the application by one or more than one processor.
More than describe and be only the preferred embodiment of the application and the explanation to institute's application technology principle.Those skilled in the art are to be understood that, invention scope involved in the application, be not limited to the technical scheme of the particular combination of above-mentioned technical characteristic, also should be encompassed in when not departing from described inventive concept, other technical scheme of being carried out combination in any by above-mentioned technical characteristic or its equivalent feature and being formed simultaneously.The technical characteristic that such as, disclosed in above-mentioned feature and the application (but being not limited to) has similar functions is replaced mutually and the technical scheme formed.

Claims (12)

1. a method for Detection of content, is characterized in that, described method comprises:
Obtain the scan image of content to be detected, wherein, described scan image meets predetermined space of a whole page distribution rule;
Identify and extract the content of presumptive area in described scan image;
Judge whether the content of described presumptive area meets corresponding pre-defined rule;
If not, described presumptive area is defined as zone errors;
Described zone errors are identified.
2. method according to claim 1, is characterized in that,
The detection relevant information that described scan image is corresponding is determined in operation based on user; Or
Based on the identification and analysis of described scan image being determined to corresponding detection relevant information;
Wherein, described detection relevant information comprises the positional information of presumptive area in scan image and the pre-defined rule corresponding to described presumptive area.
3. method according to claim 2, is characterized in that, the described operation based on user is determined to comprise the detection relevant information that described scan image is corresponding:
User is defined as detection relevant information corresponding to described scan image by the detection relevant information that operation interface is arranged; Or
The detection relevant information that described scan image is corresponding is determined according to the contents attribute corresponding to described scan image that user is selected by operation interface.
4. method according to claim 2, is characterized in that, described based on determining corresponding detection relevant information to the identification and analysis of described scan image, comprising:
Identify subregional content in the middle part of described scan image;
The contents attribute of described scan image determined in the keyword comprised according to the content of described subregion;
Detection relevant information corresponding for the contents attribute of described scan image is defined as detection relevant information corresponding to described scan image.
5. method according to claim 1, is characterized in that, described method also comprises:
What be provided for searching problem image searches foundation, and described problem image is the scan image comprising zone errors.
6. a device for Detection of content, is characterized in that, described device comprises:
Acquiring unit, for obtaining the scan image of content to be detected, wherein, described scan image meets predetermined space of a whole page distribution rule;
Identify extraction unit, for identifying and extracting the content of presumptive area in the scan image that described acquiring unit obtains;
Judging unit, for judging whether the content of the presumptive area that described identification extraction unit extracts meets corresponding pre-defined rule;
Determining unit, for being defined as zone errors by the described presumptive area not meeting corresponding pre-defined rule;
Identify unit, for identifying described zone errors.
7. device according to claim 6, is characterized in that, described device also comprises: detect relevant information determining unit;
Described detection relevant information determining unit, for determining the detection relevant information that described scan image is corresponding based on the operation of user; Or
Based on the identification and analysis of described scan image being determined to corresponding detection relevant information;
Wherein, described detection relevant information comprises the positional information of presumptive area in scan image and the pre-defined rule corresponding to described presumptive area.
8. device according to claim 7, is characterized in that, described detection relevant information determining unit is configured for:
User is defined as detection relevant information corresponding to described scan image by the detection relevant information that operation interface is arranged; Or
The detection relevant information that described scan image is corresponding is determined according to the contents attribute corresponding to described scan image that user is selected by operation interface.
9. device according to claim 7, is characterized in that, described detection relevant information determining unit is configured for:
Identify subregional content in the middle part of described scan image;
The contents attribute of described scan image determined in the keyword comprised according to the content of described subregion;
Detection relevant information corresponding for the contents attribute of described scan image is defined as detection relevant information corresponding to described scan image.
10. device according to claim 6, is characterized in that, described device also comprises:
Search according to unit, search foundation for the image that is provided for searching problem, described problem image is the scan image comprising zone errors.
11. 1 kinds of terminals, is characterized in that, comprising: processor, user interface, communication interface;
Wherein, described processor controls the scan image that described communication interface obtains content to be detected, described processor identification also extracts the content of presumptive area in described scan image, judge whether the content of described presumptive area meets corresponding pre-defined rule, the described presumptive area not meeting corresponding pre-defined rule is defined as zone errors, and described zone errors are identified; Described user interface is used for user and described terminal is carried out alternately, and described user interface at least comprises display module, and described processor controls the scan image of the to be detected content of described display module display after identifying described zone errors; Wherein, described scan image meets predetermined space of a whole page distribution rule.
12. terminals according to claim 11, is characterized in that, described processor controls the operation information that described user interface obtains user, and described processor determines based on the operation information of described user the detection relevant information that described scan image is corresponding; Or
Described processor is based on the identification and analysis of described scan image being determined to corresponding detection relevant information; Wherein, described detection relevant information comprises the positional information of presumptive area in scan image and the pre-defined rule corresponding to described presumptive area.
CN201510331116.8A 2015-06-15 2015-06-15 The method, apparatus and terminal of detection content Active CN104899572B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510331116.8A CN104899572B (en) 2015-06-15 2015-06-15 The method, apparatus and terminal of detection content

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510331116.8A CN104899572B (en) 2015-06-15 2015-06-15 The method, apparatus and terminal of detection content

Publications (2)

Publication Number Publication Date
CN104899572A true CN104899572A (en) 2015-09-09
CN104899572B CN104899572B (en) 2019-02-15

Family

ID=54032226

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510331116.8A Active CN104899572B (en) 2015-06-15 2015-06-15 The method, apparatus and terminal of detection content

Country Status (1)

Country Link
CN (1) CN104899572B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109993126A (en) * 2019-04-03 2019-07-09 腾讯科技(深圳)有限公司 The file information determines method, apparatus, equipment and readable storage medium storing program for executing
CN111062379A (en) * 2018-10-16 2020-04-24 珠海格力电器股份有限公司 Identification error-proofing recognition method, device, storage medium and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102324004A (en) * 2011-05-26 2012-01-18 重庆猪八戒网络有限公司 Verification method for webpage form input information and device
EP2530579A1 (en) * 2010-01-29 2012-12-05 Shandong New Beiyang Information Technology Co., Ltd. Printing control method, printer and printing system
CN103258198A (en) * 2013-04-26 2013-08-21 四川大学 Extraction method for characters in form document image
CN103873436A (en) * 2012-12-11 2014-06-18 金蝶软件(中国)有限公司 Information verification method, and terminal

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2530579A1 (en) * 2010-01-29 2012-12-05 Shandong New Beiyang Information Technology Co., Ltd. Printing control method, printer and printing system
CN102324004A (en) * 2011-05-26 2012-01-18 重庆猪八戒网络有限公司 Verification method for webpage form input information and device
CN103873436A (en) * 2012-12-11 2014-06-18 金蝶软件(中国)有限公司 Information verification method, and terminal
CN103258198A (en) * 2013-04-26 2013-08-21 四川大学 Extraction method for characters in form document image

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111062379A (en) * 2018-10-16 2020-04-24 珠海格力电器股份有限公司 Identification error-proofing recognition method, device, storage medium and system
CN109993126A (en) * 2019-04-03 2019-07-09 腾讯科技(深圳)有限公司 The file information determines method, apparatus, equipment and readable storage medium storing program for executing
CN109993126B (en) * 2019-04-03 2023-10-24 腾讯科技(深圳)有限公司 File information determining method, device, equipment and readable storage medium

Also Published As

Publication number Publication date
CN104899572B (en) 2019-02-15

Similar Documents

Publication Publication Date Title
CN111476227B (en) Target field identification method and device based on OCR and storage medium
US10824801B2 (en) Interactively predicting fields in a form
US9430716B2 (en) Image processing method and image processing system
CN105631393A (en) Information recognition method and device
CN112597182B (en) Optimization method, device, terminal and storage medium of data query statement
US10740638B1 (en) Data element profiles and overrides for dynamic optical character recognition based data extraction
US20150134641A1 (en) Electronic device and method for processing clip of electronic document
CN104462232A (en) Data storage method and device
CN104102704A (en) System control displaying method and system control displaying device
CN112784720A (en) Key information extraction method, device, equipment and medium based on bank receipt
US20240143163A1 (en) Digital ink processing system, method, and program
WO2018228001A1 (en) Electronic device, information query control method, and computer-readable storage medium
CN104899572A (en) Content-detecting method and device, and terminal
CN107679222B (en) Picture processing method, mobile terminal and computer readable storage medium
CN107909054A (en) The method for evaluating similarity and device of picture text
CN116860747A (en) Training sample generation method and device, electronic equipment and storage medium
JP6252296B2 (en) Data identification method, data identification program, and data identification apparatus
CN110688995A (en) Map query processing method, computer-readable storage medium and mobile terminal
EP3156883A1 (en) Input device, document input system, document input method, and program
CN109409362A (en) The detection of picture sensitive word and localization method and device based on tesseract engine
CN115186240A (en) Social network user alignment method, device and medium based on relevance information
CN110018828B (en) Source code checking method and device and terminal equipment
CN110533556B (en) Method, apparatus, computer device and storage medium for processing arbitration information
KR101368610B1 (en) Method and system for selecting paragraph on electronic book environments
CN111667214A (en) Goods information acquisition method and device based on two-dimensional code and electronic equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant