DE69425084D1 - Verfahren und Gerät zur Erkennung von Textzeilen, Wörtern und räumlichen Merkmalen von Zeichenzellen - Google Patents

Verfahren und Gerät zur Erkennung von Textzeilen, Wörtern und räumlichen Merkmalen von Zeichenzellen

Info

Publication number
DE69425084D1
DE69425084D1 DE69425084T DE69425084T DE69425084D1 DE 69425084 D1 DE69425084 D1 DE 69425084D1 DE 69425084 T DE69425084 T DE 69425084T DE 69425084 T DE69425084 T DE 69425084T DE 69425084 D1 DE69425084 D1 DE 69425084D1
Authority
DE
Germany
Prior art keywords
words
text lines
spatial features
recognizing text
character cells
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69425084T
Other languages
English (en)
Other versions
DE69425084T2 (de
Inventor
A Lawrence Spitz
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xerox Corp
Original Assignee
Xerox Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xerox Corp filed Critical Xerox Corp
Application granted granted Critical
Publication of DE69425084D1 publication Critical patent/DE69425084D1/de
Publication of DE69425084T2 publication Critical patent/DE69425084T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/158Segmentation of character regions using character size, text spacings or pitch estimation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)
DE69425084T 1993-04-19 1994-04-18 Verfahren und Gerät zur Erkennung von Textzeilen, Wörtern und räumlichen Merkmalen von Zeichenzellen Expired - Lifetime DE69425084T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US08/047,514 US5384864A (en) 1993-04-19 1993-04-19 Method and apparatus for automatic determination of text line, word and character cell spatial features

Publications (2)

Publication Number Publication Date
DE69425084D1 true DE69425084D1 (de) 2000-08-10
DE69425084T2 DE69425084T2 (de) 2000-11-09

Family

ID=21949404

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69425084T Expired - Lifetime DE69425084T2 (de) 1993-04-19 1994-04-18 Verfahren und Gerät zur Erkennung von Textzeilen, Wörtern und räumlichen Merkmalen von Zeichenzellen

Country Status (5)

Country Link
US (1) US5384864A (de)
EP (1) EP0621554B1 (de)
JP (1) JPH0713995A (de)
KR (1) KR970002420B1 (de)
DE (1) DE69425084T2 (de)

Families Citing this family (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0606780B1 (de) * 1993-01-11 2001-07-11 Canon Kabushiki Kaisha Gerät und Verfahren zur Bildverarbeitung
US5513304A (en) * 1993-04-19 1996-04-30 Xerox Corporation Method and apparatus for enhanced automatic determination of text line dependent parameters
US5517578A (en) * 1993-05-20 1996-05-14 Aha! Software Corporation Method and apparatus for grouping and manipulating electronic representations of handwriting, printing and drawings
US6587587B2 (en) 1993-05-20 2003-07-01 Microsoft Corporation System and methods for spacing, storing and recognizing electronic representations of handwriting, printing and drawings
JP3042945B2 (ja) * 1993-07-07 2000-05-22 富士通株式会社 画像抽出装置
EP0981243B1 (de) * 1993-07-16 2010-03-17 Sharp Kabushiki Kaisha Bilddatenprozessor
CA2154952A1 (en) * 1994-09-12 1996-03-13 Robert M. Ayers Method and apparatus for identifying words described in a page description language file
EP0702322B1 (de) * 1994-09-12 2002-02-13 Adobe Systems Inc. Verfahren und Gerät zur Identifikation von Wörtern, die in einem portablen elektronischen Dokument beschrieben sind
JP3805005B2 (ja) * 1994-11-09 2006-08-02 キヤノン株式会社 画像処理装置及び光学的文字認識装置及びそれらの方法
ATE185211T1 (de) * 1995-01-31 1999-10-15 United Parcel Service Inc Verfahren und gerät zum trennen des vordergrunds und hintergrunds in textenthaltenden bildern
US5999647A (en) * 1995-04-21 1999-12-07 Matsushita Electric Industrial Co., Ltd. Character extraction apparatus for extracting character data from a text image
MY121607A (en) * 1995-07-10 2006-02-28 Hyundai Curitel Inc Grid moving method of object image and apparatus using the same and compaction/motion estimation method using the same and apparatus thereof
US5867597A (en) * 1995-09-05 1999-02-02 Ricoh Corporation High-speed retrieval by example
US5737442A (en) * 1995-10-20 1998-04-07 Bcl Computers Processor based method for extracting tables from printed documents
US5848191A (en) * 1995-12-14 1998-12-08 Xerox Corporation Automatic method of generating thematic summaries from a document image without performing character recognition
US5892842A (en) * 1995-12-14 1999-04-06 Xerox Corporation Automatic method of identifying sentence boundaries in a document image
US5850476A (en) * 1995-12-14 1998-12-15 Xerox Corporation Automatic method of identifying drop words in a document image without performing character recognition
US5683586A (en) * 1996-02-05 1997-11-04 Harcourt; Gregory A. Method and apparatus for magnetically treating a fluid
US5909510A (en) * 1997-05-19 1999-06-01 Xerox Corporation Method and apparatus for document classification from degraded images
US6687404B1 (en) 1997-06-20 2004-02-03 Xerox Corporation Automatic training of layout parameters in a 2D image model
JP4320064B2 (ja) * 1998-07-10 2009-08-26 富士通株式会社 画像処理装置及び記録媒体
JP3897272B2 (ja) * 1999-09-28 2007-03-22 富士フイルム株式会社 画像解析装置
US8682077B1 (en) 2000-11-28 2014-03-25 Hand Held Products, Inc. Method for omnidirectional processing of 2D images including recognizable characters
WO2003063067A1 (en) * 2002-01-24 2003-07-31 Chatterbox Systems, Inc. Method and system for locating positions in printed texts and delivering multimedia information
US7164797B2 (en) 2002-04-25 2007-01-16 Microsoft Corporation Clustering
US7120297B2 (en) 2002-04-25 2006-10-10 Microsoft Corporation Segmented layered image system
US7024039B2 (en) 2002-04-25 2006-04-04 Microsoft Corporation Block retouching
US7392472B2 (en) * 2002-04-25 2008-06-24 Microsoft Corporation Layout analysis
US7043079B2 (en) 2002-04-25 2006-05-09 Microsoft Corporation “Don't care” pixel interpolation
US7263227B2 (en) 2002-04-25 2007-08-28 Microsoft Corporation Activity detector
US7110596B2 (en) 2002-04-25 2006-09-19 Microsoft Corporation System and method facilitating document image compression utilizing a mask
JP2004038321A (ja) * 2002-06-28 2004-02-05 Fujitsu Ltd 文書レイアウト解析プログラム、文書レイアウト解析装置および文書レイアウト解析方法
US7302098B2 (en) * 2004-12-03 2007-11-27 Motorola, Inc. Character segmentation method and apparatus
WO2006066325A1 (en) * 2004-12-21 2006-06-29 Canon Kabushiki Kaisha Segmenting digital image and producing compact representation
US7602972B1 (en) * 2005-04-25 2009-10-13 Adobe Systems, Incorporated Method and apparatus for identifying white space tables within a document
US7650041B2 (en) 2006-02-24 2010-01-19 Symbol Technologies, Inc. System and method for optical character recognition in an image
US8218890B2 (en) * 2008-01-22 2012-07-10 The Neat Company Method and apparatus for cropping images
US8620080B2 (en) * 2008-09-26 2013-12-31 Sharp Laboratories Of America, Inc. Methods and systems for locating text in a digital image
EP2275972B1 (de) * 2009-07-06 2018-11-28 AO Kaspersky Lab System und Verfahren zur Identifizierung von textbasiertem Spam in Bildern
US9003531B2 (en) 2009-10-01 2015-04-07 Kaspersky Lab Zao Comprehensive password management arrangment facilitating security
US8526732B2 (en) * 2010-03-10 2013-09-03 Microsoft Corporation Text enhancement of a textual image undergoing optical character recognition
US8571270B2 (en) * 2010-05-10 2013-10-29 Microsoft Corporation Segmentation of a word bitmap into individual characters or glyphs during an OCR process
US9237255B1 (en) 2014-08-25 2016-01-12 Xerox Corporation Methods and systems for processing documents
CN106446896B (zh) * 2015-08-04 2020-02-18 阿里巴巴集团控股有限公司 一种字符分割方法、装置及电子设备
US20170068868A1 (en) * 2015-09-09 2017-03-09 Google Inc. Enhancing handwriting recognition using pre-filter classification
US9842251B2 (en) * 2016-01-29 2017-12-12 Konica Minolta Laboratory U.S.A., Inc. Bulleted lists
KR101999549B1 (ko) 2017-07-25 2019-07-12 주식회사 한글과컴퓨터 셀 자동 분할 장치

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3613080A (en) * 1968-11-08 1971-10-12 Scan Data Corp Character recognition system utilizing feature extraction
US4206442A (en) * 1974-07-03 1980-06-03 Nippon Electric Co., Ltd. Letter segmenting apparatus for OCR comprising multi-level segmentor operable when binary segmenting fails
US4173015A (en) * 1978-08-16 1979-10-30 Recognition Equipment Incorporated System and method for character presence detection
JPS56129981A (en) * 1980-03-14 1981-10-12 Toshiba Corp Optical character reader
US4377803A (en) * 1980-07-02 1983-03-22 International Business Machines Corporation Algorithm for the segmentation of printed fixed pitch documents
EP0120334B1 (de) * 1983-03-01 1989-12-06 Nec Corporation System zum Bestimmen des Zeichenabstandes
US4918740A (en) * 1985-10-01 1990-04-17 Palantir Corporation Processing means for use in an optical character recognition system
US4899394A (en) * 1986-05-09 1990-02-06 Prodigy Systems Corporation Apparatus and method for image compression
US5001766A (en) * 1988-05-16 1991-03-19 At&T Bell Laboratories Apparatus and method for skew control of document images
US5062141A (en) * 1988-06-02 1991-10-29 Ricoh Company, Ltd. Method of segmenting characters in lines which may be skewed, for allowing improved optical character recognition
JPH0816918B2 (ja) * 1989-04-18 1996-02-21 シャープ株式会社 行抽出方法
US5253307A (en) * 1991-07-30 1993-10-12 Xerox Corporation Image analysis to obtain typeface information

Also Published As

Publication number Publication date
US5384864A (en) 1995-01-24
DE69425084T2 (de) 2000-11-09
EP0621554A3 (de) 1995-05-24
EP0621554A2 (de) 1994-10-26
JPH0713995A (ja) 1995-01-17
KR970002420B1 (ko) 1997-03-05
KR940024625A (ko) 1994-11-18
EP0621554B1 (de) 2000-07-05

Similar Documents

Publication Publication Date Title
DE69425084D1 (de) Verfahren und Gerät zur Erkennung von Textzeilen, Wörtern und räumlichen Merkmalen von Zeichenzellen
DE69424902D1 (de) Gerät und Verfahren zur anpassungsfähigen nicht-buchstäblichen Textsuche
DE69423760D1 (de) Verfahren und vorrichtung zur isolierung von mikrogefässzellen
DE69332459D1 (de) Verfahren und Vorrichtung zur Zeichenerkennung
DE69228973D1 (de) Verfahren und Gerät zur Zeichenerkennung
DE69232493D1 (de) Verfahren und Gerät zur Zeichenerkennung
DE69430082D1 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69730930D1 (de) Verfahren und Gerät zur Zeichenerkennung
DE69420400D1 (de) Verfahren und gerät zur sprechererkennung
DE69512567D1 (de) Verfahren und Vorrichtung zur Wiederherstellung von Buchstabenumrisslinien
DE69322894D1 (de) Lernverfahren und Gerät zur Spracherkennung
DE69425037D1 (de) Verfahren und Vorrichtung zur Generierung von Schriftzeichen
DE69421324D1 (de) Verfahren und Vorrichtung zur Sprachkommunikation
DE69328380D1 (de) Verfahren und vorrichtung zur vermittlung von mehreren verkehrsklassen
DE59605560D1 (de) Verfahren zum Spalten von Kohlenwasserstoffen und Vorrichtung
DE69417105D1 (de) Vorrichtung und Verfahren zum Erkennen handgeschriebener Symbole
DE69429901D1 (de) Verfahren und Vorrichtung zur Regelung von unterirdischen Speichern
DE69416360D1 (de) Verfahren und Vorrichtung zum Kontrollieren von Klassen von Dokumenten
DE69321569D1 (de) Verfahren und Vorrichtung zur Zeicheneingabe
DE69415739D1 (de) Verahren und vorrichtung zur schrittweisen bewegung von gegenstaenden
DE69732156D1 (de) Verfahren und Gerät zur Zeichenerkennung
DE69301342D1 (de) Verfahren und Vorrichtung zur Handhabung von trägerbandlosen Etiketten
DE69418776D1 (de) Verfahren und Vorrichtung zur Eingabe von Musikdaten
DE69837822D1 (de) Verfahren und Vorrichtung zur Dekodierung von Sprachsignalen
DE69332555D1 (de) Verfahren und Vorrichtung zur Anzeige von Zeichen

Legal Events

Date Code Title Description
8364 No opposition during term of opposition