Document Image Analysis Lecture 12: Word Segmentation

UC Berkeley CS294-9 Fall 2000 12- 1

Document Image AnalysisLecture 12: Word Segmentation

Richard J. FatemanHenry S. Baird

University of California – BerkeleyXerox Palo Alto Research Center

The course, recently….

• We studied symbol recognition, classifiers

and their combinations

• Word recognition as distinct from characters

A good segmentation method (or several) is handy

• We cannot rely on a lexicon to have all words (names, proper nouns, numbers, acronyms)

• Insisting that words be in the lexicon does not mean they are correct. Powerpoint tries to refuse misspell as mispell since the latter is not in the dictionary!

• Good segmentation means that the symbol based recognition has a better chance of success

Segmentation/ Naïve or clever

• Numerous papers on the subject• Some without strong models (e.g. cut at

thin parts)• Some with exhaustive search / template

matching• Some with learning/ internal

comparisons

Naïve connected component analysis can’t come close…

• Characters like “ij:; Ξ â% are separated• Ligatures are not separated: ffl, ŒÆœ ffi

• Vertical cuts between touching characters will not ordinarily work for italics

THIS IS ULTRA CONDENSED ..TZ this is times italic .

(other problems: X2 , )3 22 yx

Papers of interest on segmentation

• Tsujimoto and Asada• Bayer and Kressel• Tao Hong’s (1995) PhD on Degraded

Text Recognition

Segmentation + Clustering (Tao Hong)

Can lead to decoding!

Sometimes the image itself holds a key to decoding…

Visual inter-word relations

An example text block showing visual inter-word relationships

Pattern matching can lead to identifying a segment

Where this fits…

Example

Tsujimoto & Asada: Overview

Resolve the touching characters:

• New metric for finding breaks (find plausible breaks

• Use knowledge about “the usual suspects” rn/m k/lc d/cl … (limits search substantially)

Metric, pre-processing

ANDing columns for profile removing slant from italics

Choosing break candidates

Decision Tree for “The”

Tree search

• Depth first, looking for solution to the string matching, in sequence.

• Some partitions are penalized (but not eliminated) if the segmentation point is uncertain.

• Segments are matched to omnifont templates (“multiple similarity method..”)

Reexamined explanations

mm nun

ck dcEtc… 30 confusions

This might be mistaken for This

Some tough calls…

Unbelievable accuracy…

A different, perhaps more general method (Bayer, Kressel)

• Goal: find the column position(s) at which characters are touching– Treat as a systematic classification problem– Learn from a data base containing labelled merged

characters• Collect real life data; get human breakpoints [or could

be synthetic, I suppose]• Find appropriate feature set• Learn the features of touching characters

– Hypothesize column breaks– Application: postal addresses, other stuff too

Database of touching chars

….2158 patterns

Big ideaRather than represent the breaks as low points in the projection profile, represent the breaks in the natural context of touching characters by actual example, suitably normalized for size (15-30 pixels high).

These locations are manually marked.

Local feature set describing cut locations / measures of similarity

• Number of black pixels (= projection profile!)

• Number of white pixels counting from top/bottom

• Number of white-black transitions• Number of identical b or w pixels next to

this column (derivative of pp?)

Global feature set describing cut locations / measures of similarity

• Width to height ratio of full image (wider suggests touching characters)

• Width to height ratio of the image AFTER cutting(s)

• Number of white-black transitions• Number of identical b or w pixels next to

this column (derivative of pp?)

Illustration of the strategy

How accurate, how fast? (cut location)

• Finding cuts: 7.8% error in learning set, 7.2%(!) on test set

• 22% of the no-cut regions had errors• Best results used 50-feature classifier

using 9 column width• Cost for one image cut-analysis one

character analysis• Validates statistics > heuristics..

Document Image Analysis Lecture 12: Word Segmentation

Documents

Transcript of Document Image Analysis Lecture 12: Word Segmentation

Image Segmentation Longin Jan Latecki CIS 601. Image Segmentation Segmentation divides an image into its constituent regions or objects. Segmentation.

Lecture # 13 Image Segmentation & Hough Transform · 2017-04-20 · 1/14/2017 Image Segmentation 4 Image Segmentation Segmentation algorithms are based on one of two basic properties

LECTURE 7: Medical Image Segmentation (I) (Radiology ...bagci/teaching/mic16/lec7.pdf · LECTURE 7: Medical Image Segmentation (I) (Radiology Applications of Segmentation, and Thresholding)

IP-L8-Lecture - Segmentation · 2009-03-19 · 2 Image Segmentation • Group similar components (such as, pixels in an image, image frames in a video) to obtain a compact representation.

Lecture 11. Image Segmentation · 12/9/2010 3 3 Image Segmentation Segmentation is to subdivide an image into its component regions or objects. Segmentation should stop when the objects

LECTURE 10: Medical Image Segmentation as an …bagci/teaching/mic16/lec10.pdfMEDICAL IMAGE COMPUTING (CAP 5937) LECTURE 10: Medical Image Segmentation as an Energy Minimization Problem

Image Analysis Lecture 9.1 -Segmentation · Segmentation Image segmentation is the process of partitioning a digital image into multiple parts, i.e. find groups of pixels that belong

Lecture 9. Segmentation-thresholdingaalbu/computer vision 2009/Lecture 9. Segmentation... · 2 Context {Segmentation decomposes the image into parts for further analysis zExample:

Lecture 16 Image Segmentation 1.The basic concepts of segmentation 2.Point, line, edge detection 3.Thresh holding 4.Region-based segmentation 5.Segmentation.

Image Modeling & Segmentation Aly Farag and Asem Ali Lecture #3.

Lecture 8:Image Segmentation - media-lab.ccny.cuny.edumedia-lab.ccny.cuny.edu/wordpress/YLTCCNYHomepage/... · Image Analysis and Segmentation Low-level image processing: inputs and

Lecture 16 Image SegmentationImage Segmentation · 2016-08-25 · Lecture 16 Image SegmentationImage Segmentation 1. The basic concepts of segmentation 2. Pitli d dt tiPoint, line,

Digital Image Processing Lecture 6,7,8 – Image …fit.mta.edu.vn/files/FileMonHoc/Lecture 06,07,08...2006/07/08 · Image Segmentation 1 Digital Image Processing Lecture 6,7,8 –

Lecture 8 Image Segmentation - Tongji Universitysse.tongji.edu.cn/linzhang/DIP/slides/Lecture 08-Image Segmentation… · Lecture 8 Image Segmentation Lin ZHANG, PhD ... In Matlab,

Digital Image Processing Lecture Segmentation

CS654: Digital Image Analysis Lecture 26: Image segmentation.

Image Segmentation Image segmentation (segmentace obrazu)

Lecture #9: Image Resizing and Segmentationcs131.stanford.edu/files/09_notes.pdf · Lecture #9: Image Resizing and Segmentation Mason Swofford, Rachel Gardner, ... Figure 17: Improved

Image Segmentation Image Segmentation: Definitionspages.cs.wisc.edu/~dyer/cs766/slides/segmentation/segment-4up.pdf · “Segmentation is the process of partitioning an image into

Variational Approaches and Image Segmentation Lecture #7