Some Customers
1
OEM Direct Customers
o
Value Proposition
Content Server
TCP, IDM, VRD, C
360, P360, …
SharePoint
Collect documents from
various sources ….
… classify
them, extract data
…
… and feed them into
business applications
Automatically with OCR, ICR, IDR
Transform Pixels into
Actionable Information
Use Cases
Slide 3
Make
Money
Save
Money
Qualified
Electronic
Document
Enable
Digital
Workflow
Document
and Data
Capture
Reduce
Manual
Keying Enable
Process
Auto-
mation
Digital Mail Room
Scanning Documents
into Electronic Files
Backfile Conversion
Transaction and Process
Management
Service Centers
Ad-Hoc capturing
Copyright © Open Text Corporation. All rights reserved.
Overview Components of OCC
IM EX
Fax, Email,
FTP site,
Network Folder
SharePoint
Enterprise Scan
Client
Business
Application
Archive
ConfigurationMonitoring
Dispatch
ValidationRecognition
Open Text Capture Center
Copyright © Open Text Corporation. All rights reserved.
Recognition: Different Document Types
Structured
Documents
> Forms
Semi-Structrured
Documents
> B2B Correspondence
Unstructured
Documents
> C2B Correspondence
Data is in logical
groups but positions
are unknown.
Data may be
anywhere in the
document.
Data at fixed
positions
Copyright © Open Text Corporation. All rights reserved.
Steps in Document Recognition
Slide 6Copyright © Open Text Corporation. All rights reserved.
…
Separation: Splitting a batch of images into individual (multi-page) documents
Classification: Identification of document type
„Invoice”
„Delivery Note“
„Order“
„Others“
Extraction: Searching for basic information on the document
Date: Jan 21. 2012
Amount: 332,29 $
Order-Nr: X-44277
Supplier: Mueller & FriendsInvoice
Interpretation: Enhancing of extracted data with context information
21012012
332.29 USD
X-44277
K441258-3
Date: Jan 21. 2012
Amount: 332,29 $
Order-Nr: X-44277
Supplier: Mueller & Friends
Free Forms Extraction
Normalize
“10/24/10”
Format (US)
Day = 24
Month = Oct
Year = 2010
24.10.2010
Decompose string into
subunits and re-format as
required
Verify
Check against business
and plausibility rules
Valid
period
?
24.10.2010
yes no
ExportManual
keying
Analyze
Date
Word
Number
Find meaningful entities
and tag them
Turn pixels into
characters w/ optical
character recognition
OCR
CLERK: 12
DATE SHIPPED
ORDER DATE
10/30/02
10/24/02
COVER CODE
PA
3
Extract
Order_Date = “10/24/10”
Find the correct date
among all detected
alternatives
Copyright © Open Text Corporation. All rights reserved.
Long Term Effect of Free Forms TechniquesOCC Approach
Free Form
Recognition
Adaptive
(=learning)
technology
Combining
Free Forms
and adaptive
Form
(template)
based
Recognition
Increase of recognition rate during production time
Copyright © Open Text Corporation. All rights reserved.
Invoice Data Extraction
Supplier
Invoice Number
Line items
Net Amount
Total amount
Invoice Date
Order number
Currency
Delivery note
Copyright © Open Text Corporation. All rights reserved.
Supported Countries in Knowledge Base
Germany
United States
Austria
AustraliaBelgium
Bulgaria*
Denmark
Finland
France
United Kingdom
Italy
Canada
Netherlands
Norway
Portugal
Romania*Sweden
Switzerland
Singapore Slovakia*
SloveniaSpain
Czech Republic
Hungary
Poland
* Header data only
Russia
New Zealand
Purchase Order Processing with OCC
11
Capture Center
Enterprise
Scan
Reco-
gnitionValidation Case360
Tempo
Rendition
Sever
Content
Sever
Knowledge BaseMaster Data
Web form
OCC – Your Benefit
Copyright © Open Text Corporation. All rights reserved.
QImprove Information Quality Improving information sharing
Leveraging a common set of business rules
Reducing errors
$Reduce Operating Costs Automating manual tasks
Deploying a single input management platform
Reducing paper filing/storage
Accelerate Business ProcessesShortening cycle times
Reducing exception processing
Enhancing customer relationships
Improving knowledge worker productivity
ComplianceEnsuring compliance / auditability
Improving visibility into business processes
Improving litigation preparedness
§§
Top Related