December 2007 Document Recognition Technology Overview Presentation

44
Document Recognition a technology overview Presented by: Chris Riley of Artsyl Technologies, Inc.
  • date post

    20-Oct-2014
  • Category

    Technology

  • view

    1.711
  • download

    2

description

 

Transcript of December 2007 Document Recognition Technology Overview Presentation

Page 1: December 2007 Document Recognition Technology Overview Presentation

Document Recognitiona technology overview

Presented by: Chris Riley of Artsyl Technologies, Inc.

Page 2: December 2007 Document Recognition Technology Overview Presentation

But FirstYour new AIIM Board!

Exciting new eventsGolfNetworkingMore Education Sessions

Page 3: December 2007 Document Recognition Technology Overview Presentation

What we will cover:Why Chris?

What Are the Document Recognition Technologies

Who Makes Them

Buyer Beware

The future

Q & A

Free Stuff!

Page 4: December 2007 Document Recognition Technology Overview Presentation

Why Chris?

Who is Artsyl?

What qualifies Chris to talk to me?

When a developer turns to sales

Page 5: December 2007 Document Recognition Technology Overview Presentation

What we will cover:Why Chris?

What Are the Document Recognition Technologies

Who Makes Them

Buyer Beware

The future

Q & A

Free Stuff!

Page 6: December 2007 Document Recognition Technology Overview Presentation

Who knows what OCR is?

Page 7: December 2007 Document Recognition Technology Overview Presentation

The TechnologiesOCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Page 8: December 2007 Document Recognition Technology Overview Presentation

The Technologies: OCROCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Ship To:

Page 9: December 2007 Document Recognition Technology Overview Presentation

The Technologies: ICROCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Ilya

Page 10: December 2007 Document Recognition Technology Overview Presentation

The Technologies: OMROCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Card Account

Page 11: December 2007 Document Recognition Technology Overview Presentation

The Technologies: BarcodeOCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

1889094476620

Page 12: December 2007 Document Recognition Technology Overview Presentation

The Technologies: HandwritingOCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

* Critical *

Page 13: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Acronym Heaven

OCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Page 14: December 2007 Document Recognition Technology Overview Presentation

The Technologies: CAR/LAROCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

2 hundred dollars & no cents

Page 15: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Assisted Capture

OCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Page 16: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Fixed Form Processing

OCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Name: Ilya

Date: 12/21/2982

Page 17: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Fixed Form Processing

Name: IlyaDate: 12/21/2982

Page 18: December 2007 Document Recognition Technology Overview Presentation

80% of business end-user documents are semi-structured

Page 19: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Semi-Structured Forms

OCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Invoice No: 99044

Date: 06/09/04

Invoice No: 24567

Date: 06/09/04

Page 20: December 2007 Document Recognition Technology Overview Presentation

Invoice No: 99044Date: 06/09/04

Invoice No: 24567Date: 06/09/04 (06/09/2004)

The Technologies: Semi-Structured Forms

Page 21: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Semi-Structured Forms

OCR – Optical Character RecognitionICR – Intelligent Character RecognitionOMR – Optical Mark RecognitionBarcodeHandwritingAll the other ones made up for marketing purposes

CAR/LAR ( Check21 ) – Courtesy and Legal Amount RecognitionAssisted CaptureFixed Form ProcessSemi-Structured Forms ProcessingUnstructured Document Processing

Consignee

Consignor

Date

Term

Page 22: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Common Processes

Full page conversionClassificationIndex level extraction

RedactionRoutingAuto FilingRe-PurposingImage Rotation

Page 23: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Full page conversion

Image file to electronic data fileALL text on the pageIncludes:

Image Pre-processingDocument Analysis/ZoningExtractionExport ( Commonly PDF, DOC )

Page 24: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Classification

Software tells you the document typeScan batches of mixed documents

Bill of L

ading

Invoice

Check

PO

Page 25: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Index Level Extraction

Just certain required fields extractedNormalization of dataExport usually to a database

Invoice NumberInvoice Date

Total Amt DueTerm

Page 26: December 2007 Document Recognition Technology Overview Presentation

The Technologies: How Accurate

Better question is how do you determine accuracy

Document Type AccuracyField/Zone Location AccuracyData Type AccuracyCharacter Accuracy

Page 27: December 2007 Document Recognition Technology Overview Presentation

The Technologies: Common usage scenarios

Document Conversion

Document Archival / Retrieval

Invoice Processing

Insurance Processing( medical, mortgage )

Waybill processing

Survey processing

Page 28: December 2007 Document Recognition Technology Overview Presentation

What we will cover:Why Chris?

What Are the Document Recognition Technologies

Who Makes Them

Buyer Beware

The future

Q & A

Free Stuff!

Page 29: December 2007 Document Recognition Technology Overview Presentation

There Really are only 3 core technology providers

It takes 50 man-years to develop OCR using current computing abilities

Page 30: December 2007 Document Recognition Technology Overview Presentation

Who Makes Them: Core Engines

ABBYYNuance ( formally ScanSoft )ReadI.R.I.S

OcéCharacTellParaScriptA2iA

Handful of Open SourceHandful of Other VendorsTwo handfuls of OLD engines

Page 31: December 2007 Document Recognition Technology Overview Presentation

Who Makes Them: Who Licenses ThemEVERYONE ELSE!AnaCompAnydocBancTecBrainWareCaptarisCaptivationCardiffCVisionDataCapDigiTecheCopyEMC DocumentumKofaxLaserFicheLeadToolsMicrosoftNSi AutoStoreOnBasePerceptive ImagingReadSoftSERTop Image SystemsTowerWestbrookXerox

Hundreds More

Page 32: December 2007 Document Recognition Technology Overview Presentation

What we will cover:Why Chris?

What Are the Document Recognition Technologies

Who Makes Them

Buyer Beware

The future

Q & A

Free Stuff!

Page 33: December 2007 Document Recognition Technology Overview Presentation

30% of organizations that purchase, purchase the wrong thing

Over 50 % of organizations that purchase never use it properly

Page 34: December 2007 Document Recognition Technology Overview Presentation

Buyer Beware

If OCR is the reason for buying a solution know what Engine it is!

Talk about the WHOLE solution not the pieces

Get past marketing gimmicks

Trust, Love, Be Certain of your reseller / vendor

Page 35: December 2007 Document Recognition Technology Overview Presentation

Buyer Beware: Know your engine

What version?Will they upgrade?

Page 36: December 2007 Document Recognition Technology Overview Presentation

Buyer Beware: Talk about Whole Solution

Scanner / InputCaptureStorage

Have Requirements List Before

Page 37: December 2007 Document Recognition Technology Overview Presentation

Buyer Beware: Get past Gimmicks

NOTHING! Is 100%

All canned demos work perfect

Always see test on your documents

Version numbers are really arbitrary

Page 38: December 2007 Document Recognition Technology Overview Presentation

Buyer Beware: Trust your vendor / reseller

Support after sale ( test them )

Where to get professional services

Do they understand the solution and not just the pieces?

Page 39: December 2007 Document Recognition Technology Overview Presentation

What we will cover:Why Chris?

What Are the Document Recognition Technologies

Who Makes Them

Buyer Beware

The future

Q & A

Free Stuff!

Page 40: December 2007 Document Recognition Technology Overview Presentation

The FutureFull-page OCR will be a commodity

Advance Document Processing will become main-stream but less required

Think about what to do now that you will be gathering data rapidly

There will be a new approach to OCR

Page 41: December 2007 Document Recognition Technology Overview Presentation

What we will cover:Why Chris?

What Are the Document Recognition Technologies

Who Makes Them

Buyer Beware

The future

Q & A

Free Stuff!

Page 42: December 2007 Document Recognition Technology Overview Presentation

Questions and Answers

Before you ask

Page 43: December 2007 Document Recognition Technology Overview Presentation

What we will cover:Why Chris?

What Are the Document Recognition Technologies

Who Makes Them

Buyer Beware

The future

Q & A

Free Stuff!

Page 44: December 2007 Document Recognition Technology Overview Presentation

Free Stuff

Copy of ABBYY FineReader Pro 9.0Copy of Nuance OmniPage 16Copy of ReadI.R.I.S Pro 11

4 Hour Consulting Session with ME!