Processing PDF: How to Go from PDF to E-text to Audio
description
Transcript of Processing PDF: How to Go from PDF to E-text to Audio
![Page 1: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/1.jpg)
Processing PDF:Processing PDF:How to Go from PDF toHow to Go from PDF toE-text to AudioE-text to Audio
Gaeir DietrichDirectorHigh Tech Center Training Unitof the California Community CollegesFoothill Community College District
![Page 2: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/2.jpg)
PDF from PublishersPDF from Publishers
Portable document format (PDF) Reads the same on any computer Looks like the book Smaller than TIFFs Contains all the text
Always check to make sure the book is the right one!
Easy for publishers
![Page 3: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/3.jpg)
Requesting through ATNRequesting through ATN
Access Text Network Now free for requesting files from ATN-
member publishers Paid membership to exchange files www.accesstext.org
Not all publishers But ATN does have the largest ones
![Page 4: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/4.jpg)
Other Resources at ATNOther Resources at ATN
Accessible Textbook Finder http://www.accesstext.org/atf.php
Link to Publisher Lookup http://www.publisherlookup.org/ Will have to contact non-ATN member
publishers directly
![Page 5: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/5.jpg)
Using Publisher PDFsUsing Publisher PDFs
Sometimes students can use files directly
Often files will need further processing for student use
At the very least, large files may need to be broken into chapters
![Page 6: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/6.jpg)
PDF StrengthsPDF Strengths
Good format for large print Cropping Fit to page on large pages Print sections on large pages (tiling)
Adobe Reader has some nice features Change colors Reflow Limited voicing
Works on both Mac and PC Easy for most publishers to create
![Page 7: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/7.jpg)
PDF WeaknessesPDF Weaknesses
Not always fully accessible Screen readers do not always like them—
even when they are text-based Reading order can be problematic
May be graphics (pictures of text) May have too much security
![Page 8: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/8.jpg)
As an Aside…As an Aside…
When faculty create PDFs… The PDF always started as something
else…usually a Word file Try to get the starting document if the
student prefers audio Security concerns?
Word files can be password protected Button > Prepare > Encrypt
![Page 9: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/9.jpg)
Types of PDF DocumentsTypes of PDF Documents
Text-based Text can be selected
Graphical Picture of text (i.e., a graphic) Text cannot be selected
Use text-select tool to tell the difference Files may be “locked”
![Page 10: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/10.jpg)
Processing PDFsProcessing PDFs
Adobe Acrobat Professional Check on College Buys for discount
Good OCR program Abbyy FineReader Nuance OmniPage
IF you are a Kurzweil campus, you will also need Kurzweil
![Page 11: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/11.jpg)
Adobe ToolsAdobe Tools
Adobe Reader Free Useful for students who need minimal
accessibility features http://www.adobe.com/products/reader/
Adobe Acrobat Professional Essential for alt media specialists Extract text, create accessible PDFs, enabled
Adobe Reader features www.uscollegebuy.com Discounted Price
![Page 12: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/12.jpg)
Acrobat ReaderAcrobat Reader
Reads aloud But does not highlight or track
Enlarges text Nice reflow feature
Changes text/background colors Text highlighting, sticky notes, and
comments Access for text-based PDFs
![Page 13: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/13.jpg)
Production Features in Reader
Really designed for reading, not reformatting
Export PDF Subscription service (about $20/year) Upload PDF file, service auto-converts to
Word, download
![Page 14: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/14.jpg)
Process with Acrobat ProProcess with Acrobat Pro
Cropping Enlargement for printing Tiling Extracting/deleting pages Combining/inserting pages Text extraction
Works best with text-based PDF Does have built-in OCR capability
![Page 15: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/15.jpg)
Customize Quick Tools
Click on the “gear”
View > Show/hide > Toolbar Items > Quick Tools
![Page 16: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/16.jpg)
Quick Tools Menu
![Page 17: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/17.jpg)
Customize
![Page 18: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/18.jpg)
Please Note
To enable single-key shortcuts Open Preferences dialog box Ctrl + K Under General > select Use Single-Key
Accelerators To Access Tools (first checkbox under Basic Tools)
![Page 19: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/19.jpg)
Cropping
Tools > Pages > Crop
Shortcut: C (Please note: This shortcut brings up the
mouse-driven cropping tool—must double click to open the dialog box!)
![Page 20: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/20.jpg)
Crop Tool
![Page 21: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/21.jpg)
Crop Toolbox
![Page 22: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/22.jpg)
Enlarging
Choose paper size/printer File > Print > Size…to Fit
Shortcut: Ctrl + P (tab through)
Tip: Crop document before enlarging
![Page 23: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/23.jpg)
Print to Fit
![Page 24: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/24.jpg)
Tiling
Choose paper size/printer File > Print > Poster > Tile Scale and
Overlap
Shortcut: Ctrl + P (tab through)
Tip: Crop document before tiling
![Page 25: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/25.jpg)
Enlarge with Tiling
![Page 26: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/26.jpg)
Extracting Pages
Tools > Pages > Extract
Delete Shortcut: Ctrl + Shift + D Extract Pages Shortcut: Alt V + T + P
(opens Pages pane; F6 focuses in pane and can arrow down)
![Page 27: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/27.jpg)
Extraction Tool
![Page 28: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/28.jpg)
Tips for Extracting Chapters
Crop on complete file before extracting Work on a copy!!!!! Extract from end toward front! Use table of contents to help Place focus on first page of chapter to
extract (beginning with last)
![Page 29: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/29.jpg)
Starting from the Back
![Page 30: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/30.jpg)
Combining
File > Pages > Insert
OR
Create > Combine files
![Page 31: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/31.jpg)
Inserting Pages
![Page 32: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/32.jpg)
Combining Pages
![Page 33: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/33.jpg)
Auto Extracting Text
File > Save As > MS Word Retains styles and paragraphs
File > Save As > More options… Text (Accessible)
Lose styles, places hard returns at end of line Text (Plain)
Lose styles, keeps paragraphs
Shortcut: Alt F + A
![Page 34: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/34.jpg)
Save As Options
![Page 35: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/35.jpg)
Better Text Extraction
OCR programs analyze text and structure Acrobat Pro has built-in OCR, but other
programs provide more control Can control which text to include
![Page 36: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/36.jpg)
More Control over Text
For graphical PDFs Or To maintain more control over extracting
text from text-based PDFs Use an OCR program!
![Page 37: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/37.jpg)
Processing Graphical PDFsProcessing Graphical PDFs
Must run optical character recognition (OCR) Computers cannot read pictures OCR programs recognize the “characters” in the
picture
How you process the file depends on the end format the student wants!
![Page 38: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/38.jpg)
Want to Stay in PDF?
Sometimes students do want a text-based PDF
Can OCR in Adobe Pro Tools> Recognize Text
![Page 39: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/39.jpg)
Under Tools
![Page 40: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/40.jpg)
Want Text OutWant Text Out
OmniPage or FineReader FineReader generally easier to learn Save to Word or HTML or Text based on student
preference
Use virtual printer with Kurzweil Create KESI files
R&W Save as Word
![Page 41: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/41.jpg)
Which One When?Which One When?
Want a Word file? Best choice is OmniPage or FineReader
Want a Kurzweil document? Use Kurzweil to process the PDF
For students to do themselves? Whichever program they prefer
![Page 42: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/42.jpg)
Why?Why?
OCR programs are designed to make extraction and editing easy
Document readers (R&W, Kurzweil, etc.) are designed to make reading easy…NOT editing.
![Page 43: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/43.jpg)
NEVER!!!NEVER!!!
Do NOT run OCR with FineReader or OmniPage…save to PDF…and then take into Kurzweil, R&W, etc.
Kurzweil, R&W, WYNN will run their own OCR on the PDF! Wastes time, adds error to do OCR twice
![Page 44: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/44.jpg)
OCR ProgramsOCR Programs
Treat PDFs the same as a TIFF If you OCR scanned documents, use the
same process
Load image file Select zones Create templates as needed
![Page 45: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/45.jpg)
OCR Process Details
Crop before loading into OCR engine Turn on multiple languages as needed
If doing math, turn on Greek Only turn on the languages you need
Edit in the OCR program Some OCR programs have font matching features
Save to Word
![Page 46: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/46.jpg)
Captions and Such
For students who want audio or who are using screen readers Separate the main body of the text and the
“ancillary text” (captions, sidebars, footnotes)
Create two documents 00 Chapter and 00A Chapter
Allows the student to hear main text uninterrupted
![Page 47: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/47.jpg)
Two Doc Workflow
Open PDF in OCR Program Analyze layout for entire document
Save a copy On one copy…delete all ancillary text
Save to Word as 00 Chapter On other copy…delete all main body text
Save as 00A Chapter Keep page numbers in both documents!
![Page 48: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/48.jpg)
Once in Word
Learn to use “show hidden” Ctrl + Shift + 8
Beware of the optional hyphen Search and replace to delete Search for ^- replace with nothing Run spell check
Use styles to structure files for braille program
![Page 49: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/49.jpg)
Converting Files
![Page 50: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/50.jpg)
Mobile Readers?
Check formats that device can handle Some handle PDF and DOC, some do not
All readers handle TXT Also called text, ASCII Can save from Word as plain text
![Page 51: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/51.jpg)
Magic Conversion Tool
Calibre Converts to and from many formats Fairly intuitive Free!
http://calibre-ebook.com/
![Page 52: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/52.jpg)
Another Conversion Tool
TechAdapt http://www.techadapt.com/
TechAdapt Accessible Media Center (TAMC) For converting NIMAS and DAISY
DAISY to… RTF HTML
![Page 53: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/53.jpg)
File Transfer
Can use DropBox or Box to transfer files for most readers
Kindle and iPad can often use e-mail
![Page 54: Processing PDF: How to Go from PDF to E-text to Audio](https://reader035.fdocuments.net/reader035/viewer/2022062314/5681437c550346895daffbfb/html5/thumbnails/54.jpg)
Resource InfoResource Info
Gaeir Dietrich [email protected] 408-996-6047
www.htctu.net Alt media listserv Manuals online