Accessibility: PDFs
- Ensures screen readers can navigate and interpret content.
- Makes scanned documents searchable and readable.
- Preserves structure and accessibility features from source documents.
Use accessible PDFs when sharing finalized documents, forms, reports or resources that need to be preserved in a fixed format.
Use Adobe Acrobat or Microsoft Word to tag headings, lists and tables before exporting. Convert scanned PDFs into searchable text using optical character recognition.
Example
A scanned flyer should be run through optical character recognition and tagged with headings and alternative text before being uploaded to a website or emailed to users.
PDF tags are hidden markers that define the logical structure and reading order of content within a PDF document. They work like HTML tags, identifying different elements like headings, paragraphs and lists.
Document Structure Tags
Text-Level Tags
List and Table Tags
Specalized Tags
Form Tags
Optical Character Recognition
Optical character recognition is a technology that converts different types of documents — such as scanned paper documents, PDFs or images captured by a camera — into machine-readable text. It’s widely used for digitizing printed materials so they can be searched, edited and processed electronically.
How OCR Works
- Image Preprocessing
- Improves image quality (deskewing, noise removal, binarization).
- Character Detection
- Identifies shapes that resemble letters or numbers.
- Pattern Recognition
- Matches detected shapes against known character patterns.
- Postprocessing
- Applies language models or dictionaries to correct errors.
Common Uses
- Digitizing books and archives.
- Automating data entry from printed forms.
- Extracting text from invoices, receipts or ID cards.
- Making scanned PDFs accessible (tagging + searchable text).
Use Adobe Acrobat’s Accessibility Checker (All Tools > Prepare for Accessibility > Check for Accessibility) to scan for missing tags, reading order issues and contrast problems. Manually verify that headings, lists and tables are tagged correctly.