Back to all postsApril 28, 2026

How to Get Text Out of a Scanned PDF

Why you cannot copy from a scanned PDF

A scanned PDF is a stack of photos in a PDF wrapper. The pages look like documents, but the text is an image, so you cannot select or search it. To get usable text, you run each page through OCR. Save or screenshot a page as an image, then drop it into the image to text converter.

Step by step

1. Turn the PDF pages into images

Export the pages as JPG or PNG from your PDF viewer, or take a clear, full-size screenshot of each page.

2. Upload to ocrX

Add the page image, pick the language, and extract.

3. Work through the pages

Do them in order and keep the text together as you go.

4. Save the result

Download as TXT, or as a PDF or Word file if you want a tidy document.

Why bother

Once the text is real text, you can search a contract for a clause, copy an address out of an old letter, or quote a paragraph without retyping it. A searchable archive beats a folder of flat images.

Tips

Export pages at a decent size. Tiny thumbnails read poorly.
Pick the language that matches the document.
For long PDFs, work in small batches so you do not lose your place.

Wrapping up

A scanned PDF is just pictures until you read it. ocrX turns those pages back into text you can search, copy, and edit.

How to Get Text Out of a Scanned PDF

Why you cannot copy from a scanned PDF

Step by step

1. Turn the PDF pages into images

2. Upload to ocrX

3. Work through the pages

4. Save the result

Why bother

Tips

Wrapping up

Try ocrX

More from the blog

How to Extract Text From an Image (No Typing Required)

How to Convert a Photo to Text on iPhone and Android

How to Copy Text From a Screenshot