PDF Tools

How to Extract Images from a PDF Free Online

Pulling images out of a PDF is simpler than it looks — if the images are actually embedded in the file. The key distinction most guides skip: a PDF can display images in two fundamentally different ways, and only one of them can be extracted as individual files. This guide explains exactly what FusionPDF's extract-images tool finds, how to use it in three steps, and what to do when the result isn't what you expected.

By FusionPDF Team · July 1, 2026 · 6 min read · Updated July 2026

Key Takeaways

FusionPDF uses PDF.js to locate embedded raster image objects (XObjects) in the PDF content stream and exports them as PNG files in a ZIP
Works on PDFs that contain embedded JPEG or PNG photos, diagrams, and scanned images
Vector graphics, text-as-paths, and solid backgrounds are not extractable — they have no pixel data
Need the whole page as an image? Use PDF to Image instead — it renders every page, including vectors
100% browser-based: your PDF never leaves your device

Most people reach for an image extractor when they open a PDF report, textbook, or scanned document and want to save the photos or diagrams inside it. Whether that works — and how well — depends entirely on how those images are stored in the PDF file. Let's look at exactly what's happening under the hood.

How the FusionPDF Extract-Images Tool Works

FusionPDF's extract-images tool uses PDF.js — Mozilla's open-source PDF rendering library — to parse the PDF content stream and locate embedded image objects. In the PDF specification, raster images are stored as XObjects with a subtype of Image. PDF.js identifies each one, decodes the compressed pixel data (whether the original was JPEG, PNG, or another compressed format), and saves it as a PNG. All extracted images are bundled into a single ZIP file that your browser downloads automatically.

This approach — reading image objects directly from the content stream rather than rendering pages — has a practical advantage: images are extracted at their original embedded resolution, not at screen resolution. A high-resolution photo embedded in a PDF at 300 DPI comes out at 300 DPI, not downsampled to whatever the screen renders.

In the PDF specification (ISO 32000-2), raster images are stored as Image XObjects in the file's content stream. Each XObject contains the image dimensions, color space, compression filter (typically DCTDecode for JPEG or FlateDecode for PNG), and the raw compressed pixel data. PDF.js can access these objects directly, making it possible to extract pixel-perfect copies of embedded images without rendering the page. PDF specification ISO 32000-2; Mozilla PDF.js project (github.com/mozilla/pdf.js)

How to Extract Images from a PDF in 3 Steps

The process is straightforward: open the tool, load the PDF, and download the ZIP. No account, no file upload, no software to install. Everything runs in your browser window.

Open the Extract Images tool. Go to fusionpdf.pro/extract-images. No sign-up or account is required. The tool loads directly in your browser.

Load your PDF. Click "Select PDF" or drag your file onto the drop area. The PDF is read into your browser's memory using the FileReader API. Nothing is sent to any server — the file stays on your device throughout the entire process.

Download the ZIP. Click "Extract Images". PDF.js scans every page's content stream for Image XObjects, decodes each one, and saves it as a PNG. Once all images have been collected, your browser downloads a ZIP archive containing all the extracted files, numbered sequentially (image-1.png, image-2.png, and so on).

Tip: If the downloaded ZIP is empty or contains fewer images than you expected, the missing content is likely vector graphics rather than embedded raster images. See the section below on what cannot be extracted — and when to use PDF to Image instead.

What Can Be Extracted: Embedded Raster Images

The tool extracts content that exists in the PDF as a discrete image object with actual pixel data. If you can point to a photo, diagram, or illustration in the PDF that was originally a JPEG or PNG before the PDF was created, it is almost certainly extractable.

Content that can be extracted includes:

Embedded JPEG photos. Product photos, portraits, documentary images — any photograph embedded in the PDF content stream. These are typically compressed with the DCTDecode filter internally and come out as high-quality PNGs.
Embedded PNG images. Screenshots, UI mockups, diagrams created as PNG files before PDF creation. These are stored with lossless compression (FlateDecode or LZWDecode) and extract without quality loss.
Scanned page images inside a PDF. When a scanned document is saved as a PDF, each page scan is stored as a single large image object. The tool will extract these — one PNG per scanned page image found in the content stream.
Embedded illustrations in raster format. Charts exported as PNG from Excel or Google Sheets, technical diagrams saved as JPEG — anything that entered the PDF as a raster image object.

300 DPI

Original embedded resolution is preserved Because the tool reads image data directly from the PDF content stream rather than rendering the page at screen resolution, extracted images come out at the resolution they were embedded — typically 150–300 DPI for press-quality documents.

What Cannot Be Extracted: Vectors, Paths, and Backgrounds

A PDF is not simply a container of image files. Most of what appears visually in a PDF — shapes, lines, borders, backgrounds, even many "drawings" — is described by drawing commands, not stored as pixel data. These elements have no image bytes to extract.

Content that cannot be extracted as individual images includes:

Vector graphics. Logos, icons, infographics, and illustrations created in Illustrator, Inkscape, or exported as SVG or EDF before PDF creation are stored as sequences of path-drawing instructions (m, l, c operators in the content stream). There is no pixel grid to read — only mathematical descriptions of curves and fills.
Text rendered as outlined paths. Some PDFs convert text to paths to embed custom fonts portably. This text "looks like" drawing on screen but contains no image data.
Page backgrounds and color fills. A colored background drawn with a rectangle fill command is a drawing instruction, not an embedded image. It will not appear in the extracted ZIP.
Drawings and diagrams created natively in PDF. Charts produced by PDF-native tools (some LaTeX packages, for example) may draw everything with lines and curves directly in the content stream, with no raster images present at all.

Empty ZIP? If the extracted ZIP is empty or missing elements you expected, the visual content in your PDF is likely vector-based. The extract-images tool can only recover what was embedded as a raster image. For full-page visual capture of any PDF — including vector content — use PDF to Image, which renders the complete page at your chosen resolution.

Content type	Stored as	Extract Images tool
Photo (JPEG/PNG embedded)	Image XObject with pixel data	Extracted
Scanned page image	Image XObject (full-page raster)	Extracted
Vector illustration / logo	Path drawing operators	Not extractable
Solid color background	Rectangle fill command	Not extractable
Text (even styled)	Text operators or outlined paths	Not extractable
Chart drawn natively in PDF	Line and curve operators	Not extractable

Extract Images vs. Convert Pages to Images — Key Difference

These are two different operations that are easy to confuse. Extracting images pulls out the discrete raster image objects that were embedded in the PDF. Converting pages to images renders each full PDF page — every element on it, including text, vector art, and images — into a single flat raster file. Which one you need depends on what you actually want.

Extract Images

Pulls individual embedded images

Reads Image XObjects from the PDF content stream. Gives you each embedded photo or diagram as its own file at original resolution.

One file per embedded image
Original embedded resolution preserved
Only works on raster images in the file
Vector art and page layout are not included

PDF to Image

Renders complete pages as images

Uses PDF.js to render the entire page — text, vectors, photos, backgrounds — at a chosen DPI. Gives you one image per page, showing everything.

One file per page
Resolution is set by the render DPI you choose
Captures vectors, text, and all page content
Ideal for thumbnails, previews, or archiving

Use Extract Images when you want to recover the original photos or diagrams that were placed into the document — for example, pulling product photos out of a catalog PDF, or recovering scanned images from a PDF archive. Use PDF to Image when you want every page of the PDF as a flat image, regardless of what the page contains — for example, creating page thumbnails, preparing pages for printing, or converting a PDF to a format a non-PDF tool can process.

When to use PDF to Image instead: if your PDF contains diagrams, charts, logos, or illustrations that are vector-based, PDF to Image (fusionpdf.pro/pdf-to-img) is the right tool. It renders the complete page at up to 300 DPI, capturing all visual content regardless of how it is stored internally.

Privacy: 100% Browser-Based, No Upload

FusionPDF's extract-images tool never uploads your PDF to a server. The entire operation — parsing, image extraction, ZIP creation, and download — happens inside your browser window using JavaScript and WebAssembly. Your files are not transmitted over the network at any point.

Here is what happens technically when you use the tool:

Your PDF is read into browser memory using the FileReader API — a standard browser capability that gives JavaScript access to local files without a server round-trip.
PDF.js parses the file structure entirely in the browser's JavaScript engine. PDF.js is open-source and maintained by Mozilla — the same library used in Firefox's built-in PDF viewer.
Extracted image data is assembled into a ZIP archive in memory using a client-side JavaScript ZIP library.
The final ZIP is offered as a download via a Blob URL — a temporary in-browser URL that points to data already in memory. No file ever touches FusionPDF's servers.

This means your PDF content — including any sensitive photos, confidential diagrams, or personal documents — never leaves your machine. You do not need to trust FusionPDF's servers with your data, because your data never reaches them.

bytes transmitted to any server Every step of the image extraction process runs locally in your browser. No upload, no cloud processing, no server storage. Your PDF and its contents stay on your device from start to finish.

If you are extracting images from documents that contain personal data, medical records, legal contracts, or proprietary business content, the browser-based model means there is no third-party server exposure to worry about.

Frequently asked questions

Why are some images missing from the extracted ZIP?

The tool extracts embedded raster image objects (XObjects) from the PDF content stream. If an image appears to be missing, the most likely reason is that the content is actually vector graphics — paths, shapes, and lines drawn with PDF drawing operators, not embedded image data. Vector content has no pixel data to extract. The same applies to backgrounds drawn with solid color fills and text rendered as outlined paths. Only JPEG and PNG raster images that were genuinely embedded in the PDF can be extracted. If you need the complete page including all vector content, use PDF to Image to render the full page at your chosen resolution.

What format are the extracted images?

All extracted images are saved as PNG files, regardless of whether the original embedded image was JPEG or PNG internally. Converting to PNG during extraction produces lossless output and simplifies the ZIP file format — you get a consistent set of PNG files rather than a mix of formats. The ZIP contains one PNG per extracted image, numbered sequentially (image-1.png, image-2.png, and so on).

Is there a limit to how many images I can extract?

There is no hard limit imposed by FusionPDF. The tool extracts every embedded raster image it finds across all pages of the PDF. The practical ceiling is your device's available memory — a PDF with hundreds of large high-resolution images may slow down on devices with limited RAM. Most documents with embedded photos or diagrams complete extraction in a few seconds. There is no cap on the number of pages or images processed.

Can I extract images from a password-protected PDF?

If the PDF is encrypted and requires a password to open, the tool cannot extract images from it — the content stream is encrypted and unreadable without the decryption key. If you own the PDF and know the password, open it in any PDF reader first, then print or re-export it as an unencrypted PDF. You can then use that unencrypted copy with the extract-images tool. Note that some PDFs have permissions-only restrictions (printing allowed, copying restricted) rather than full encryption — PDF.js can read and extract images from those without any issue.

What if my PDF only has vector art and no raster images?

If the PDF contains only vector graphics — illustrations drawn with paths, curves, and fills rather than pixel data — the extract-images tool will find zero embedded image objects and the downloaded ZIP will be empty. This is common with PDFs created from Illustrator, Inkscape, or engineering software where everything is drawn as geometry. For vector content, the right tool is PDF to Image at fusionpdf.pro/pdf-to-img, which renders the entire page (including all vector art, text, and backgrounds) as a flat PNG or JPEG at your chosen resolution. You get one image per page — not the individual objects, but a faithful rendering of what the page looks like.

Extract Images from Your PDF — Free, No Upload

Load your PDF in the browser, let PDF.js find every embedded image, and download a ZIP of PNG files. Your document never leaves your device.

Extract Images → PDF to Image (full pages) →