How to Extract Images from a PDF Free Online
Pulling images out of a PDF is simpler than it looks — if the images are actually embedded in the file. The key distinction most guides skip: a PDF can display images in two fundamentally different ways, and only one of them can be extracted as individual files. This guide explains exactly what FusionPDF's extract-images tool finds, how to use it in three steps, and what to do when the result isn't what you expected.
- FusionPDF uses PDF.js to locate embedded raster image objects (XObjects) in the PDF content stream and exports them as PNG files in a ZIP
- Works on PDFs that contain embedded JPEG or PNG photos, diagrams, and scanned images
- Vector graphics, text-as-paths, and solid backgrounds are not extractable — they have no pixel data
- Need the whole page as an image? Use PDF to Image instead — it renders every page, including vectors
- 100% browser-based: your PDF never leaves your device
Most people reach for an image extractor when they open a PDF report, textbook, or scanned document and want to save the photos or diagrams inside it. Whether that works — and how well — depends entirely on how those images are stored in the PDF file. Let's look at exactly what's happening under the hood.
How the FusionPDF Extract-Images Tool Works
FusionPDF's extract-images tool uses PDF.js — Mozilla's open-source PDF rendering library — to parse the PDF content stream and locate embedded image objects. In the PDF specification, raster images are stored as XObjects with a subtype of Image. PDF.js identifies each one, decodes the compressed pixel data (whether the original was JPEG, PNG, or another compressed format), and saves it as a PNG. All extracted images are bundled into a single ZIP file that your browser downloads automatically.
This approach — reading image objects directly from the content stream rather than rendering pages — has a practical advantage: images are extracted at their original embedded resolution, not at screen resolution. A high-resolution photo embedded in a PDF at 300 DPI comes out at 300 DPI, not downsampled to whatever the screen renders.
How to Extract Images from a PDF in 3 Steps
The process is straightforward: open the tool, load the PDF, and download the ZIP. No account, no file upload, no software to install. Everything runs in your browser window.
Open the Extract Images tool. Go to fusionpdf.pro/extract-images. No sign-up or account is required. The tool loads directly in your browser.
Load your PDF. Click "Select PDF" or drag your file onto the drop area. The PDF is read into your browser's memory using the FileReader API. Nothing is sent to any server — the file stays on your device throughout the entire process.
Download the ZIP. Click "Extract Images". PDF.js scans every page's content stream for Image XObjects, decodes each one, and saves it as a PNG. Once all images have been collected, your browser downloads a ZIP archive containing all the extracted files, numbered sequentially (image-1.png, image-2.png, and so on).
Tip: If the downloaded ZIP is empty or contains fewer images than you expected, the missing content is likely vector graphics rather than embedded raster images. See the section below on what cannot be extracted — and when to use PDF to Image instead.
What Can Be Extracted: Embedded Raster Images
The tool extracts content that exists in the PDF as a discrete image object with actual pixel data. If you can point to a photo, diagram, or illustration in the PDF that was originally a JPEG or PNG before the PDF was created, it is almost certainly extractable.
Content that can be extracted includes:
- Embedded JPEG photos. Product photos, portraits, documentary images — any photograph embedded in the PDF content stream. These are typically compressed with the DCTDecode filter internally and come out as high-quality PNGs.
- Embedded PNG images. Screenshots, UI mockups, diagrams created as PNG files before PDF creation. These are stored with lossless compression (FlateDecode or LZWDecode) and extract without quality loss.
- Scanned page images inside a PDF. When a scanned document is saved as a PDF, each page scan is stored as a single large image object. The tool will extract these — one PNG per scanned page image found in the content stream.
- Embedded illustrations in raster format. Charts exported as PNG from Excel or Google Sheets, technical diagrams saved as JPEG — anything that entered the PDF as a raster image object.
What Cannot Be Extracted: Vectors, Paths, and Backgrounds
A PDF is not simply a container of image files. Most of what appears visually in a PDF — shapes, lines, borders, backgrounds, even many "drawings" — is described by drawing commands, not stored as pixel data. These elements have no image bytes to extract.
Content that cannot be extracted as individual images includes:
- Vector graphics. Logos, icons, infographics, and illustrations created in Illustrator, Inkscape, or exported as SVG or EDF before PDF creation are stored as sequences of path-drawing instructions (
m,l,coperators in the content stream). There is no pixel grid to read — only mathematical descriptions of curves and fills. - Text rendered as outlined paths. Some PDFs convert text to paths to embed custom fonts portably. This text "looks like" drawing on screen but contains no image data.
- Page backgrounds and color fills. A colored background drawn with a rectangle fill command is a drawing instruction, not an embedded image. It will not appear in the extracted ZIP.
- Drawings and diagrams created natively in PDF. Charts produced by PDF-native tools (some LaTeX packages, for example) may draw everything with lines and curves directly in the content stream, with no raster images present at all.
Empty ZIP? If the extracted ZIP is empty or missing elements you expected, the visual content in your PDF is likely vector-based. The extract-images tool can only recover what was embedded as a raster image. For full-page visual capture of any PDF — including vector content — use PDF to Image, which renders the complete page at your chosen resolution.
| Content type | Stored as | Extract Images tool |
|---|---|---|
| Photo (JPEG/PNG embedded) | Image XObject with pixel data | Extracted |
| Scanned page image | Image XObject (full-page raster) | Extracted |
| Vector illustration / logo | Path drawing operators | Not extractable |
| Solid color background | Rectangle fill command | Not extractable |
| Text (even styled) | Text operators or outlined paths | Not extractable |
| Chart drawn natively in PDF | Line and curve operators | Not extractable |
Extract Images vs. Convert Pages to Images — Key Difference
These are two different operations that are easy to confuse. Extracting images pulls out the discrete raster image objects that were embedded in the PDF. Converting pages to images renders each full PDF page — every element on it, including text, vector art, and images — into a single flat raster file. Which one you need depends on what you actually want.
Pulls individual embedded images
Reads Image XObjects from the PDF content stream. Gives you each embedded photo or diagram as its own file at original resolution.
- One file per embedded image
- Original embedded resolution preserved
- Only works on raster images in the file
- Vector art and page layout are not included
Renders complete pages as images
Uses PDF.js to render the entire page — text, vectors, photos, backgrounds — at a chosen DPI. Gives you one image per page, showing everything.
- One file per page
- Resolution is set by the render DPI you choose
- Captures vectors, text, and all page content
- Ideal for thumbnails, previews, or archiving
Use Extract Images when you want to recover the original photos or diagrams that were placed into the document — for example, pulling product photos out of a catalog PDF, or recovering scanned images from a PDF archive. Use PDF to Image when you want every page of the PDF as a flat image, regardless of what the page contains — for example, creating page thumbnails, preparing pages for printing, or converting a PDF to a format a non-PDF tool can process.
When to use PDF to Image instead: if your PDF contains diagrams, charts, logos, or illustrations that are vector-based, PDF to Image (fusionpdf.pro/pdf-to-img) is the right tool. It renders the complete page at up to 300 DPI, capturing all visual content regardless of how it is stored internally.
Privacy: 100% Browser-Based, No Upload
FusionPDF's extract-images tool never uploads your PDF to a server. The entire operation — parsing, image extraction, ZIP creation, and download — happens inside your browser window using JavaScript and WebAssembly. Your files are not transmitted over the network at any point.
Here is what happens technically when you use the tool:
- Your PDF is read into browser memory using the FileReader API — a standard browser capability that gives JavaScript access to local files without a server round-trip.
- PDF.js parses the file structure entirely in the browser's JavaScript engine. PDF.js is open-source and maintained by Mozilla — the same library used in Firefox's built-in PDF viewer.
- Extracted image data is assembled into a ZIP archive in memory using a client-side JavaScript ZIP library.
- The final ZIP is offered as a download via a Blob URL — a temporary in-browser URL that points to data already in memory. No file ever touches FusionPDF's servers.
This means your PDF content — including any sensitive photos, confidential diagrams, or personal documents — never leaves your machine. You do not need to trust FusionPDF's servers with your data, because your data never reaches them.
If you are extracting images from documents that contain personal data, medical records, legal contracts, or proprietary business content, the browser-based model means there is no third-party server exposure to worry about.
Why are some images missing from the extracted ZIP?
The tool extracts embedded raster image objects (XObjects) from the PDF content stream. If an image appears to be missing, the most likely reason is that the content is actually vector graphics — paths, shapes, and lines drawn with PDF drawing operators, not embedded image data. Vector content has no pixel data to extract. The same applies to backgrounds drawn with solid color fills and text rendered as outlined paths. Only JPEG and PNG raster images that were genuinely embedded in the PDF can be extracted. If you need the complete page including all vector content, use PDF to Image to render the full page at your chosen resolution.
What format are the extracted images?
All extracted images are saved as PNG files, regardless of whether the original embedded image was JPEG or PNG internally. Converting to PNG during extraction produces lossless output and simplifies the ZIP file format — you get a consistent set of PNG files rather than a mix of formats. The ZIP contains one PNG per extracted image, numbered sequentially (image-1.png, image-2.png, and so on).
Is there a limit to how many images I can extract?
There is no hard limit imposed by FusionPDF. The tool extracts every embedded raster image it finds across all pages of the PDF. The practical ceiling is your device's available memory — a PDF with hundreds of large high-resolution images may slow down on devices with limited RAM. Most documents with embedded photos or diagrams complete extraction in a few seconds. There is no cap on the number of pages or images processed.
Can I extract images from a password-protected PDF?
If the PDF is encrypted and requires a password to open, the tool cannot extract images from it — the content stream is encrypted and unreadable without the decryption key. If you own the PDF and know the password, open it in any PDF reader first, then print or re-export it as an unencrypted PDF. You can then use that unencrypted copy with the extract-images tool. Note that some PDFs have permissions-only restrictions (printing allowed, copying restricted) rather than full encryption — PDF.js can read and extract images from those without any issue.
What if my PDF only has vector art and no raster images?
If the PDF contains only vector graphics — illustrations drawn with paths, curves, and fills rather than pixel data — the extract-images tool will find zero embedded image objects and the downloaded ZIP will be empty. This is common with PDFs created from Illustrator, Inkscape, or engineering software where everything is drawn as geometry. For vector content, the right tool is PDF to Image at fusionpdf.pro/pdf-to-img, which renders the entire page (including all vector art, text, and backgrounds) as a flat PNG or JPEG at your chosen resolution. You get one image per page — not the individual objects, but a faithful rendering of what the page looks like.
Extract Images from Your PDF — Free, No Upload
Load your PDF in the browser, let PDF.js find every embedded image, and download a ZIP of PNG files. Your document never leaves your device.