Image & Document Privacy Guide

Metadata Removal Guide: Protecting Privacy in Photos and Documents

Every file you create or capture contains hidden metadata that can reveal far more than the visible content suggests. This guide explains what metadata exists in images and documents, what it discloses, and how to remove it before sharing files.

What Is Metadata and Why Does It Matter?

Metadata is "data about data" — structured information embedded within a file that describes its content, origin, and history. Unlike the visible content of a photo or document, metadata is invisible in normal viewing but is immediately accessible to anyone who examines the file with appropriate tools — and these tools are freely available, simple to use, and often built into operating systems.

The privacy implications are substantial. A photo shared on social media, a document submitted in a legal proceeding, a PDF sent to a journalist, or an image attached to an email can all carry metadata that discloses the creator's identity, location, the device they used, and when and where the file was created or edited — information that was never intended to be shared.

Several high-profile cases have involved metadata revealing the identities of anonymous whistleblowers, the locations of journalists reporting from sensitive areas, and the identity of document authors in legal disputes. Metadata removal is not a niche concern — it is a basic hygiene practice for anyone who shares files professionally or wishes to maintain privacy online.

EXIF Metadata in Photos: What Your Camera Reveals

EXIF (Exchangeable Image File Format) is a standard for storing metadata in JPEG, TIFF, and some PNG files. Defined by the JEITA (Japan Electronics and Information Technology Industries Association), EXIF was designed to preserve camera settings alongside images for photographic purposes. Modern smartphones extend this standard significantly.

Location Data (GPS Tags)

When location services are enabled during photography, EXIF stores GPS coordinates with precision typically between 1 and 10 meters. The stored data includes latitude, longitude, altitude, and GPS timestamp — sufficient to identify a specific room in a building, let alone a home address. On iOS and Android, location embedding is controlled by settings and camera permissions, but is enabled by default on most devices.

Device and Camera Information

EXIF records the camera make (e.g., Apple), model (e.g., iPhone 15 Pro), lens model, and sometimes a unique device identifier. This allows correlation of images taken by the same device across different photographers or pseudonymous accounts — a powerful de-anonymization technique for investigating actors who maintain multiple online identities.

Timestamps and Edit History

EXIF contains multiple timestamps: the original capture time, the digitization time, and the modification time if the image was edited. These can be used to verify (or contradict) claimed timelines, locate a person during a specific time period based on where they were photographed, or reveal editing software versions.

PDF Metadata: What Documents Reveal About Their Authors

PDF files maintain several layers of metadata defined by the PDF specification (ISO 32000). The core document properties — accessible via File > Properties in any PDF reader — include the author's name, the title, subject, keywords, the application used to create the file, and the PDF producer (often revealing the version of Word, LibreOffice, or Adobe Acrobat used).

Beyond these standard fields, PDFs can contain XMP (Extensible Metadata Platform) packets — an XML-based metadata format that provides additional fields including the original author's full name, organisation, copyright information, and a history of which software has opened or modified the document.

Before sharing a PDF professionally — in a legal submission, a public consultation response, a press release, or an academic submission — use our PDF Metadata Remover to strip all author fields, software information, timestamps, and XMP data. The cleaned PDF retains its visual content and formatting while revealing nothing about its creator or editing history.

Remove PDF Metadata →

Bulk Metadata Removal for Multiple Photos

When preparing a set of photos for publication — a portfolio, a blog post, a press release, or social media content — removing EXIF data one image at a time is impractical. Our Bulk Metadata Remover processes multiple images simultaneously in the browser, with a real-time progress indicator and a ZIP download containing all cleaned images.

The bulk tool strips EXIF by redrawing each image on an HTML Canvas element and exporting it as a new JPEG. The canvas output contains no EXIF data by specification — the image data is preserved at the specified quality, but all embedded metadata is discarded at the rendering stage.

This approach works for JPEG images. PNG files typically contain metadata in tEXt and iTXt chunks rather than EXIF — these are handled by the same canvas redraw process, which discards all non-pixel data during export.

Metadata Removal Checklist

Before sharing any file that may contain sensitive metadata, work through this checklist:

  1. Inspect first. Use the Image Metadata Viewer to see exactly what EXIF tags are present before deciding what to remove.
  2. Remove EXIF from all photos. Process through the Remove EXIF Metadata tool. For multiple images, use the Bulk Metadata Remover.
  3. Strip PDF metadata. Run all PDFs through the PDF Metadata Remover before external distribution.
  4. Redact visible sensitive content. Use Screenshot Redaction or Face Blur for visual content that needs obscuring.
  5. Scan document text for PII. Use the PII Pattern Detector on the document's text content to catch embedded personal data that metadata removal won't address.

Frequently Asked Questions

What metadata do photos contain?

EXIF metadata commonly includes GPS coordinates precise to within a few meters, camera make and model, shooting timestamp, exposure settings, lens information, and sometimes a device identifier. Use the Image Metadata Viewer to inspect any photo's full tag set.

How do I remove EXIF data from a photo?

Use our Remove EXIF Metadata tool — it strips the EXIF segment from JPEG files at the binary level, entirely in your browser. For multiple photos, the Bulk Metadata Remover processes entire folders.

What metadata does a PDF contain?

PDF files embed author name, creator software, producer, creation and modification timestamps, keywords, and XMP packets. These can reveal the document creator's identity and editing history to anyone who examines the file properties.

Does sharing photos on social media remove EXIF data automatically?

Most platforms strip EXIF during upload, but behaviour varies and is not guaranteed. Some platforms retain location data internally even when not displaying it. Always remove EXIF locally before uploading to guarantee no location data is shared.