Metadata Removal

DOCX

DOCX files are essentially a ZIP archive containing XML files

  1. Make a backup of your file first.

  2. Rename the file extension from .docx to .zip

  3. Unzip it.

Files containing metadata

  • core.xml --> contains author, creation date, etc.

  • app.xml --> contains application-specific metadata

  • custom.xml --> may contain custom properties

  1. Delete the sensitive content from these files.

  • Re-zip the contents: zip -r ../clean_document.docx * from inside the folder.

  • Rename back to .docx

Last updated