How is this different from drawing a black box?

Drawing a black rectangle only hides text visually — it can still be copied or extracted underneath. This tool deletes the matched characters from the PDF's content streams, so a copy-paste from the output reveals nothing.

Does the PDF leave my computer?

No. The file is read with the browser File API and rewritten using the native CompressionStream and DecompressionStream APIs. Nothing is uploaded, which is exactly what you want for sensitive documents.

Why did it say no matching text was found?

The PDF may be a scanned image with no real text layer, or it may use subset-encoded fonts that store glyph codes rather than literal characters. In those cases the text is not present as searchable strings and cannot be redacted this way.

Will the layout shift after redaction?

No. The matched characters are replaced with spaces of equal length inside the same text strings, so word positions and page layout stay intact while the sensitive content is gone.

Should I verify the result?

Always. Re-open the downloaded PDF and try to select and copy the area you redacted to confirm the text is truly gone before you share the document.

What is the PDF Text Redactor?

Free in-browser PDF text redactor. Search for a name, number or phrase and remove it from a text-based PDF's content streams so it cannot be copied back out. Uses native browser compression APIs — no upload, no server. It runs free in your browser on Gera Tools, with nothing uploaded.

PDF Text Redactor — Gera Tools

Name: PDF Text Redactor
Creator: Gera Tools
License: https://creativecommons.org/licenses/by/4.0/

Get one useful tool a week

Like this tool? Enter your email and we'll send you one genuinely useful Gera tool a week — plus a link to come back to this one. No spam, one-click unsubscribe any time.

Redacting a PDF properly means the sensitive text must be gone, not just hidden behind a black box that anyone can remove or copy through. This tool searches a text-based PDF for a phrase you specify and deletes those characters from the file’s content streams, all inside your browser so the document is never uploaded.

How it works

A text-based PDF stores its visible text as literal strings inside content streams, usually compressed with FlateDecode. The redactor works directly on those streams rather than rendering the page and drawing over it:

Parses the raw PDF bytes and locates each stream … endstream object.
For FlateDecode streams, inflates the data using the browser’s native DecompressionStream("deflate") — no external library is needed.
Inside the decoded text-showing operators (Tj and TJ), finds your phrase within the parenthesized PDF strings and replaces each matched character with a space, so the glyphs vanish from the text layer.
Re-compresses the stream with CompressionStream, fixes the stream’s /Length, and reassembles a valid PDF for download.

Because the characters are removed rather than covered, selecting and copying the redacted region in the output yields nothing — the text is structurally absent, not merely hidden visually.

Why black boxes are not enough

This is the most important concept in document redaction. Drawing a black rectangle on a PDF is an annotation or a drawing instruction layered on top of the content stream — the text underneath is still there, still selectable, and still extractable by any PDF parser, copy-paste operation, or command-line tool. Numerous real-world disclosures of sensitive information have happened because organisations used visual overlay redaction rather than content-stream deletion. This tool removes the characters from the stream itself.

What this tool can and cannot redact

Works on: digitally created PDFs exported from Word, LibreOffice, InDesign, LaTeX, and most modern document tools. These store real text in content streams and the characters are literal ASCII or Unicode strings.

Does not work on: scanned PDFs where each page is a photograph with no text layer — the text is pixels, not characters, so there is nothing to delete from a content stream. Run OCR to generate a text layer first if needed.

May not work reliably on: PDFs using heavily subset-encoded fonts that store glyph indices instead of readable characters. In these cases the tool will report no match found even if the text is visible on screen, because the stored character codes do not correspond to the literal characters you typed.

Verification after redaction

Always re-open the downloaded PDF and:

Try to select and copy the redacted area — paste into a text editor and confirm nothing appears.
Use a PDF inspector or command-line tool to check the content stream if you need higher confidence.
Never share the document without verifying — assume the redaction failed until you have confirmed it succeeded.

Everything runs locally in your browser; the PDF is never transmitted.