Accent Fold (Search Normaliser)

Fold accented letters to ASCII base for search matching

Ad placeholder (leaderboard)

Accent folding normalises text so that searches match regardless of accents and special letters. It is the standard preprocessing step behind diacritic-insensitive search, where café and cafe are treated as identical.

How it works

Two stages produce a clean ASCII key:

1. Expand special letters and ligatures that have no combining mark:
   ß → ss   æ → ae   œ → oe   ø → o   ð → d   þ → th   ł → l
2. Normalise to NFD and delete combining marks (U+0300–U+036F):
   é → e   ñ → n   ü → u
   (optionally) lowercase the whole string

Doing the ligature expansion before the NFD strip ensures characters that NFD cannot decompose still become plain ASCII. Apply the same pipeline to both your search index and the incoming query.

Example and tips

Mötley Crüe — Straße folds to motley crue strasse. Build your search index by folding every stored term with this exact pipeline, then fold each query the same way before comparing. That guarantees accented and unaccented spellings always match.

Ad placeholder (rectangle)