CSV Column Type Inferrer

Detect the probable data type of each CSV column — date, int, float, bool

Ad placeholder (leaderboard)

The CSV Column Type Inferrer reads pasted CSV data and works out the most likely data type of each column — integer, float, boolean, ISO date, email, phone or free-text — with a confidence score. It is a quick way to understand an unknown dataset or bootstrap a database schema, all in your browser.

How it works

First the tool parses the CSV using an RFC-4180-style reader that correctly handles quoted fields containing commas, escaped quotes ("") and embedded newlines. It then samples up to 1000 data rows and classifies each non-empty cell with an ordered set of tests:

  1. boolean — values like true, false, yes, no, t, f
  2. integer — optional sign followed by digits
  3. float — a decimal point with digits
  4. ISO dateYYYY-MM-DD with an optional time component that also parses as a real date
  5. email — a local part, @, and a dotted domain
  6. phone — an optional + and at least seven digits with common separators
  7. free-text — anything else

A column is assigned a type only when at least 80% of its non-empty cells match a single type; otherwise it is reported as free-text. If a numeric column mixes whole numbers and decimals, it is reported as float so every value fits.

Example

A price column containing 19.99, 5 and 12.50 is reported as float (because of the decimals), while an id column of 1, 2, 3 is reported as integer. A signup_date column of 2026-01-15-style values is reported as date.

Notes

Confidence is the fraction of non-empty cells matching the chosen type, so it also hints at data quality — a high-but-not-100% score usually points to a few malformed rows worth investigating. All parsing runs locally, so the tool is safe for sensitive datasets.

Ad placeholder (rectangle)