Geospatial
Country name/abbreviation actually exists | This cell doesn't contain a valid country name |
State name actually exists | This cell doesn't contain a valid state name |
State abbreviation actually exists | This cell doesn't contain a valid state abbreviation |
Noise
Mostly numeric columns with rare (non-N/A) text | This value doesn't look like a number |
Occasional percentage or currency numbers | This value doesn't look like a number |
Numeric
Numeric outlier | This number looks way too [big or small] |
Likely noise numbers | This number looks like a placeholder |
Numeric truncation | Possible Numeric Truncation |
Security / PII
Social security numbers | This column may contain social security numbers |
Credit card numbers | This column may contain credit card numbers |
Phone numbers | This column may contain phone numbers |
Email addresses | This column may contain email addresses |
Structural Warnings
Empty Columns | This column is blank |
Duplicate Rows | This row is a duplicate of the one above it. |
Likely row truncation | Possible row truncation. The number of characters in this row could indicate that some amount of data was clipped. |
Likely column truncation | Possible column truncation. The number of characters in a given field could indicate that some amount of data was clipped. |
Suspiciously round numbers of rows | Suspiciously round number of rows. I.e. exactly 1000 rows. Perhaps this isn’t the full data, but rather a subset. |
Rare blank cells in columns | A column which contains mostly filled values has some small number left blank. |
Text
String truncation | Possible string truncation. |
Likely noise text | This text looks like a placeholder (i.e. qwerty, asdf) |
String length outlier | This text looks [longer or shorter] than the rest |