DataQualityFunctions

Classes

DataQualityFunctions()

Exposes functions to analyze, validate, and improve the quality of your data.

Methods

DataQualityFunctions.get_country_iso_values(...)

Reads a country name as input and returns the full ISO country name, the ISO three-letter country code, and the ISO two-letter country code.

DataQualityFunctions.validate_email_address(...)

Validates the email format.

DataQualityFunctions.find_invalid_names(...)

Flags suspicious or fake names in the input data.

DataQualityFunctions.remove_control_characters(...)

Identifies and removes non-printable characters in an input field.

DataQualityFunctions.convert_diacritic_english_chars(...)

Translates diacritical characters to regular ASCII text characters.

DataQualityFunctions.validate_usa_zipcode(...)

Validates if the input column data is a five-digit United States zip code.

DataQualityFunctions.validate_usa_state(...)

Validates if the input column data is a state name in the United States.

DataQualityFunctions.validate_ssn(...)

Parses a United States SSN pattern from a larger string of text.

DataQualityFunctions.validate_usa_phone_number(...)

Parses the United States phone number from a string of text and validates if the area code is valid for the United States.

DataQualityFunctions.parse_name(...)

Determines the gender status.If you know the gender for a specified name, the rule determines the status based on the gender-specific score.

DataQualityFunctions.validate_us_county(...)

Checks if the input column data is a valid county name in any state in the Unites States.

DataQualityFunctions.standardize_us_companyname(...)

Standardizes a company name and provides an acronym if possible.