informatica.infacore.functions.dq_functions.DataQualityFunctions
- class informatica.infacore.functions.dq_functions.DataQualityFunctions
Bases:
object
Exposes functions to analyze, validate, and improve the quality of your data.
You can use this class to eliminate duplicate and redundant records, verify addresses, and standardize operations on your business data.
Example
>>> import informatica.infacore as ic >>> dq_obj = ic.DataQualityFunctions() >>> dq_obj.get_country_iso_values(src_data_obj, country_col_name)
Methods
Translates diacritical characters to regular ASCII text characters.
find_invalid_names
(src_data_object, ...)Flags suspicious or fake names in the input data.
get_country_iso_values
(src_data_object, ...)Reads a country name as input and returns the full ISO country name, the ISO three-letter country code, and the ISO two-letter country code.
parse_name
(src_data_object, ...)Determines the gender status.If you know the gender for a specified name, the rule determines the status based on the gender-specific score.
remove_control_characters
(src_data_object, ...)Identifies and removes non-printable characters in an input field.
standardize_us_companyname
(src_data_object, ...)Standardizes a company name and provides an acronym if possible.
validate_email_address
(src_data_object, ...)Validates the email format.
validate_ssn
(src_data_object, ssn_col_name)Parses a United States SSN pattern from a larger string of text.
validate_us_county
(src_data_object, ...)Checks if the input column data is a valid county name in any state in the Unites States.
validate_usa_phone_number
(src_data_object, ...)Parses the United States phone number from a string of text and validates if the area code is valid for the United States.
validate_usa_state
(src_data_object, ...)Validates if the input column data is a state name in the United States.
validate_usa_zipcode
(src_data_object, ...)Validates if the input column data is a five-digit United States zip code.