Data Quality accelerator bundles > Data Quality bundle for India
  

Data Quality bundle for India

Use the Data Quality bundle for India to accelerate the configuration and deployment of data quality solutions in your organizations. The bundle includes mapplets and other assets that define cleansing, labeling, and parsing operations for India data.
The following table lists the assets contained in the Data Quality bundle for India:
Name
Asset Type
Description
c_ind_city_longitude
Cleanse
Returns the longitude coordinate of a city.
c_India_State
Cleanse
Returns valid if the input is an Indian state.
c_ind_district_std
Cleanse
Returns a standardized district name.
c_ind_state_std
Cleanse
Returns a standardized state name.
c_ind_state_for_city
Cleanse
Returns the state corresponding to a city.
c_ind_city_latitude
Cleanse
Returns the latitude coordinate of a city.
ind_pincode_prefix_3digits
Dictionary
Contains the first three digits of the Postal Index Number (PIN) code and the corresponding state name.
indian_states
Dictionary
Contains a list of Indian states and union territories.
ind_cities_w_lat_long
Dictionary
Contains a list of Indian cities with geocoordinates.
ind_middlename
Dictionary
Contains a list of middle names based on gender for India.
ind_pincode_prefix_2digits
Dictionary
Contains the first two digits of the Postal Index Number (PIN) code and the corresponding state name.
ind_names
Dictionary
Contains a list of given names for India.
ind_prename_gender
Dictionary
Contains a list of genders on the basis of Indian prenames.
india_districts
Dictionary
Contains a list of district names in India.
ind_puducherry_pincode
Dictionary
Contains Puducherry Postal Index Number (PIN) codes.
ind_lakshadweep_pincode
Dictionary
Contains Lakshadweep Postal Index Number (PIN) codes.
ind_vehicle_registration_code
Dictionary
Contains a list of Indian vehicle registration codes for each state.
ind_lastname
Dictionary
Contains a list of common Indian surnames.
ind_gst_state_code
Dictionary
Validates state code and returns corresponding state name.
ind_DadraNagarHaveli_DamanDiu_pincode
Dictionary
Contains Dadra and Nagar Haveli, Daman and Diu Postal Index Number (PIN) codes.
mplt_IND_District_Parse
Mapplet
Parses an Indian district from a string of text.
mplt_IND_Corporate_Identification_Number_Validation
Mapplet
Validates a Corporate Identification Number (CIN), which is a 21-digit alphanumeric code.
The CIN is unique to each company registered in India.
The number is issued by the Registrar of Companies (ROC) and is used to track a company's activities and other details.
mplt_IND_Address_Validation_Multiline
Mapplet
Validates the deliverability of India addresses. Use the mapplet when you can connect the input address fields to fields on the Multiline model in the verifier asset.
mplt_IND_GSTIN_Standardize
Mapplet
Standardizes an Indian Goods and Services Tax Identification Number (GSTIN).
mplt_IND_Aadhaar_Number_Parse
Mapplet
Parses a 12-digit string as an Indian Aadhaar number from a string.
mplt_IND_Driving_License_Master
Mapplet
Parses, validates, and standardizes an Indian driving license number.
mplt_IND_Driving_License_Validation
Mapplet
Validates Indian driving license numbers.
mplt_IND_PAN_Standardization
Mapplet
Standardizes an Indian Permanent Account Number (PAN).
The mapplet returns strings of numbers in the following format:
  • - no punctuation
mplt_IND_Pincode_Parse
Mapplet
Parses an Indian Postal Index Number (PIN) code from a string of text.
mplt_IND_Corporate_Identification_Number_Standardization
Mapplet
Standardizes a 21-digit alphanumeric character as an Indian Corporate Identification Number (CIN).
The mapplet returns strings of numbers in the following format:
  • - LNNNNNAAYYYYAAANNNNNN
mplt_IND_GSTIN_Parse
Mapplet
Parses an Indian Goods and Services Tax Identification Number (GSTIN) from a string of text.
mplt_IND_District_Validate
Mapplet
Validates Indian district names.
mplt_IND_EPIC_Master
Mapplet
Parses, validates, and standardizes an Indian Electors Photo Identity Card (EPIC) number.
mplt_IND_District_Standardize
Mapplet
Standardizes Indian district names.
The mapplet returns strings of numbers in the following format:
  • - No_Punctuation, except hyphen
mplt_IND_Driving_License_Parse
Mapplet
Parses 15-character strings that conform to the Indian driving license number format.
mplt_IND_Pincode_Master
Mapplet
Parses, validates, and standardizes an Indian Postal Index Number (PIN) code.
mplt_IND_State_Validation
Mapplet
Validates an input string as an Indian state name.
mplt_India_PAN_Master
Mapplet
Parses, validates, and standardizes an Indian Permanent Account Number (PAN).
mplt_IND_EPIC_Validation
Mapplet
Validates an Electors Photo Identity Card (EPIC) number.
mplt_IND_Gender_Assignment
Mapplet
Assigns gender according to first names. The mapplet returns M for male names, F for female names, and U if the gender is unknown. For example, the mapplet assigns the name Raj Kumar a gender of M for male.
mplt_IND_Passport_Standardize
Mapplet
Standardizes an eight-digit character as an Indian passport number.
The mapplet returns strings of numbers in the following format:
  • - ann nnnnn
mplt_IND_PAN_Validation
Mapplet
Validates an Indian Permanent Account Number (PAN).
mplt_IND_Driving_License_Standardization
Mapplet
Standardizes an Indian driving license number.
The mapplet returns strings of numbers in the following format:
  • - AA-NNNNNNNNNNNNN
mplt_IND_Pincode_Standardize
Mapplet
Standardizes an Indian Postal Index Number (PIN) code.
The mapplet returns strings of numbers in the following format:
  • - no punctuation
mplt_IND_UPI_ID_Validate
Mapplet
Validates a Unified Payment Interface (UPI) ID, a banking system for money transfers on payment applications.
mplt_IND_GSTIN_Validate
Mapplet
Validates an Indian Goods and Services Tax Identification Number (GSTIN).
Returns valid or invalid status along with state name for GST state code.
mplt_IND_EPIC_Standardize
Mapplet
Standardizes a 10-digit character as an Indian Electors Photo Identity Card (EPIC).
The mapplet returns strings of numbers in the following format:
  • - aaannnnnnn
mplt_IND_Aadhaar_Validation
Mapplet
Validates an Indian Aadhaar number.
mplt_IND_Aadhaar_Number_Standardize
Mapplet
Standardizes a 12-digit number as an Indian Aadhaar number.
The mapplet returns strings of numbers in the following format:
  • - 1111 1111 1111
mplt_IND_Pincode_Validate
Mapplet
Validates an India Postal Index Number (PIN) code.
mplt_IND_Corporate_Identification_Number_Parse
Mapplet
Parses a 21-digit alphanumeric character from a string that conforms to the Indian Corporate Identification Number (CIN) registered with the Registrar of Companies (ROC).
mplt_IND_City_n_Location_Data
Mapplet
Returns the geographic location, including the state, latitude, and longitude for an Indian city.
mplt_IND_Vehicle_Registration_Number_Parse
Mapplet
Parses an Indian vehicle registration number from a string.
mplt_India_Passport_Parse
Mapplet
Parses eight-character strings that conform to the Indian passport number format that the Government Data Standards Catalogue specifies.
mplt_India_EPIC_Parse
Mapplet
Parses 10-character strings that conform to the Electors Photo Identity Card (EPIC) number format that the Government Data Standards Catalogue specifies.
mplt_IND_Address_Validation_Discrete
Mapplet
Validates the deliverability of India addresses. Use the mapplet when you can connect the input address fields to fields on the Discrete model in the verifier asset.
mplt_IND_State_Standardization
Mapplet
Standardizes Indian state names.
The mapplet returns strings of numbers in the following format:
  • - No_Punctuation
mplt_IND_PAN_Parse
Mapplet
Parses 10-character strings that conform to the Indian Permanent Account Number (PAN) format that the Government Data Standards Catalogue specifies.
mplt_IND_Address_Validation_Hybrid
Mapplet
Validates the deliverability of Indian addresses. Use the mapplet when you can connect the input address fields to fields on the Hybrid model in the verifier asset.
mplt_IND_State_Parse
Mapplet
Parses an Indian state from a string of text.
mplt_IND_State_Master
Mapplet
Parses, validates, and standardizes an Indian state name.
mplt_IND_Passport_Master
Mapplet
Parses, validates, and standardizes an Indian passport number.
mplt_IND_Vehicle_Registration_No_Standardize
Mapplet
Standardizes an Indian vehicle registration number to the AA NN AA NNNN format.
mplt_IND_Passport_Validation
Mapplet
Validates an Indian passport number.
lbl_ind_gst_state_code
Labeler
Labels Goods and Services Tax (GST) state codes.
lbl_ind_epic
Labeler
Labels Indian Electors Photo Identity Card (EPIC) numbers as valid.
lbl_ind_dl
Labeler
Labels a valid Indian driving license number as valid.
lbl_IND_PAN
Labeler
Labels an Indian Permanent Account Number (PAN) as Valid.
lbl_IND_GSTIN
Labeler
Returns valid if the input value matches the regular expression.
lbl_ind_district
Labeler
Labels Indian district names as valid.
lbl_ind_state
Labeler
Labels Indian state names as valid.
lbl_ind_pspt
Labeler
Labels Indian passport numbers as valid.
lbl_ind_reg_code
Labeler
Labels Indian state codes as valid.
lbl_ind_upi
Labeler
Labels valid Unified Payment Interface (UPI) IDs.
lbl_ind_aadhaar
Labeler
Labels Indian Aadhaar numbers as valid.
p_IND_District
Parse
Parses Indian district names.
p_ind_epic
Parse
Parses an Electors Photo Identity Card (EPIC) number.
p_pan
Parse
Parses an Indian Permanent Account Number (PAN) from a string.
p_ind_aadhaar
Parse
Parses an Indian Aadhaar number from a string.
p_ind_middlenames
Parse
Assigns a gender value based on the middle name.
p_IND_Vehicle_Number
Parse
Parses a vehicle registration number.
p_ind_passport
Parse
Parses an Indian passport number.
p_ind_gender_assignment_prename
Parse
Assigns a gender value based on the name prefix.
p_ind_cin
Parse
Parses an Indian Corporate Identification Number (CIN).
p_IND_GSTIN
Parse
Parses a Goods and Services Tax Identification Number (GSTIN) from a string of text.
p_IND_Phone_Number
Parse
Parses an Indian phone number from a string of text.
p_ind_dl
Parse
Parses a valid Indian driving license number.
p_ind_givenname_gender
Parse
Parses the gender for a given name.
p_IND_State
Parse
Parses an Indian state name.
p_IND_Pincode
Parse
Parse six-digit Indian Postal Index Number (PIN) codes.
rs_pincode_stateName
Rule Specification
Identifies state names based on Postal Index Number (PIN) code prefix values.
v_IND_AddressValidation_Discrete_w_Geo
Verifier
Verifies Indian addresses with input fields that map to the Discrete model, and adds geocodes to the output.
v_IND_AddressValidation_Multiline_w_Geo
Verifier
Verifies Indian addresses with input fields that map to the Multiline model, and adds geocodes to the output.
v_IND_AddressValidation_Discrete
Verifier
Verifies Indian addresses with input fields that map to the Discrete model.
v_IND_AddressValidation_Hybrid
Verifier
Verifies Indian addresses with input fields that map to the Hybrid model.
v_IND_AddressValidation_Multiline
Verifier
Verifies Indian addresses with input fields that map to the Multiline model.
v_IND_AddressValidation_Hyb_w_Geo
Verifier
Verifies Indian addresses with input fields that map to the Hybrid model, and adds geocodes to the output.
mplt_IND_Phone_Number_Parse
Mapplet
Parses a ten-digit Indian telephone number from a string of text.
p_IND_Phone_Number
Parse
Parses an Indian telephone number from a string of text.
mplt_IND_Phone_Number_Validation
Mapplet
Validates an Indian telephone number and identifies whether it is a landline or mobile number.
Returns valid or invalid status along with the associated region for landline numbers using the Subscriber Trunk Dialling (STD) code.
c_remove_dialling_prefix
Cleanse
Removes the dialling prefix from Indian telephone numbers.
rs_ind_std_codes
Rule Specification
Returns the Subscriber Trunk Dialling (STD) code from an Indian telephone number.
lbl_ind_std_codes
Labeler
Labels an input value as valid if the value is present in the dictionary and returns the associated Indian city, town, or village name.
mplt_IND_Phone_Number_Standardize
Mapplet
Standardizes the format of Indian telephone numbers.
The mapplet standardizes the numbers in the following format:
  • - Landline number format: CountryCode-StdCode LandlineNumber
  • For example, +91 44-35279123
  • - Mobile number format: CountryCode MobileNumber
  • For example, +91 87654 32456
mplt_Remove_Punctuation_and_Space
Mapplet
Removes all punctuation symbols and character spaces from an input field, and returns the remaining digits and letters.
c_merge_remaining_text
Cleanse
Merges multiple input fields into a single field.
mplt_UpperCase
Mapplet
Changes alphabetic characters to uppercase.
c_Remove_Numbers
Cleanse
Removes numbers from a string.
mplt_Remove_Non_Numbers
Mapplet
Removes all characters that are not a digit or number from an input string.
mplt_Remove_All_Leading_Zeros
Mapplet
Removes any leading zero from number data.
c_Uppercase
Cleanse
Converts the input text to uppercase.
c_Remove_Punctuation
Cleanse
Removes all occurrences of punctuation from the input field.
c_Replace_Limited_Punctuation_w_Space
Cleanse
Replaces any forward slash, backslash, exclamation point, period, or underscore character with a character space. Also replaces instances of multiple character spaces with a single space.
c_IFSC_Bank_Code
Cleanse
Replaces an Indian Financial System Code (IFSC) with a valid full bank name.
An IFSC is an eleven-character alphanumeric code that the Reserve Bank of India uses to identify bank branches within the NEFT (National Electronic Funds Transfer) network.
c_av6_verification_status_code_desc
Cleanse
Returns descriptions for verification status codes.
c_Remove_Space
Cleanse
Removes all occurrences of a character space from the input field.
c_Remove_Extra_Spaces
Cleanse
Replaces multiple consecutive spaces with a single space and trims leading and trailing spaces.
c_av6_address_type_desc
Cleanse
Returns descriptions for the Address Type Codes
c_av6_result_quality_desc
Cleanse
Returns descriptions for Result Quality.
Full_IFSC_Code_List
Dictionary
Contains a list of consolidated Indian Financial System Codes that identify NEFT-enabled bank branches as of December 31, 2022. NEFT is the abbreviation for National Electronic Funds Transfer.
dq_av6_address_type_desc
Dictionary
Contains a list of result quality codes with descriptions.
dq_av6_result_quality_descriptions
Dictionary
Contains a list of the result quality codes that Address Verification version 6 engine returns with descriptions.
dq_av6_status_code_descriptions_infa
Dictionary
Contains alpha status values that indicate the outcome of the verification operation for an address. Each status value has a corresponding text description.
ind_std_codes
Dictionary
Contains a list of Indian Subscriber Trunk Dialling (STD) codes along with the city, town, or village names.
mplt_Remove_Extra_Spaces
Mapplet
Replaces multiple consecutive spaces with a single character space and trims leading and trailing spaces.
mplt_Remove_Limited_Punctuation
Mapplet
Removes a limited number of punctuation symbols, including a forward slash, backslash, exclamation point, period, or underscore character. Also replaces instances of two, three, or four consecutive character spaces with a single space.
mplt_IND_Vehicle_Registration_No_Validation
Mapplet
Validates the input string as a vehicle registration number in India.
mplt_Remove_Space
Mapplet
Removes all occurrences of a character space from an input field.
mplt_Replace_Punctuation_with_Space
Mapplet
Replaces punctuation symbols with a character space.
mplt_Replace_Limited_Punct_with_Space
Mapplet
Replaces punctuation symbols including a forward slash, backslash, exclamation point, period, or underscore character with a character space. Also replaces instances of two, three, or four consecutive character spaces with a single space.
mplt_IND_Bank_Name_From_IFSC_Code
Mapplet
Checks the Indian Financial System Code (IFSC) code and returns the respective bank names.
mplt_AV6_Assign_DQ_Status_Code_Description
Mapplet
Describes the Match Score value that a verifier asset returns for the Address Verification version 6 engine.
mplt_Remove_Punctuation
Mapplet
Removes all punctuation symbols from the input.
mplt_IND_Bank_IFSC_Code_Validation
Mapplet
Validates an Indian Financial System Code (IFSC) code.
An IFSC is an eleven-character alphanumeric code that the Reserve Bank of India uses to identify bank branches within the NEFT (National Electronic Funds Transfer) network.
lbl_ifsc_code
Labeler
Labels the numeric part of an Indian Financial System Code (IFSC) code as valid.
lbl_IFSC_Bank_Code
Labeler
Labels alphabet characters of the input based on the contents of the Indian Financial System Code (IFSC) bank codes.
rs_remove_all_leading_zeros
Rule Specification
Removes all the zeros at the beginning of an input string.
mplt_IND_Address_Validation_Hybrid_w_Geocoding
Mapplet
Validates the deliverability of India addresses and adds latitude and longitude coordinates to each valid address. Use the mapplet when you can connect the input address fields to fields on the Hybrid model in the verifier asset.
mplt_IND_Address_Validation_Discrete_w_Geocoding
Mapplet
Validates the deliverability of India addresses and adds latitude and longitude coordinates to each valid address. Use the mapplet when you can connect the input address fields to fields on the Discrete model in the verifier asset.
mplt_IND_Address_Validation_Multiline_w_Geocoding
Mapplet
Validates the deliverability of India addresses and adds latitude and longitude coordinates to each valid address. Use the mapplet when you can connect the input address fields to fields on the Multiline model in the verifier asset.