•When the connection is used as a target, the last batch of the insert load is not reflected in the record count. Refer the session logs for the record count of the last batch inserted.
•For example, if the commit interval is set to 1 million and the actual rows inserted are 1.1 million, the record count in the UI shows 1 million and the session logs reveal the row count of the reminder 100k records.
•Set the commit interval to the highest value possible before java.lang.OutofMemoryError is encountered.
•When the connection is used as a target to load data into Hadoop, ensure that all the fields are mapped.
•After a data load in Hive, Impala needs to be refreshed manually for the latest changes to the table to be reflected in Impala. In the current version, the connector does not automatically refresh Impala upon a Hive dataset insert.