Intelligent Data Lake
This section describes new Intelligent Data Lake features in version 10.1.1.
Data Preview for Tables in External Sources
Effective in version 10.1.1, you can preview sample data for external (outside Hadoop data lake) tables if these sources are cataloged. The administrator needs to configure JDBC connections with Sqoop and provide the analysts with requisite permissions. The analyst can connect to the data source using these connections to view the data from assets that are not in the data lake.
For more information, see the "Discover Data" chapter in the 10.1.1 Intelligent Data Lake User Guide.
Importing Data From Tables in External Sources
Effective in version 10.1.1, you can import data from tables in external sources (outside Hadoop data lake), such as Oracle and Teradata, into the data lake if these sources are already cataloged. The administrator needs to configure JDBC connections with Sqoop to the external sources and provide access to the analyst. The analyst can use these connections to preview the data asset and import into the lake based on their needs.
For more information, see the "Discover Data" chapter in the 10.1.1 Intelligent Data Lake User Guide.
Exporting Data to External Targets
Effective in version 10.1.1, you can export a data asset or a publication to external targets (outside Hadoop data lake), such as Oracle and Teradata. The administrator needs to configure the JDBC connections with Sqoop to the external sources and provide access to the analyst. The analyst can use these connections to export the data asset to the external database.
For more information, see the "Discover Data" chapter in the 10.1.1 Intelligent Data Lake User Guide.
Configuring Sampling Criteria for Data Preparation
Effective in version 10.1.1, you can specify sampling criteria that best suits your needs for data preparation for a given data asset. You can choose to include only a few columns during preparation and filter the data, choose number of rows to sample, and select Random or First N rows as sample.
For more information, see the "Prepare Data" chapter in the 10.1.1 Intelligent Data Lake User Guide.
Performing a Lookup on Worksheets
Effective in version 10.1.1, you can perform a lookup. Use the lookup function to lookup a key column in another sheet and fetch values in corresponding other columns in that looked up sheet.
For more information, see the "Prepare Data" chapter in the 10.1.1 Intelligent Data Lake User Guide.
Downloading as a TDE File
Effective in version 10.1.1, you can download data in data lake assets as a TDE file. You can directly open the downloaded file in Tableau. You can search for any data asset and download it as a CSV file or TDE file.
For more information, see the "Discover Data" chapter in the 10.1.1 Intelligent Data Lake User Guide.
Sentry and Ranger Support
Effective in version 10.1.1, Intelligent Data Lake supports Sentry and Ranger on Cloudera and Hortonworks. Ranger and Sentry offer a centralized security framework to manage granular level access control on Cloudera and Hortonworks. You can create authorization rules or policies to control the access of data. Sentry and Ranger support SQL-based authorization for data lake assets.