Connectivity Summary
An out of the box connector is available for the AWS Glue Catalog. It provides support for crawling cataloged objects.
The connectivity to AWS Glue is performed via AWS SDK for AWS Glue Catalog, which is included in the platform.
The connector supports any version of AWS Glue.
The drivers used by the connector are given below:
Driver/API: AWS SDK for AWS Glue
Details: SDK is provided by AWS to communicate with AWS Glue
Technical Specifications
The core components of the Java binding are files named AWS SDK for Glue, which contain the Java classes that provide the connection and caching mechanisms for communication with the Glue.
The connector capabilities are shown below:
Crawling
Feature | Supported Objects | Remarks |
Crawling | Tables | |
Table Columns |
Supports all the standard datatypes related to AWS Glue. |
Profiling
Feature | Supported Objects | Remarks |
Table Profiling | Not Supported | |
View Profiling | Not Supported | |
Column Profiling | Not Supported | |
Full Profiling | Not Supported | |
Sample Profiling | Not Supported |
Lineage Building
Feature | Supported Objects | Remarks |
Table Lineage | Not Supported | |
Column Lineage | Not Supported | |
Lineage Sources | Not Supported |
Querying
Feature | Supported Objects | Remarks |
Select | Not Supported | |
Insert | Not Supported | |
Update | Not Supported | |
Delete | Not Supported | |
Join within database | Not Supported | |
Join outside database | Not Supported | |
Aggregations | Not Supported | |
Group By | Not Supported | |
Order By | Not Supported |
Note: By default, the service account provided for the connector will be used for any query operations.
Pre-requisites
To use the connector, the following need to be available:
Connection details as specified in the following section should be available.
An admin/service account for crawling and profiling. If using ROLE ARN needs to have access over the entities like databases, tables. etc.
Connection Details
The following are the connection settings that should be added for connecting to an AWS Glue Catalog:
General Authentication
Property |
Details |
Database Type |
AWS Glue |
Connection Name |
Select a Connection name for the AWS Glue catalog. The name that you specify is a reference name to easily identify your AWS Glue catalog connection in OvalEdge. Example: AWS Glue Connection |
Access key |
Access key |
Secret key |
Secret key |
Database region |
Region of database |
Role Based Authentication
Property |
Details |
Database Type |
AWS Glue |
Connection Name |
Select a Connection name for the AWS Glue catalog. The name that you specify is a reference name to easily identify your AWS Glue catalog connection in OvalEdge. Example: AWS Glue Connection |
Cross account Role ARN |
Provide role as ARN |
Database region |
Region of database |
Once connectivity is established, additional configurations for crawling and profiling can be specified:
Property |
Details |
Setting > Crawler Options |
|
Tables, Views columns |
By default, it is selected true |
Relationships |
no need to select |
Procedures, functions view source codes |
no need to select |
Settings > Profiler |
|
N/A |
FAQs
- How much does the driver cost?
There is no need for any additional driver for communication between the OvalEdge and AWS Glue.
Copyright © 2019, OvalEdge LLC, Peachtree Corners GA USA