OvalEdge uses API to connect to the Airflow source, allowing users to crawl DAGs and Tasks and build Lineage.
Overview
Connector Details
Connector Category |
ETL |
OvalEdge Release Current Connector Version |
6.3.4 |
Connectivity [How OvalEdge connects to Airflow] |
API |
OvalEdge Releases Supported (Available from) |
Release5.0 onwards |
Connector Features
Crawling of Metadata Objects |
Supported |
|
Profiling |
Not Supported |
|
Query Sheet |
Not Supported |
|
Metadata Preview |
Supported |
|
Lineage |
Supported |
|
Lineage Levels Supported |
||
Table Lineage Column Lineage |
||
Authentication via Credential Manager |
Supported |
|
Data Quality |
Not Supported |
|
DAM (Data Access Management) |
Not Supported |
|
Bridge |
Supported |
Getting Ready to Establish a Connection
Prerequisites
The following are the prerequisites required for establishing a connection:
Service Account User Permissions
Important: We recommend having a separate service account to establish a connection from OvalEdge to the data source with minimal permissions.
Operations |
Minimum Permissions |
Connection validation |
Read Only |
Crawling |
Read Only |
Setup a Connection
Important: You must have the Connector Creator role to set up a connection in OvalEdge.
Log into OvalEdge, go to Administration > Connectors, click + (New Connector), search for Airflow, and complete the specific parameters.
Note: Fields marked with an asterisk (*) are mandatory for establishing a connection.
Field Name |
Description |
Connector Type |
By default, "Airflow" is displayed as the selected connector type. |
Connector Settings |
|
Credential Manager* |
Select the desired credentials manager from the dropdown list. Relevant parameters will be displayed based on your selection. Supported Credential Managers:
|
License Add Ons |
OvalEdge connectors have a default license add-on for data crawling and profiling.
|
Connector Name* |
Enter a unique name for the Airflow connection. (Example: "Airflow_Prod"). |
Connector Environment |
Select the environment (Example: PROD, STG) configured for the connector. |
Server* |
Enter the Airflow database Server name or IP address (Example: Airflow.example.com or 192.168.1.10). |
Remote DAG Path |
Enter the path of the location where all DAGs (Python files) are located in the Airflow server |
Local DAG Path* |
Enter the location path where all the DAGs (Python files) are present in the Local/OvalEdge server. Here, both (Remote DAGs and Local DAGs) must have the exact count. |
Username* |
Enter the service account username set up to access the Airflow database (Example: "oesauser"). |
Password* |
Enter the password associated with the service account user (Example: "password"). |
Default Governance Roles |
|
Default Governance Roles* |
Select the appropriate users or teams for each governance role from the dropdown list. All users and teams configured in OvalEdge Security are displayed for selection. |
Admin Roles |
|
Admin Roles* |
Select one or more users from the dropdown list for Integration Admin and Security and Governance Admin. All users configured in OvalEdge Security are available for selection. |
No Of Archive Objects* |
It indicates the number of recent metadata changes to a dataset at the source. By default, it is off. You can enable it by toggling the Archive button and specifying the number of objects to archive. Example: Setting it to 4 retrieves the last four changes, shown in the 'version' column of the 'Metadata Changes' module. |
Bridge |
|
Select Bridge* |
If applicable, select the bridge from the drop-down list. The drop-down list displays all active bridges configured in OvalEdge. These bridges enable communication between data sources and OvalEdge without altering firewall rules. |
After entering all connection details, you can perform the following actions:
- Click Validate to verify the connection.
- Click Save to store the connection for future use.
- Click Save & Configure to apply additional settings before saving.
- The saved connection will appear on the Connectors home page.
Connectivity Troubleshooting
If incorrect parameters are provided, you may encounter error messages. To resolve these issues, ensure all input is correct. If problems persist, contact your assigned OvalEdge support team.
S.No. |
Error Description |
Resolution |
1 |
Failed to establish a connection, Please check the credentials |
Error Description: It indicates an issue with given credentials or permission on the given credentials. Resolution: Please verify the credentials and permissions for the connector. |
2 |
Local DAG Path doesn't exist |
Error Description: It indicates the local DAG path is invalid. Resolution: Please check the given local DAG path and verify its permissions. |
Manage Connector Operations
Configure Settings for Connector Operations
The Airflow connector offers various settings to customize data crawling, profiling, and access. These include:
- Lineage: Automatically build data lineage using source code parsing.
Crawl/Profile
Important: You must have the Integration Admin role in OvalEdge for crawl/profile operations.
Crawl and Profile operations enable you to select one or more schemas from a list of all available schemas within a specific server. This allows you to customize the crawling operation selection according to your requirements. Furthermore, it provides the option to schedule crawling and enable anomaly detection to identify any irregularities in the data objects.
Other Operations
The Connectors page in OvalEdge provides a centralized view of all configured connectors, including their health status. You can view, edit, validate, build lineage, and delete connectors using the Nine Dots menu.
Managing connectors includes:
- Connectors Health: Displays performance with a green (active) or red (inactive) icon, helping monitor data flow and address issues early.
- Viewing: Shows connector details (e.g., Databases, Tables, Table Columns, and Codes) via the View icon.
Nine Dots Menu Options:
- Edit Connector: Update and revalidate the data source.
- Validate Connector: Check the connection's integrity.
- Settings: Modify connector settings.
- Build Lineage: Automatically build data lineage using SQL logs and source code parsing.
- Delete Connector: Remove connectors or schemas with confirmation.
Limitations
Description |
The Auto-lineage in the Airflow connection is supported only for SQL commands. |
Copyright © 2025, OvalEdge LLC, Peachtree Corners GA USA