Knowledge Base

Connectors

ETLs

Airflow

OvalEdge uses API to connect to the Airflow source, allowing users to crawl DAGs and Tasks and build Lineage.

Overview

Connector Details

Connector Category	ETL
OvalEdge Release Current Connector Version	6.3.4
Connectivity [How OvalEdge connects to Airflow]	API
OvalEdge Releases Supported (Available from)	Release5.0 onwards

Connector Features

Crawling of Metadata Objects	Supported
Profiling	Not Supported
Query Sheet	Not Supported
Metadata Preview	Supported
Lineage	Supported
	Lineage Levels Supported
	Table Lineage Column Lineage
Authentication via Credential Manager	Supported
Data Quality	Not Supported
DAM (Data Access Management)	Not Supported
Bridge	Supported

Getting Ready to Establish a Connection

Prerequisites

The following are the prerequisites required for establishing a connection:

Service Account User Permissions

Important: We recommend having a separate service account to establish a connection from OvalEdge to the data source with minimal permissions.

Operations	Minimum Permissions
Connection validation	Read Only
Crawling	Read Only

Setup a Connection

Important: You must have the Connector Creator role to set up a connection in OvalEdge.

Log into OvalEdge, go to Administration > Connectors, click + (New Connector), search for Airflow, and complete the specific parameters.

Note: Fields marked with an asterisk (*) are mandatory for establishing a connection.

Field Name	Description
Connector Type	By default, "Airflow" is displayed as the selected connector type.
Connector Settings
Credential Manager*	Select the desired credentials manager from the dropdown list. Relevant parameters will be displayed based on your selection. Supported Credential Managers: OE Credential Manager HashiCorp Vault AWS Secrets Manager Azure Key Vault
License Add Ons	OvalEdge connectors have a default license add-on for data crawling and profiling. Select the checkbox for Auto Lineage Add-On to build data lineage automatically.
Connector Name*	Enter a unique name for the Airflow connection. (Example: "Airflow_Prod").
Connector Environment	Select the environment (Example: PROD, STG) configured for the connector.
Server*	Enter the Airflow database Server name or IP address (Example: Airflow.example.com or 192.168.1.10).
Remote DAG Path	Enter the path of the location where all DAGs (Python files) are located in the Airflow server
Local DAG Path*	Enter the location path where all the DAGs (Python files) are present in the Local/OvalEdge server. Here, both (Remote DAGs and Local DAGs) must have the exact count.
Username*	Enter the service account username set up to access the Airflow database (Example: "oesauser").
Password*	Enter the password associated with the service account user (Example: "password").
Default Governance Roles
Default Governance Roles*	Select the appropriate users or teams for each governance role from the dropdown list. All users and teams configured in OvalEdge Security are displayed for selection.
Admin Roles
Admin Roles*	Select one or more users from the dropdown list for Integration Admin and Security and Governance Admin. All users configured in OvalEdge Security are available for selection.
No Of Archive Objects*	It indicates the number of recent metadata changes to a dataset at the source. By default, it is off. You can enable it by toggling the Archive button and specifying the number of objects to archive. Example: Setting it to 4 retrieves the last four changes, shown in the 'version' column of the 'Metadata Changes' module.
Bridge
Select Bridge*	If applicable, select the bridge from the drop-down list. The drop-down list displays all active bridges configured in OvalEdge. These bridges enable communication between data sources and OvalEdge without altering firewall rules.

After entering all connection details, you can perform the following actions:

Click Validate to verify the connection.
Click Save to store the connection for future use.
Click Save & Configure to apply additional settings before saving.
The saved connection will appear on the Connectors home page.

Connectivity Troubleshooting

If incorrect parameters are provided, you may encounter error messages. To resolve these issues, ensure all input is correct. If problems persist, contact your assigned OvalEdge support team.

S.No.

Error Description

Resolution

Failed to establish a connection, Please check the credentials

Error Description:

It indicates an issue with given credentials or permission on the given credentials.

Resolution:

Please verify the credentials and permissions for the connector.

Local DAG Path doesn't exist

Error Description:

It indicates the local DAG path is invalid.

Resolution:

Please check the given local DAG path and verify its permissions.

Manage Connector Operations

Configure Settings for Connector Operations

The Airflow connector offers various settings to customize data crawling, profiling, and access. These include:

Lineage: Automatically build data lineage using source code parsing.

Crawl/Profile

Important: You must have the Integration Admin role in OvalEdge for crawl/profile operations.

Crawl and Profile operations enable you to select one or more schemas from a list of all available schemas within a specific server. This allows you to customize the crawling operation selection according to your requirements. Furthermore, it provides the option to schedule crawling and enable anomaly detection to identify any irregularities in the data objects.

Other Operations

The Connectors page in OvalEdge provides a centralized view of all configured connectors, including their health status. You can view, edit, validate, build lineage, and delete connectors using the Nine Dots menu.

Managing connectors includes:

Connectors Health: Displays performance with a green (active) or red (inactive) icon, helping monitor data flow and address issues early.
Viewing: Shows connector details (e.g., Databases, Tables, Table Columns, and Codes) via the View icon.

Nine Dots Menu Options:

Edit Connector: Update and revalidate the data source.
Validate Connector: Check the connection's integrity.
Settings: Modify connector settings.
Build Lineage: Automatically build data lineage using SQL logs and source code parsing.
Delete Connector: Remove connectors or schemas with confirmation.

Limitations

Description

The Auto-lineage in the Airflow connection is supported only for SQL commands.