ETLs

Airflow

OvalEdge uses API to connect to the Airflow source, allowing users to crawl DAGs and Tasks and build Lineage.

 

Overview

Connector Details

Connector Category

ETL

OvalEdge Release Current Connector Version

6.3.4

Connectivity

[How OvalEdge connects to Airflow]

API

OvalEdge Releases Supported (Available from)

Release5.0 onwards

Connector Features

Crawling of Metadata Objects 

Supported

 

Profiling

Not Supported

 

Query Sheet

Not Supported

 

Metadata Preview

Supported

 

Lineage 

Supported

Lineage Levels Supported

Table Lineage

Column Lineage

Authentication via Credential Manager

Supported

Data Quality

Not Supported

DAM (Data Access Management)

Not Supported

Bridge

Supported 

Getting Ready to Establish a Connection

Prerequisites

The following are the prerequisites required for establishing a connection:

Service Account User Permissions

Important: We recommend having a separate service account to establish a connection from OvalEdge to the data source with minimal permissions.

Operations

Minimum Permissions

Connection validation

Read Only

Crawling

Read Only

Setup a Connection

Important: You must have the Connector Creator role to set up a connection in OvalEdge.

Log into OvalEdge, go to Administration > Connectors, click + (New Connector), search for Airflow, and complete the specific parameters.

Note: Fields marked with an asterisk (*) are mandatory for establishing a connection.

Field Name

Description

Connector Type

By default, "Airflow" is displayed as the selected connector type.

Connector Settings

Credential Manager*

Select the desired credentials manager from the dropdown list. Relevant parameters will be displayed based on your selection.

Supported Credential Managers:

  • OE Credential Manager
  • HashiCorp Vault
  • AWS Secrets Manager
  • Azure Key Vault

License Add Ons

OvalEdge connectors have a default license add-on for data crawling and profiling.

  • Select the checkbox for Auto Lineage Add-On to build data lineage automatically.

Connector Name*

Enter a unique name for the Airflow connection.             

(Example: "Airflow_Prod").

Connector Environment

Select the environment (Example: PROD, STG) configured for the connector.

Server*

Enter the Airflow database Server name or IP address (Example: Airflow.example.com or 192.168.1.10).

Remote DAG Path

Enter the path of the location where all DAGs (Python files) are located in the Airflow server

Local DAG Path*

Enter the location path where all the DAGs (Python files) are present in the Local/OvalEdge server. Here, both (Remote DAGs and Local DAGs) must have the exact count.

Username*

Enter the service account username set up to access the Airflow database (Example: "oesauser").

Password*

Enter the password associated with the service account user (Example: "password").

Default Governance Roles

Default Governance Roles*

Select the appropriate users or teams for each governance role from the dropdown list. All users and teams configured in OvalEdge Security are displayed for selection.

Admin Roles

Admin Roles*

Select one or more users from the dropdown list for Integration Admin and Security and Governance Admin. All users configured in OvalEdge Security are available for selection.

No Of Archive Objects*

It indicates the number of recent metadata changes to a dataset at the source. By default, it is off. You can enable it by toggling the Archive button and specifying the number of objects to archive.

Example: Setting it to 4 retrieves the last four changes, shown in the 'version' column of the 'Metadata Changes' module.

Bridge

Select Bridge*

If applicable, select the bridge from the drop-down list.

The drop-down list displays all active bridges configured in OvalEdge. These bridges enable communication between data sources and OvalEdge without altering firewall rules.

After entering all connection details, you can perform the following actions:

  • Click Validate to verify the connection.
  • Click Save to store the connection for future use.
  • Click Save & Configure to apply additional settings before saving.
  • The saved connection will appear on the Connectors home page.

Connectivity Troubleshooting

If incorrect parameters are provided, you may encounter error messages. To resolve these issues, ensure all input is correct. If problems persist, contact your assigned OvalEdge support team.

S.No.

Error Description

Resolution

1

Failed to establish a connection, Please check the credentials

Error Description: 

It indicates an issue with given credentials or permission on the given credentials.


Resolution: 

Please verify the credentials and permissions for the connector.

2

Local DAG Path doesn't exist

Error Description:

It indicates the local DAG path is invalid.


Resolution:

Please check the given local DAG path and verify its permissions.

Manage Connector Operations

Configure Settings for Connector Operations

The Airflow connector offers various settings to customize data crawling, profiling, and access. These include:

  • Lineage: Automatically build data lineage using source code parsing.

Crawl/Profile

Important: You must have the Integration Admin role in OvalEdge for crawl/profile operations.

Crawl and Profile operations enable you to select one or more schemas from a list of all available schemas within a specific server. This allows you to customize the crawling operation selection according to your requirements. Furthermore, it provides the option to schedule crawling and enable anomaly detection to identify any irregularities in the data objects.

Other Operations

The Connectors page in OvalEdge provides a centralized view of all configured connectors, including their health status. You can view, edit, validate, build lineage, and delete connectors using the Nine Dots menu.

Managing connectors includes:

  • Connectors Health: Displays performance with a green (active) or red (inactive) icon, helping monitor data flow and address issues early.
  • Viewing: Shows connector details (e.g., Databases, Tables, Table Columns, and Codes) via the View icon.

Nine Dots Menu Options:

  • Edit Connector: Update and revalidate the data source.
  • Validate Connector: Check the connection's integrity.
  • Settings: Modify connector settings.
  • Build Lineage: Automatically build data lineage using SQL logs and source code parsing.
  • Delete Connector: Remove connectors or schemas with confirmation. 

Limitations

 Description

The Auto-lineage in the Airflow connection is supported only for SQL commands.


Copyright © 2025, OvalEdge LLC, Peachtree Corners GA USA