File

SFTP Connector

This document provides information about the SFTP connector with the OvalEdge application.

SSH File Transfer Protocol (SFTP) is a secure file protocol that is used to access, manage and transfer files over an encrypted SSH protocol. When compared with the traditional FTP protocol, SFTP offers all the functionality of FTP, but it is more secure and easier to configure.

SFTP connector

Connector Capabilities

The following is the list of objects and data types the SFTP connector supports.

Functionality Support Data Objects

Crawler 

  • File Folders
  • File Objects

Profiler

  • File Profiling: Row count, Columns count, and View sample data
  • Sample Profiling 

Prerequisites

The following are prerequisites for connecting to the SFTP:

The Drivers/APIs used by the connector are given below:

Driver/API

Version

Details

Jsch

0.1.55

https://mvnrepository.com/artifact/com.jcraft/jsch

Note: Latest version is 0.1.55

Server User Permission

By default, the service account provided for the connector will be used for any user operations. The minimum privileges required are:

Operation

Access Permission

Connection Validation

Read

Catalog Files/Folders

Read

Profile Files/Folders 

Read

Technical Specification

The connector capabilities are shown below:

Crawling

Feature

Supported Objects

Remarks

Crawling

Files/Folders

While crawling root Files/Folders, by default all the folder and files existing in that specific root path will be cataloged 

Profiling

Features

Supported Objects

Details

File Profiling

Row Count, Columns Count, View Sample Data

Supported File Types: CSV, XLS, XLSX, JSON, AVRO, PARQUET, ORC

Sample Profiling

Supported

-

Connection Details

To connect to the SFTP using the OvalEdge application, complete the following steps:

  1. Log in to the OvalEdge application.
  2. Navigate to Administration > Connectors module.
  3. Click on the + icon. The Add Connector pop-up window is displayed where you can search for the required connector.
  4. Search and select the SFTP connector. The Add Connector with SFTP specific details pop-up window is displayed.

Field Name

Description

Connector Type

By default, the selected connection type is displayed as the SFTP.

Protocol

The Protocol drop-down menu allows you to select SFTP or FTP. 

SFTP: It is a secured and fully encrypted layer. Default port no is 22.

FTP: It is an unencrypted protocol. It adds a layer of encryption using SSL or TLS. Default port no is 21.

License Add Ons

All the connectors will have a Base Connector License by default that allows you to crawl and profile to obtain the metadata and statistical information from a datasource. 

OvalEdge supports various License Add-Ons based on the connector’s functionality requirements.

  • Select the Auto Lineage Add-On license that enables the automatic construction of the Lineage of data objects for a connector with the Lineage feature. 
  • Select the Data Quality Add-On license to identify, report, and resolve the data quality issues for a connector whose data supports data quality, using DQ Rules/functions, Anomaly detection, Reports, and more.
  • Select the Data Access Add-On license that will enforce connector access via OvalEdge with Remote Data Access Management (RDAM) feature-enabled.

Connector Environment

The connector environment drop-down menu allows you to select the environment configured for the connector from the drop-down list. For example, PROD, or STG (based on the configured items in the OvalEdge configuration for the connector.environment).

The purpose of the environment field is to help you identify which connector is connecting what type of system environment (Production, STG, or QA).

Note: The steps to set up environment variables are explained in the prerequisite section.

Connector Name*

Enter the connector name specified in the Connector Name text box. It will be a reference to the SFTP connection in the OvalEdge application.

Server*

Enter the fully qualified server name or IP address related to the SFTP server. 

Example: 12.34.56.78

Username*

A username is required to connect to the SFTP server. Enter the Service Account Name established to access the SFTP server environment. 

Note: Sometimes, this field is auto filled by the web browser with the current OvalEdge user login. Please enter the SFTP Server Service Account name.

Password*

Enter the Service Account password

Port

Port number of the protocol

Example: SFTP: 22

FTP: 21

Path

Path of the location where files or folders located

Ex: \ovaledge

Default Governance Roles*

Users can select a specific user or a  team from the governance roles (Steward, Custodian, Owner) that get assigned for managing the data asset.

Note: The drop-down list displays all the configurable roles (single user or a team) as per the configurations made in the OvalEdge Security | Governance Roles section.

Admin Roles*

Select the required admin roles for this connector.

  • To add Integration Admin Roles, search for or select one or more roles from the Integration Admin options, and then click on the Apply button.
    The responsibility of the Integration Admin includes configuring crawling and profiling settings for the connector, as well as deleting connectors, schemas, or data objects.
  • To add Security and Governance Admin roles, search for or select one or more roles from the list, and then click on the Apply button.
    The security and Governance Admin is responsible for:
    • Configure role permissions for the connector and its associated data objects.
    • Add admins to set permissions for roles on the connector and its associated data objects.
    • Update governance roles.
    • Create custom fields.
    • Develop Service Request templates for the connector.
    • Create Approval workflows for the templates.

No of Archive Objects*

The number of archive objects indicates the number of recent metadata modifications made to a dataset at a remote/source location. By default, the archive objects feature is deactivated. However, users may enable it by clicking the Archive toggle button and specify the number of objects they wish to archive.

Select Bridge

Select option NO Bridge if no bridge is available for the connector.

Connection Settings

Crawler

S.No

Property

Description

1

Crawler Options

FileFolders / Buckets by default enabled

2

Crawler Rules

Include and exclude regex for FileFolders and Buckets only but not for files

Profiler

S.No

Property

Description

1

Profile Options

No existence for profile

2

Profile Rules

No Profile rule exist

Points to note

  1. Supported File Types: CSV, XLS, XLSX, JSON, AVRO, PARQUET, ORC.