File

Google Drive

Google Drive is a data storage service provided by Google that enables users to store data and access any amount of data, anytime, from anywhere on the web.

OvalEdge uses SDK (V2 1.32.1) to connect to the data source, which allows the user to crawl data objects such as Files & Folders.

Note: This document focuses on specific information relevant to this particular connector. Details about Establishing Connections, Connector Parameters, and Connector Settings will be found in the "Generic Features of Connectors" document.

Connector Characteristics

Connector Category

File System

Connectivity

Java SDK

Connector Version

[OvalEdge Connector Version Information]

6.3.1

Google Drive source Versions Supported 

[Data Source Version from which OvalEdge started providing support]

V2

OvalEdge Releases Supported (Available from)

[OvalEdge Release Version in which the connector was included in the OvalEdge Application]

6.1 onwards

Supported Features

Crawling of Metadata Objects 

Folders

Files

Metadata Source

Using Java SDK

Supported File Types:

Cataloged file types: configure in system settings

CSV, XLS, XLSX, JSON, AVRO, PARQUET, ORC, GZ

Will be included in System Settings

Add System Settings link

Profiling

Not Supported

Data Preview

Not Supported

Data Quality Module

Not Supported

RDAM (Remote Data Access Management) Support

Not Supported

Bridge Support

[The Bridge component is used as a Tunnel to Transfer Data from the Client-Hosted Cloud or Client-on-Premises Data Center. The handshake between the Bridge Server and Client is established with a Security Tool Kit (these differ from client to client and are generated based on the IP address/DNS, etc.).]

Supported

Note: Need to add Bridge configurations in detailed steps

Crawl of Usage Statistics (Source System)

[This involves systematically collecting, analyzing, and reporting on data related to user activities, including details on which users are accessing the system, their actions, and the frequency of their access.]

Not Supported

Certifications at Source (Source System)

[This refers to endorsing metadata objects applied within the source system. These certifications are then fetched and displayed in OvalEdge, enabling more in-depth analysis.]

Not Supported

Prerequisites (Prepare Google Drive Environment)

The following are the prerequisites required for establishing a connection: 

  1. SDK Details
    SDK is provided by default over the OvalEdge installation.

    Type

    Version

    Details

    SDK

    1.32.1

    https://mvnrepository.com/artifact/com.google.apis/google-api-services-drive/v2-rev20210707-1.32.1

  2. Google Drive User Account and Permissions

    Create a Google Drive Connector service account to connect to OvalEdge with the following permissions to crawl metadata into the application.

    Operation 

    Minimum Access Permission 

    Connection Validation

    LIST, GET permission on Crawling Folders

    Crawling

    LIST, GET permission on Crawling Folders & File Objects

  3. Configure Environment Variables (Optional).
    This section describes the settings or instructions you should know before establishing a connection. If your environments have been configured, skip this step.
    For more information, refer to the "Generic Features of Connectors" document.

Establish Connection

In the OvalEdge application, the Google Drive connector allows you to crawl the data objects using Credential Manager Authentication.
The "Generic Features of Connectors" document covers general settings and parameters common to most OvalEdge connectors. Here, we'll focus on the specific details required to configure the Google Drive Connector.

Google Drive Connector-Specific Parameters

Note: *(asterisk) indicates the mandatory field to create a connection.

Fields

Details

Google Service Account*

It is a special type of Google account intended for server-to-server interactions. It allows your application to authenticate and access Google APIs programmatically.

File Path*

The file path of Google Console access file  (.P12).

After entering the required parameters, you can either save the connection details first or validate the connection and then save it.

Errors & Resolution

You may encounter the following errors if the correct parameters are not provided while establishing the connection. Below are the most common error messages and their resolutions for your reference. 

If you continue to experience issues with establishing the connection, please contact your assigned OvalEdge GCS team

S.No.

Error Message(s)

Error Description/Resolution

1

Invalid Credentials

Error Description:

The service account credentials JSON file is incorrect or missing.

Resolution:

  • Ensure the correct JSON file is used and that it hasn't been modified. 
  • Verify that the credentials file corresponds to the service account created in the Google Cloud Console.

2

Insufficient Permissions

Error Description:

The service account does not have the necessary permissions to access the specified file or folder.

Resolution:

  • Share the required files or folders with the service account's email address and ensure it has the necessary permissions (e.g., Viewer or Editor).

3

File Not Found

Error Description:

The specified file ID or URL is incorrect, or the file does not exist in Google Drive.

Resolution:

  • Verify the file ID or URL and ensure the file exists in the shared Google Drive.

4

Access Denied

Error Description: 

The Google Drive API is not enabled for the project.

Resolution:

  • Enable the Google Drive API in the Google Cloud Console for the project associated with the service account.

Connector Settings

After successfully validating the connection, you can access various settings to retrieve specific information from the data source.

The following are settings specific to Google Drive

  • Crawler
  • Access Instruction
  • Business Glossary

For more information, refer to the "Generic Features of Connectors" document.

Limitations

S.No.

Description

1

Profiling is not supported

FAQs

  • How do you configure a Google Drive Connector?
    Answer: Configuring a Google Drive Connector typically involves several steps, including setting up a Google Service Account, enabling the Google Drive API, and configuring your application to use the connector. 

Additional Information

Step-by-Step Guide to Configuring a Google Drive Connector

Create a Google Cloud Project

  1. Go to Google Cloud Console: Visit the Google Cloud Console.
  2. Create a New Project: Click on the project dropdown menu at the top and select "New Project." Fill in the project name and other required details, then click "Create."

Enable the Google Drive API

  1. Enable API:
    1. Navigate to API & Services > Library.
    2. Search for "Google Drive API."
    3. Click on it and then click "Enable."

Create a Service Account

  1. Navigate to Service Accounts:
    1. Go to API & Services > Credentials.
    2. Click on "Create Credentials" and select "Service Account."
  2. Configure Service Account:
    1. Fill in the details such as name, ID, and description.
    2. Click "Create" and then "Done."
  3. Grant Permissions to Service Account:
    1. Go to IAM & Admin > IAM.
    2. Click on "Add" and enter the service account email.
    3. Assign necessary roles (e.g., "Viewer" or "Editor").
  4. Create and Download the Key:
    1. Navigate back to API & Services > Credentials.
    2. Click on your service account.
    3. Go to "Keys" and click "Add Key" > "Create New Key."
    4. Choose JSON format and download the key file. Store it securely as it contains sensitive information.

Share Google Drive Files with the Service Account

Go to Google Drive and share the required files or folders with the service account's email address (e.g., your-service-account@your-project-id.iam.gserviceaccount.com).