The OvalEdge Upload File or Folder is recommended to upload the data from your storage unit to OvalEdge to process the data and bring the reporting result. It consists of
- Select data Lake: The Data Lake is a central repository where you can store all your structured and unstructured data. You can view the available File connections and the connection type on the Select Your Data lake page, search the File connection name, and filter the connection type using the search and filter icon configured on the respective column.
- Select Your Directory: A data lake has several directories to upload data files. You can also upload files directly to data lakes.
- Upload File or Folder: This will allow you to select a file from your data lake.
Steps to upload file or folder
- Go to Advanced Tools > Upload File or Folder
- Click the Upload File or Folder. By default, Select Data Lake is displayed.
- Select a connection name (NFS - A) from the existing list where you wish to store your data.
- Click on the Next button. It will display the Select Your Directory tab., where you want to import your file from the list of existing directories. Alternatively, you can create a new directory through Nine dots. Give a unique name for the new directory. Click Next.
- Set the toggle key to File or Folder to upload data content.
When the toggle key is enabled, it will upload the file. When it is disabled, it will upload the folder. - Click the Select from your computer button to locate your File (TestFile.txt)data to upload.
- Click on the Finish button. The file (TestFile.txt) will be uploaded to the OvalEdge repository.
- Once the File(s) is uploaded successfully, it will display the below information for each connection.
- Type: An icon to identify a file or a folder.
- Name of file/folders: The name of the file/ folder from the connection.
- Catalog sign: OvalEdge needs users to catalog each file or folder to organize the file/folder metadata. A file must be cataloged before profiling it (similar to crawling a database connection before profiling).
- Cataloging a File:
- The first level of data will be automatically cataloged while creating a connection.
- The Second level of data has to be cataloged manually by clicking the sign for each file or folder in the file manager module. Alternatively, users can catalog multiple folders or files in the data catalog by using the Nine dots options.
- The sign + changes to to indicate to us that the file or folder is cataloged successfully.
- When a file connection is crawled, the files/folders are automatically cataloged and displayed under the Data Catalog > Files module.
- Users, when they add a file/folder manually, would need to load the file from load metadata and should manually catalog the uploaded file with the sign indications as shown above.
- Size of file/folders: The logical/physical size of the file in the system.
- Preview Link: Preview link enables users to navigate the file from OvalEdge by pasting the URL in a new tab.
Data Formats OvalEdge Supports
File Extension |
Format Supported |
Description of Format |
---|---|---|
CSV |
Values separated by comma |
A CSV file stores tabular data in plain text. Each line of the file is a data record. Each record consists of one or more fields, separated by commas. The extensions that are used are .csv and txt. |
CONF |
This file is saved in a plain text format |
CONF files are configuration files used by Unix and Linux systems. |
DDL |
Text |
Database file created in the Data Definition Language (DDL), a language used for describing database schemas; saved in a plain text format and contains commands such as CREATE, USE, ALTER, and DROP |
ENV |
Now Contact Envelope Template |
an application that organizes contact information and day-to-day activities; contains a template print layout for your envelope |
GZ |
ZipFile |
|
HQL |
HQL file is an Apache Hive HiveQL Script. |
|
Parquet |
Apache Parquet |
Is a free and open-source column-oriented data storage format of the Apache Hadoop ecosystem. It is like the other columnar-storage file formats available in Hadoop: RCFile and ORC. |
JSON |
JavaScript Object Notation |
A JSON file is a file that stores simple data structures and objects in JavaScript Object Notation (JSON) format, which is a standard data interchange format. It is primarily used for transmitting data between a web application and a server. |
Pipe Delimited File |
Pipe Delimited File |
A delimited text file is a text file used to store data. Each line represents a single book, company, or other things, and each line has fields separated by the delimiter. Compared to the flat file that uses spaces to force every field to the same width, a delimited file has the advantage of allowing field values of any length |
PROPERTIES
|
Text |
Minecraft Properties File, It is saved in plain text and stores configuration information for the server |
SQL |
Text |
Structured Query Language Data File: It stores SQL statements for creating or modifying database structures, insertions, updates, deletions, or other SQL operations. |
SH |
Text |
The File with.Sh extension is a script programmed for the Unix shell. It contains instructions written in Bash language and can be executed by typing a text command. |
XLS |
Microsoft Office Excel |
Contains rows and columns of cells; each can include data, which can be words, numbers, or formulas that have data and solve equations dynamically. XLS spreadsheets can also contain tables and charts that show all selected sections or data. |
XLSX |
XML Microsoft Office Excel |
A file with the xlsx file extension is a Microsoft Excel Open XML Spreadsheet (XLSX) file created by Microsoft Excel. |
TSV |
Tab-separated values |
A tab-separated values file is a simple text format for storing data in a tabular structure, e.g., database table or spreadsheet data, and a way of exchanging information between databases. Each record in the table is one line of the text file. |
TXT |
Text |
A TXT file is a standard text document that contains plain text. |
YAML |
Text |
It is used for reading and writing data independent of a specific programming language |
Copyright © 2019, OvalEdge LLC, Peachtree Corners GA USA