Advanced Tools

Upload File or Folder 

The OvalEdge Upload File or Folder is recommended to upload the data from your storage unit to OvalEdge to process the data and bring the reporting result. It consists of 

  1. Select data Lake: The Data Lake is a central repository where you can store all your structured and unstructured data. You can view the available File connections and the connection type on the Select Your Data lake page, search the File connection name, and filter the connection type using the search and filter icon configured on the respective column.
  2. Select Your Directory: A data lake has several directories to upload data files. You can also upload files directly to data lakes.
  3. Upload File or Folder: This will allow you to select a file from your data lake.

Steps to upload file or folder

  1. Go to Advanced Tools > Upload File or Folder 
  2. Click the Upload File or Folder. By default, Select Data Lake is displayed.  
  3. Select a connection name (NFS - A) from the existing list where you wish to store your data.
  4. Click on the Next button. It will display the Select Your Directory tab., where you want to import your file from the list of existing directories. Alternatively, you can create a new directory through Nine dots. Give a unique name for the new directory. Click Next.
  5. Set the toggle key to File or Folder to upload data content. 
    When the toggle key is enabled, it will upload the file. When it is disabled, it will upload the folder.     
  6. Click the Select from your computer button to locate your File (TestFile.txt)data to upload.
  7. Click on the Finish button. The file (TestFile.txt) will be uploaded to the OvalEdge repository.     
  8. Once the File(s) is uploaded successfully, it will display the below information for each connection.
    1. Type: An icon to identify a file or a folder.
    2. Name of file/folders: The name of the file/ folder from the connection. 
    3. Catalog sign: OvalEdge needs users to catalog each file or folder to organize the file/folder metadata. A file must be cataloged before profiling it (similar to crawling a database connection before profiling). 
    4. Cataloging a File:
      1. The first level of data will be automatically cataloged while creating a connection.
      2. The Second level of data has to be cataloged manually by clicking the sign for each file or folder in the file manager module. Alternatively, users can catalog multiple folders or files in the data catalog by using the Nine dots options.
      Note: 
      1. The sign + changes to to indicate to us that the file or folder is cataloged successfully.
      2. When a file connection is crawled, the files/folders are automatically cataloged and displayed under the Data Catalog > Files module.
      3. Users, when they add a file/folder manually, would need to load the file from load metadata and should manually catalog the uploaded file with the sign indications as shown above.
    5. Size of file/folders: The logical/physical size of the file in the system.  
    6. Preview Link: Preview link enables users to navigate the file from OvalEdge by pasting the URL in a new tab.

      Data Formats OvalEdge Supports

      File Extension

      Format Supported

      Description of Format

      CSV

      Values separated by comma

      A CSV file stores tabular data in plain text. Each line of the file is a data record. Each record consists of one or more fields, separated by commas. The extensions that are used are .csv and txt.

      CONF

      This file is saved in a plain text format

      CONF files are configuration files used by Unix and Linux systems.

      DDL

      Text

      Database file created in the Data Definition Language (DDL), a language used for describing database schemas; saved in a plain text format and contains commands such as CREATE, USE, ALTER, and DROP

      ENV

      Now Contact Envelope Template

      an application that organizes contact information and day-to-day activities; contains a template print layout for your envelope

      GZ

      ZipFile

       

      HQL

       

      HQL file is an Apache Hive HiveQL Script.

      Parquet

      Apache Parquet

      Is a free and open-source column-oriented data storage format of the Apache Hadoop ecosystem. It is like the other columnar-storage file formats available in Hadoop: RCFile and ORC. 

      JSON

      JavaScript Object Notation

      A JSON file is a file that stores simple data structures and objects in JavaScript Object Notation (JSON) format, which is a standard data interchange format. It is primarily used for transmitting data between a web application and a server.

      Pipe Delimited File

      Pipe Delimited File

      A delimited text file is a text file used to store data. Each line represents a single book, company, or other things, and each line has fields separated by the delimiter. Compared to the flat file that uses spaces to force every field to the same width, a delimited file has the advantage of allowing field values of any length

      PROPERTIES

       

      Text 

      Minecraft Properties File, It is saved in plain text and stores configuration information for the server 

      SQL

      Text

      Structured Query Language Data File: It stores SQL statements for creating or modifying database structures, insertions, updates, deletions, or other SQL operations. 

      SH

      Text 

      The File with.Sh extension is a script programmed for the Unix shell. It contains instructions written in Bash language and can be executed by typing a text command. 

      XLS

      Microsoft Office Excel

      Contains rows and columns of cells; each can include data, which can be words, numbers, or formulas that have data and solve equations dynamically. XLS spreadsheets can also contain tables and charts that show all selected sections or data.

      XLSX

      XML Microsoft Office Excel

      A file with the xlsx file extension is a Microsoft Excel Open XML Spreadsheet (XLSX) file created by Microsoft Excel.

      TSV

      Tab-separated values

      A tab-separated values file is a simple text format for storing data in a tabular structure, e.g., database table or spreadsheet data, and a way of exchanging information between databases. Each record in the table is one line of the text file.

      TXT

      Text

      A TXT file is a standard text document that contains plain text. 

      YAML

      Text

      It is used for reading and writing data independent of a specific programming language


      Copyright © 2019, OvalEdge LLC, Peachtree Corners GA USA