- Knowledge Base
- Setup Data Catalog
- Profiling
-
Roadmap
-
Knowledgebase Documents
-
Installation and System Setup
-
Setup Data Catalog
-
Connectors
-
Data Discovery
-
Self Service
-
Access Management
-
Data Quality
-
Data Literacy
-
Privacy Compliance
-
Reporting
-
Architecture, Security & Releases
-
Developer's Zone
-
Advanced Tools
-
Record of Processing Activities (ROPA)
-
Others
-
Release6.0 User Guide
-
Release6.1 Features
-
Data Catalog
-
News
-
Deactivated_Old
-
Release6.3 Deep Dive Articles
Introduction to OvalEdge Profiling
Profiling is the process of gathering statistics and informative summaries about the connected data sources that help in:
- Assessing the quality of the data source before using it in an analysis.
- Identifying relationships between the tables at the entity level and patterns between them.
In this process, we obtain a sample of records for getting the data insights. OvalEdge has a system setting (Administration > System Settings > Others tab) which can enable access to the data:
ovaledge.handleprofile.stats: if “true” (default), OvalEdge will fetch and store the sample data - hence, profiling and AI recommendations can be done.
ovaledge.handleprofile.stats: if “false”, OvalEdge will NOT fetch and store any data - hence, profiling and AI recommendations can not be done.
In OvalEdge, profiling collects the following statistics:
- Table Row Count
- Column’s Null Count
- Distinct Count
- Minimum Value, Maximum Value, and Top 50 Values
All other statistics are inferred from the above data points. Profiling can only be performed on databases and files.
You cannot perform profiling on reports.