Connectivity Summary
An out of the box connector is available for the Cassandra database. It provides support for crawling database objects, profiling of data.
The connectivity to Cassandra db is via JDBC driver given by Datastax.
The drivers used by the connector are given below:
Driver / API: Datastax Cassandra driver
Version: 3.x
Details: Is JDBC 4.2 compliant
Connector Capabilities
The core components of the Java binding are files named com.datastax.cassandra, which contain the Java classes that provide the connection and caching mechanisms for communication with the Cassandra server, JDBC connectivity.
The connector capabilities are shown below:
Crawling
Supported objects and data types for Crawling are:
Supported Objects | Supported Data Types |
Tables, Table Columns, Views, Functions | All the standard data types of Cassandra |
Please see this article Crawling Data for more details on crawling.
Profiling
Please see Profiling Data for more details on profiling.
Feature | Support | Remarks |
Table Profiling |
Row count, Columns count, View sample data |
|
View Profiling |
Row count, Columns count, View sample data |
View is treated as a table for profiling purposes |
Column Profiling |
Min, Max, Null count, top 50 values |
|
Full Profiling |
Supported |
|
Sample Profiling |
Supported |
Querying
Operation | Details |
Select |
Supported |
Insert |
Not supported, by default. |
Update |
Not supported, by default. |
Delete |
Not supported, by default. |
Joins within database |
Supported |
Joins outside database |
Not supported |
Aggregations |
Supported |
Group By |
Supported |
Order By |
Supported |
By default the service account provided for the connector will be used for any query operations. If the service account has write privileges, then Insert / Update / Delete queries can be executed.
Pre-requisites
To use the connector, the following need to be available:
- Connection details as specified in the following section should be available.
- Service account, for crawling and profiling. The minimum privileges required are:
- Connection validate
- Crawl schemas
- Crawl tables
- Profile schemas, tables
- Query logs
- Get views, procedures, function code
- JDBC driver is provided by default.
Connection Details
The following connection settings should be added for connecting to a Cassandra database:
- Database Type: Cassandra
- Connection Name: Select a Connection name for the Cassandra database. The name that you specify is a reference name to easily identify your Cassandra database connection in OvalEdge.
Example: Cassandra Connection DB1 - Hostname / IP Address: Database instance URL (on-premises/cloud-based)
Example: oval-cassandra.csklygkwz3dx.us-east-1.rds.amazonaws.com - Port number: 9042
- Sid / Database: Name of the keyspace to connect.
- Username (optional): User account login credential
- Password (optional): Password
Once connectivity is established, additional configurations for Crawling and Profiling can be specified.
FAQs
- How much does the driver cost?
The Cassandra db JDBC Driver for is available at no additional charge. - Can I use the driver to access Cassandra from a Linux computer?
Yes, you can use the driver to access Cassandra from Linux, Unix, and other non-Windows platforms.