Cassandra

Connectivity Summary

An out of the box connector is available for the Cassandra database. It provides support for crawling database objects, profiling of data. 

The connectivity to Cassandra db is via JDBC driver given by Datastax.

The drivers used by the connector are given below:

Driver / API: Datastax Cassandra driver

Version: 3.x

Details: Is JDBC 4.2 compliant

Connector Capabilities

The core components of the Java binding are files named com.datastax.cassandra, which contain the Java classes that provide the connection and caching mechanisms for communication with the Cassandra server, JDBC connectivity.

The connector capabilities are shown below:

Crawling

Supported objects and data types for Crawling are:

Supported Objects Supported Data Types
Tables, Table Columns, Views, Functions All the standard data types of Cassandra

Please see this article Crawling Data for more details on crawling. 

Profiling

Please see Profiling Data for more details on profiling.

Feature Support Remarks

Table Profiling

Row count, Columns count, View sample data

 

View Profiling

Row count, Columns count, View sample data

View is treated as a table for profiling purposes

Column Profiling

Min, Max, Null count, top 50 values

 

Full Profiling 

Supported

 

Sample Profiling

Supported

 

Querying

Operation Details

Select

Supported

Insert

Not supported, by default.

Update

Not supported, by default.

Delete

Not supported, by default.

Joins within database

Supported

Joins outside database

Not supported

Aggregations

Supported

Group By

Supported

Order By

Supported

By default the service account provided for the connector will be used for any query operations. If the service account has write privileges, then Insert / Update / Delete queries can be executed.

Pre-requisites

To use the connector, the following need to be available:

  • Connection details as specified in the following section should be available.
  • Service account, for crawling and profiling. The minimum privileges required are:
    • Connection validate
    • Crawl schemas
    • Crawl tables
    • Profile schemas, tables
    • Query logs
    • Get views, procedures, function code
  • JDBC driver is provided by default.

Connection Details

The following connection settings should be added for connecting to a Cassandra database:

  • Database Type: Cassandra
  • Connection Name: Select a Connection name for the Cassandra database. The name that you specify is a reference name to easily identify your Cassandra database connection in OvalEdge.
    Example: Cassandra Connection DB1
  • Hostname / IP Address: Database instance URL (on-premises/cloud-based)
    Example: oval-cassandra.csklygkwz3dx.us-east-1.rds.amazonaws.com
  • Port number: 9042
  • Sid / Database: Name of the keyspace to connect.
  • Username (optional): User account login credential
  • Password (optional): Password

Once connectivity is established, additional configurations for Crawling and Profiling can be specified.

FAQs

  1. How much does the driver cost?
    The Cassandra db JDBC Driver for is available at no additional charge.
  2. Can I use the driver to access Cassandra from a Linux computer?
    Yes, you can use the driver to access Cassandra from Linux, Unix, and other non-Windows platforms.