Spotfire® User Guide

Accessing data from Databricks

You can access data from Databricks in Spotfire.

About this task

To connect to data in your Databricks workspace, you use the connector for Databricks. To learn about the functionality and features available when you work with data from these systems, see Connector for Databricks — Features and settings.

Before you begin

Procedure

  1. Open the Files and data flyout, and click Connect to.
  2. In the list of data sources, select Databricks.
  3. In the panel on the right, choose if you want to create a new connection or add data from a shared data connection:

Working with and troubleshooting Databricks data connections

About this task

The following is information specifically about working with data from a Databricks connection in Spotfire.

Databricks cluster that is not running

When connecting to a Databricks cluster or warehouse that is not already running, the first connection attempt will trigger the resource to start. This can take several minutes. You may have to click Connect again if the connection times out.

Unity Catalog

You can access data from Databricks workspaces that are enabled for Unity Catalog, which means that data is organized in three levels (catalog.schema.table). You can browse the data after connecting, when you select data to include in the connection.

Converting an existing Databricks connection from the Apache Spark SQL connector

The data connector for Databricks was introduced in Spotfire version 14.6.1. In earlier versions, to access data from Databricks, you would use the connector for Apache Spark SQL. If you have existing data connections to Databricks created with the Apache Spark SQL connector, and you want to use the stand-alone Databricks connector instead, replace the data connections in your analyses, see Replacing a data source.