LogoLogo
About UsCustomersResourcesGet Started for Free
  • What is Select Star?
  • 🏁Getting Started
    • 1. Data Source Setup
    • 2. Mark Service Accounts
    • 3. Hide Unwanted Datasets
    • 4. Invite Owners
    • 5. Add Documentation
    • Next Steps
  • 🔄Integrations
    • Snowflake
      • Using Key Pair Authentication
      • Using Password Authentication
      • Snowflake Tag Sync
      • Snowflake Key Pair Rotation
      • Cortex Analyst (beta)
    • Databricks
      • Databricks on AWS
      • Databricks on Azure
    • BigQuery
    • AWS Redshift
      • Manual setup
    • Microsoft SQL Server / MS SQL (beta)
      • Query Logs
    • MySQL (beta)
      • Query Logs
    • Oracle (beta)
      • Query Logs
    • Salesforce (beta)
    • DB2 (beta)
    • PostgreSQL
      • AWS Aurora PostgreSQL
      • AWS RDS PostgreSQL
      • PostgreSQL on-prem
    • AWS Glue (beta)
    • dbt
      • dbt Cloud
      • dbt Core (open source)
      • dbt Tags
      • dbt Tests
      • dbt docs Sync
        • Github dbt docs Sync
        • Bitbucket dbt docs Sync
      • dbt Impact Report
      • dbt Project Dependencies
    • Apache Airflow (beta)
    • Tableau
      • Tableau Cloud
      • Tableau Server
    • PowerBI
    • Looker
    • Metabase
    • Fivetran (beta)
    • Mode
    • Sigma Computing
    • Sisense / Periscope (beta)
    • Looker Studio (beta)
    • ThoughtSpot
    • QuickSight (beta)
      • Event Logs
    • Hex (beta)
    • Slack
    • Monte Carlo
    • Private Network
    • Request an Integration
  • ✨Features
    • Search
    • Table Page
    • Database Page
    • Dashboard Page
    • Data Lineage
    • Entity Relationship Diagram (ERD)
    • Queries & Joins
    • Tags
    • Teams
    • Discussion
    • Downstream Notifications
    • Documentation
      • Pages
      • Metrics
        • Metrics Generation
      • Glossary
    • Automated Documentation
    • User Analytics
    • Chrome Extension
      • Organization-wide install
    • Source Tables
    • Cost Analysis
    • Schema Change Detection
    • AI Features & Settings
      • Ask AI Chatbot
    • Request a Feature
  • 🧭Data Discovery
    • Where's my data?
    • Where's my dashboard?
    • How can I get the full context of this data?
    • My dashboard looks off
    • Change management
    • I'm new to the team
    • I have a data question
  • 🗃️Data Management
    • Add Documentation
      • CSV Metadata Upload
    • Collections
    • Tags
    • Data Ownership
    • Sensitive / PII Data
    • Automated PII Detection
  • 📚Learning Data
    • Getting Started: Looker
    • Getting Started: Mode
    • Getting Started: Tableau
    • Getting Started: Snowflake
    • Getting Started: Databricks
    • Getting Started: Data Warehouse
    • Getting Started: BigQuery
      • Nested Fields
    • Getting Started: Sigma
    • Getting Started: ThoughtSpot
  • 🛠️Data Source Management
    • Manage Data Sources
    • Connect Data Source Users to Select Star
    • Custom Attributes
    • Recent Queries
  • 👥User Management
    • Invite Users
    • Roles & Permissions
    • SAML SSO
    • Importing Roles and Teams (Okta)
    • Policy Based Access Control
    • Account and User Settings
  • 💻Select Star API
    • Overview
    • API Token
    • Getting Started
    • Rich Text Descriptions via API
    • Troubleshooting
    • API Examples
    • API Reference
  • 🔓Security & Compliance
  • ❓FAQ
    • Icon Map
  • 📰Changelog
    • May 20, 2025 - Chrome Extension, Notifications, and More!
    • April 16, 2025 - Semantic Models, AI Metrics, and More!
    • March 12, 2025 - Fivetran Integration, Tableau Updates and More!
    • February 6, 2025 - Collections, Slack App Published, Salesforce Formula Lineage and more!
    • December 10, 2024 - Hex Integration, Impact Score & Snowflake Key Pair Authentication!
    • November 13, 2024 - New Navigation, Airflow and More!
    • September 30, 2024 - Upstream Data Quality Issue Tracking & 5 New Integrations!
    • August 30, 2024 - Monte Carlo, dbt Cross-Project Lineage
    • July 31, 2024 - Glossary Import, Lineage Updates & more!
    • July 9, 2024 - Lineage Explorer 2.0, Slack AI and Notifications
    • February 29, 2024 - AI Chat, Schema Change Notifications
    • February 23, 2024 - Manual Lineage Creation
    • November 23, 2023 - Bulk AI Documentation
    • October 19, 2023 - Downstream Notifications
    • October 16, 2023 - New Homepage
    • October 13, 2023 - dbt Impact Report
    • Historical Changelogs
  • Security & Compliance
  • System Status
Powered by GitBook
On this page
  • Before you start
  • 1. Create a Service Principal in Databricks
  • Add a service principal to your Databricks account
  • Add service principals to your account using the account console
  • Add a service principal to a workspace
  • Assign a service principal to a workspace using the account console
  • Grant SQL and Workspace access for a service principal
  • Grant permissions to a catalog for a service principal
  • Grant permission to a workspace for a service principal
  • 2. Generate a Personal Access Token
  • Generate a Personal Access Token
  • 3. Connect Databricks to Select Star
  • 4. Choose catalog and schemas

Was this helpful?

  1. Integrations
  2. Databricks

Databricks on AWS

PreviousDatabricksNextDatabricks on Azure

Last updated 1 year ago

Was this helpful?

Before you start

Ensure Unity Catalog is enabled for your Databricks instance. For details, see .

To connect Databricks to Select Star, you will need...

  • an Databricks instance on AWS. For details, see .

  • Account admin permissions on the Databricks instance

  • Workspace admin permissions on the Databricks instance

Complete all of the following steps to see Databricks metadata, lineage, and popularity in Select Star.

1. Create a Service Principal in Databricks

What is a Service Principal?

A service principal is an identity that you create in Databricks for use with automated tools, jobs, and applications. Service principals give automated tools and scripts API-only access to Databricks resources, providing greater security than using users or groups. It also prevents jobs and automations from failing if a user leaves your organization or a group is modified. For details, see .

Add a service principal to your Databricks account

Account admins can add service principals to your Databricks account using the account console or the System for Cross-domain Identity Management (SCIM) Account API.

Add service principals to your account using the account console

To add a service principal to the account using the account console:

  1. Click User management.

  2. On the Service principals tab, click Add service principal.

  3. Enter a name (SelectStar) for the service principal.

  4. Click Add.

💡 To use service principals, you must add them to a workspace and generate access tokens for them in the workspace.

Add a service principal to a workspace

  • The account console

  • The Workspace Assignment API

Workspace admins can manage service principals in their workspace using the following:

  • The workspace admin console (if the workspace is enabled for identity federation)

  • The workspace-level SCIM (ServicePrincipals) API

  • The Workspace Assignment API (if the workspace is enabled for identity federation)

Assign a service principal to a workspace using the account console

To add service principals to a workspace using the account console, the workspace must be enabled for identity federation.

  1. Click Workspaces.

  2. On the Permissions tab, click Add permissions.

  3. Search for and select the service principal SelectStar and assign the permission level (workspace Admin), and click Save.

Grant SQL and Workspace access for a service principal

To grant SQL Warehouse access for a service principal using the workspace admin console, the workspace must be enabled for identity federation.

  1. As a workspace admin, log in to the Databricks workspace.

  2. Click your username in the top bar of the Databricks workspace and select Admin Console.

  3. Click Settings and select Service principals.

  4. On the Service principals tab, click the service principal that was create in the previous steps.

  5. Select the checkbox for Databricks SQL access and Workspace access, and click Update.

Grant permissions to a catalog for a service principal

  1. Log in to a workspace that is linked to the metastore.

  2. Click Data.

  3. Click the catalog that needs to be granted access to, and select Permissions.

  4. Click Grant.

  5. Select the user/group and grant Privilege presets to Data Reader, and select the checkbox for USE CATALOG, USE SCHEMA and SELECT, and click Grant.

Grant permission to a workspace for a service principal

This step is required to show notebooks in the catalog and notebook lineage.

  1. Log in to a workspace that is linked to the metastore.

  2. Click Workspace and select top folder.

  3. Click Share button.

  4. Select the user/group, then select permission "Can view", and click Add.

2. Generate a Personal Access Token

To authenticate a service principal to APIs on Databricks, an administrator can create a Databricks Personal Access Tokens on behalf of the service principal.

Generate a Personal Access Token

POST https://<deployment name>.cloud.databricks.com/api/2.0/token-management/on-behalf-of/tokens/

When you want to use the Databricks API to generate a Personal Access token on behalf of a user or service principal, use this command.

Use the token value generated from this response as API key.

Request Body

Name
Type
Description

application_id*

String

comment

String

lifetime_seconds

String

Use value = -1 in order for it to live indefinitely


{
    "token_value": "dapia.....", #Use this value
    "token_info": {
        "token_id": "4305bc67998.........",
        "creation_time": 1671720121149,
        "expiry_time": -1,
        "comment": "Service Principal Token. API Test",
        "created_by_id": 355825636633264,
        "created_by_username": "prat@getselectstar.com",
        "owner_id": 4012126671306509
    }
}

{
    "error_code": "INVALID_PARAMETER_VALUE",
    "message": "edxiueirxxxxxxxx does not exist"
}

3. Connect Databricks to Select Star

Go to the Select Star Settings. Click Data in the sidebar, then + Add to create a new Data Source.

Choose Databricks in the Source Type dropdown and provide the following information:

Display Name: This value is Databricks by default, but you can override it if desired.

Access Token: This is the Personal access token from Step 2, which is used to authenticate access to Databricks.

4. Choose catalog and schemas

After you fill in the information, you'll be asked to select the catalog you'd like to load into Select Star.

💡 Select Star will not read queries or metadata or generate lineage for Catalogs, schemas, or tables that are not loaded. Please load all data for which you expect to see lineage.

Select the catalogs and click Next.

For each catalog you selected, you'll be able to select the schemas.

Your metadata should start loading automatically. Please allow 24-48 hours to completely generate popularity and lineage.

When the sync is complete, you'll be able to explore Databricks in Select Star.

See the link below for more information on Databricks in Select Star.

As an account admin, log in to the .

To add a service principal via REST API, see .

Account admins can add service principals to using the following:

As an account admin, log in to the .

To add a service principle to a workspace via admin console or REST API, see .

These are the minimum permissions required for Select Star to collect basic metadata and query history. Query history is also used to generate .

Grant the to the service principal.

Create a Databricks personal access token on behalf of the service principal using the POST /token-management/on-behalf-of/tokens operation in the . An administrator can also list personal access tokens and delete them using the same API.

UUID of the Service Principal, and can be found here -

For detailed, step-by-step instructions for creating access tokens for service principals, see .

Workspace URL: This is the address of the Workspace. This should include the <deployment name>.cloud.databricks.com . Deployment Name can be found in

You can you have loaded if needed.

🔄
Getting Started with Unity catalog
Databricks' documentation
Create a service principal (SelectStar) in Databricks
Generate a Personal Access Token
Connect Databricks to Select Star
Choose Catalogs and Schemas
Manage Service Principal
account console
Add service principals to your account using the SCIM (Account) API
identity-federated workspaces
account console
Add a service principal to a workspace
Data Lineage
Can Use token permission
token management REST API
Service principals for Databricks automation
https://accounts.cloud.databricks.com/workspaces
change the catalogs and schemas
Getting Started: Databricks
https://accounts.cloud.databricks.com/users/serviceprincipals/
Admin Console
Entitlements for service principal
Catalog permissions in the Data Explorer UI
Privileges for service principal or User groups
Folder permissions in the Workspace explore UI
Permission grant in Workspace share