LogoLogo
About UsCustomersResourcesGet Started for Free
  • What is Select Star?
  • 🏁Getting Started
    • 1. Data Source Setup
    • 2. Mark Service Accounts
    • 3. Hide Unwanted Datasets
    • 4. Invite Owners
    • 5. Add Documentation
    • Next Steps
  • 🔄Integrations
    • Snowflake
      • Using Key Pair Authentication
      • Using Password Authentication
      • Snowflake Tag Sync
      • Snowflake Key Pair Rotation
    • Databricks
      • Databricks on AWS
      • Databricks on Azure
    • BigQuery
    • AWS Redshift
      • Manual setup
    • Microsoft SQL Server / MS SQL (beta)
      • Query Logs
    • MySQL (beta)
      • Query Logs
    • Oracle (beta)
      • Query Logs
    • Salesforce (beta)
    • DB2 (beta)
    • PostgreSQL
      • AWS Aurora PostgreSQL
      • AWS RDS PostgreSQL
      • PostgreSQL on-prem
    • AWS Glue (beta)
    • dbt
      • dbt Cloud
      • dbt Core (open source)
      • dbt Tags
      • dbt Tests
      • dbt docs Sync
        • Github dbt docs Sync
        • Bitbucket dbt docs Sync
      • dbt Impact Report
      • dbt Project Dependencies
    • Apache Airflow (beta)
    • Tableau
      • Tableau Cloud
      • Tableau Server
    • PowerBI
    • Looker
    • Metabase
    • Fivetran (beta)
    • Mode
    • Sigma Computing
    • Sisense / Periscope (beta)
    • Looker Studio (beta)
    • ThoughtSpot
    • QuickSight (beta)
      • Event Logs
    • Hex (beta)
    • Slack
    • Monte Carlo
    • Private Network
    • Request an Integration
  • ✨Features
    • Search
    • Table Page
    • Database Page
    • Dashboard Page
    • Data Lineage
    • Entity Relationship Diagram (ERD)
    • Queries & Joins
    • Tags
    • Teams
    • Discussion
    • Downstream Notifications
    • Documentation
      • Pages
      • Metrics
        • Metrics Generation
      • Glossary
    • Automated Documentation
    • User Analytics
    • Chrome Extension
    • Source Tables
    • Cost Analysis
    • Schema Change Detection
    • AI Features & Settings
      • Ask AI Chatbot
    • Request a Feature
  • 🧭Data Discovery
    • Where's my data?
    • Where's my dashboard?
    • How can I get the full context of this data?
    • My dashboard looks off
    • Change management
    • I'm new to the team
    • I have a data question
  • 🗃️Data Management
    • Add Documentation
      • CSV Metadata Upload
    • Collections
    • Tags
    • Data Ownership
    • Sensitive / PII Data
    • Automated PII Detection
  • 📚Learning Data
    • Getting Started: Looker
    • Getting Started: Mode
    • Getting Started: Tableau
    • Getting Started: Snowflake
    • Getting Started: Databricks
    • Getting Started: Data Warehouse
    • Getting Started: BigQuery
      • Nested Fields
    • Getting Started: Sigma
    • Getting Started: ThoughtSpot
  • 🛠️Data Source Management
    • Manage Data Sources
    • Connect Data Source Users to Select Star
    • Custom Attributes
    • Recent Queries
  • 👥User Management
    • Invite Users
    • Roles & Permissions
    • SAML SSO
    • Importing Roles and Teams (Okta)
    • Policy Based Access Control
    • Account and User Settings
  • 💻Select Star API
    • Overview
    • API Token
    • Getting Started
    • Rich Text Descriptions via API
    • Troubleshooting
    • API Examples
    • API Reference
  • 🔓Security & Compliance
  • ❓FAQ
    • Icon Map
  • 📰Changelog
    • April 16, 2025 - Semantic Models, AI Metrics, and More!
    • March 12, 2025 - Fivetran Integration, Tableau Updates and More!
    • February 6, 2025 - Collections, Slack App Published, Salesforce Formula Lineage and more!
    • December 10, 2024 - Hex Integration, Impact Score & Snowflake Key Pair Authentication!
    • November 13, 2024 - New Navigation, Airflow and More!
    • September 30, 2024 - Upstream Data Quality Issue Tracking & 5 New Integrations!
    • August 30, 2024 - Monte Carlo, dbt Cross-Project Lineage
    • July 31, 2024 - Glossary Import, Lineage Updates & more!
    • July 9, 2024 - Lineage Explorer 2.0, Slack AI and Notifications
    • February 29, 2024 - AI Chat, Schema Change Notifications
    • February 23, 2024 - Manual Lineage Creation
    • November 23, 2023 - Bulk AI Documentation
    • October 19, 2023 - Downstream Notifications
    • October 16, 2023 - New Homepage
    • October 13, 2023 - dbt Impact Report
    • Historical Changelogs
  • Security & Compliance
  • System Status
Powered by GitBook
On this page
  • Before you start
  • 1. Connect AWS Glue to Select Star
  • 2. Create an AWS IAM Role ARN using Cloudformation
  • 3. Confirm authorization
  • 4. Choose Catalogs and Databases

Was this helpful?

  1. Integrations

AWS Glue (beta)

PreviousPostgreSQL on-premNextdbt

Last updated 1 year ago

Was this helpful?

Before you start

Ensure AWS Glue database and tables are setup in your AWS Glue instance. For details, see .

To connect AWS Glue to Select Star, you will need to...

1. Connect AWS Glue to Select Star

Select AWS Glue from the Add Data Source menu and provide the

Display Name - This value is AWS Glue by default, but you can overrided.

Region - ID of the AWS region where the cluster was created. For example us-east-2,us-west-1, eu-central-1

2. Create an AWS IAM Role ARN using Cloudformation

Select Star recommends use of AWS CloudFormation to setup integration, which allows you to make necessary changes to the AWS Glue environment in a automatic, transparent, safe and auditable manner.

AWS CloudFormation creates an AWS IAM Role to enable access for Select Star and add it to AWS Glue cluster.

You can pass the link to CloudFormation to the infrastructure team to enable the integration to be created.

1. A simple form will be displayed in Select Star.

2. Select the "Open CloudFormation" button. A new window will open to proceed to the creation of a CloudFormation stack by AWS Management Console. Make sure you are logged into the AWS account in which the AWS Glue cluster exist.

3. The Create Stack form will be displayed. Fill form in the required information:

4. Review the information and under Capabilities choose "I acknowledge that AWS CloudFormation might create IAM resources".

5. Click Create stack.

6. Wait until the stack changes it status to "CREATE_COMPLETE" from "CREATE_IN_PROGRESS" in tab "Stack Info". The operation should take up to 5 minutes. You need to refresh tab to see the progress.

7. After completing stack creation, the Role ARN is available from the "Outputs". Copy and save the RoleArn for later use.

3. Confirm authorization

1. Return to Select Star. You should see a form that allows you to provide "Role ARN". Fill form in the required information:

2. Click Connect.

4. Choose Catalogs and Databases

After you fill in the information, you'll be asked to select the catalogs and databases you'd like to load into Select Star.

Select Star will not read queries or metadata or generate lineage for databases, or tables that are not loaded. Please load all data for which you expect to see lineage.

Select the Catalog from the list (if more than one).

Select Databases from the list.

Click Next and your metadata should start loading automatically. Please allow 24-48 hours to completely generate lineage.

When the sync is complete, you'll be able to explore AWS Glue in Select Star.

The source code of the CloudFormation template along with build scripts and real-time logs of the continuous deployment system is available on public repository on GitHub "" to be freely audited.

Role ARN: Identifier of AWS IAM Role to use by Select Star. You'll see this after completing of the instructions.

You can you have loaded if needed.

🔄
selectstar/cloudformation-templates
change the catalog and databases
step 2.7
Adding an AWS Glue crawler
Connect AWS Glue to Select Star
Create an AWS IAM Role ARN using Cloudformation
Confirm authorization
Choose Catalogs and Databases
Select AWS Glue from the Source Type list
Provide a Role ARN
Outputs tab
Enter Role ARN from Step 2.7
Ingesting Catalog
Select Databases