Skip to content
Brief

Unity Catalog

Data Governance & Catalog

Unified, open-source governance and lineage for all data and AI assets across multi-cloud environments, enabling fine-grained access control and compliance without vendor lock-in.

Last updated May 11, 2026 by the ATDb Editorial Team

Founded
2021
HQ
San Francisco, California, United States
Parent
Connections
9

At a glance

Employees
10000+
7integrations1competitors1corporate family

About

Leading open-source data catalog and governance layer for Lakehouse architectures, competing directly with Snowflake's Apache Polaris and proprietary catalogs like AWS Glue and Google Dataplex.

Unity Catalog is Databricks's unified governance solution for data and AI, providing centralized access control, auditing, lineage tracking, and data discovery across all data assets on the Databricks Lakehouse Platform. Originally launched as a proprietary feature within Databricks in 2021, Unity Catalog was open-sourced in June 2024, allowing organizations to use it independently of the Databricks platform and enabling broader ecosystem adoption. The catalog supports fine-grained governance for tables, files, machine learning models, dashboards, and notebooks, making it one of the more comprehensive metadata and governance layers available in the modern data stack. Its open-source release was a strategic move to compete directly with Snowflake's Apache Polaris (also open-sourced in 2024) and to establish Unity Catalog as a neutral, interoperable standard for data governance across multi-cloud and multi-engine environments. In the AdTech and broader data ecosystem, Unity Catalog is significant for organizations managing large volumes of audience data, campaign performance data, and identity graphs. It enables data mesh architectures, enforces privacy compliance through attribute-based access controls, and provides end-to-end data lineage critical for regulatory requirements like GDPR and CCPA. Its integration with Delta Lake, Apache Spark, and other open formats positions it as a foundational governance layer for enterprises running data-intensive advertising and marketing analytics workloads.

Business model

Open Source + SaaS

Target market

Enterprise

What they offer

  • Unity Catalog (Open Source)

    Open-source universal catalog for data and AI governance, released June 2024, supporting multi-engine and multi-cloud environments.

  • Data Lineage

    Automated end-to-end lineage tracking across tables, notebooks, workflows, and dashboards within the Databricks ecosystem.

  • Fine-Grained Access Control

    Row-level and column-level security with attribute-based access controls for tables, views, volumes, and models.

  • Data Discovery & Search

    Centralized metadata search and tagging to help users find, understand, and trust data assets across the organization.

  • Audit Logging

    Comprehensive audit trails of data access and modifications to support compliance and security investigations.

  • Delta Sharing Integration

    Native integration with Delta Sharing protocol for secure cross-organizational data sharing without data movement.

Key features

Open-source universal data catalogFine-grained row and column-level securityEnd-to-end automated data lineageCentralized metadata managementMulti-cloud and multi-engine supportAI and ML model governanceDelta Sharing for secure external data sharingAttribute-based access control (ABAC)Data tagging and classificationAudit logging and compliance reporting

Use cases

Centralized data governance for enterprise data lakehousesAudience data access control for AdTech platformsGDPR/CCPA compliance enforcement via column masking and row filtersData mesh architecture enablement with decentralized ownershipML model registry and governance for AI-driven advertisingCross-team data discovery and self-service analyticsData lineage tracking for regulatory auditsSecure data sharing with advertising partners via Delta Sharing

Customer segments

Large enterprises with complex data governance needsAdTech and MarTech companies managing audience and campaign dataFinancial services firms with strict compliance requirementsHealthcare organizations managing sensitive dataRetail and e-commerce companies with large data estatesMedia and entertainment companiesData platform and engineering teams

Tech & specs

Technology stack

Apache SparkDelta LakeApache IcebergPythonScalaREST APIsApache ArrowDelta Sharing ProtocolKubernetesAWS / Azure / GCP

Security & compliance

SOC 2 Type IIGDPRCCPAHIPAAISO 27001FedRAMP (via Databricks platform)

Deployment

CloudHybridOn-premise (via open-source self-hosted)

API

Yes

Corporate history
  • 2021Founded
Connection details
See integrations with Unity Catalog (7)

Explore further

2 views