Skip to Content
Unified docs shell with shared Classifyre tokens and acid-green highlight accents.
SourcesDatabricks

Databricks

Schema-driven source documentation.

DATABRICKS43 fields2 examples
Commonly Asked Questions
Assistant knowledge mapped to this source type from assistant_knowledge.json.

Required
Fields required for a valid configuration payload under `config.required`.
PathTypeRequiredDescriptionDefaultConstraints
requiredobjectYes
Masked
Sensitive fields under `config.masked` (secrets/credentials).
PathTypeRequiredDescriptionDefaultConstraints
maskedobjectYes
Optional
Optional configuration fields under `config.optional`.
PathTypeRequiredDescriptionDefaultConstraints
optionalobjectNono extra properties
optional.connectionobjectNoDatabricks API and SQL statement execution tuning options.no extra properties
optional.connection.max_statement_pollsintegerNoMaximum polling attempts when waiting for SQL statement completion30min 1, max 120
optional.connection.statement_timeout_secondsintegerNoMaximum wait timeout for SQL statement execution60min 5, max 600
optional.connection.timeout_secondsintegerNoHTTP timeout for Databricks API calls30min 5, max 300
optional.extractionobjectNoDatabricks Unity Catalog extraction feature flags.no extra properties
optional.extraction.include_column_lineagebooleanNoAttempt to fetch column-level lineage metadatafalse
optional.extraction.include_notebooksbooleanNoExtract workspace notebook metadata as additional assetsfalse
optional.extraction.include_pipelinesbooleanNoExtract Delta Live Tables pipeline metadata as additional assetsfalse
optional.extraction.include_table_lineagebooleanNoInclude table-level lineage links between Unity Catalog tablestrue
optional.scopeobjectNoDatabricks Unity Catalog scope filters.no extra properties
optional.scope.exclude_catalogsarrayNoCatalog denylist (exact catalog names)[]
optional.scope.exclude_catalogs[]stringNo
optional.scope.exclude_schemasarrayNoSchema denylist. Accepted forms: schema or catalog.schema["information_schema"]
optional.scope.exclude_schemas[]stringNo
optional.scope.include_catalogsarrayNoOptional catalog allowlist (exact catalog names)
optional.scope.include_catalogs[]stringNo
optional.scope.include_hive_metastorebooleanNoInclude hive_metastore catalog in extractionfalse
optional.scope.include_schemasarrayNoOptional schema allowlist. Accepted forms: schema or catalog.schema
optional.scope.include_schemas[]stringNo
optional.scope.include_tablesarrayNoOptional table allowlist. Accepted forms: table, schema.table, or catalog.schema.table
optional.scope.include_tables[]stringNo
optional.scope.table_limit_per_schemaintegerNoOptional cap on number of Unity Catalog tables extracted per schemamin 1
Examples
Reference payloads generated from shared source examples JSON.
Databricks Unity Catalog with PAT token
Extract Unity Catalog tables and lineage with PAT authentication and optional notebook/pipeline metadata

Schedule

{
  "enabled": true,
  "preset": "nightly",
  "cron": "26 0 * * *",
  "timezone": "UTC"
}

Config Payload

{
  "type": "DATABRICKS",
  "required": {
    "auth_mode": "PAT_TOKEN",
    "workspace_url": "https://adb-3018287583848948.8.azuredatabricks.net",
    "warehouse_id": "85a0db1067b31560"
  },
  "masked": {
    "token": "dapi533087dfbc1a9b17eaa95bbe01440726-2"
  },
  "optional": {
    "scope": {
      "include_catalogs": [
        "main"
      ]
    },
    "extraction": {
      "include_table_lineage": true,
      "include_column_lineage": false,
      "include_notebooks": true,
      "include_pipelines": true
    }
  },
  "sampling": {
    "strategy": "RANDOM",
    "limit": 20,
    "max_columns": 20,
    "max_cell_chars": 512
  }
}
Databricks Unity Catalog with service principal
Use service principal auth for scheduled Databricks ingestion with recency-focused sampling

Schedule

{
  "enabled": true,
  "preset": "weekday_business",
  "cron": "8 15 * * 1-5",
  "timezone": "UTC"
}

Config Payload

{
  "type": "DATABRICKS",
  "required": {
    "auth_mode": "SERVICE_PRINCIPAL",
    "workspace_url": "https://adb-3018287583848948.8.azuredatabricks.net",
    "warehouse_id": "85a0db1067b31560",
    "client_id": "service-principal-client-id"
  },
  "masked": {
    "client_secret": "service-principal-client-secret"
  },
  "optional": {
    "scope": {
      "include_catalogs": [
        "main",
        "finance"
      ],
      "include_hive_metastore": false
    },
    "extraction": {
      "include_table_lineage": true,
      "include_column_lineage": true,
      "include_notebooks": false,
      "include_pipelines": false
    }
  },
  "sampling": {
    "strategy": "LATEST",
    "limit": 30,
    "order_by_column": "updated_at",
    "fallback_to_random": true
  }
}