Skip to Content
Unified docs shell with shared Classifyre tokens and acid-green highlight accents.
SourcesS3 Compatible Storage

S3 Compatible Storage

Schema-driven source documentation.

S3_COMPATIBLE_STORAGE41 fields2 examples
Commonly Asked Questions
Assistant knowledge mapped to this source type from assistant_knowledge.json.

Required
Fields required for a valid configuration payload under `config.required`.
PathTypeRequiredDescriptionDefaultConstraints
requiredobjectYesno extra properties
required.bucketstringYesBucket name for AWS S3, MinIO, Cloudflare R2, Backblaze B2, Garage, and other S3-compatible endpoints
Masked
Sensitive fields under `config.masked` (secrets/credentials).
PathTypeRequiredDescriptionDefaultConstraints
maskedobjectNoOptional static credentials. Leave empty to use ambient AWS credentials chain.no extra properties
masked.aws_access_key_idstringNoS3-compatible access key ID
masked.aws_secret_access_keystringNoS3-compatible secret access key
masked.aws_session_tokenstringNoOptional session token for temporary credentials
Optional
Optional configuration fields under `config.optional`.
PathTypeRequiredDescriptionDefaultConstraints
optionalobjectNono extra properties
optional.connectionobjectNono extra properties
optional.connection.endpoint_urlstringNoCustom endpoint URL for MinIO/R2/B2/Garage and other S3-compatible providersformat uri
optional.connection.max_keys_per_pageintegerNoMaximum objects requested per provider list API call200min 1, max 1000
optional.connection.max_object_bytesintegerNoMaximum bytes downloaded per object for MIME detection and text extraction5242880min 1024, max 52428800
optional.connection.region_namestringNoRegion (recommended for AWS; required by some S3-compatible providers)
optional.connection.request_timeout_secondsnumberNoNetwork timeout in seconds for list/download operations30min 1, max 300
optional.connection.verify_sslbooleanNoTLS certificate verification toggletrue
optional.scopeobjectNoObject scope and filtering controls.no extra properties
optional.scope.exclude_extensionsarrayNoOptional extension denylist
optional.scope.exclude_extensions[]stringNo
optional.scope.include_content_previewbooleanNoDownload object bytes to infer MIME and extract detector-ready text previewstrue
optional.scope.include_empty_objectsbooleanNoInclude zero-byte objects in extraction resultsfalse
optional.scope.include_extensionsarrayNoOptional extension allowlist (for example, .pdf, .csv, .parquet)
optional.scope.include_extensions[]stringNo
optional.scope.include_object_metadatabooleanNoAttach provider metadata (etag, size, content-type hints, timestamps) to asset checksumstrue
optional.scope.prefixstringNoObject key prefix filter (for example, exports/2026/)
Examples
Reference payloads generated from shared source examples JSON.
S3-compatible blob scan (latest files)
Scan object metadata and file previews from an S3-compatible bucket using endpoint_url for MinIO/R2/B2

Schedule

{
  "enabled": true,
  "preset": "nightly",
  "cron": "9 2 * * *",
  "timezone": "UTC"
}

Config Payload

{
  "type": "S3_COMPATIBLE_STORAGE",
  "required": {
    "bucket": "customer-exports"
  },
  "masked": {
    "aws_access_key_id": "minio-access-key",
    "aws_secret_access_key": "minio-secret-key"
  },
  "optional": {
    "connection": {
      "endpoint_url": "https://storage.example.internal",
      "region_name": "us-east-1",
      "max_object_bytes": 5242880
    },
    "scope": {
      "prefix": "daily/",
      "include_extensions": [
        ".csv",
        ".parquet",
        ".pdf"
      ],
      "include_content_preview": true
    }
  },
  "sampling": {
    "strategy": "LATEST",
    "limit": 100
  }
}
Backblaze B2 S3 endpoint scan
Scan objects from Backblaze B2 via S3-compatible endpoint and region settings

Schedule

{
  "enabled": true,
  "preset": "nightly",
  "cron": "14 2 * * *",
  "timezone": "UTC"
}

Config Payload

{
  "type": "S3_COMPATIBLE_STORAGE",
  "required": {
    "bucket": "b2-random-archive-bucket"
  },
  "masked": {
    "aws_access_key_id": "b2-key-id-7f3c9a4d1e",
    "aws_secret_access_key": "b2-secret-key-Qx7mN2kL9pV4sT1w"
  },
  "optional": {
    "connection": {
      "endpoint_url": "https://s3.us-west-002.backblazeb2.com",
      "region_name": "us-west-002",
      "verify_ssl": true,
      "request_timeout_seconds": 30
    },
    "scope": {
      "include_content_preview": true
    }
  },
  "sampling": {
    "strategy": "LATEST",
    "limit": 100
  }
}