Duplicate
Schema-driven detector documentation.
DUPLICATEplannedP24 params2 examples
Detector Metadata
Capability catalog entry from
all_detectors.json.Categories
QUALITY
Supported Asset Types
TXTTABLEURL
Recommended Model
datasketch (MinHash LSH)Notes
Finds near-duplicate content to reduce noise and repeated policy or legal risk.
Parameters
Configuration parameters for the Duplicate detector. Shared from `GenericDetectorConfig`.
| Parameter | Type | Required | Description | Default | Constraints |
|---|---|---|---|---|---|
| enabled_patterns | array | null | No | List of enabled pattern names | null | — |
| severity_threshold | enum | null | No | Minimum severity to report | null | — |
| confidence_threshold | number | No | Minimum confidence to report (0-1) | 0.7 | min 0, max 1 |
| max_findings | integer | null | No | Maximum number of findings to return | null | — |