Skip to main content
The classify node sends each document page to a specific output branch based on its content. Use it to sort incoming documents by type, topic, or any other classification criteria before applying downstream processing.

When to use classify

  • You receive mixed document types through one trigger and need to apply different processing to each.
  • Different document types call for different extract schemas. Classifying upstream lets each branch use a focused schema instead of one mega-schema covering everything.
  • You want to drop or quarantine documents that don’t fit any expected type. Use the other output for unmatched pages.
  • Use route instead when you’re filtering on a metadata condition that doesn’t require reading the document content.
For mode selection, criteria writing, and confidence tuning, see classify best practices.

Configuration

FieldTypeRequiredDescription
ModeselectYesClassification method: Rules or AI
Routing Rulesrule editorWhen mode is RulesConditions that determine which output branch each page follows
Target Pipelinespipeline pickerWhen mode is AIPipelines to match pages against. Only pipelines with a Forward Trigger are selectable
Confidence ThresholdnumberWhen mode is AIMinimum similarity score (0–1). Pages below it route to the other output. Defaults to 0.7
Force OCRtoggleNoForces OCR processing on the document before classification, even when it contains selectable text

Modes

Rules

Define conditions that match page content or page numbers. Each rule maps to a different output edge. Rules accept Contains text, Not contains text, and Page numbers.

AI

Uses AI embedding similarity to match pages against reference documents in your target pipelines. Select one or more Target Pipelines. Each target pipeline must have a Forward Trigger with reference documents so Ingestly can compare incoming pages against it. Set the Confidence Threshold to control how strict the match must be. Pages with a similarity score at or above the threshold are forwarded to the matching pipeline; pages below fall through to the other output so you can handle them separately.

Inputs and outputs

Allowed inputs: Trigger nodes, route, parse, review. Output: Pages routed to the matching output edge, or to the other edge if no classification matched.

Credits

See credits for pricing. Rules mode costs 0.5 credits per page; AI mode costs 0.2 credits per page. Generating embeddings on the forward trigger when you upload reference documents is billed separately at 0.2 credits per page per reference document.

Classify best practices

Pick a mode, write criteria, and tune confidence

Conditional routing guide

Patterns for branching pipelines on document type

Forward trigger

Configure reference documents for AI classification

Extract action

Extract data after classifying by document type