Skip to main content
When documents are uploaded, Plomo classifies each one against your deal’s taxonomy using multimodal AI. The process is designed to be accurate and transparent — you can see the confidence score and evidence for every classification.

How Documents Are Classified

Plomo reads each document and compares it against your deal’s categories. For each document, it determines:
  • Category — Which top-level group does this belong to?
  • Subcategory — Which specific area within that group?
  • Confidence — How certain is the classification? (0–100%)
  • Evidence — What parts of the document support this classification?

Two Classification Modes

Text Mode

Works for all file types. Plomo extracts the text and sends it to the AI with your taxonomy using a chain-of-thought pipeline. The AI reasons step by step, scoring each category before making a decision. Long documents are intelligently trimmed to preserve the most relevant content while staying within model context limits.

Vision Mode

For PDFs only. Plomo sends the actual PDF pages as images to the multimodal AI. This computer vision approach catches information in tables, charts, headers, and formatting that text extraction might miss — especially useful for financial statements, org charts, and scanned documents. Both modes feed into the same ensemble voting system to ensure accuracy.

Ensemble Voting & Anomaly Detection

Rather than relying on a single AI call, Plomo runs multiple classification rounds with an ensemble approach:
  1. Multi-round inference — Multiple classification attempts with diverse parameters
  2. Anomaly filtering — Suspicious responses are automatically removed
  3. Consensus voting — Final answer selected when consistent agreement is reached

Confidence and Review Status

After classification, each document gets a review status:
ConfidenceStatusWhat It Means
85%+AcceptedHigh confidence — auto-accepted, no action needed
60–84%Needs ReviewModerate confidence — take a quick look
Below 60%UncategorizedLow confidence — classify manually
This tiered approach means you only spend time reviewing the documents that actually need human judgment.

Real-Time Results

Classifications stream to the dashboard in real time over HTTP/2. As each document finishes processing, it appears in the table immediately — you don’t have to wait for the entire batch. Documents are parsed and classified in parallel across multiple worker threads.

Pipeline Optimization (Opt-In)

If you provide labeled examples from past deals, the evolutionary optimizer can improve classification accuracy:
  1. Takes your labeled examples as training data
  2. Evolves better classification prompts through a reflection loop
  3. Evaluates candidates against your data and selects top performers
  4. Saves an ensemble of the best prompts for higher accuracy
The optimized pipeline loads on future runs for improved accuracy on your specific document types and taxonomy.
Plomo does not collect or store your documents for training purposes. Optimization only runs when you explicitly provide labeled data.