Skip to main content

Introduction

Sterndesk - Turn documents into structured data Sterndesk is a document processing API that extracts structured data from complex documents. It automates workflows that previously required manual data entry—whether you’re processing shipping manifests, research papers, compliance reports, or enterprise records.

What Sterndesk Does

The API accepts documents via upload or URL crawling and returns structured JSON data according to schemas you define. Key capabilities include:
  • Document classification — Automatically identify document types
  • Data extraction — Extract fields, tables, and values using AI
  • Schema-based output — Define exactly what data you need and how it should be structured
  • Batch processing — Process thousands of documents in a single request
  • Flexible ingestion — Feed documents via direct upload, URL crawling, or integrations

Core Concepts

Sterndesk is organized around four main concepts:
ConceptDescription
Organizations & ProjectsLogical containers for managing access, billing, and grouping related work
Extraction SchemasDefinitions that specify what data to extract and how to structure it
CollectorsEntry points that feed documents into the system (uploads, URLs, integrations)
ExtractionsThe processed results containing structured data from your documents

Use Cases

Sterndesk is built for software teams handling document-heavy workflows:
  • Maritime & logistics — Process cargo manifests, bills of lading, and shipment documents
  • Ocean enterprise — Extract data from vessel reports, port documentation, and regulatory filings
  • Research & academia — Structure data from scientific papers, surveys, and institutional records
  • Compliance & legal — Parse contracts, audit reports, and regulatory submissions
  • Enterprise operations — Automate invoice processing, procurement documents, and internal records

Next Steps