Skip to main content

Welcome to SoMark

SoMark converts PDFs, PPTs, images, and many other document formats into machine-readable structured output with high accuracy, high speed, and strong cost efficiency, providing high-quality data for LLM training and RAG applications.

99% OCR Accuracy

Industry-leading recognition accuracy with coordinate traceback to pinpoint every element in the source document.

100 Pages in 5 Seconds

High-speed parsing with horizontally scalable cluster deployment for large-scale batch workloads.

Pay As You Go

Usage-based billing or one-time licensing. Private deployment starts from a single RTX 3090 GPU.

21 Component Types

Detects headings, tables, formulas, images, chemical structures, seals, QR codes, and 14 more element types.

Multiple Output Formats

Outputs Markdown, JSON — ready for LLM training pipelines and RAG applications.

Broad Document Coverage

Supports research papers, reports, whitepapers, contracts, scanned books, government files, and more.

Supported file formats

pdf png jpg jpeg bmp tiff jp2 dib ppm pgm pbm gif heic heif webp xpm tga dds xbm doc docx ppt pptx xlsx xlsm xls

Recognized document elements

SoMark can recognize these 21 document element types:
CategoryElements
Text structureTitle title, text block text, header header, footer footer, footnote footnote
Figures and tablesFigure figure, figure caption figure_caption, table table, table caption table_caption
Specialized contentEquation equation, chemical structure cs, chemical equation cs_equation, code block code
Navigation and layoutSidebar sider, table of contents cate, TOC entry cate_item
Education and structured itemsChoice item choice, fill-in-the-blank blank, reference reference
Special elementsQR code qrcode, stamp stamp
Title
Title
Text block
Text block
Figure
Figure
Figure caption
Figure caption
Table
Table
Table caption
Table caption
Equation
Equation
Header
Header
Footer
Footer
Sidebar
Sidebar
Footnote
Footnote
TOC
TOC
TOC entry
TOC entry
Choice
Choice
Code block
Code block
Blank
Blank
Reference
Reference
QR code
QR code
Stamp
Stamp
Chemical structure
Chemical structure
Chemical equation
Chemical equation

Get Started

See the Quickstart Guide to begin; if you want to inspect API capability and limits first, jump to the API overview, and use FAQs for common questions.