Introduction

What is Parsefy?

Parsefy is a universal document extraction engine that transforms unstructured documents into structured JSON data using AI-powered precision. Simply define what data you need using a schema, upload your document, and get perfectly structured data back.

Schema-Driven

Define exactly what you need with JSON Schema or Pydantic models

Multi-Format Support

Process PDFs with native multimodal AI and DOCX files

High Accuracy

Intelligent fallback architecture ensures reliable extractions

Accurate Extraction

Strict extraction rules minimize errors and false data

Key Features

Feature	Description
Schema Adherence	100% compliance with your JSON Schema definition
PDF Processing	Native multimodal AI processing for PDFs
DOCX Support	Automatic Markdown conversion for Word documents
Smart Fallback	Automatic escalation to more capable models when needed
Confidence Metrics	Built-in quality scoring (0.0 - 1.0) with issue tracking
Rate Limiting	Built-in protection with credits-based and request-rate limits
Playground Mode	Test without an API key (10 credits/day)

How It Works

Define Your Schema

Create a JSON Schema or use Pydantic/Zod models to define the data structure you want to extract.

Upload Your Document

Send your PDF or DOCX file to the API along with your schema.

Get Structured Data

Receive perfectly structured JSON data matching your schema, complete with confidence scores.

Quick Example

Extract invoice data with a simple API call:

import { Parsefy } from 'parsefy';
import * as z from 'zod';

const client = new Parsefy();

const schema = z.object({
  invoice_number: z.string().describe('The invoice number'),
  total: z.number().describe('Total amount'),
  vendor: z.string().describe('Vendor name'),
});

const { object } = await client.extract({
  file: './invoice.pdf',
  schema,
});

Get Started

Quick Start

Get up and running with Parsefy in under 5 minutes

API Reference

Explore the complete API documentation

Python SDK

Use Parsefy with Pydantic models

JavaScript SDK

Use Parsefy with Zod schemas

Getting Started

Schema Guide

What is Parsefy?

Schema-Driven

Multi-Format Support

High Accuracy

Accurate Extraction

Key Features

How It Works

Quick Example

Get Started

Quick Start

API Reference

Python SDK

JavaScript SDK

Getting Started

Schema Guide

​What is Parsefy?

Schema-Driven

Multi-Format Support

High Accuracy

Accurate Extraction

​Key Features

​How It Works

​Quick Example

​Get Started

Quick Start

API Reference

Python SDK

JavaScript SDK

What is Parsefy?

Key Features

How It Works

Quick Example

Get Started