Extracting data with Documind

Let’s run a sample extraction with the Documind open-source package.

file
string
required

The file URL.

schema
object[]
required

The schema that defines the structure of the data you want to extract. More on schema definitions here.

Currently, only URLs are accepted. Ensure your document is hosted and accessible via a public URL.

Example Output

Once the extraction process is complete, the result will return a structured JSON object with the extracted data:

success
boolean

Indicates whether the extraction was successful or not.

pages
number

The number of pages processed in the document.

data
object

The extracted data based on the schema.

fileName
string

The name of the processed file

Configurations

You can only select one of template, schema, or autoSchema.