Documents Methods

In this section, we will see how to use the methods of the documents client.

Create Messages

Returns

DocumentMessage Object

A DocumentMessage object with the messages created from the document.

from uiform import UiForm
from openai import OpenAI

uiclient = UiForm()
doc_msg = uiclient.documents.create_messages(
    document = "freight/booking_confirmation.jpg",
    modality = "text",
    image_settings = {
        "correct_image_orientation": True,
        "dpi": 72,
        "image_to_text": "ocr",
        "browser_canvas": "A4"
    }
)

Use doc_msg.items to have a list of [PIL.Image.Image | str] objects

Correct image orientation

Returns

PIL.Image.Image

The orientation-corrected image as a PIL Image object

from uiform import UiForm
from openai import OpenAI

uiclient = UiForm()
image = uiclient.documents.correct_image_orientation(
    document = "freight/booking_confirmation.jpg",
)

Extractions

Returns

ParsedChatCompletion

An OpenAI ParsedChatCompletion object with the extracted data.

from uiform import UiForm

uiclient = UiForm()

doc_msg = uiclient.documents.extractions.parse(
    document = "freight/booking_confirmation.jpg", 
    model="gpt-4.1-nano",
    json_schema = {
      'X-SystemPrompt': 'You are a useful assistant.',
      'properties': {
          'name': {
              'X-FieldPrompt': 'Provide a descriptive and concise name for the event.',
              'description': 'The name of the calendar event.',
              'title': 'Name',
              'type': 'string'
          },
          'date': {
              'X-FieldPrompt': 'Specify the event date in YYYY-MM-DD format.',
              'description': 'The date of the calendar event in ISO 8601 format.',
              'title': 'Date',
              'type': 'string'
          }
      },
      'required': ['name', 'date'],
      'title': 'CalendarEvent',
      'type': 'object'
    },
    modality="text",
    n_consensus=1 # 1 means disabled (default), if greater than 1 it will run the extraction with n-consensus mode
)

Templates

templates.documentai

The Document AI templates allow you to extract structured data from documents using predefined templates inspired by Google Document AI. These templates are designed to handle common document types like invoices, receipts, IDs and more.

templates.documentai.parse

templates.documentai.parse

function

Extract structured data from a document using a predefined template inspired by Google Document AI

template

DocumentAITemplate

required

The template to use for extraction. Must be one of:

“bank_statement” - Extract data from bank statements
“contract” - Extract data from contracts
“driver_license” - Extract data from driver licenses
“expense” - Extract data from expense reports
“identity_proofing” - Extract data for identity verification
“invoice” - Extract data from invoices
“passport” - Extract data from passports
“pay_slip” - Extract data from pay slips
“w2” - Extract data from W-2 forms

document

Path | str | IO[bytes] | MIMEData

required

The document to extract from. Can be a file path, string, bytes IO object, or MIMEData.

image_settings

ImageSettings

Optional image preprocessing operations:

model

string

default:"gpt-4o-2024-08-06"

The model to use for extraction.

temperature

float

default:"0"

The sampling temperature to use.

messages

array[ChatCompletionUiformMessage]

default:"[]"

Optional list of previous messages to include. Each message must have:

modality

string

default:"native"

The modality to use for processing the document. Can be:

“native” (default) - Uses the document’s native modality based on file type
“image+text” - Uses both the document’s native modality and processes as text
“text” - Process as text
“image” - Process as image
“audio” - Process as audio
“video” - Process as video

store

boolean

default:"false"

Whether to store the document and extraction results.

Response

DocumentExtractResponse

The extraction response containing the structured data.

from uiform import UiForm

uiclient = UiForm()

doc_msg = uiclient.documents.templates.documentai.parse(
    document = "freight/booking_confirmation.jpg", 
    model="gpt-4.1-nano",
    template="invoice",
    modality="text"
)

Get Started

SDK

Create Messages

Correct image orientation

Extractions

Templates

templates.documentai

Get Started

SDK

​Create Messages

​Correct image orientation

​Extractions

​Templates

​templates.documentai

Create Messages

Correct image orientation

Extractions

Templates

templates.documentai