Patching¶

Instructor enhances LLM client functionality by patching them with additional capabilities for structured outputs. This allows you to use the enhanced client as usual, while gaining structured output benefits.

Core Patching Features¶

Instructor adds three key parameters to the client's chat.completions.create method:

response_model: Defines the expected response type (Pydantic model or simple type)
max_retries: Controls how many retry attempts should be made if validation fails
validation_context: Provides additional context for validation hooks

Patching Modes¶

The default mode is instructor.Mode.TOOLS which is the recommended mode for OpenAI clients. Different providers support different modes based on their capabilities.

Tool Calling¶

This is the recommended method for OpenAI clients. It is the most stable as functions is being deprecated soon.

import instructor

client = instructor.from_provider(
    "openai/gpt-4.1-mini",
    mode=instructor.Mode.TOOLS,
)

Gemini Tool Calling¶

Gemini supports tool calling for stuctured data extraction. Gemini tool calling requires jsonref to be installed.

Limitations

Gemini tool calling comes with some known limitations:

- `strict` Pydantic validation can fail for integer/float and enum validations
- Gemini tool calling is incompatible with Pydantic schema customizations such as examples due to API limitations and may result in errors
- Gemini can sometimes call the wrong function name, resulting in malformed or invalid json
- Gemini tool calling could fail with enum and literal field types
- Gemini tool calling doesn't preserve the order of the fields in the response. Don't rely on the order of the fields in the response.

import instructor

client = instructor.from_provider(
    "google/gemini-2.5-flash",
    mode=instructor.Mode.GEMINI_TOOLS,
)

Gemini Vertex AI Tool Calling¶

This method allows us to get structured output from Gemini via tool calling with the Vertex AI SDK.

Note: Gemini Tool Calling is in preview and there are some limitations, you can learn more in the Vertex AI examples notebook.

import instructor

client = instructor.from_provider(
    "vertexai/gemini-1.5-pro-preview-0409",
    mode=instructor.Mode.VERTEXAI_TOOLS,
)

Parallel Tool Calling¶

Parallel tool calling is also an option but you must set response_model to be Iterable[Union[...]] types since we expect an array of results. Check out Parallel Tool Calling for more information.

import instructor
client = instructor.from_provider(
    "openai/gpt-4.1-mini",
    mode=instructor.Mode.PARALLEL_TOOLS,
)

Function Calling¶

Note that function calling is soon to be deprecated in favor of TOOL mode for OpenAI. But will still be supported for other clients.

import instructor

client = instructor.from_provider(
    "openai/gpt-4.1-mini",
    mode=instructor.Mode.TOOLS,
)

JSON Mode¶

JSON mode uses OpenAI's JSON format for responses by setting response_format={"type": "json_object"} in the chat.completions.create method.

import instructor

client = instructor.from_provider(
    "openai/gpt-4.1-mini",
    mode=instructor.Mode.JSON,
)

JSON mode is also required for the Gemini Models via OpenAI's SDK.

pip install google-auth

import google.auth
import google.auth.transport.requests
import instructor

creds, project = google.auth.default()
auth_req = google.auth.transport.requests.Request()
creds.refresh(auth_req)

# Pass the Vertex endpoint and authentication to the OpenAI SDK
PROJECT = 'PROJECT_ID'
LOCATION = 'LOCATION'

base_url = f'https://{LOCATION}-aiplatform.googleapis.com/v1beta1/projects/{PROJECT}/locations/{LOCATION}/endpoints/openapi'
client = instructor.from_provider(
    "openai/gpt-4.1-mini",
    base_url=base_url,
    api_key=creds.token,
    mode=instructor.Mode.JSON,
)

Gemini JSON Mode¶

This mode uses Gemini's response mimetype field to generate a response in JSON format using the schema provided.

import instructor

client = instructor.from_provider(
    "google/gemini-2.5-flash",
    mode=instructor.Mode.GEMINI_JSON,
)

Markdown JSON Mode¶

This just asks for the response in JSON format, but it is not recommended, and may not be supported in the future, this is just left to support vision models and models provided by Databricks and will not give you the full benefits of instructor.

Experimental

This is not recommended, and may not be supported in the future, this is just left to support vision models and models provided by Databricks.

General syntax:

import instructor

client = instructor.from_provider(
    "openai/gpt-4.1-mini",
    mode=instructor.Mode.MD_JSON,
)

Databricks syntax:

import instructor
import os

DATABRICKS_TOKEN = os.environ.get("DATABRICKS_TOKEN", "")
DATABRICKS_HOST = os.environ.get("DATABRICKS_HOST", "")

# Assuming Databricks environment variables are set
client = instructor.from_provider(
    "openai/gpt-4.1-mini",
    api_key=DATABRICKS_TOKEN,
    base_url=f"{DATABRICKS_HOST}/serving-endpoints",
    mode=instructor.Mode.MD_JSON,
)