Multi-Task and Streaming¶

Using an Iterable lets you extract multiple structured objects from a single LLM call, streaming them as they arrive. This is useful for entity extraction, multi-task outputs, and more.

We recommend using the create_iterable method for most use cases. It's simpler and less error-prone than manually specifying Iterable[...] and stream=True.

Here's a simple example showing how to extract multiple users from a single sentence. You can use either the recommended create_iterable method or the create method with Iterable[User]:

Using create_iterable (recommended)Using create with Iterable[User]

import instructor
from pydantic import BaseModel

client = instructor.from_provider("openai/gpt-4.1-mini")

class User(BaseModel):
    name: str
    age: int

resp = client.chat.completions.create_iterable(
    messages=[
        {
            "role": "user",
            "content": "Ivan is 28, lives in Moscow and his friends are Alex, John and Mary who are 25, 30 and 27 respectively",
        }
    ],
    response_model=User,
)

for user in resp:
    print(user)

Recommended for most use cases. Handles streaming and iteration for you.

import instructor
from pydantic import BaseModel
from typing import Iterable

client = instructor.from_provider("openai/gpt-4.1-mini")

class User(BaseModel):
    name: str
    age: int

resp = client.chat.completions.create(
    messages=[
        {
            "role": "user",
            "content": "Ivan is 28, lives in Moscow and his friends are Alex, John and Mary who are 25, 30 and 27 respectively",
        }
    ],
    response_model=Iterable[User],
)

for user in resp:
    print(user)

Use this if you need more manual control or compatibility with legacy code.

We also support more complex extraction patterns such as Unions as you'll see below out of the box.

Warning

Unions don't work with Gemini because the AnyOf is not supported in the current response schema.

Synchronous Usage¶

Using createUsing create_iterable (recommended)

import instructor
from typing import Iterable, Union, Literal
from pydantic import BaseModel

class Weather(BaseModel):
    location: str
    units: Literal["imperial", "metric"]

class GoogleSearch(BaseModel):
    query: str

client = instructor.from_provider(
    "openai/gpt-4.1-mini", mode=instructor.Mode.TOOLS
)

results = client.chat.completions.create(
    messages=[
        {"role": "system", "content": "You must always use tools"},
        {"role": "user", "content": "What is the weather in toronto and dallas and who won the super bowl?"},
    ],
    response_model=Iterable[Union[Weather, GoogleSearch]],
    stream=True,
)

for item in results:
    print(item)

import instructor
from typing import Union, Literal
from pydantic import BaseModel

class Weather(BaseModel):
    location: str
    units: Literal["imperial", "metric"]

class GoogleSearch(BaseModel):
    query: str

client = instructor.from_provider(
    "openai/gpt-4.1-mini", mode=instructor.Mode.TOOLS
)

results = client.chat.completions.create_iterable(
    messages=[
        {"role": "system", "content": "You must always use tools"},
        {"role": "user", "content": "What is the weather in toronto and dallas and who won the super bowl?"},
    ],
    response_model=Union[Weather, GoogleSearch],
)

for item in results:
    print(item)

Asynchronous Usage¶

Using createUsing create_iterable (recommended)

import instructor
from typing import Iterable, Union, Literal
from pydantic import BaseModel
import asyncio

class Weather(BaseModel):
    location: str
    units: Literal["imperial", "metric"]

class GoogleSearch(BaseModel):
    query: str

aclient = instructor.from_provider(
    "openai/gpt-4.1-mini", async_client=True, mode=instructor.Mode.TOOLS
)

async def main():
    results = await aclient.chat.completions.create(
        messages=[
            {"role": "system", "content": "You must always use tools"},
            {"role": "user", "content": "What is the weather in toronto and dallas and who won the super bowl?"},
        ],
        response_model=Iterable[Union[Weather, GoogleSearch]],
        stream=True,
    )
    async for item in results:
        print(item)

asyncio.run(main())

import instructor
from typing import Union, Literal
from pydantic import BaseModel
import asyncio

class Weather(BaseModel):
    location: str
    units: Literal["imperial", "metric"]

class GoogleSearch(BaseModel):
    query: str

aclient = instructor.from_provider(
    "openai/gpt-4.1-mini", async_client=True, mode=instructor.Mode.TOOLS
)

async def main():
    results = await aclient.chat.completions.create_iterable(
        messages=[
            {"role": "system", "content": "You must always use tools"},
            {"role": "user", "content": "What is the weather in toronto and dallas and who won the super bowl?"},
        ],
        response_model=Union[Weather, GoogleSearch],
    )
    async for item in results:
        print(item)

asyncio.run(main())