Patching¶
Instructor enhances client functionality with three new keywords for backwards compatibility. This allows use of the enhanced client as usual, with structured output benefits.
response_model
: Defines the response type forchat.completions.create
.max_retries
: Determines retry attempts for failedchat.completions.create
validations.validation_context
: Provides extra context to the validation process.
The default mode is instructor.Mode.TOOLS
which is the recommended mode for OpenAI clients. This mode is the most stable and is the most recommended for OpenAI clients. The other modes are for other clients and are not recommended for OpenAI clients.
Tool Calling¶
This is the recommended method for OpenAI clients. It is the most stable as functions is being deprecated soon.
import instructor
from openai import OpenAI
client = instructor.from_openai(OpenAI(), mode=instructor.Mode.TOOLS)
Gemini Tool Calling¶
This method allows us to get structured output from Gemini via tool calling with the Vertex AI SDK.
Note: Gemini Tool Calling is in preview and there are some limitations, you can learn more in the Vertex AI examples notebook.
import instructor
from vertexai.generative_models import GenerativeModel # type: ignore
import vertexai
vertexai.init(project="vertexai-generative-models")
client = instructor.from_vertexai(
client=GenerativeModel("gemini-1.5-pro-preview-0409"),
mode=instructor.Mode.VERTEXAI_TOOLS,
)
Parallel Tool Calling¶
Parallel tool calling is also an option but you must set response_model
to be Iterable[Union[...]]
types since we expect an array of results. Check out Parallel Tool Calling for more information.
import instructor
from openai import OpenAI
client = instructor.from_openai(OpenAI(), mode=instructor.Mode.PARALLEL_TOOLS)
Function Calling¶
Note that function calling is soon to be deprecated in favor of TOOL mode for OpenAI. But will still be supported for other clients.
import instructor
from openai import OpenAI
client = instructor.from_openai(OpenAI(), mode=instructor.Mode.FUNCTIONS)
JSON Mode¶
JSON mode uses OpenAI's JSON fromat for responses. by setting response_format={"type": "json_object"}
in the chat.completions.create
method.
import instructor
from openai import OpenAI
client = instructor.from_openai(OpenAI(), mode=instructor.Mode.JSON)
Gemini JSON Mode¶
This mode uses Gemini's response mimetype field to generate a response in JSON format using the schema provided.
import instructor
import google.generativeai as genai
client = instructor.from_gemini(
genai.GenerativeModel(), mode=instructor.Mode.GEMINI_JSON
)
Markdown JSON Mode¶
This just asks for the response in JSON format, but it is not recommended, and may not be supported in the future, this is just left to support vision models and models provided by Databricks and will not give you the full benefits of instructor.
Experimental
This is not recommended, and may not be supported in the future, this is just left to support vision models and models provided by Databricks.
General syntax:
import instructor
from openai import OpenAI
client = instructor.from_openai(OpenAI(), mode=instructor.Mode.MD_JSON)
Databricks syntax:
import instructor
import os
from openai import OpenAI
DATABRICKS_TOKEN = os.environ.get("DATABRICKS_TOKEN", "")
DATABRICKS_HOST = os.environ.get("DATABRICKS_HOST", "")
# Assuming Databricks environment variables are set
client = instructor.from_openai(
OpenAI(
api_key=DATABRICKS_TOKEN,
base_url=f"{DATABRICKS_HOST}/serving-endpoints",
),
mode=instructor.Mode.MD_JSON,
)