Validators¶
Instead of framing "self-critique" or "self-reflection" in AI as new concepts, we can view them as validation errors with clear error messages that the systen can use to self correct.
Pydantic offers an customizable and expressive validation framework for Python. Instructor leverages Pydantic's validation framework to provide a uniform developer experience for both code-based and LLM-based validation, as well as a reasking mechanism for correcting LLM outputs based on validation errors. To learn more check out the Pydantic docs on validators.
Note: For the majority of this notebook we won't be calling openai, just using validators to see how we can control the validation of the objects.
Validators will enable us to control outputs by defining a function like so:
def validation_function(value):
if condition(value):
raise ValueError("Value is not valid")
return mutation(value)
Before we get started lets go over the general shape of a validator:
from pydantic import BaseModel, ValidationError
from typing_extensions import Annotated
from pydantic import AfterValidator
def name_must_contain_space(v: str) -> str:
if " " not in v:
raise ValueError("Name must contain a space.")
return v.lower()
class UserDetail(BaseModel):
age: int
name: Annotated[str, AfterValidator(name_must_contain_space)]
person = UserDetail(age=29, name="Jason")
--------------------------------------------------------------------------- ValidationError Traceback (most recent call last) /Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb Cell 4 line 1 <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#W3sZmlsZQ%3D%3D?line=10'>11</a> age: int <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#W3sZmlsZQ%3D%3D?line=11'>12</a> name: Annotated[str, AfterValidator(name_must_contain_space)] ---> <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#W3sZmlsZQ%3D%3D?line=13'>14</a> person = UserDetail(age=29, name="Jason") File ~/dev/instructor/.venv/lib/python3.11/site-packages/pydantic/main.py:164, in BaseModel.__init__(__pydantic_self__, **data) 162 # `__tracebackhide__` tells pytest and some other tools to omit this function from tracebacks 163 __tracebackhide__ = True --> 164 __pydantic_self__.__pydantic_validator__.validate_python(data, self_instance=__pydantic_self__) ValidationError: 1 validation error for UserDetail name Value error, Name must contain a space. [type=value_error, input_value='Jason', input_type=str] For further information visit https://errors.pydantic.dev/2.4/v/value_error
Validation Applications
Validators are essential in tackling the unpredictabile nature of LLMs.
Straightforward examples include:
- Flagging outputs containing blacklisted words.
- Identifying outputs with tones like racism or violence.
For more complex tasks:
- Ensuring citations directly come from provided content.
- Checking that the model's responses align with given context.
- Validating the syntax of SQL queries before execution.
Setup and Dependencies¶
Using the instructor library, we streamline the integration of these validators. instructor
manages the parsing and validation of outputs and automates retries for compliant responses. This simplifies the process for developers to implement new validation logic, minimizing extra overhead.
To use instructor in our api calls, we just need to patch the openai client:
import instructor
from openai import OpenAI
client = instructor.patch(OpenAI())
Software 2.0: Rule-based validators¶
Deterministic validation, characterized by its rule-based logic, ensures consistent outcomes for the same input. Let's explore how we can apply this concept through some examples.
Flagging bad keywords¶
To begin with, we aim to prevent engagement in topics involving explicit violence.
We will define a blacklist of violent words that cannot be mentioned in any messages:
blacklist = {
"rob",
"steal",
"hurt",
"kill",
"attack",
}
To validate if the message contains a blacklisted word we will use a field_validator over the 'message' field:
from pydantic import BaseModel, ValidationError, field_validator
from pydantic.fields import Field
class Response(BaseModel):
message: str
@field_validator('message')
def message_cannot_have_blacklisted_words(cls, v: str) -> str:
for word in v.split():
if word.lower() in blacklist:
raise ValueError(f"`{word}` was found in the message `{v}`")
return v
Response(message="I will hurt him")
--------------------------------------------------------------------------- ValidationError Traceback (most recent call last) /Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb Cell 17 line 1 <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X23sZmlsZQ%3D%3D?line=10'>11</a> raise ValueError(f"`{word}` was found in the message `{v}`") <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X23sZmlsZQ%3D%3D?line=11'>12</a> return v ---> <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X23sZmlsZQ%3D%3D?line=13'>14</a> Response(message="I will hurt him") File ~/dev/instructor/.venv/lib/python3.11/site-packages/pydantic/main.py:164, in BaseModel.__init__(__pydantic_self__, **data) 162 # `__tracebackhide__` tells pytest and some other tools to omit this function from tracebacks 163 __tracebackhide__ = True --> 164 __pydantic_self__.__pydantic_validator__.validate_python(data, self_instance=__pydantic_self__) ValidationError: 1 validation error for Response message Value error, `hurt` was found in the message `I will hurt him` [type=value_error, input_value='I will hurt him', input_type=str] For further information visit https://errors.pydantic.dev/2.4/v/value_error
Flagging using OpenAI Moderation¶
To enhance our validation measures, we'll extend the scope to flag any answer that contains hateful content, harassment, or similar issues. OpenAI offers a moderation endpoint that addresses these concerns, and it's freely available when using OpenAI models.
With the instructor
library, this is just one function edit away:
from typing import Annotated
from pydantic.functional_validators import AfterValidator
from instructor import openai_moderation
class Response(BaseModel):
message: Annotated[str, AfterValidator(openai_moderation(client=client))]
Now we have a more comprehensive flagging for violence and we can outsource the moderation of our messages.
Response(message="I want to make them suffer the consequences")
--------------------------------------------------------------------------- ValidationError Traceback (most recent call last) Cell In[7], line 1 ----> 1 Response(message="I want to make them suffer the consequences") File ~/.virtualenvs/pampa-labs/lib/python3.10/site-packages/pydantic/main.py:164, in BaseModel.__init__(__pydantic_self__, **data) 162 # `__tracebackhide__` tells pytest and some other tools to omit this function from tracebacks 163 __tracebackhide__ = True --> 164 __pydantic_self__.__pydantic_validator__.validate_python(data, self_instance=__pydantic_self__) ValidationError: 1 validation error for Response message Value error, `I want to make them suffer the consequences` was flagged for harassment, harassment_threatening, violence, harassment/threatening [type=value_error, input_value='I want to make them suffer the consequences', input_type=str] For further information visit https://errors.pydantic.dev/2.5/v/value_error
And as an extra, we get flagging for other topics like religion, race etc.
Response(message="I will mock their religion")
--------------------------------------------------------------------------- ValidationError Traceback (most recent call last) Cell In[26], line 1 ----> 1 Response(message="I will mock their religion") File ~/.virtualenvs/pampa-labs/lib/python3.10/site-packages/pydantic/main.py:164, in BaseModel.__init__(__pydantic_self__, **data) 162 # `__tracebackhide__` tells pytest and some other tools to omit this function from tracebacks 163 __tracebackhide__ = True --> 164 __pydantic_self__.__pydantic_validator__.validate_python(data, self_instance=__pydantic_self__) ValidationError: 1 validation error for Response message Value error, `I will mock their religion` was flagged for ['harassment'] [type=value_error, input_value='I will mock their religion', input_type=str] For further information visit https://errors.pydantic.dev/2.5/v/value_error
Filtering very long messages¶
In addition to content-based flags, we can also set criteria based on other aspects of the input text. For instance, to maintain user engagement, we might want to prevent the assistant from returning excessively long texts.
Here, noticed that Field
has built-in validators for min_length
and max_length
. to learn more checkout Field Contraints
class AssistantMessage(BaseModel):
message: str = Field(..., max_length=100)
AssistantMessage(message="Certainly! Lorem ipsum is a placeholder text commonly used in the printing and typesetting industry. Here's a sample of Lorem ipsum text: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nullam euismod velit vel tellus tempor, non viverra eros iaculis. Sed vel nisl nec mauris bibendum tincidunt. Vestibulum sed libero euismod, eleifend tellus id, laoreet elit. Donec auctor arcu ac mi feugiat, vel lobortis justo efficitur. Fusce vel odio vitae justo varius dignissim. Integer sollicitudin mi a justo bibendum ultrices. Quisque id nisl a lectus venenatis luctus. Please note that Lorem ipsum text is a nonsensical Latin-like text used as a placeholder for content, and it has no specific meaning. It's often used in design and publishing to demonstrate the visual aspects of a document without focusing on the actual content.")
--------------------------------------------------------------------------- ValidationError Traceback (most recent call last) /Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb Cell 29 line 1 ----> <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X41sZmlsZQ%3D%3D?line=0'>1</a> AssistantMessage(message="Certainly! Lorem ipsum is a placeholder text commonly used in the printing and typesetting industry. Here's a sample of Lorem ipsum text: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nullam euismod velit vel tellus tempor, non viverra eros iaculis. Sed vel nisl nec mauris bibendum tincidunt. Vestibulum sed libero euismod, eleifend tellus id, laoreet elit. Donec auctor arcu ac mi feugiat, vel lobortis justo efficitur. Fusce vel odio vitae justo varius dignissim. Integer sollicitudin mi a justo bibendum ultrices. Quisque id nisl a lectus venenatis luctus. Please note that Lorem ipsum text is a nonsensical Latin-like text used as a placeholder for content, and it has no specific meaning. It's often used in design and publishing to demonstrate the visual aspects of a document without focusing on the actual content.") File ~/dev/instructor/.venv/lib/python3.11/site-packages/pydantic/main.py:164, in BaseModel.__init__(__pydantic_self__, **data) 162 # `__tracebackhide__` tells pytest and some other tools to omit this function from tracebacks 163 __tracebackhide__ = True --> 164 __pydantic_self__.__pydantic_validator__.validate_python(data, self_instance=__pydantic_self__) ValidationError: 1 validation error for AssistantMessage message String should have at most 100 characters [type=string_too_long, input_value="Certainly! Lorem ipsum i... on the actual content.", input_type=str] For further information visit https://errors.pydantic.dev/2.4/v/string_too_long
Avoiding hallucination with citations¶
When incorporating external knowledge bases, it's crucial to ensure that the agent uses the provided context accurately and doesn't fabricate responses. Validators can be effectively used for this purpose. We can illustrate this with an example where we validate that a provided citation is actually included in the referenced text chunk:
from pydantic import ValidationInfo
class AnswerWithCitation(BaseModel):
answer: str
citation: str
@field_validator('citation')
@classmethod
def citation_exists(cls, v: str, info: ValidationInfo):
context = info.context
if context:
context = context.get('text_chunk')
if v not in context:
raise ValueError(f"Citation `{v}` not found in text")
return v
Here we assume that there is a "text_chunk" field that contains the text that the model is supposed to use as context. We then use the field_validator
decorator to define a validator that checks if the citation is included in the text chunk. If it's not, we raise a ValueError
with a message that will be returned to the user.
AnswerWithCitation.model_validate(
{
"answer": "Blueberries are packed with protein",
"citation": "Blueberries contain high levels of protein"
},
context={"text_chunk": "Blueberries are very rich in antioxidants"},
)
--------------------------------------------------------------------------- ValidationError Traceback (most recent call last) /Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb Cell 34 line 1 ----> <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X50sZmlsZQ%3D%3D?line=0'>1</a> AnswerWithCitation.model_validate( <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X50sZmlsZQ%3D%3D?line=1'>2</a> { <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X50sZmlsZQ%3D%3D?line=2'>3</a> "answer": "Blueberries are packed with protein", <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X50sZmlsZQ%3D%3D?line=3'>4</a> "citation": "Blueberries contain high levels of protein" <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X50sZmlsZQ%3D%3D?line=4'>5</a> }, <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X50sZmlsZQ%3D%3D?line=5'>6</a> context={"text_chunk": "Blueberries are very rich in antioxidants"}, <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X50sZmlsZQ%3D%3D?line=6'>7</a> ) File ~/dev/instructor/.venv/lib/python3.11/site-packages/pydantic/main.py:503, in BaseModel.model_validate(cls, obj, strict, from_attributes, context) 501 # `__tracebackhide__` tells pytest and some other tools to omit this function from tracebacks 502 __tracebackhide__ = True --> 503 return cls.__pydantic_validator__.validate_python( 504 obj, strict=strict, from_attributes=from_attributes, context=context 505 ) ValidationError: 1 validation error for AnswerWithCitation citation Value error, Citation `Blueberries contain high levels of protein` not found in text [type=value_error, input_value='Blueberries contain high levels of protein', input_type=str] For further information visit https://errors.pydantic.dev/2.4/v/value_error
Software 3.0: Probabilistic validators¶
For scenarios requiring more nuanced validation than rule-based methods, we use probabilistic validation. This approach incorporates LLMs into the validation workflow for a sophisticated assessment of outputs.
The instructor
library offers the llm_validator
utility for this purpose. By specifying the desired directive, we can use LLMs for complex validation tasks. Let's explore some intriguing use cases enabled by LLMs.
Keeping an agent on topic¶
When creating an agent focused on health improvement, providing answers and daily practice suggestions, it's crucial to ensure strict adherence to health-related topics. This is important because the knowledge base is limited to health topics, and veering off-topic could result in fabricated responses.
To achieve this focus, we'll follow a similar process as before, but with an important addition: integrating an LLM into our validator.
This LLM will be tasked with determining whether the agent's responses are exclusively related to health topics. For this, we will use the llm_validator
from instructor
like so:
from instructor import llm_validator
class AssistantMessage(BaseModel):
message: Annotated[str,
AfterValidator(
llm_validator("don't talk about any other topic except health best practices and topics",
client=client))]
AssistantMessage(message="I would suggest you to visit Sicily as they say it is very nice in winter.")
--------------------------------------------------------------------------- ValidationError Traceback (most recent call last) /Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb Cell 38 line 1 <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X56sZmlsZQ%3D%3D?line=4'>5</a> class AssistantMessage(BaseModel): <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X56sZmlsZQ%3D%3D?line=5'>6</a> message: Annotated[str, <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X56sZmlsZQ%3D%3D?line=6'>7</a> AfterValidator( <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X56sZmlsZQ%3D%3D?line=7'>8</a> llm_validator("don't talk about any other topic except health best practices and topics", <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X56sZmlsZQ%3D%3D?line=8'>9</a> openai_client=client))] ---> <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#X56sZmlsZQ%3D%3D?line=10'>11</a> AssistantMessage(message="I would suggest you to visit Sicily as they say it is very nice in winter.") File ~/dev/instructor/.venv/lib/python3.11/site-packages/pydantic/main.py:164, in BaseModel.__init__(__pydantic_self__, **data) 162 # `__tracebackhide__` tells pytest and some other tools to omit this function from tracebacks 163 __tracebackhide__ = True --> 164 __pydantic_self__.__pydantic_validator__.validate_python(data, self_instance=__pydantic_self__) ValidationError: 1 validation error for AssistantMessage message Assertion failed, The statement is not related to health best practices or topics. [type=assertion_error, input_value='I would suggest you to v...is very nice in winter.', input_type=str] For further information visit https://errors.pydantic.dev/2.4/v/assertion_error
Important that for these examples we're not waiting for the messages, to get this message we would need to call the openai with response_model=AssistantMessage
.
Validating agent thinking with CoT¶
Using probabilistic validation, we can also assess the agent's reasoning process to ensure it's logical before providing a response. With chain of thought prompting, the model is expected to think in steps and arrive at an answer following its logical progression. If there are errors in this logic, the final response may be incorrect.
Here we will use Pydantic's model_validator which allows us to apply validation over all the properties of the AIResponse
at once.
To make this easier we'll make a simple validation class that we can reuse for all our validation:
from typing import Optional
class Validation(BaseModel):
is_valid: bool = Field(..., description="Whether the value is valid based on the rules")
error_message: Optional[str] = Field(..., description="The error message if the value is not valid, to be used for re-asking the model")
The function we will call will integrate an LLM and will ask it to determine whether the answer the model provided follows from the chain of thought:
def validate_chain_of_thought(values):
chain_of_thought = values["chain_of_thought"]
answer = values["answer"]
resp = client.chat.completions.create(
model="gpt-4-1106-preview",
messages=[
{
"role": "system",
"content": "You are a validator. Determine if the value follows from the statement. If it is not, explain why.",
},
{
"role": "user",
"content": f"Verify that `{answer}` follows the chain of thought: {chain_of_thought}",
},
],
response_model=Validation,
)
if not resp.is_valid:
raise ValueError(resp.error_message)
return values
The use of the 'before' argument in this context is significant. It means that the validator will receive the complete dictionary of inputs in their raw form, before any parsing by Pydantic.
from typing import Any
from pydantic import model_validator
class AIResponse(BaseModel):
chain_of_thought: str
answer: str
@model_validator(mode='before')
@classmethod
def chain_of_thought_makes_sense(cls, data: Any) -> Any:
# here we assume data is the dict representation of the model
# since we use 'before' mode.
return validate_chain_of_thought(data)
AIResponse(chain_of_thought="The user suffers from diabetes.", answer="The user has a broken leg.")
--------------------------------------------------------------------------- ValidationError Traceback (most recent call last) /Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb Cell 47 line 1 ----> <a href='vscode-notebook-cell:/Users/jasonliu/dev/instructor/tutorials/5.validation.ipynb#Y103sZmlsZQ%3D%3D?line=0'>1</a> AIResponse(chain_of_thought="The user suffers from diabetes.", answer="The user has a broken leg.") File ~/dev/instructor/.venv/lib/python3.11/site-packages/pydantic/main.py:164, in BaseModel.__init__(__pydantic_self__, **data) 162 # `__tracebackhide__` tells pytest and some other tools to omit this function from tracebacks 163 __tracebackhide__ = True --> 164 __pydantic_self__.__pydantic_validator__.validate_python(data, self_instance=__pydantic_self__) ValidationError: 1 validation error for AIResponse Value error, The statement about the user having a broken leg does not logically follow from the information provided about the user suffering from diabetes. These are two separate health conditions and one does not imply the other. [type=value_error, input_value={'chain_of_thought': 'The...user has a broken leg.'}, input_type=dict] For further information visit https://errors.pydantic.dev/2.4/v/value_error
Reasking with validators¶
For most of these examples all we've done we've mostly only defined the validation logic.
We'eve covered field validators and model validators and even used LLMs to validate our outputs. But we haven't actually used the validators to reask the model! One of the most powerful features of instructor
is that it will automatically reask the model when it receives a validation error. This means that we can use the same validation logic for both code-based and LLM-based validation.
This also means that our 'prompt' is not only the prompt we send, but the code that runs the validator, and the error message we send back to the model.
Integrating these validation examples with the OpenAI API is streamlined using instructor
. After patching the OpenAI client with instructor
, you simply need to specify a response_model
for your requests. This setup ensures that all the validation processes occur automatically.
To enable reasking you can set a maximum number of retries. When calling the OpenAI client, the system can re-attempt to generate a correct answer. It does this by resending the original query along with feedback on why the previous response was rejected, guiding the LLM towards a more accurate answer in subsequent attempts.
class QuestionAnswer(BaseModel):
question: str
answer: str
question = "What is the meaning of life?"
context = "The according to the devil the meaning of life is a life of sin and debauchery."
resp = client.chat.completions.create(
model="gpt-4-1106-preview",
response_model=QuestionAnswer,
messages=[
{
"role": "system",
"content": "You are a system that answers questions based on the context. answer exactly what the question asks using the context.",
},
{
"role": "user",
"content": f"using the context: `{context}`\n\nAnswer the following question: `{question}`",
},
],
)
resp.answer
'a life of sin and debauchery'
from pydantic import BeforeValidator
class QuestionAnswer(BaseModel):
question: str
answer: Annotated[
str,
BeforeValidator(
llm_validator("don't say objectionable things", client=client)
),
]
resp = client.chat.completions.create(
model="gpt-3.5-turbo",
response_model=QuestionAnswer,
max_retries=2,
messages=[
{
"role": "system",
"content": "You are a system that answers questions based on the context. answer exactly what the question asks using the context.",
},
{
"role": "user",
"content": f"using the context: `{context}`\n\nAnswer the following question: `{question}`",
},
],
)
resp.answer
'The meaning of life is a concept that varies depending on individual perspectives and beliefs.'
Conclusion¶
This guide explains how to use deterministic and probabilistic validation techniques with Large Language Models (LLMs). We discussed using an instructor to establish validation processes for content filtering, context relevance maintenance, and model reasoning verification. These methods enhance the performance of LLMs across different tasks.
For those interested in further exploration, here's a to-do list:
- SQL Syntax Checker: Create a validator to check the syntax of SQL queries before executing them.
- Context-Based Response Validation: Design a method to flag responses based on the model's own knowledge rather than the provided context.
- PII Detection: Implement a mechanism to identify and handle Personally Identifiable Information in responses while prioritizing user privacy.
- Targeted Rule-Based Filtering: Develop filters to remove specific content types, such as responses mentioning named entities.
Completing these tasks will enable users to acquire practical skills in improving LLMs through advanced validation methods.