Outlines 集成

Outlines 是一个用于受约束语言生成的 Python 库。它为各种语言模型提供统一接口，并允许使用正则表达式匹配、类型约束、JSON 模式和上下文无关文法等技术进行结构化生成。

Outlines 支持多种后端，包括：

Hugging Face Transformers
llama.cpp
vLLM
MLX

此集成允许您将 Outlines 模型与 LangChain 结合使用，提供 LLM 和聊天模型接口。

安装与设置

要将 Outlines 与 LangChain 结合使用，您需要安装 Outlines 库：

pip install outlines

根据您选择的后端，您可能需要安装额外的依赖项：

Transformers：pip install transformers torch datasets
llama.cpp：pip install llama-cpp-python
vLLM：pip install vllm
MLX：pip install mlx

LLM

要在 LangChain 中将 Outlines 用作 LLM，您可以使用 Outlines 类：

from langchain_community.llms import Outlines

聊天模型

要在 LangChain 中将 Outlines 用作聊天模型，您可以使用 ChatOutlines 类：

from langchain_community.chat_models import ChatOutlines

模型配置

Outlines 和 ChatOutlines 类共享类似的配置选项：

model = Outlines(
    model="meta-llama/Llama-2-7b-chat-hf",  # 模型标识符
    backend="transformers",  # 使用的后端（transformers、llamacpp、vllm 或 mlxlm）
    max_tokens=256,  # 最大生成 Token 数
    stop=["\n"],  # 可选的停止字符串列表
    streaming=True,  # 是否流式输出
    # 结构化生成的额外参数：
    regex=None,
    type_constraints=None,
    json_schema=None,
    grammar=None,
    # 额外的模型参数：
    model_kwargs={"temperature": 0.7}
)

模型标识符

model 参数可以是：

Hugging Face 模型名称（例如，“meta-llama/Llama-2-7b-chat-hf”）
模型的本地路径
对于 GGUF 模型，格式为 “repo_id/file_name”（例如，“TheBloke/Llama-2-7B-Chat-GGUF/llama-2-7b-chat.Q4_K_M.gguf”）

后端选项

backend 参数指定要使用的后端：

"transformers"：用于 Hugging Face Transformers 模型（默认）
"llamacpp"：用于使用 llama.cpp 的 GGUF 模型
"transformers_vision"：用于视觉语言模型（例如，LLaVA）
"vllm"：用于使用 vLLM 库的模型
"mlxlm"：用于使用 MLX 框架的模型

结构化生成

Outlines 提供了几种结构化生成的方法：

正则表达式匹配：

model = Outlines(
    model="meta-llama/Llama-2-7b-chat-hf",
    regex=r"((25[0-5]|2[0-4]\d|[01]?\d\d?)\.){3}(25[0-5]|2[0-4]\d|[01]?\d\d?)"
)

这将确保生成的文本与指定的正则表达式模式匹配（在本例中为有效的 IP 地址）。

类型约束：
```
model = Outlines(
    model="meta-llama/Llama-2-7b-chat-hf",
    type_constraints=int
)
```
这将输出限制为有效的 Python 类型（int、float、bool、datetime.date、datetime.time、datetime.datetime）。

JSON 模式：

from pydantic import BaseModel

class Person(BaseModel):
    name: str
    age: int

model = Outlines(
    model="meta-llama/Llama-2-7b-chat-hf",
    json_schema=Person
)

这确保生成的输出符合指定的 JSON 模式或 Pydantic 模型。

上下文无关文法：

model = Outlines(
    model="meta-llama/Llama-2-7b-chat-hf",
    grammar="""
        ?start: expression
        ?expression: term (("+" | "-") term)*
        ?term: factor (("*" | "/") factor)*
        ?factor: NUMBER | "-" factor | "(" expression ")"
        %import common.NUMBER
    """
)

这生成符合 EBNF 格式中指定的上下文无关文法的文本。

使用示例

LLM 示例

from langchain_community.llms import Outlines

llm = Outlines(model="meta-llama/Llama-2-7b-chat-hf", max_tokens=100)
result = llm.invoke("Tell me a short story about a robot.")
print(result)

聊天模型示例

from langchain_community.chat_models import ChatOutlines
from langchain.messages import HumanMessage, SystemMessage

chat = ChatOutlines(model="meta-llama/Llama-2-7b-chat-hf", max_tokens=100)
messages = [
    SystemMessage(content="You are a helpful AI assistant."),
    HumanMessage(content="What's the capital of France?")
]
result = chat.invoke(messages)
print(result.content)

流式示例

from langchain_community.chat_models import ChatOutlines
from langchain.messages import HumanMessage

chat = ChatOutlines(model="meta-llama/Llama-2-7b-chat-hf", streaming=True)
for chunk in chat.stream("Tell me a joke about programming."):
    print(chunk.content, end="", flush=True)
print()

结构化输出示例

from langchain_community.llms import Outlines
from pydantic import BaseModel

class MovieReview(BaseModel):
    title: str
    rating: int
    summary: str

llm = Outlines(
    model="meta-llama/Llama-2-7b-chat-hf",
    json_schema=MovieReview
)
result = llm.invoke("Write a short review for the movie 'Inception'.")
print(result)

附加功能

访问分词器

您可以访问模型的底层分词器：

tokenizer = llm.tokenizer
encoded = tokenizer.encode("Hello, world!")
decoded = tokenizer.decode(encoded)

在 GitHub 上编辑此页面或提交问题。

将这些文档连接到 Claude、VSCode 等，通过 MCP 获取实时答案。

Popular Providers

Integrations by component

安装与设置

LLM

聊天模型

模型配置

模型标识符

后端选项

结构化生成

使用示例

LLM 示例

聊天模型示例

流式示例

结构化输出示例

附加功能

访问分词器

Popular Providers

Integrations by component

​安装与设置

​LLM

​聊天模型

​模型配置

​模型标识符

​后端选项

​结构化生成

​使用示例

​LLM 示例

​聊天模型示例

​流式示例

​结构化输出示例

​附加功能

​访问分词器

安装与设置

LLM

聊天模型

模型配置

模型标识符

后端选项

结构化生成

使用示例

LLM 示例

聊天模型示例

流式示例

结构化输出示例

附加功能

访问分词器