记录 LLM 调用

当您直接调用 LLM，而非通过 LangChain 或 LangSmith 支持的集成时，需要提供特定的元数据，以便 LangSmith 能够显示 token 计数、计算成本，并允许您在 Playground 中使用正确的提供商和模型打开运行。一个功能完整的 LLM 追踪有四个要求：

要求	操作	启用功能
1. 设置 `run_type="llm"`	将 `run_type="llm"` 传递给 `@traceable`	LLM 特定渲染、token/成本显示
2. 格式化输入/输出	使用 OpenAI、Anthropic 或 LangChain 消息格式	结构化消息渲染、Playground 支持
3. 设置 `ls_provider` 和 `ls_model_name`	在 `metadata` 中传递两者	成本追踪、Playground 模型选择
4. 提供 token 计数	在运行上设置 `usage_metadata`	token 计数和成本计算

如果您使用 LangChain OSS、OpenAI 包装器或 Anthropic 包装器，这些细节会自动处理。本页示例使用 traceable 装饰器/包装器（Python 和 JS/TS 的推荐方法）。如果您直接使用 RunTree 或 API，同样的要求也适用。

消息格式

追踪自定义模型或自定义输入/输出格式时，必须遵循 LangChain 格式、OpenAI completions 格式或 Anthropic 消息格式。更多详情，请参阅 OpenAI Chat Completions 或 Anthropic Messages 文档。LangChain 格式如下：

Show LangChain 格式

array

required

包含对话内容的消息列表。

string

required

标识消息类型。可选值：system | reasoning | user | assistant | tool

array

required

消息内容。类型化字典列表。

Show 内容选项

string

required

Show text

literal('text')

required

string

required

文本内容。

object[]

文本的注释列表

object

附加的提供商特定数据。

Show reasoning

literal('reasoning')

required

string

required

文本内容。

object

附加的提供商特定数据。

Show image

literal('image')

required

string

指向图像位置的 URL。

string

required

Base64 编码的图像数据。

string

指向外部存储图像的引用 ID（例如，在提供商的文件系统或存储桶中）。

string

图像 MIME 类型（例如，image/jpeg、image/png）。

Show file (例如 PDF)

literal('file')

required

string

指向文件的 URL。

string

required

Base64 编码的文件数据。

string

指向外部存储文件的引用 ID（例如，在提供商的文件系统或存储桶中）。

string

文件 MIME 类型（例如，application/pdf）。

Show audio

literal('audio')

required

string

指向音频文件的 URL。

string

required

Base64 编码的音频数据。

string

指向外部存储音频文件的引用 ID（例如，在提供商的文件系统或存储桶中）。

string

音频 MIME 类型（例如，audio/mpeg、audio/wav）。

Show video

literal('video')

required

string

指向视频文件的 URL。

string

required

Base64 编码的视频数据。

string

指向外部存储视频文件的引用 ID（例如，在提供商的文件系统或存储桶中）。

string

视频 MIME 类型（例如，video/mp4、video/webm）。

Show tool_call

literal('tool_call')

required

string

object

required

传递给工具的参数。

string

此工具调用的唯一标识符。

Show server_tool_call

literal('server_tool_call')

required

string

required

此工具调用的唯一标识符。

string

required

要调用的工具名称。

object

required

传递给工具的参数。

Show server_tool_result

literal('server_tool_result')

required

string

required

对应服务器工具调用的标识符。

string

此工具调用的唯一标识符。

string

required

服务器端工具的执行状态。可选值：success | error。

已执行工具的输出。

string

必须与先前 assistant 消息的 tool_calls[i] 条目的 id 匹配。仅在 role 为 tool 时有效。

object

使用此字段随模型输出发送 token 计数和/或成本。更多详情，请参阅提供 token 和成本信息。

 inputs = {
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "Hi, can you tell me the capital of France?"
        }
      ]
    }
  ]
}

outputs = {
  "messages": [
    {
      "role": "assistant",
      "content": [
        {
          "type": "text",
          "text": "The capital of France is Paris."
        },
        {
          "type": "reasoning",
          "text": "The user is asking about..."
        }
      ]
    }
  ]
}

input = {
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What's the weather in San Francisco?"
        }
      ]
    }
  ]
}

outputs = {
  "messages": [
    {
      "role": "assistant",
      "content": [{"type": "tool_call", "name": "get_weather", "args": {"city": "San Francisco"}, "id": "call_1"}],
    },
    {
      "role": "tool",
      "tool_call_id": "call_1",
      "content": [
        {
          "type": "text",
          "text": "{\"temperature\": \"18°C\", \"condition\": \"Sunny\"}"
        }
      ]
    },
    {
      "role": "assistant",
      "content": [
        {
          "type": "text",
          "text": "The weather in San Francisco is 18°C and sunny."
        }
      ]
    }
  ]
}

inputs = {
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What breed is this dog?"
        },
        {
          "type": "image",
          "url": "https://fastly.picsum.photos/id/237/200/300.jpg?hmac=TmmQSbShHz9CdQm0NkEjx1Dyh_Y984R9LpNrpvH2D_U",
          # 作为 url 的替代方案，您可以提供 base64 编码的图像
          # "base64": "<base64 encoded image>",
          "mime_type": "image/jpeg",
        }
      ]
    }
  ]
}

outputs = {
  "messages": [
    {
      "role": "assistant",
      "content": [
        {
          "type": "text",
          "text": "This looks like a Black Labrador."
        }
      ]
    }
  ]
}

input = {
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is the price of AAPL?"
        }
      ]
    }
  ]
}

output = {
  "messages": [
    {
      "role": "assistant",
      "content": [
        {
          "type": "server_tool_call",
          "name": "web_search",
          "args": {
            "query": "price of AAPL",
            "type": "search"
          },
          "id": "call_1"
        },
        {
          "type": "server_tool_result",
          "tool_call_id": "call_1",
          "status": "success"
        },
        {
          "type": "text",
          "text": "The price of AAPL is $150.00"
        }
      ]
    }
  ]
}

将自定义 I/O 格式转换为 LangSmith 兼容格式

如果您使用自定义输入或输出格式，可以使用 @traceable 装饰器（Python）或 traceable 函数（TS）上的 process_inputs/processInputs 和 process_outputs/processOutputs 函数将其转换为 LangSmith 兼容格式。 process_inputs/processInputs 和 process_outputs/processOutputs 接受函数，允许您在特定追踪的日志记录到 LangSmith 之前转换其输入和输出。它们可以访问追踪的输入和输出，并可以返回包含处理后数据的新字典。以下是使用 process_inputs 和 process_outputs 将自定义 I/O 格式转换为 LangSmith 兼容格式的模板示例：

class OriginalInputs(BaseModel):
    """您的应用的自定义请求形状"""

class OriginalOutputs(BaseModel):
    """您的应用的自定义响应形状。"""

class LangSmithInputs(BaseModel):
    """LangSmith 期望的输入格式。"""

class LangSmithOutputs(BaseModel):
    """LangSmith 期望的输出格式。"""

def process_inputs(inputs: dict) -> dict:
    """字典 -> OriginalInputs -> LangSmithInputs -> 字典"""

def process_outputs(output: Any) -> dict:
    """OriginalOutputs -> LangSmithOutputs -> 字典"""


@traceable(run_type="llm", process_inputs=process_inputs, process_outputs=process_outputs)
def chat_model(inputs: dict) -> dict:
    """
    您的应用的模型调用。保持您的自定义 I/O 形状。
    装饰器调用 process_* 以记录 LangSmith 兼容格式。
    """

在追踪中识别自定义模型

使用自定义模型时，建议同时提供以下 metadata 字段，以便在查看追踪和过滤时识别模型。

ls_provider：模型的提供商，例如 "openai"、"anthropic"。
ls_model_name：模型的名称，例如 "gpt-5.4-mini"、"claude-3-opus-20240229"。

from langsmith import traceable

inputs = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "I'd like to book a table for two."},
]
output = {
    "choices": [
        {
            "message": {
                "role": "assistant",
                "content": "Sure, what time would you like to book the table for?"
            }
        }
    ]
}

@traceable(
    run_type="llm",
    metadata={"ls_provider": "my_provider", "ls_model_name": "my_model"}
)
def chat_model(messages: list):
    return output

chat_model(inputs)

import { traceable } from "langsmith/traceable";

const messages = [
    { role: "system", content: "You are a helpful assistant." },
    { role: "user", content: "I'd like to book a table for two." }
];
const output = {
    choices: [
        {
            message: {
                role: "assistant",
                content: "Sure, what time would you like to book the table for?",
            },
        },
    ],
    usage_metadata: {
        input_tokens: 27,
        output_tokens: 13,
        total_tokens: 40,
    },
};

// 也可以使用以下任一形式：
// const output = {
//     message: {
//         role: "assistant",
//         content: "Sure, what time would you like to book the table for?"
//     }
// };
//
// const output = {
//     role: "assistant",
//     content: "Sure, what time would you like to book the table for?"
// };
//
// const output = ["assistant", "Sure, what time would you like to book the table for?"];

const chatModel = traceable(
    async ({ messages }: { messages: { role: string; content: string }[] }) => {
        return output;
    },
    {
        run_type: "llm",
        name: "chat_model",
        metadata: {
            ls_provider: "my_provider",
            ls_model_name: "my_model"
        }
    }
);

await chatModel({ messages });

如果您实现了自定义流式 chat_model，可以将输出“归约”为与非流式版本相同的格式。这仅在 Python 中受支持：

def _reduce_chunks(chunks: list):
    all_text = "".join([chunk["choices"][0]["message"]["content"] for chunk in chunks])
    return {"choices": [{"message": {"content": all_text, "role": "assistant"}}]}

@traceable(
    run_type="llm",
    reduce_fn=_reduce_chunks,
    metadata={"ls_provider": "my_provider", "ls_model_name": "my_model"}
)
def my_streaming_chat_model(messages: list):
    for chunk in ["Hello, " + messages[1]["content"]]:
        yield {
            "choices": [
                {
                    "message": {
                        "content": chunk,
                        "role": "assistant",
                    }
                }
            ]
        }

list(
    my_streaming_chat_model(
        [
            {"role": "system", "content": "You are a helpful assistant. Please greet the user."},
            {"role": "user", "content": "polly the parrot"},
        ],
    )
)

在 metadata 中设置 ls_model_name 是 LangSmith 识别模型并计算自定义 LLM 追踪成本所必需的。没有它，token 计数可能仍会被记录，但成本不会被估算。

要了解更多关于如何使用 metadata 字段的信息，请参阅添加元数据和标签指南。

提供 token 和成本信息

token 计数支持成本计算，LangSmith 在追踪项目 UI 中显示。有两种提供方式：

在运行树上设置 usage_metadata：在您的 @traceable 函数内调用 get_current_run_tree() / getCurrentRunTree() 并设置 usage_metadata 字段。这不会改变函数的返回值。
在输出中返回 usage_metadata：在函数返回的字典中包含 usage_metadata 作为顶级键。

支持的 `usage_metadata` 字段

字段	类型	描述
`input_tokens`	`int`	总输入/提示 token 数
`output_tokens`	`int`	总输出/补全 token 数
`total_tokens`	`int`	输入 + 输出的总和（可选，可推断）
`input_token_details`	`object`	明细：`cache_read`、`cache_creation`、`cache_read_over_200k`、`ephemeral_5m_input_tokens`、`ephemeral_1h_input_tokens`、`audio`、`text`、`image`
`output_token_details`	`object`	明细：`reasoning`、`audio`、`text`、`image`

要直接发送成本（用于非线性定价），您还可以包含 input_cost、output_cost 和 total_cost 字段。有关配置模型定价和在 UI 中查看成本的详细信息，请参阅成本追踪页面。

首个 token 时间

如果您使用 traceable 或 SDK 包装器之一，LangSmith 将自动填充流式 LLM 运行的首个 token 时间。但是，如果您直接使用 RunTree API，则需要向运行树添加 new_token 事件，以便正确填充首个 token 时间。以下是示例：

from langsmith.run_trees import RunTree
run_tree = RunTree(
    name="CustomChatModel",
    run_type="llm",
    inputs={ ... }
)
run_tree.post()
llm_stream = ...
first_token = None
for token in llm_stream:
    if first_token is None:
      first_token = token
      run_tree.add_event({
        "name": "new_token"
      })
run_tree.end(outputs={ ... })
run_tree.patch()

import { RunTree } from "langsmith";
const runTree = new RunTree({
    name: "CustomChatModel",
    run_type: "llm",
    inputs: { ... },
});
await runTree.postRun();
const llmStream = ...;
let firstToken;
for (const token of llmStream) {
    if (firstToken == null) {
        firstToken = token;
        runTree.addEvent({ name: "new_token" });
    }
}
await runTree.end({
    outputs: { ... },
});
await runTree.patchRun();

Tracing setup

Configuration & troubleshooting

Viewing & managing traces

Automations

Feedback & evaluation

Monitoring & alerting

Data type reference

消息格式

将自定义 I/O 格式转换为 LangSmith 兼容格式

在追踪中识别自定义模型

提供 token 和成本信息

支持的 `usage_metadata` 字段

首个 token 时间

相关内容

​消息格式

​将自定义 I/O 格式转换为 LangSmith 兼容格式

​在追踪中识别自定义模型

​提供 token 和成本信息

​支持的 usage_metadata 字段

​首个 token 时间

​相关内容

消息格式

将自定义 I/O 格式转换为 LangSmith 兼容格式

在追踪中识别自定义模型

提供 token 和成本信息

支持的 `usage_metadata` 字段

首个 token 时间

相关内容