Comet 集成

Comet 机器学习平台与您现有的基础设施和工具集成，让您可以管理、可视化和优化模型——从训练运行到生产监控

230328046 a8b18c51 12e3 4617 9b39 97614a571a2d

在本指南中，我们将演示如何使用 Comet 追踪您的 LangChain 实验、评估指标和 LLM 会话。 示例项目： Comet 与 LangChain 230326720 a9711435 9c6f 4edb a707 94b67271ab25

安装 Comet 及依赖项

pip install -qU comet_ml langchain langchain-openai google-search-results spacy textstat pandas

!{sys.executable} -m spacy download en_core_web_sm

初始化 Comet 并设置凭据

您可以在此处获取 Comet API 密钥，或在初始化 Comet 后点击链接

import comet_ml

comet_ml.init(project_name="comet-example-langchain")

设置 OpenAI 和 SerpApi 凭据

运行以下示例需要 OpenAI API 密钥和 SerpApi API 密钥

import os

os.environ["OPENAI_API_KEY"] = "..."
# os.environ["OPENAI_ORGANIZATION"] = "..."
os.environ["SERPAPI_API_KEY"] = "..."

场景一：仅使用 LLM

from langchain_community.callbacks import CometCallbackHandler
from langchain_core.callbacks import StdOutCallbackHandler
from langchain_openai import OpenAI

comet_callback = CometCallbackHandler(
    project_name="comet-example-langchain",
    complexity_metrics=True,
    stream_logs=True,
    tags=["llm"],
    visualizations=["dep"],
)
callbacks = [StdOutCallbackHandler(), comet_callback]
llm = OpenAI(temperature=0.9, callbacks=callbacks, verbose=True)

llm_result = llm.generate(["Tell me a joke", "Tell me a poem", "Tell me a fact"] * 3)
print("LLM result", llm_result)
comet_callback.flush_tracker(llm, finish=True)

场景二：在链中使用 LLM

from langchain_classic.chains import LLMChain
from langchain_community.callbacks import CometCallbackHandler
from langchain_core.callbacks import StdOutCallbackHandler
from langchain_core.prompts import PromptTemplate
from langchain_openai import OpenAI

comet_callback = CometCallbackHandler(
    complexity_metrics=True,
    project_name="comet-example-langchain",
    stream_logs=True,
    tags=["synopsis-chain"],
)
callbacks = [StdOutCallbackHandler(), comet_callback]
llm = OpenAI(temperature=0.9, callbacks=callbacks)

template = """You are a playwright. Given the title of play, it is your job to write a synopsis for that title.
Title: {title}
Playwright: This is a synopsis for the above play:"""
prompt_template = PromptTemplate(input_variables=["title"], template=template)
synopsis_chain = LLMChain(llm=llm, prompt=prompt_template, callbacks=callbacks)

test_prompts = [{"title": "Documentary about Bigfoot in Paris"}]
print(synopsis_chain.apply(test_prompts))
comet_callback.flush_tracker(synopsis_chain, finish=True)

场景三：使用带工具的智能体

from langchain.agents import create_agent, load_tools
from langchain_community.callbacks import CometCallbackHandler
from langchain_core.callbacks import StdOutCallbackHandler
from langchain_openai import OpenAI

comet_callback = CometCallbackHandler(
    project_name="comet-example-langchain",
    complexity_metrics=True,
    stream_logs=True,
    tags=["agent"],
)
callbacks = [StdOutCallbackHandler(), comet_callback]
llm = OpenAI(temperature=0.9, callbacks=callbacks)

tools = load_tools(["serpapi", "llm-math"], llm=llm, callbacks=callbacks)

agent = create_agent(
    model=llm,
    tools=tools,
)

agent.invoke(
    "Who is Leo DiCaprio's girlfriend? What is her current age raised to the 0.43 power?"
)

comet_callback.flush_tracker(agent, finish=True)

场景四：使用自定义评估指标

CometCallbackManager 还允许您定义和使用自定义评估指标来评估模型的生成输出。让我们看看这是如何工作的。在下面的代码片段中，我们将使用 ROUGE 指标来评估输入提示生成摘要的质量。

pip install -qU  rouge-score

from langchain_classic.chains import LLMChain
from langchain_community.callbacks import CometCallbackHandler
from langchain_core.callbacks import StdOutCallbackHandler
from langchain_core.prompts import PromptTemplate
from langchain_openai import OpenAI
from rouge_score import rouge_scorer


class Rouge:
    def __init__(self, reference):
        self.reference = reference
        self.scorer = rouge_scorer.RougeScorer(["rougeLsum"], use_stemmer=True)

    def compute_metric(self, generation, prompt_idx, gen_idx):
        prediction = generation.text
        results = self.scorer.score(target=self.reference, prediction=prediction)

        return {
            "rougeLsum_score": results["rougeLsum"].fmeasure,
            "reference": self.reference,
        }


reference = """
The tower is 324 metres (1,063 ft) tall, about the same height as an 81-storey building.
It was the first structure to reach a height of 300 metres.

It is now taller than the Chrysler Building in New York City by 5.2 metres (17 ft)
Excluding transmitters, the Eiffel Tower is the second tallest free-standing structure in France .
"""
rouge_score = Rouge(reference=reference)

template = """Given the following article, it is your job to write a summary.
Article:
{article}
Summary: This is the summary for the above article:"""
prompt_template = PromptTemplate(input_variables=["article"], template=template)

comet_callback = CometCallbackHandler(
    project_name="comet-example-langchain",
    complexity_metrics=False,
    stream_logs=True,
    tags=["custom_metrics"],
    custom_metrics=rouge_score.compute_metric,
)
callbacks = [StdOutCallbackHandler(), comet_callback]
llm = OpenAI(temperature=0.9)

synopsis_chain = LLMChain(llm=llm, prompt=prompt_template)

test_prompts = [
    {
        "article": """
                 The tower is 324 metres (1,063 ft) tall, about the same height as
                 an 81-storey building, and the tallest structure in Paris. Its base is square,
                 measuring 125 metres (410 ft) on each side.
                 During its construction, the Eiffel Tower surpassed the
                 Washington Monument to become the tallest man-made structure in the world,
                 a title it held for 41 years until the Chrysler Building
                 in New York City was finished in 1930.

                 It was the first structure to reach a height of 300 metres.
                 Due to the addition of a broadcasting aerial at the top of the tower in 1957,
                 it is now taller than the Chrysler Building by 5.2 metres (17 ft).

                 Excluding transmitters, the Eiffel Tower is the second tallest
                 free-standing structure in France after the Millau Viaduct.
                 """
    }
]
print(synopsis_chain.apply(test_prompts, callbacks=callbacks))
comet_callback.flush_tracker(synopsis_chain, finish=True)

回调追踪器

与 Comet 还有另一种集成：查看示例。

from langchain_community.callbacks.tracers.comet import CometTracer

在 GitHub 上编辑此页面或提交 issue。

通过 MCP 将这些文档连接到 Claude、VSCode 等，获取实时答案。

Popular Providers

Integrations by component

安装 Comet 及依赖项

初始化 Comet 并设置凭据

设置 OpenAI 和 SerpApi 凭据

场景一：仅使用 LLM

场景二：在链中使用 LLM

场景三：使用带工具的智能体

场景四：使用自定义评估指标

回调追踪器

Popular Providers

Integrations by component

​安装 Comet 及依赖项

​初始化 Comet 并设置凭据

​设置 OpenAI 和 SerpApi 凭据

​场景一：仅使用 LLM

​场景二：在链中使用 LLM

​场景三：使用带工具的智能体

​场景四：使用自定义评估指标

​回调追踪器

安装 Comet 及依赖项

初始化 Comet 并设置凭据

设置 OpenAI 和 SerpApi 凭据

场景一：仅使用 LLM

场景二：在链中使用 LLM

场景三：使用带工具的智能体

场景四：使用自定义评估指标

回调追踪器