evaluate() / aevaluate()传递upload_results=False来实现这一点。
这将像往常一样运行您的应用程序和评估器,并返回相同的输出,但不会向LangSmith记录任何内容。这不仅包括实验结果,还包括应用程序和评估器的追踪记录。
如果您想将结果上传到LangSmith,但还需要在脚本中处理它们(用于质量门控、自定义聚合等),请参阅在本地读取实验结果。
示例
让我们来看一个示例: 需要langsmith>=0.2.0。示例也使用了pandas。
| inputs.question | outputs.answer | reference.answer | feedback.is_concise | |
|---|---|---|---|---|
| 0 | What is the largest mammal? | What is the largest mammal? is a good question. I don’t know the answer. | The blue whale | False |
| 1 | What do mammals and birds have in common? | What do mammals and birds have in common? is a good question. I don’t know the answer. | They are both warm-blooded | False |
Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

