Hugging Face Multiagents

Multiagents

tensorflowsmolagents_dochf-notebookszh

alph-notebooks/hf-notebooks / multiagents.ipynb

Export

Run Notebooks

Contents

No cells yet

Add cells to see them here

[ ]

编排 multi-agent 系统 🤖🤝🤖

此notebook将构建一个 multi-agent 网络浏览器：一个有多个代理协作，使用网络进行搜索解决问题的代理系统

ManagedAgent 对象将封装这些管理网络搜索的agent，形成一个简单的层次结构：

              +----------------+
              | Manager agent  |
              +----------------+
                       |
        _______________|______________
       |                              |
  Code interpreter   +--------------------------------+
       tool          |         Managed agent          |
                     |      +------------------+      |
                     |      | Web Search agent |      |
                     |      +------------------+      |
                     |         |            |         |
                     |  Web Search tool     |         |
                     |             Visit webpage tool |
                     +--------------------------------+

我们来一起构建这个系统。运行下列代码以安装依赖包：

!pip install 'smolagents[toolkit]' --upgrade -q

我们需要登录Hugging Face Hub以调用HF的Inference API：

from huggingface_hub import login

login()

⚡️ HF的Inference API 可以快速轻松地运行任何开源模型，因此我们的agent将使用HF的Inference API 中的InferenceClientModel类来调用 Qwen/Qwen3-Next-80B-A3B-Thinking模型。

Note: 基于多参数和部署模型的 Inference API 可能在没有预先通知的情况下更新或替换模型。了解更多信息，请参阅这里。

[ ]

🔍 创建网络搜索工具

虽然我们可以使用已经存在的 WebSearchTool 工具作为谷歌搜索的平替进行网页浏览，然后我们也需要能够查看WebSearchTool找到的页面。为此，我们可以直接导入库的内置 VisitWebpageTool。但是我们将重新构建它以了解其工作原理。

我们将使用markdownify 来从头构建我们的VisitWebpageTool工具。

[ ]

现在我们初始化这个工具并测试它！

[ ]

构建我们的 multi-agent 系统 🤖🤝🤖

现在我们有了所有工具search和visit_webpage，我们可以使用它们来创建web agent。

我们该选取什么样的配置来构建这个agent呢？

网页浏览是一个单线程任务，不需要并行工具调用，因此JSON工具调用对于这个任务非常有效。因此我们选择ToolCallingAgent。
有时候网页搜索需要探索许多页面才能找到正确答案，所以我们更喜欢将 max_steps 增加到10。

[ ]

请注意，我们为这个代理赋予了 name（名称）和 description（描述）属性，这些是必需属性，以便让管理代理能够调用此代理。

然后，我们创建一个管理代理，在初始化时，将受管代理作为 managed_agents 参数传递给它。

由于这个代理的任务是进行规划和思考，高级推理能力会很有帮助，因此 CodeAgent（代码代理）将是最佳选择。

此外，我们要提出一个涉及当前年份并需要进行额外数据计算的问题：所以让我们添加 additional_authorized_imports=["time", "numpy", "pandas"]，以防代理需要用到这些包。

[ ]

可以了！现在让我们运行我们的系统！我们选择一个需要一些计算和研究的问题：

[ ]

我们用这个report 来回答这个问题：

Based on current growth projections and energy consumption estimates, if LLM trainings continue to scale up at the
current rhythm until 2030:

1. The electric power required to power the biggest training runs by 2030 would be approximately 303.74 GW, which
translates to about 2,660,762 GWh/year.

1. Comparing this to countries' electricity consumption:
   - It would be equivalent to about 34% of China's total electricity consumption.
   - It would exceed the total electricity consumption of India (184%), Russia (267%), and Japan (291%).
   - It would be nearly 9 times the electricity consumption of countries like Italy or Mexico.

2. Source of numbers:
   - The initial estimate of 5 GW for future LLM training comes from AWS CEO Matt Garman.
   - The growth projection used a CAGR of 79.80% from market research by Springs.
   - Country electricity consumption data is from the U.S. Energy Information Administration, primarily for the year
2021.

如果scaling hypothesis持续成立的话，我们需要一些庞大的动力配置。我们的agent成功地协作解决了这个任务！✅

💡 你可以轻松地将这个编排扩展到更多的agent：一个执行代码，一个进行网页搜索，一个处理文件加载⋯⋯