ChatHuggingFace

這將幫助您開始使用 langchain_huggingface 聊天模型。如需所有 ChatHuggingFace 功能和設定的詳細文件，請前往 API 參考。如需 Hugging Face 支援的模型列表，請查看此頁面。

概觀

整合詳細資訊

類別	套件	本地	可序列化	JS 支援	套件下載	套件最新版
ChatHuggingFace	langchain-huggingface	✅	beta	❌

模型功能

工具呼叫	結構化輸出	JSON 模式	影像輸入	音訊輸入	視訊輸入	Token 層級串流	原生非同步	Token 使用量	Logprobs
✅	✅	❌	✅	✅	✅	❌	✅	✅	❌

設定

若要存取 Hugging Face 模型，您需要建立 Hugging Face 帳戶、取得 API 金鑰，並安裝 langchain-huggingface 整合套件。

憑證

產生 Hugging Face 存取權杖，並將其儲存為環境變數：HUGGINGFACEHUB_API_TOKEN。

import getpass
import os

if not os.getenv("HUGGINGFACEHUB_API_TOKEN"):
    os.environ["HUGGINGFACEHUB_API_TOKEN"] = getpass.getpass("Enter your token: ")

安裝

類別	套件	本地	可序列化	JS 支援	套件下載	套件最新版
ChatHuggingFace	langchain_huggingface	✅	❌	❌

模型功能

工具呼叫	結構化輸出	JSON 模式	影像輸入	音訊輸入	視訊輸入	Token 層級串流	原生非同步	Token 使用量	Logprobs
✅	✅	❌	❌	❌	❌	❌	❌	❌	❌

設定

若要存取 langchain_huggingface 模型，您需要建立一個 Hugging Face 帳戶、取得 API 金鑰，並安裝 langchain_huggingface 整合套件。

憑證

您需要將 Hugging Face 存取權杖儲存為環境變數：HUGGINGFACEHUB_API_TOKEN。

import getpass
import os

os.environ["HUGGINGFACEHUB_API_TOKEN"] = getpass.getpass(
    "Enter your Hugging Face API key: "
)

%pip install --upgrade --quiet  langchain-huggingface text-generation transformers google-search-results numexpr langchainhub sentencepiece jinja2 bitsandbytes accelerate

[1m[[0m[34;49mnotice[0m[1;39;49m][0m[39;49m A new release of pip is available: [0m[31;49m24.0[0m[39;49m -> [0m[32;49m24.1.2[0m
[1m[[0m[34;49mnotice[0m[1;39;49m][0m[39;49m To update, run: [0m[32;49mpip install --upgrade pip[0m
Note: you may need to restart the kernel to use updated packages.

例項化

您可以使用兩種不同的方式例項化 ChatHuggingFace 模型，可以從 HuggingFaceEndpoint 或 HuggingFacePipeline 例項化。

`HuggingFaceEndpoint`

from langchain_huggingface import ChatHuggingFace, HuggingFaceEndpoint

llm = HuggingFaceEndpoint(
    repo_id="HuggingFaceH4/zephyr-7b-beta",
    task="text-generation",
    max_new_tokens=512,
    do_sample=False,
    repetition_penalty=1.03,
)

chat_model = ChatHuggingFace(llm=llm)

API 參考：ChatHuggingFace | HuggingFaceEndpoint

The token has not been saved to the git credentials helper. Pass `add_to_git_credential=True` in this function directly or `--add-to-git-credential` if using via `huggingface-cli` if you want to set the git credential as well.
Token is valid (permission: fineGrained).
Your token has been saved to /Users/isaachershenson/.cache/huggingface/token
Login successful

`HuggingFacePipeline`

from langchain_huggingface import ChatHuggingFace, HuggingFacePipeline

llm = HuggingFacePipeline.from_model_id(
    model_id="HuggingFaceH4/zephyr-7b-beta",
    task="text-generation",
    pipeline_kwargs=dict(
        max_new_tokens=512,
        do_sample=False,
        repetition_penalty=1.03,
    ),
)

chat_model = ChatHuggingFace(llm=llm)

API 參考：ChatHuggingFace | HuggingFacePipeline

config.json:   0%|          | 0.00/638 [00:00<?, ?B/s]

model.safetensors.index.json:   0%|          | 0.00/23.9k [00:00<?, ?B/s]

Downloading shards:   0%|          | 0/8 [00:00<?, ?it/s]

model-00001-of-00008.safetensors:   0%|          | 0.00/1.89G [00:00<?, ?B/s]

model-00002-of-00008.safetensors:   0%|          | 0.00/1.95G [00:00<?, ?B/s]

model-00003-of-00008.safetensors:   0%|          | 0.00/1.98G [00:00<?, ?B/s]

model-00004-of-00008.safetensors:   0%|          | 0.00/1.95G [00:00<?, ?B/s]

model-00005-of-00008.safetensors:   0%|          | 0.00/1.98G [00:00<?, ?B/s]

model-00006-of-00008.safetensors:   0%|          | 0.00/1.95G [00:00<?, ?B/s]

model-00007-of-00008.safetensors:   0%|          | 0.00/1.98G [00:00<?, ?B/s]

model-00008-of-00008.safetensors:   0%|          | 0.00/816M [00:00<?, ?B/s]

Loading checkpoint shards:   0%|          | 0/8 [00:00<?, ?it/s]

generation_config.json:   0%|          | 0.00/111 [00:00<?, ?B/s]

使用量化例項化

若要執行模型的量化版本，您可以指定 bitsandbytes 量化設定，如下所示

from transformers import BitsAndBytesConfig

quantization_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_compute_dtype="float16",
    bnb_4bit_use_double_quant=True,
)

並將其作為 model_kwargs 的一部分傳遞給 HuggingFacePipeline

llm = HuggingFacePipeline.from_model_id(
    model_id="HuggingFaceH4/zephyr-7b-beta",
    task="text-generation",
    pipeline_kwargs=dict(
        max_new_tokens=512,
        do_sample=False,
        repetition_penalty=1.03,
        return_full_text=False,
    ),
    model_kwargs={"quantization_config": quantization_config},
)

chat_model = ChatHuggingFace(llm=llm)

調用

from langchain_core.messages import (
    HumanMessage,
    SystemMessage,
)

messages = [
    SystemMessage(content="You're a helpful assistant"),
    HumanMessage(
        content="What happens when an unstoppable force meets an immovable object?"
    ),
]

ai_msg = chat_model.invoke(messages)

API 參考：HumanMessage | SystemMessage

print(ai_msg.content)

According to the popular phrase and hypothetical scenario, when an unstoppable force meets an immovable object, a paradoxical situation arises as both forces are seemingly contradictory. On one hand, an unstoppable force is an entity that cannot be stopped or prevented from moving forward, while on the other hand, an immovable object is something that cannot be moved or displaced from its position. 

In this scenario, it is un

API 參考

如需所有 ChatHuggingFace 功能和設定的詳細文件，請前往 API 參考：https://langchain-python.dev.org.tw/api_reference/huggingface/chat_models/langchain_huggingface.chat_models.huggingface.ChatHuggingFace.html

API 參考

如需所有 ChatHuggingFace 功能和設定的詳細文件，請前往 API 參考：https://langchain-python.dev.org.tw/api_reference/huggingface/chat_models/langchain_huggingface.chat_models.huggingface.ChatHuggingFace.html

聊天模型概念指南
聊天模型操作指南

概觀​

整合詳細資訊​

整合詳細資訊​

模型功能​

設定​

憑證​

安裝​

模型功能​

設定​

憑證​

例項化​

HuggingFaceEndpoint​

HuggingFacePipeline​

使用量化例項化​

調用​

API 參考​

API 參考​

相關內容​

此頁面是否對您有幫助？

概觀

整合詳細資訊

整合詳細資訊

模型功能

設定

憑證

安裝

模型功能

設定

憑證

例項化

`HuggingFaceEndpoint`

`HuggingFacePipeline`

使用量化例項化

調用

API 參考

API 參考

相關內容