Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

is:issue is:open GLM-4-9B、GLM-4-9B-Chat、GLM-4-9B-Chat-HF、GLM-4-9B-Chat-1M、GLM-4-9B-Chat-1M-HF、GLM-4V-9B这几个模型的区别和应用场景是什么呢 #682

Closed
adminadminadminadminadminadminadmin opened this issue Dec 27, 2024 · 1 comment
Assignees

Comments

@adminadminadminadminadminadminadmin

Feature request / 功能建议

请补充一下几个模型的区别和适用场景

Motivation / 动机

获取区别和使用场景

Your contribution / 您的贡献

.

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR self-assigned this Dec 28, 2024
@THUDM THUDM deleted a comment Dec 28, 2024
@zRzRzRzRzRzRzR
Copy link
Member

zRzRzRzRzRzRzR commented Dec 29, 2024

GLM-4-9B 是一个基座模型,不具备对话能力。
GLM-4-9B-Chat 是对话模型,具备工具调用,对话,指令跟随。适用transformers 4.44-4.46(后续版本未测试),支持vLLM(市面上主流的框架这个版本都支持)
GLM-4-9B-Chat-HF 适用于transformers 4.46以后,但是暂未适配vLLM,模型就是GLM-4-9B-Chat
GLM-4-9B-Chat-1M 是长文本模型,支持1M上下文,没有工具调用。GLM-4-9B-Chat 是128K上下文
GLM-4-9B-Chat-1M-HF同理。
GLM-4V-9B视觉理解模型,支持8K上下文,图像固定占用1600token,仅支持一张图像,不支持工具调用,主要完成VQA任务

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants