Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question]: How to improve parsing and chating speed? #814

Open
wjjf opened this issue May 17, 2024 · 3 comments
Open

[Question]: How to improve parsing and chating speed? #814

wjjf opened this issue May 17, 2024 · 3 comments
Labels
question Further information is requested

Comments

@wjjf
Copy link

wjjf commented May 17, 2024

[Question]: 您好,我在服务器上部署了一套,使用的是千问大模型,服务器16+200G。现在的问题是解析文档有时候会卡住,提问的话如果有答案通常是20s起步。我想知道我需要调整哪方面的参数能优化这一问题?我的性能瓶颈在哪里?

@wjjf wjjf added the question Further information is requested label May 17, 2024
@KevinHuSh
Copy link
Collaborator

-- We use visual model to parsing PDF, so it's slow. We're working on its speed. You can disable layout recognition for PDF in general parsing method. And this can be configured later in knowledgebase configuration.

-- About the chatting speed, we're gona use streamly chat which will bring speedy chating user experience. Today, it's gona release in docker images of dev version.

-- I'm not sure what the 16 refer to. If it's 16GB memory, I'm afraid it's not enough. 32GB will be better.

image

@KevinHuSh KevinHuSh changed the title [Question]: 解析速度慢,回答速度慢 [Question]: How to imporve parsing and chating speed? May 17, 2024
@wjjf
Copy link
Author

wjjf commented May 17, 2024

Thanks!

@JinHai-CN JinHai-CN changed the title [Question]: How to imporve parsing and chating speed? [Question]: How to improve parsing and chating speed? May 18, 2024
@wjjf
Copy link
Author

wjjf commented May 20, 2024

Now I'm using a machine with 32GB of RAM, but the answer speed is still twenty or thirty seconds, what parameters should I adjust to improve it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants