Add a way to exit chat request early from the chat UI #508

sabaimran · 2023-10-19T18:36:45Z

The local llama chat response can take minutes sometimes. If you want to update the request and tweak it, then this can mean a lot of waiting in order to retry your request. Add some way to send an interrupt signal from the UI to cancel the request.

See relevant discussion on Discord.

sabaimran · 2023-10-23T03:43:11Z

Mini-update. I looked into this today and found that gpt4all only supports shortcircuiting the model response after tokens have already started emitting. So, you can't stop it from 'thinking', so to speak, once it's already been given a query. To that end, I'll update the UI so that you can cancel the query once tokens are being spit out, but not before then.

Hopefully the time to first token issue will be less of a headache for folks using Mistral. That'll become the default model (see commit 0f1ebca) in the next release.

sabaimran added the upgrade New feature or request label Oct 19, 2023

sabaimran mentioned this issue Oct 23, 2023

Allow users to short-circuit LLM response to offline model #515

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a way to exit chat request early from the chat UI #508

Add a way to exit chat request early from the chat UI #508

sabaimran commented Oct 19, 2023

sabaimran commented Oct 23, 2023 •

edited

Add a way to exit chat request early from the chat UI #508

Add a way to exit chat request early from the chat UI #508

Comments

sabaimran commented Oct 19, 2023

sabaimran commented Oct 23, 2023 • edited

sabaimran commented Oct 23, 2023 •

edited