Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new engine to chat/generate features #895

Open
pavaris-pm opened this issue Dec 24, 2023 · 2 comments
Open

Add new engine to chat/generate features #895

pavaris-pm opened this issue Dec 24, 2023 · 2 comments
Labels
enhancement enhance functionalities
Projects
Milestone

Comments

@pavaris-pm
Copy link
Contributor

pavaris-pm commented Dec 24, 2023

In a couple days before, I've seen that we also have a chat/generate features that utilize wangchanglm as a current LLM model for text generation ability. Moreover, there has an upcoming LLM known as Typhoon-7b from scb10x which bring a wow factor into Thai LLM with evaluation on Thai examination task. Due to this new wave of Thai LLM, do we need to add Typhoon-7b as an optional engine of PyThaiNLP? What do you think?

Ps. I'm not sure that it will produce inappropriate word or not since they claim that it has no moderation mechanism. Maybe I can fine-tune it with some samples (e.g. 1k text samples) in order to adjust their mood and tone for more appropriate generation as well. You can suggest.

@wannaphong
Copy link
Member

Yes, I agree.

Typhoon-7b are bilingual llm, so I think if somenoe train instruct fellow by English, It should can working with Thai too!.

I am welcome if the model doesn't use the data from ChatGPT (example ShareGPT, self-instruct that use ChatGPT data for create the dataset).

@wannaphong wannaphong added this to the Future milestone Dec 26, 2023
@pavaris-pm
Copy link
Contributor Author

Yes, I agree.

Typhoon-7b are bilingual llm, so I think if somenoe train instruct fellow by English, It should can working with Thai too!.

I am welcome if the model doesn't use the data from ChatGPT (example ShareGPT, self-instruct that use ChatGPT data for create the dataset).

Already set in the goal! you can wait for my upcoming PR krub.

@bact bact added the enhancement enhance functionalities label Feb 11, 2024
@bact bact added this to To do in PyThaiNLP Feb 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement enhance functionalities
Projects
PyThaiNLP
  
To do
Development

No branches or pull requests

3 participants