Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chinese input to lexer? #556

Open
nIxedoahz opened this issue Mar 3, 2023 · 1 comment
Open

chinese input to lexer? #556

nIxedoahz opened this issue Mar 3, 2023 · 1 comment

Comments

@nIxedoahz
Copy link

How do I support Chinese string matching

@Mightyjo
Copy link
Contributor

Mightyjo commented Mar 8, 2023

If you encode your Chinese characters in UTF-8 most features will just work. Character classes (i.e. [aAcC-_] will not work, though. They will treat each byte of the code point as a separate character instead of treating them as a multibyte sequence.

You can achieve something similar to character classes using the alternation operator (e.g. "#!" | "?!").

@westes westes changed the title support chinese input to lexer? Mar 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants