skip UTF-8 BOM also #381

kmuto · 2023-01-28T14:35:34Z

I noticed that reading a UTF-8 encoded file with a BOM causes an error. Unfortunately, major Windows software adds a BOM to UTF-8 files.
UTF-16 support is fixed in #277 , but in addition, skipping the UTF-8 BOM (ef bb bf, https://en.wikipedia.org/wiki/Byte_order_mark ) also solves the problem, I believe.

A sample is attached.

before:

OK(withoutBOM) あ
ERROR(withBOM) toml: line 1: expected '.' or '=', but got '\ufeff' instead

after:

OK(withoutBOM) あ
OK(withBOM) あ

mini.zip

arp242 · 2023-01-28T19:45:22Z

Thanks!

kmuto · 2023-01-30T13:48:47Z

@arp242
Thanks for the merge!
Have you decided when you plan to release the next one? If you are keeping the releases at 3 month intervals, I think it's about the right time. 😊

kmuto and others added 2 commits January 28, 2023 23:05

skip UTF-8 BOM also

23a0335

Add test and comment

1854085

arp242 merged commit 1a6ca6e into BurntSushi:master Jan 28, 2023

kmuto mentioned this pull request May 18, 2023

awaiting v1.2.2 release #389

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

skip UTF-8 BOM also #381

skip UTF-8 BOM also #381

kmuto commented Jan 28, 2023

arp242 commented Jan 28, 2023

kmuto commented Jan 30, 2023

skip UTF-8 BOM also #381

skip UTF-8 BOM also #381

Conversation

kmuto commented Jan 28, 2023

arp242 commented Jan 28, 2023

kmuto commented Jan 30, 2023