- Unicode 15.1
- Unicode 15
- Unicode 14
- Unicode 13
- Improve terminal width detection / Windows support
- Unicode 12
- Unicode 11
- Support Unicode 10.0
- Double check UTF-32 only on Ruby versions which contain the bug
- Highlight unassigned codepoints which are ignorable
- Bump symbolify dependency
- Add special characters (U+FFF9 - U+FFFC)
- Non-control separators return ⏎
- Bump characteristics dependency
- Allow GB1988 encoding (7bit ascii-like)
- Extract symbolification logic into extra symbolify gem (includes fixes and non-character detection)
- Update characteristics gem (includes a new blank)
- More consistent handling of bidi controls (always symbolify with alias, but highlight ALM, RLM, LRM as blanks)
- Highlight control chars that are also blanks as blank
- Highlight CGJ as blank
- Highlight NEL as blank (only in Unicode)
- Add Unicode version to
unibits --version
command
- Add mongolian free variation selectors and combining grapheme joiner to interesting codepoints list
- Green highlighting of "marks" in Unicode
- Always use dotted circle for non-spacing marks
- Always prepend enclosing marks with a space
- Update characteristics gem (includes new blanks and UTF-8 dialects with japanese emojis)
- Proper UTF-32 validness / invalid codepoint highlighting, see https://bugs.ruby-lang.org/issues/13292
- Support more encoding: IBMX, CP85X, macX, TIS-620/Windows-874, and KOI8-X
- Highlight non-control formatting characters in pink
- Improve
unibits --help
command
- Support more encodings: ISO-8859-X and Windows-125X
- Add three HANGUL characters (U+115F, U+1160, U+3164) to list of possible white spaces
- Move character handling to separate gem. It is called characteristics.
- Highlight control chars in blue and blanks in light blue
- Handle encodings that are not convertible to UTF-8
- Add variation selectors 17-256 (U+E0100 - U+E01EF)
- Add U+1D159 (MUSICAL SYMBOL NULL NOTEHEAD) to list of possible white spaces
- Bump unicode-categories dependency for more reliable unassigned codepoint detection
- Fix bug that inserted wrong bytes
- Do not display (but highlight) unassigned codepoints
- Support (and highlight) invalid encodings \o/
- Improve character symbolification
- Fix that the Kernel method would not take keyword arguments
- New option for setting a custom output width to use
- New option for activating wide ambiguous characters
- Initial release