Remove redundant file dialog string conversions #1665

tobil4sk · 2023-04-18T12:25:47Z

Currently, there are a lot of redundant string conversions when using the file dialog api. This was mentioned briefly in #1622.

Here is a rough overview of the current situation:

Linux/macOS:

C++: hxstring -> std::wstring -> std::string, passed into tinyfiledialogs utf-8 api, then std::string -> std::wstring -> hxstring
HL: hl_vstring -> std::wstring -> std::string, passed into tinyfiledialogs utf-8 api, then std::string -> std::wstring -> utf-8 bytes -> native utf-16 hashlink string (on Haxe side via String.fromUtf8())

Windows:

C++: hxstring -> std::wstring, passed into tinyfiledialogs utf-16 api, then std::wstring -> hxstring
HL: hl_vstring -> std::wstring, passed into tinyfiledialogs utf-16 api, then std::wstring -> utf-8 bytes -> native utf-16 hashlink string (on Haxe side via String.fromUtf8())

This cleans things up to avoid unnecessary conversions, so now it looks like this in most cases:

Linux/macOS:

C++: hxstring -> utf-8[0], passed into tinyfiledialogs utf-8 api, then utf-8 -> hxstring
HL: hl_vstring -> utf-8, passed into tinyfiledialogs utf-8 api, then utf-8 -> utf-16 hashlink string (on Haxe side via String.fromUtf8())

Windows:

C++: hxstring -> utf-16[0], passed into tinyfiledialogs utf-16 api, then utf-16 -> hxstring
HL: hl_vstring, passed into tinyfiledialogs utf-16 api, then utf-16 -> utf8 -> utf-16 hashlink string (on Haxe side via String.fromUtf8())

[0] hxcpp can use ASCII (which is compatible with utf-8) or utf-16 depending on the string, so these conversions are avoided in some cases.

We can improve Hashlink further (for Windows), by returning utf-16 strings from the apis, however, I'm not sure if this would be considered a breaking change (new lime.hdlls would be incompatible with old lime haxe files).

In #1622, we briefly discussed using the utf-8 tinyfiledialog functions on Windows to unify the api, however, I looked into this and on Windows these functions just convert back to utf-16, so there would still be extra conversions. So I think this is the cleanest solution.

I've tested on Linux and Windows with the code from #1622, and everything works well.

No need to iterate through the whole string to find the length, just to see if it is empty.

nixbody · 2023-04-18T12:36:56Z

Just a little correction here, HXCPP (with smart strings enabled) never uses UTF-8 for strings. It uses ASCII and when any character in a string doesn't fit within ASCII range then the entire string is converted to UTF-16. HXCPP, however, provides a function to convert it's strings to UTF-8.

tobil4sk · 2023-04-18T12:44:19Z

Yes, that's correct. However, an ASCII string is valid utf-8, so when using hxs_utf8 on an hxcpp string which is ASCII encoded, it just returns a pointer to the ASCII string without any conversion being necessary.

nixbody · 2023-04-18T12:57:35Z

I was wondering if you point that out :) I thought you might know, my apologies for stating the obvious.

tobil4sk · 2023-04-18T13:30:11Z

No worries, it's worth making the distinction :)

project/src/ui/FileDialog.cpp

tobil4sk · 2023-04-20T14:24:17Z

There are a few more places where wstrings are converted to utf8 still. These include most of the system functions, though these are windows only, where wchar is equal to char16_t so hl_to_utf8/alloc_wstring can be used there. For System::GetDirectory, it can be modified to avoid wstrings.

The bigger issue is Font::GetFamilyName() in text/Font.cpp. Currently it assumes that wchar is 16 bits, which is only the case on Windows. Even if it used char16_t, the alloc_hxs_utf16() function returns an HxString, and I can't seem to figure out to turn that into a value.

player-03 · 2024-02-02T21:20:25Z

project/include/ui/FileDialog.h

+			static const char* OpenFile (const char* title = 0, const char* filter = 0, const char* defaultPath = 0);
+			static const char* OpenFiles (const char* title = 0, const char* filter = 0, const char* defaultPath = 0);
+			static const char* SaveFile (const char* title = 0, const char* filter = 0, const char* defaultPath = 0);
+		#endif


On the one hand, I'm pretty sure this is what typedefs are for. It would be really nice not to have this kind of duplicate code.

On the other hand, if we aren't careful, new typedefs would just mean new instances where conditional compilation is needed. For instance, a type that mapped to wchar_t on Windows and char elsewhere doesn't help when Windows code needs to use char, so we'd still need #ifdef HX_WINDOWS.

tobil4sk added 2 commits April 18, 2023 12:46

Remove redundant file dialog string conversions

abf217f

Make empty string checks slightly faster

3d8560d

No need to iterate through the whole string to find the length, just to see if it is empty.

tobil4sk mentioned this pull request Apr 18, 2023

UNICODE fixes (clipboard, window title, file dialogs, paths, font glyphs, ...) #1472

Open

nixbody approved these changes Apr 18, 2023

View reviewed changes

project/src/ui/FileDialog.cpp Show resolved Hide resolved

nixbody approved these changes Apr 18, 2023

View reviewed changes

Clean up

4677f70

nixbody approved these changes Apr 20, 2023

View reviewed changes

player-03 reviewed Feb 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove redundant file dialog string conversions #1665

Remove redundant file dialog string conversions #1665

tobil4sk commented Apr 18, 2023 •

edited

nixbody commented Apr 18, 2023 •

edited

tobil4sk commented Apr 18, 2023

nixbody commented Apr 18, 2023 •

edited

tobil4sk commented Apr 18, 2023

tobil4sk commented Apr 20, 2023

player-03 Feb 2, 2024

Remove redundant file dialog string conversions #1665

Are you sure you want to change the base?

Remove redundant file dialog string conversions #1665

Conversation

tobil4sk commented Apr 18, 2023 • edited

nixbody commented Apr 18, 2023 • edited

tobil4sk commented Apr 18, 2023

nixbody commented Apr 18, 2023 • edited

tobil4sk commented Apr 18, 2023

tobil4sk commented Apr 20, 2023

player-03 Feb 2, 2024

Choose a reason for hiding this comment

tobil4sk commented Apr 18, 2023 •

edited

nixbody commented Apr 18, 2023 •

edited

nixbody commented Apr 18, 2023 •

edited