You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! My issue / question revolves mainly around the CC-BY-NC-SA-4.0 license, though I guess it applies generally. It's about the (un?)importance of non-ASCII characters in international licenses.
In particular my issue focuses on the spdx CC that I seek to use, which seems to have a chaotic mixing of “ (U+201C) (line 13), ” (U+201D) (line 13), and " (U+0022) (line 45), as well as ’ (U+2019) (line 32) and ' (U+0027) (line 63), and some mixed dashes vs -. So too the creative commons version seems to use both double-quotes types, but in different places, using one set of non-ASCII double-quotes towards the end.
Question / remarks
My question is therefore "If the quotes differ between license vendors, does it even matter? Can I carefully swap all the non-ASCII codepoints for their ASCII version, and does it have a legal effect?" I am not a lawyer, and I am not seeking strict legal advice, I'm just wondering if I'm being an extreme pedant, or if the mix of quoting styles is there for a reason. It seems unchanged from the second ever commit to this repository (if it's not broken don't fix it). I am naively suspicious that slight punctuation differences could have consequences, therefore I am sticking with the spdx version for now.
I hope this doesn't come across as too rudely English-centric either, yearning for simple ASCII-only files. I can appreciate internationali(s|z)ed versions having say guillemets, and hey, I'm British so I have a non-ASCII £ on my keyboard! It's just about simplicity, Unix / KISS philosophy, important files being uncomplicated, with all glyphs reproducible with a US keyboard, and displayable on a limited tty with a 7/8-bit character set.
Just an aside, it is interesting to learn about Arch's approach to inclusion of license files since at some point the Fedora Project will need to reassess how it handles that. 🤓
Regarding your question about different styles of quote, that is covered in B.6.4 Guideline: Quotes
Any variation of quotations (single, double, curly, etc.) should be considered equivalent.
As @xsuchy already noted, we have a similar guideline for hyphens, dashes, etc.
Those differences are not legally significant in my opinion ;)
The reason that other punctuation needs to match is that the placement of a comma or period can change the meaning. We have recently used markup to indicate a specific comma is optional where we determined it did not/could not change the meaning. (how's that for pedantic!)
Hi! My issue / question revolves mainly around the CC-BY-NC-SA-4.0 license, though I guess it applies generally. It's about the (un?)importance of non-ASCII characters in international licenses.
Context
Up until a few weeks ago, Arch Linux's licenses package used to include two main folders for
/usr/share/licenses/
. One is thespdx
folder which persists today, and sources its files from the spdx/license-list-data repo. The other, namedcommon
, now removed since a commit to the Arch package on GitLab by @dvzrv, used to consist of some common licenses like the GPLs. What was nice about some files undercommon
is that they consisted of only ASCII codepoints. Indeed the version of GPL-3.0 that I use in all of my programming repositories is that version, and is the same (md5 checksum-wise) version still offered on GNU's website. For example on line four it uses(C)
instead of©
which spdx uses.In particular my issue focuses on the spdx CC that I seek to use, which seems to have a chaotic mixing of
“
(U+201C) (line 13),”
(U+201D) (line 13), and"
(U+0022) (line 45), as well as’
(U+2019) (line 32) and'
(U+0027) (line 63), and some mixed dashes vs-
. So too the creative commons version seems to use both double-quotes types, but in different places, using one set of non-ASCII double-quotes towards the end.Question / remarks
My question is therefore "If the quotes differ between license vendors, does it even matter? Can I carefully swap all the non-ASCII codepoints for their ASCII version, and does it have a legal effect?" I am not a lawyer, and I am not seeking strict legal advice, I'm just wondering if I'm being an extreme pedant, or if the mix of quoting styles is there for a reason. It seems unchanged from the second ever commit to this repository (if it's not broken don't fix it). I am naively suspicious that slight punctuation differences could have consequences, therefore I am sticking with the spdx version for now.
I hope this doesn't come across as too rudely English-centric either, yearning for simple ASCII-only files. I can appreciate internationali(s|z)ed versions having say guillemets, and hey, I'm British so I have a non-ASCII
£
on my keyboard! It's just about simplicity, Unix / KISS philosophy, important files being uncomplicated, with all glyphs reproducible with a US keyboard, and displayable on a limited tty with a 7/8-bit character set.See also #1893
(I am very aware that the maintenance of Arch Linux packages is orthogonal to spdx's work, it's just for context 😄 )
The text was updated successfully, but these errors were encountered: