Fix BytePos -> CharPos calculations #6574

jridgewell · 2022-12-03T22:42:53Z

Description:

This fixes the BytePos -> CharPos calculation necessary for source maps. There were a few issues in the old code:

UTF-8 maps 1-3 bytes into 1 UTF-16 char, but 4 bytes into 2 UTF-16 chars
The starting offset was not recorded when we reach the end of the multibyte_chars iteration
The mappings can be unordered, meaning we need to restart UTF-16 offset calculation

BREAKING CHANGE:

Related issue (if exists):

There were a few issues in the old code: 1. UTF-8 maps 1-3 bytes into 1 UTF-16 char, but 4 bytes into 2 UTF-16 chars 2. The starting offset was not recorded when we end the `multibyte_chars` iteration 3. The `mappings` can be unordered, meaning we need to restart UTF-16 offset calculation

kdy1

Thank you so much!

swc-bump:

swc_common

jridgewell · 2022-12-04T04:01:09Z

Sorry, I just pushed a reverse conversion for when the mapping isn't ordered. This hopefully makes it a bit faster.

crates/swc_common/src/source_map.rs

kdy1

Thank you!

jridgewell · 2022-12-04T04:16:49Z

Ah, one of my recent changes is causing a failure. Working to fix it now.

jridgewell · 2022-12-04T04:22:24Z

Should be good now.

IWANABETHATGUY · 2022-12-04T04:39:10Z

It seems that this pr regressed performance again

PR

Main

IWANABETHATGUY · 2022-12-04T04:44:18Z

large.js https://gist.github.com/IWANABETHATGUY/8cbfa7246af46a26cd8480ce7e9a5f47

jridgewell · 2022-12-04T05:50:25Z

Can you explain how you're doing the benchmark? I'm actually seeing an ~2% improvement with that file running time ./target/release/swc compile --source-maps true large.js > /dev/null

kdy1 · 2022-12-04T05:52:14Z

Let's fix performance regression with another PR..

IWANABETHATGUY · 2022-12-04T05:59:28Z

Sorry, you are right, I just using the wrong branch. No performance regression.

jridgewell force-pushed the multibyte-fix branch from a4b8c38 to ef0c185 Compare December 3, 2022 22:46

kdy1 previously approved these changes Dec 4, 2022

View reviewed changes

Implement reverse conversion

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

8ff6494

jridgewell dismissed kdy1’s stale review via 8ff6494 December 4, 2022 04:00

kdy1 reviewed Dec 4, 2022

View reviewed changes

crates/swc_common/src/source_map.rs Outdated Show resolved Hide resolved

Remove debug statement

10bc02c

kdy1 previously approved these changes Dec 4, 2022

View reviewed changes

kdy1 enabled auto-merge (squash) December 4, 2022 04:03

kdy1 added this to the Planned milestone Dec 4, 2022

Fixes

8944cd3

jridgewell dismissed kdy1’s stale review via 8944cd3 December 4, 2022 04:22

kdy1 approved these changes Dec 4, 2022

View reviewed changes

jridgewell mentioned this pull request Dec 4, 2022

SWC generate wrong sourcemap when have mutibyte character after 1.3.20 #6552

Closed

kdy1 disabled auto-merge December 4, 2022 04:40

kdy1 enabled auto-merge (squash) December 4, 2022 05:50

kdy1 disabled auto-merge December 4, 2022 05:50

kdy1 merged commit a203fdb into swc-project:main Dec 4, 2022

jridgewell deleted the multibyte-fix branch December 4, 2022 05:52

kdy1 modified the milestones: Planned, v1.3.22 Dec 9, 2022

kdy1 added this to the v1.3.22 milestone Dec 9, 2022

swc-project locked as resolved and limited conversation to collaborators Jan 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Sponsors

Fix BytePos -> CharPos calculations #6574

Fix BytePos -> CharPos calculations #6574

jridgewell commented Dec 3, 2022 •

edited

Loading

kdy1 left a comment

jridgewell commented Dec 4, 2022

kdy1 left a comment

jridgewell commented Dec 4, 2022

jridgewell commented Dec 4, 2022

IWANABETHATGUY commented Dec 4, 2022

IWANABETHATGUY commented Dec 4, 2022

jridgewell commented Dec 4, 2022

kdy1 commented Dec 4, 2022

IWANABETHATGUY commented Dec 4, 2022

Fix BytePos -> CharPos calculations #6574

Fix BytePos -> CharPos calculations #6574

Conversation

jridgewell commented Dec 3, 2022 • edited Loading

kdy1 left a comment

Choose a reason for hiding this comment

jridgewell commented Dec 4, 2022

kdy1 left a comment

Choose a reason for hiding this comment

jridgewell commented Dec 4, 2022

jridgewell commented Dec 4, 2022

IWANABETHATGUY commented Dec 4, 2022

PR

Main

IWANABETHATGUY commented Dec 4, 2022

jridgewell commented Dec 4, 2022

kdy1 commented Dec 4, 2022

IWANABETHATGUY commented Dec 4, 2022

jridgewell commented Dec 3, 2022 •

edited

Loading