Define which points in JS should have a source map mapping #38

littledan · 2023-04-25T22:49:06Z

Different tools here make different decisions, sometimes leading to lossiness when multiple levels in a build pipeline apply. If we make. definition here, it would lend itself to better testing.

robpalme · 2023-04-26T08:49:31Z

Here's the explainer for this issue.

Background

The most common API for transitively combining a chain of two sourcemaps to produce a single sourcemap is applySourcemap()

The Problem

Unambiguously mapping a coordinate in generated code all the way back to the origin in a reliable way depends on that point being present in each sourcemap in the chain. applySourcemap results in a map that loses fidelity if any points are not present.

Imagine a low resolution identity sourcemap A that only has mappings for the start and end of the range. And a higher resolution sourcemap B that includes an intermediate point.

?    ?
 +---------+
A|1        |
 +----+----+
B|1   |2   |
 +----+----+
      ^
      |

Currently applySourceMap will map the intermediate point below B to the first question mark.

Credit to @mariusGundersen for explaining this

Workarounds

For our internal toolchain we mitigate this by ensuring each source transform produces an overly high-resolution sourcemap (close to per-token mappings) to maximize the number of mapping points that can be accurately followed all the way through. The final result is then validated for accuracy.

Potential Solution

If there were a specification of the important boundaries in the JS and TS grammars for which mapping points ought to be generated, per-transform sourcemap producers could comply with this to guarantee the composability of those sourcemaps whilst preserving accuracy and minimizing redundant mappings.

Additionally we could consider specifying a new canonical sourcemap composition algorithm if we judge applySourcemap to be non-optimal.

jridgewell · 2023-04-26T16:53:58Z

Copying a comment I left on Babel last year:

The easiest sourcemaps to generate just mark the beginnings of identifiers, and completely ignore any syntax. The debugging experience will be better with } and now the ( marked. But really, nothing else is strictly necessary. So there's no guarantee that the whitespace/syntax/X directly following an identifier name will be marked.

sjrd · 2023-04-26T23:07:51Z

To provide an opposite experience, Scala.js emits precise source maps. Every AST node from the initial parsing maintains a position. Positions are maintained through transformations, and are eventually attached to every node of the JS AST. Every node gives its position to the range of JS characters that it produces. So an addition like a + b maintains three positions: a, + and b.

Emitting less accurate source maps would ironically require more work, for us, I think.

mitsuhiko · 2023-04-28T20:25:15Z

I would already love to just have a hint in the source map itself of what I get to expect mapping wise. That at least would provide tools with the ability to better tell the user what is going on.

mariusGundersen · 2023-04-28T20:42:07Z

What are the downsides to having positions for each token? It increases the size of the the sourcemap and maybe the processing time? Not all tokens need to be individually represented in the sourcemap, I guess.

mariusGundersen · 2023-04-28T20:46:36Z

BTW, this is not just a problem in js, it's also an issue when compiling less into css. Because of how less and postcss produce sourcemaps, nested declarations end up being mapped to the outer declaration. Postcss treats the entire rule as one position while less maps each part of the rule to different positions. For a nested rule the first token maps to a different line thanks the second token. But postcss uses the location of the first token. That means that most nested declarations point to line 1, column 1 of a less file.

robpalme · 2023-04-28T21:00:57Z

What are the downsides to having positions for each token?

I should clarify. Whilst per-token positions might be inefficient, the real goal is to simply agree on a definition of the required mapping points to ensure end-to-end accuracy. If we agree that per-token is the way to go, that would achieve the goal.

jaro-sevcik · 2023-07-05T12:37:31Z

Webpack also offers the possibility of "cheap" source maps (via the devtool config option). Those only include line mappings, but not token mappings. As far as I know, this is the default in create-react-app. In Chrome Devtools, we had bugs related to this config (e.g., crbug.com/1422883).

As a result, we should consider specifying line-by-line source maps, and describing how tools should detect and handle those.

jkup · 2023-10-11T15:27:26Z

I know this is a correctness issue in the sense that it's unspecified in the spec, but I'm curious if we should re-categorize this as a feature as it will probably take a good amount of work to implement?

jkup · 2024-01-10T17:31:48Z

Some options mentioned in an earlier call:

Every line
Every token
Every breakable position

littledan added the Workstream: Correctness label Apr 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define which points in JS should have a source map mapping #38

Define which points in JS should have a source map mapping #38

littledan commented Apr 25, 2023

robpalme commented Apr 26, 2023

jridgewell commented Apr 26, 2023

sjrd commented Apr 26, 2023

mitsuhiko commented Apr 28, 2023

mariusGundersen commented Apr 28, 2023

mariusGundersen commented Apr 28, 2023

robpalme commented Apr 28, 2023

jaro-sevcik commented Jul 5, 2023

jkup commented Oct 11, 2023

jkup commented Jan 10, 2024

Define which points in JS should have a source map mapping #38

Define which points in JS should have a source map mapping #38

Comments

littledan commented Apr 25, 2023

robpalme commented Apr 26, 2023

Background

The Problem

Workarounds

Potential Solution

jridgewell commented Apr 26, 2023

sjrd commented Apr 26, 2023

mitsuhiko commented Apr 28, 2023

mariusGundersen commented Apr 28, 2023

mariusGundersen commented Apr 28, 2023

robpalme commented Apr 28, 2023

jaro-sevcik commented Jul 5, 2023

jkup commented Oct 11, 2023

jkup commented Jan 10, 2024