[WIP] Source cache refactor #1827

vkleen · 2024-02-23T18:45:15Z

No description provided.

yannham

I didn't look into all the details. I think I like the general idea of making the cache...just a cache. Although the previous one had a nice encapsulation, it also made extracting some data from it (like source ids or the like) quite painful, needing to couple code with the actual cache instead of ImportResolver for example.

One aspect that needs to be really considered is the error tolerance, though. I see it disappeared from the new cache. I think we had several series of annoying bugs where some operation should be error-tolerant but wasn't, or the converse, leading to errors being reported too late in the pipe line. We thus decided to set in stone at the cache level if we are in an error tolerant mode (basically, LSP), or not. Doing that on a per-function basis was just too error prone. I think we should have the same kind of guarantee in the new cache (or it can somewhere else than in cache, but at least centralized in an object such that each individual "parse" doesn't have to think about error tolerance). I honestly don't know these days how the LSP uses the cache. Maybe it's also possible to make the cache strict by default, with specific method for error-tolerant parsing, and use that only in the LSP. Although I think that's exactly what we used to do, and it was still not entirely reliable.

yannham · 2024-03-01T13:05:35Z

core/src/parser/mod.rs

+    #[cfg(feature = "nix-experimental")]
+    Nix,
+    Raw,
+}


I wonder if we should mix that with this module. In particular because it's been in the back of my head to move the (pure) Nickel parser in a separate crate, so that

It can be used by other crates to make tooling around Nickel without depending on the whole of nickel-lang-core

It's a separate compilation unit that we don't need to even look at if we didn't touch the grammar

I guess it's still doable, but we should be wary of coupling too much the "parser pure Nickel code" with "parse generic Nickel inputs including JSON", if possible.

Those are good points. I'm thinking of renaming the prepare.rs module into a driver module where this functionality might fit better. That should also give us a better point to track error tolerance.

yannham · 2024-03-01T13:10:57Z

lsp/nls/src/files.rs

-    file_id: FileId,
-) -> Result<CacheOp<()>, Vec<Diagnostic<FileId>>> {
+    file_id: CacheKey,
+) -> Result<(), Vec<Diagnostic<CacheKey>>> {


I vaguely remember CacheOp<()> being a useful distinction from () to avoid cycles/loops when typechecking or importing circular stuff. But I'm not sure.

Grepping through the source code, by now CacheOp is used in a single place in the LSP (apart from the cache itself) and that is an extension function to the cache. So I think it's safe to get rid of it in the external interface of the cache.

yannham · 2024-03-01T13:26:00Z

core/src/prepare.rs

+        }
+        _ => Ok(cache
+            .get_parse_errors(cache_key)
+            .expect("Any parsed entry should have a corresponding entry in parse_errors")


Should we encode that with an enum, instead of a separate EntryState in a struct?

I'm not sure why I wanted the parse errors to be stored separately from the cache entries, but that seems silly to me now. I've changed the code to store parse errors directly in the cache entry for parsed entries. The only wrinkle is that it would be annoying to keep parse errors around after the entry is advanced from "parsed" to "typechecked". But would there be any reason to proceed to typechecking for a source file that doesn't parse correctly, even in the LSP?

Ah, I believe we do proceed with typechecking on a source file that doesn't parse correctly. If only because typechecking is actually the driver for the whole code analysis (goto(ref|def), hover, etc.), and we want to provide analysis for as much code as possible. In fact the typechecker explicitly ignores parse errors in the AST for that very reason:

nickel/core/src/typecheck/mod.rs

Line 1797 in baf6e3d

Term::ParseError(_) => Ok(()),

yannham reviewed Mar 1, 2024

View reviewed changes

vkleen force-pushed the refactor/cache branch from 68d6fba to 548df7b Compare March 5, 2024 12:34

github-actions bot temporarily deployed to pull request March 5, 2024 12:38 Inactive

github-actions bot temporarily deployed to pull request March 6, 2024 19:07 Inactive

github-actions bot temporarily deployed to pull request March 6, 2024 19:12 Inactive

github-actions bot temporarily deployed to pull request March 7, 2024 10:41 Inactive

github-actions bot temporarily deployed to pull request March 7, 2024 21:39 Inactive

github-actions bot temporarily deployed to pull request March 8, 2024 17:02 Inactive

github-actions bot temporarily deployed to pull request March 8, 2024 17:57 Inactive

vkleen added 20 commits March 9, 2024 13:06

Draft new cache interface

5f432ba

WIP Remove core/src/cache.rs imports

f74db93

Remove FileIds from neckel-lang-core

8f0ef69

Cache stdlib cache keys when they are inserted into the SourceCache

5276818

Use InitialEnvs in the evaluator

5054b5c

Fix some remaining compile errors from the refactor

039c831

WIP

9d2e6ed

Rename prepare to driver

bbedeba

Restore import resolution

be3eac2

Clarify that parse_multi doesn't update the cache entry

476062d

resolve_import -> resolve_path

c63299b

Implement transformations and typechecking in the driver

3f8898b

Implement load_from_filesystem

b152cbb

Finish import resolution

78a8014

Start addressing test failures

50aa06b

Fix remaining compile errors after rebase

1800ed3

Fix CLI snapshot tests

400eb2f

Report all parse errors as import errors

c21698c

Allow typechecking without running import resolution first

bf56540

Fix snippets in doc/manual/contracts.md

fd7496a

vkleen force-pushed the refactor/cache branch from 656f1fd to fd7496a Compare March 9, 2024 21:30

github-actions bot temporarily deployed to pull request March 9, 2024 21:34 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Source cache refactor #1827

[WIP] Source cache refactor #1827

vkleen commented Feb 23, 2024

yannham left a comment

yannham Mar 1, 2024 •

edited

vkleen Mar 5, 2024 •

edited

yannham Mar 1, 2024

vkleen Mar 6, 2024

yannham Mar 1, 2024 •

edited

vkleen Mar 6, 2024

yannham Mar 7, 2024 •

edited

[WIP] Source cache refactor #1827

Are you sure you want to change the base?

[WIP] Source cache refactor #1827

Conversation

vkleen commented Feb 23, 2024

yannham left a comment

Choose a reason for hiding this comment

yannham Mar 1, 2024 • edited

Choose a reason for hiding this comment

vkleen Mar 5, 2024 • edited

Choose a reason for hiding this comment

yannham Mar 1, 2024

Choose a reason for hiding this comment

vkleen Mar 6, 2024

Choose a reason for hiding this comment

yannham Mar 1, 2024 • edited

Choose a reason for hiding this comment

vkleen Mar 6, 2024

Choose a reason for hiding this comment

yannham Mar 7, 2024 • edited

Choose a reason for hiding this comment

yannham Mar 1, 2024 •

edited

vkleen Mar 5, 2024 •

edited

yannham Mar 1, 2024 •

edited

yannham Mar 7, 2024 •

edited