Allow to name existentials in pattern-matching #9584

garrigue · 2020-05-20T13:30:41Z

This attempts to implement a solution suggested by @yallop in #7074 to the problem of naming existentials in pattern-matching.
For the sake of simplicity(?) the syntax is:

type _ ty = Int : int ty
type dyn = Dyn : 'a ty * 'a -> dyn
let f = function Dyn (type a) (w, x : a ty * a) -> ignore (x : a)

I.e. the type is given as a tuple rather than on individual arguments.
Ellipses are allowed, but all existentials must be matched.

gasche · 2020-09-01T15:27:50Z

@trefis pinged me about this PR just now. I like the proposed syntax, much better than the one in #9579. There is broad consensus in #7074 that this syntax is acceptable (@yallop, @lpw25 and myself supported this syntax, @craigfe had solid arguments against the other proposal, etc.).

I remain of the opinion that it should ideally be possible to have something "more local" in the annotating-the-argument parts, for example C (type a) ((x : a), f) should be accepted just as well as C (type a b) (x, f : a * (a -> b)), but this could come later as a relaxation of the current feature. Even in that relaxed world, the syntax you just proposed would be accepted and make sense.

(I'm only semi-happy with the * which entertains the confusion between tuple types and multi-argument constructors, but it is consistent with the GADT declaration syntax so probably fine.)

gasche · 2020-09-01T15:29:00Z

I think it would be nice to move forward: we have a syntax we like, now we need to check that the implementation is reasonably consensual (maybe @trefis or @lpw25 could chime in with their perspective here), and seriously consider this for inclusion.

lpw25

LGTM. A few small things to improve, but the code looks correct to me. Well done everyone involved for finding something we can all agree to.

parsing/ast_iterator.ml

parsing/ast_mapper.ml

parsing/printast.ml

typing/ctype.ml

typing/typecore.ml

typing/untypeast.ml

parsing/parser.mly

parsing/parsetree.mli

garrigue · 2021-01-25T05:33:40Z

After many weeks and two rebases, I think this PR is now ready for inclusion.
I also added an entry for the manual.
@lpw25 already gave his approval, but since there are a number of changes, it would be good if at least one of you could give a fresh look.

gasche · 2021-01-25T10:18:07Z

typing/typedtree.mli

@@ -89,7 +89,7 @@ and 'k pattern_desc =
         *)
  | Tpat_construct :
      Longident.t loc * Types.constructor_description *
-        value general_pattern list ->
+        value general_pattern list * (Ident.t loc list * core_type) option ->
      value pattern_desc
        (** C                []
            C P              [P]


The comment here should be updated, just like in parsetree.mli (thanks for that change, by the way).

lpw25

Spotted a couple of small things to change and maybe a small bug. Good to go once they are addressed.

typing/tast_iterator.ml

typing/typecore.ml

typing/printpat.ml

gasche · 2021-01-26T09:25:45Z

Note: I think that the feature is very useful/important and that the design we ended up with is good. Thanks a lot @garrigue for this work.

alainfrisch · 2021-01-26T09:48:46Z

parsing/parsetree.mli

+        (* C                    None
+           C P                  Some (P, None)
+           C (P1, ..., Pn)      Some (Ppat_tuple [P1; ...; Pn], None)
+           C (type a b) (P : T) Some (P, Some ([a; b], T))


Sorry, I'm late in the discussion, but shouldn't we encode a regular Ppat_constraint to keep the core_type, i.e. just use:

Ppat_construct of loc * (pattern * string loc list) option

(with an empty list when there is no (type ...)) Requiring to have the inner pattern be a Ppat_constraint when the list is not empty is ok, imo.

I'm concerned that the current representation changes the way an existing constraint would be represented when one adds a type binding (C (P : T) --> C (type a) (P : T')).

Can the type-checked behave differently on C (type a) (P : T) and C (type a) ((P : T) : T)?

Btw, is it an error to bind type names which are not used in the pattern?

I thought about suggesting such nesting, but:

Currently the annotation is interpreted in the context of the type variables in a very specific way, so it makes sense to keep the two pieces together at the constructor site.

Even with the representation you propose, my understanding is that the type-checker would have to lookup the Tpat_constraint node and use its payload specifically (not go through the usual Tpat_constraint path in this case), so you would not get the same-behavior guarantees that you are asking about.

The handling of existential variables is very specific because, morally, their actual binding is computed by unifying the annotation with the constructor return type, and then making them rigid for the rest of the type-checking. This is not the behavior you would get if you checked the whole subpattern before rigidifying (I think that behavior would make sense and we discussed it earlier, but @garrigue says it is harder to implement), nor of course the behavior you would get by rigidifying before checking the annotation or using flexible variables.

Note: the previous discussion I was referring to is starting (approximatively) at #7074 (comment) .

Thanks @gasche for the pointer.

If the type-checking is not an extension of what happens for C (P : T), reusing the same concrete syntax can really be misleading, no?

Independently of what the type checker does, having different representation for the T in C (type a) (P : T) and in C (P : T) is confusing for people writing syntactic tools. I'm not shocked that, short of having a cleaner solution, the type-checker does something special with Ppat_constraint on top of the pattern when existential type names are bound. We might be able to change the behavior of the type-checker later, while keeping the same syntax and Parsetree representation.

If the type-checking is not an extension of what happens for C (P : T), reusing the same concrete syntax can really be misleading, no?

I think that the two readings of C (P : T) (as a constraint, and as a return-type-annotated-constructor pattern with no existential variables bound) do have the same meaning. What I meant is that if we think of the type-checking of a pattern P as generating an inference constraint (or some other "result", for example in an non-inferred world taking an scrutinee type and returning types for the bound variables), then the result of type-checking of C (type a...) (P : T) cannot be computed in a modular lay, as an action on the result of the type-checking of (P : T).

Independently of what the type checker does, having different representation for the T in C (type a) (P : T) and in C (P : T) is confusing for people writing syntactic tools.

This is a good point.

(When I say "merge this one", I'm not actually sure that all pending questions have been resolved, this is for @lpw25 to confirm. So it may not be the unique blocker yet.)

I'm fine with either the current version or the version where the check is completely moved to the type-checker and the parser is happy to parse patterns with existential binders and no type constraints. I dislike any version where what the type can represent and what the parser can produce are not the same.

I just realize that it is wrong to say that

C (type a) (pats : ty)

is equivalent to

C (pats : ty)

when a occurs nowhere: if C has arity 2 or more, pats must syntactically be a list of patterns, and the latter would not be typable. So, sorry, this is really a new syntax.
@gasche indeed suggested that it would be nicer to allow type constraints on each argument, but I answered that it would be difficult to implement it cleanly.
I will still give it a try, just to see how it would work.

So, IIUC, for a pattern with 2 arguments:

C _ is allowed but C (type a) _ is rejected(?).

C (type a) ((x, y) : t) can be allowed (for a correct type) (?) but C ((x, y) : t) is always rejected.

Is that right? Are there other differences?

The situation with n-ary arguments was already quite confusing, I fear this is getting even worse...

Not quite correct.

C _ is allowed, but C (type a) _ is rejected by the type checker (not the parser) independently of the number of arguments. The (type a) notation inside patterns is restricted to existential variables, and requires to bind all of them, which you cannot do without a type annotation, hence the rejection.

Annotations are always allowed for multiary types (the internal parenthesis are optional).

type ('a, 'b) pair = Pair of 'a * 'b;; type t = int * bool;; let f = function Pair ((x,y) : t) -> x;; val f : (int, bool) pair -> int = <fun> type closure = Clos : ('a -> int) * 'a -> closure;; type 'a app = ('a -> int) * 'a;; let f = function Clos (type a) ((x,y) : a app) -> x y;; val f : closure -> int = <fun>

The result looks more regular to me.

lpw25 · 2021-01-26T10:42:49Z

You replied "done" to a few remarks, but I don't see the changes. A forgotten push maybe?

garrigue · 2021-01-26T12:36:28Z

You replied "done" to a few remarks, but I don't see the changes. A forgotten push maybe?

No, this should be there. They are marked as outdated, so the code should have changed.
If there is still a problem, please add a new comment.

garrigue · 2021-01-29T12:52:08Z

I have create another PR implementing(?) @alainfrisch 's idea (#10180).
Tell me which you prefer.
I don't like the syntax for type annotations, but the problem is indeed with ocaml syntax.

garrigue · 2021-01-31T06:34:03Z

I have merged most of the suggestions here and in #10180 and rebased.

the parsetree represention uses a Ppat_constraint, which is checked in Typecore
type annotations are only allowed on the argument seen as a tuple, but this is now also allowed for normal constructors too

garrigue · 2021-01-31T06:49:27Z

I'm getting a compilation error in the flambda backend, but I don't see how this can be related to this PR.
Will probably have to wait until this is fixed and rebase.

garrigue · 2021-01-31T08:28:15Z

Note also that neither of these approaches work for GADT constructors with an inline record.
The problem is not just that we are not allowed to name the record, but that there is no way to determine the order of the existentials it introduces.
A solution would be to allow typing a part of the arguments, delaying the GADT constructors they may contain, then check that the existentials are bound, and only then restart typing those eventual GADT constructors.
But this can probably wait for a future PR...

garrigue · 2021-01-31T09:33:05Z

I'm really confused about what is happening. Apparently the bug comes from this PR, but the error message is:

./boot/ocamlrun ./boot/ocamlc -g -nostdlib -I boot -use-prims runtime/primitives -strict-sequence -principal -absname -w +a-4-9-40-41-42-44-45-48-66 -warn-error A -bin-annot -safe-string -strict-formats -I utils -I parsing -I typing -I bytecomp -I file_formats -I lambda -I middle_end -I middle_end/closure -I middle_end/flambda -I middle_end/flambda/base_types -I asmcomp -I asmcomp/debug -I driver -I toplevel -c middle_end/flambda/simple_value_approx.ml -I middle_end/flambda
Fatal error: exception Invalid_argument("output_value: functional value")
make[2]: *** [middle_end/flambda/simple_value_approx.cmo] Error 2

Very confusing.
I hope it would disappear with a bootstrap, but actually this is even worse: before it occurred only when compiling with ocamlopt, but after bootstrap with ocamlc too.

garrigue · 2021-01-31T13:23:47Z

I could eventually fix the problem by being more conservative, but there is a mysterious interaction at work here.

garrigue · 2021-01-31T14:15:38Z

OK, I found the reason: I had forgotten to reinstate processing of the annotation core_type in Tast_mapper and Tast_iterator. It seems that this is used somewhere, and resulted probably in a malformed typedtree. I still do not understand how this can lead to outputting a functional value, but at least the original cause is now clear.

I'm still keeping the conservative approach for already valid annotations, as removing a Ppat_constraint node could change the behavior of build_as_type for instance.

lpw25 · 2021-02-01T09:07:41Z

I still do not understand how this can lead to outputting a functional value, but at least the original cause is now clear.

The typed AST mapper is used for iterating over all the environments in the AST and calling [Env.keep_only_summary] on them before outputing the cmt file.

garrigue · 2021-02-01T11:16:42Z

Thanks for the explanation, this now makes sense.

Is everybody happy with this last iteration of the PR?
If nobody protests, I'm going to merge it tomorrow, as syntax PRs are a pain to rebase...

Octachron · 2021-02-02T14:24:18Z

This is minor, but am I right to think that there is no support for naming existentially quantified row variable:

type t = X: [> `A | `B ] -> t
let f (X (type x) (v:??)) = ()

?

garrigue · 2021-02-04T06:02:13Z

Thanks to everybody in the discussion, and good annotating!

johnwhitington · 2021-02-08T15:15:34Z

The directory manual/manual was moved to manual/src by 2aeb55a a few days before your merge, but your merge has recreated manual/refman/exten_camltex.tex in manual/manual, orphaning it. The full manual build now fails on trunk (although make html seems to work, luckily).

cc @Octachron

Octachron · 2021-02-08T15:54:01Z

Thanks for the notice! This should be fixed in trunk.

@garrigue , to be safe, on the manual side, was there only a new paragraph in the GADT section?

garrigue · 2021-02-09T12:37:43Z

Yes, that’s all I added.
I thought I understood the change of the manual, but this PR went through too many rebases.

Octachron · 2021-02-09T14:44:02Z

Thanks for the confirmation. Indeed rebase across repertory moves tend to be painful, sorry for the noise!

…t-principality-and-gadts Update a testcase in principality-and-gadts.ml to reflect changes in ocaml#9584

Patterns without named existentials are not correctly constructed. Issue introduced in ocaml#9584

…mar13/fix-typing-gadts-test-principality-and-gadts Update a testcase in principality-and-gadts.ml to reflect changes in ocaml#9584

garrigue mentioned this pull request May 20, 2020

There is no easy way to give names to existential variables introduced by GADT pattern-matching #7074

Closed

garrigue force-pushed the name_existentials branch from 951cf8b to 6cb3500 Compare June 3, 2020 09:56

garrigue requested a review from gasche June 3, 2020 09:59

lpw25 approved these changes Nov 27, 2020

View reviewed changes

gasche reviewed Nov 27, 2020

View reviewed changes

parsing/parser.mly Outdated Show resolved Hide resolved

parsing/parsetree.mli Outdated Show resolved Hide resolved

garrigue force-pushed the name_existentials branch from 6cb3500 to 0374bc0 Compare December 14, 2020 10:56

garrigue force-pushed the name_existentials branch from 0374bc0 to df5df3e Compare January 5, 2021 02:17

garrigue force-pushed the name_existentials branch from 2718514 to 9f6cf10 Compare January 25, 2021 05:25

gasche reviewed Jan 25, 2021

View reviewed changes

lpw25 approved these changes Jan 25, 2021

View reviewed changes

alainfrisch reviewed Jan 26, 2021

View reviewed changes

lpw25 approved these changes Jan 27, 2021

View reviewed changes

garrigue mentioned this pull request Jan 29, 2021

Name existentials : new approach #10180

Closed

garrigue force-pushed the name_existentials branch from dac29a4 to 45d7cc1 Compare January 31, 2021 06:30

garrigue added 5 commits February 3, 2021 18:38

fix mysterious problem by using Tpat_constraint where possible

61ea150

test

b9aadff

add example of multiary annotation

e1058f2

typo

d2e50fa

fix Tast_mapper and Tast_iterator

aa12a07

garrigue force-pushed the name_existentials branch from 7dfd2b8 to aa12a07 Compare February 3, 2021 09:42

garrigue added the merge-me label Feb 3, 2021

garrigue merged commit 89aae98 into ocaml:trunk Feb 4, 2021

gasche mentioned this pull request Feb 19, 2021

Allow a let definition to introduce locally abstract types #10237

Open

314eter mentioned this pull request Feb 27, 2021

Support new features (OCaml 4.12+) tree-sitter/tree-sitter-ocaml#50

Merged

garrigue added a commit to garrigue/ocaml that referenced this pull request Mar 3, 2021

Allow to name existentials in pattern-matching (ocaml#9584)

aae3a3b

lpw25 mentioned this pull request Mar 4, 2021

[trunk] Bound types via GADT are not resolved when referred via a module #10271

Closed

shubhamkumar13 mentioned this pull request Mar 24, 2021

Update a testcase in principality-and-gadts.ml to reflect changes in #9584 ocaml-multicore/ocaml-multicore#510

Merged

smuenzel pushed a commit to smuenzel/ocaml that referenced this pull request Mar 30, 2021

Allow to name existentials in pattern-matching (ocaml#9584)

e0eb793

EduardoRFS pushed a commit to esy-ocaml/ocaml that referenced this pull request May 17, 2021

update testcase to reflect changes in ocaml#9584

6c9adac

EduardoRFS pushed a commit to esy-ocaml/ocaml that referenced this pull request May 17, 2021

Merge pull request ocaml#510 from shubhamkumar13/fix-typing-gadts-tes…

c5f7152

…t-principality-and-gadts Update a testcase in principality-and-gadts.ml to reflect changes in ocaml#9584

lpw25 mentioned this pull request May 27, 2021

Introduce local type variables in patterns #9579

Closed

voodoos added a commit to voodoos/ocaml that referenced this pull request Aug 31, 2021

Add a test illustrating wrong untyping

452a9e1

Patterns without named existentials are not correctly constructed. Issue introduced in ocaml#9584

voodoos mentioned this pull request Aug 31, 2021

Fix untypeast for patterns #10593

Merged

sadiqj pushed a commit to sadiqj/ocaml that referenced this pull request Jan 10, 2022

update testcase to reflect changes in ocaml#9584

33f6e28

trefis mentioned this pull request Feb 3, 2022

Readability improvements around instance_constructor #10987

Merged

This was referenced Mar 30, 2022

Ppxlib.0.26.0 compatibility aantron/bisect_ppx#400

Merged

Ppxlib.0.26.0 compatibility ocaml-gospel/gospel#173

Merged

gasche mentioned this pull request Nov 19, 2022

Allow existential types introduced in a constructor pattern to be bound without tuple type constraints patterns #11491

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to name existentials in pattern-matching #9584

Allow to name existentials in pattern-matching #9584

garrigue commented May 20, 2020

gasche commented Sep 1, 2020

gasche commented Sep 1, 2020

lpw25 left a comment

garrigue commented Jan 25, 2021

gasche Jan 25, 2021

lpw25 left a comment

gasche commented Jan 26, 2021

alainfrisch Jan 26, 2021

gasche Jan 26, 2021

gasche Jan 26, 2021

alainfrisch Jan 26, 2021

gasche Jan 26, 2021

gasche Jan 27, 2021

lpw25 Jan 27, 2021

garrigue Jan 29, 2021

alainfrisch Jan 29, 2021

garrigue Feb 2, 2021

lpw25 commented Jan 26, 2021

garrigue commented Jan 26, 2021

garrigue commented Jan 29, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

lpw25 commented Feb 1, 2021

garrigue commented Feb 1, 2021

Octachron commented Feb 2, 2021 •

edited

garrigue commented Feb 4, 2021

johnwhitington commented Feb 8, 2021

Octachron commented Feb 8, 2021

garrigue commented Feb 9, 2021

Octachron commented Feb 9, 2021

Allow to name existentials in pattern-matching #9584

Allow to name existentials in pattern-matching #9584

Conversation

garrigue commented May 20, 2020

gasche commented Sep 1, 2020

gasche commented Sep 1, 2020

lpw25 left a comment

Choose a reason for hiding this comment

garrigue commented Jan 25, 2021

Choose a reason for hiding this comment

lpw25 left a comment

Choose a reason for hiding this comment

gasche commented Jan 26, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lpw25 commented Jan 26, 2021

garrigue commented Jan 26, 2021

garrigue commented Jan 29, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

garrigue commented Jan 31, 2021

lpw25 commented Feb 1, 2021

garrigue commented Feb 1, 2021

Octachron commented Feb 2, 2021 • edited

garrigue commented Feb 4, 2021

johnwhitington commented Feb 8, 2021

Octachron commented Feb 8, 2021

garrigue commented Feb 9, 2021

Octachron commented Feb 9, 2021

Octachron commented Feb 2, 2021 •

edited