Audit for missing `fold` implementations. #755

jswrenn · 2023-09-19T15:43:13Z

Implementing fold for our iterators allows us to provide more performant internal iteration. We provide it in some cases, but not all. I'd like us to take three steps towards resolving this:

Take an inventory of whether our adapters provide fold.
Implement fold for missing adapters.
(Potentially!) Implement a clippy lint to prevent regressions.

Happy to provide mentorship for any of these items.

The text was updated successfully, but these errors were encountered:

scottmcm · 2023-09-19T16:32:23Z

Note that the vast majority of the time, you should be overriding fold, not for_each.

The default for_each just calls fold with a trivial () accumulator, which is easily optimized out (since in the default Rust calling convention, it's not even passed).

Thus the only reason to override for_each in addition to fold is if you can optimize something by knowing that the accumulation is stateless, but I can't off-hand recall any case where I've seen that to be the case.

jswrenn · 2023-09-19T17:39:52Z

Whoops, yes, of course. I'll modify the issue accordingly.

Philippe-Cholet · 2023-09-20T19:36:56Z

phimuemue · 2023-09-21T05:58:22Z

Once we do this, we should test all of them via test_specializations.

Owen-CH-Leung · 2023-09-22T12:04:30Z

I'm interested to contribute to this crate and I digged a bit into the source code. I think one potential candidate to override fold is Combinations.

https://github.com/rust-itertools/itertools/blob/master/src/combinations.rs#L100

What do you guys think ?

I can create a PR in coming few days to implement this.

phimuemue · 2023-09-22T12:07:05Z

I'm interested to contribute to this crate and I digged a bit into the source code. I think one potential candidate to override fold is Combinations.

https://github.com/rust-itertools/itertools/blob/master/src/combinations.rs#L100

What do you guys think ?

I can create a PR in coming few days to implement this.

@Philippe-Cholet I think you're the most knowledgeable there. Could you mentor/review this?

Philippe-Cholet · 2023-09-22T15:08:51Z

I previously had some thoughts about spliting next in two methods to make a nth that would create only one vector instead of n+1 vectors. And I thought earlier it would be better done before fold but no as it needs to create each item anyway. I guess fold would be an interesting and total rewrite of next.
@Owen-CH-Leung Hi, sure you can help. And I'll be there to "mentor/review" it.

Philippe-Cholet · 2023-10-06T13:01:43Z

@jswrenn @phimuemue I've been editing a lot my comment above, and I would like to have feedback on it.
The list currently consists of all structs implementing Iterator, maybe we should ignore some of them?
Should we consider rfold specializations as well (small list above)?

DavidCoy77 · 2023-10-23T21:56:37Z

@jswrenn, I'd love to get started contributing to Rust, but I'd need a little help getting started since this would be my first ever GitHub contribution. You mention you're happy to provide mentorship in your initial post, so I'm eager to get to work if you could help me out with any of the items on the checklist. Thanks!

Philippe-Cholet · 2023-10-24T06:31:20Z

@DavidCoy77 I guess that specialize PutBackN::fold would be a good first pull request. You could look at #772 to find out what's expected.

jswrenn · 2023-10-24T12:19:17Z

Hi @DavidCoy77, apologies for not replying to your emails — I've been traveling. I agree with @Philippe-Cholet that our fold implementations are the best place to start.

DavidCoy77 · 2023-10-25T02:20:42Z

Great! Thank you for the help @jswrenn and @Philippe-Cholet. I've spent some time looking at the WithPosition::fold at #772 Philippe mentioned and unfortunately I have a hard time what would be asked of me for PutBackN::fold , so If you could offer additional guidance it would be greatly appreciated.

Philippe-Cholet · 2023-10-25T06:53:40Z

EDIT: The text below is a bit outdated as benchmarks and specializations are now written for most of our iterators. Write the fold method (and run a benchmark command before/after) is enough to contribute. Example: #820.

@DavidCoy77 I guess there is a lot of new things (I did not know criterion and quickcheck before coming here, and was not that familiar with fold either).

Are you familiar with Iterator::fold? With it, we consume the iterator and therefore consume all items. Specialize it can be faster than calling next repeatedly as it would be the case without specialization. You might not use fold directly but by default there are a few iterator methods that use fold such as for_each which I believe is more well known/used(?).
Are you familiar with itertools::PutBackN? I know I was not but I like the doctest example of its put_back method.
The core of the pull request would be to write PutBackN::fold.
In "benches/bench1.rs", we would write a benchmark function (using criterion) to measure the time improvement of specializing PutBackN::fold as we don't want to slow things down. Soon enough, this should be automatically provided by a macro I'm writing in another PR.
In "tests/specializations.rs", we write a function inside a "quickcheck" macro (it's property based testing, basically it tests our function against many inputs, and when it fails, it tries to find the smallest input that fails, super useful to find edge cases we did not consider properly). In that file, test_specializations takes a clonable iterator and test "fold" specialization amongst other method specializations, which simply allow us to write less code.

You can focus on figuring out PutBackN::fold first.

pub struct PutBackN<I: Iterator> {
    top: Vec<I::Item>,
    iter: I,
}
// From the next method, we want elements of `top` in the reversed order then, only then, the elements from `iter` (normal order).
fn next(&mut self) -> Option<Self::Item> {
    self.top.pop().or_else(|| self.iter.next())
}

EDIT: WithPosition::fold was a huge rewrite of WithPosition::next, not the simplest one. #774 might be a better example and also show we should delegate to fold when possible. I think PutBackN::fold will be an even better example once it's done.

Philippe-Cholet · 2024-01-25T21:39:40Z

@jswrenn I'm curious but don't understand the exact intention behind this yet:

(Potentially!) Implement a clippy lint to prevent regressions.

A lint that warn us about missing fold specializations, is that it?!

jswrenn · 2024-01-25T22:47:29Z

Yes, the idea is to add lints to clippy itself to warn us about missing fold specializations. For any adaptors where we explicitly didn't want to specialize fold, we would need to write #[allow(clippy::missing_iterator_fold)]. This would allow us to audit for missing clippy lints as simple as running cargo clippy, and to provide an automated warning if we ever tried to add a new adaptor without a fold implementation.

luander · 2024-02-01T10:37:42Z

I'm looking to give this issue a try.
Started with Combinations and I wonder how does folding on that one would look like? with LazyBuffer and such.
I can also use some mentoring/walkthrough of the crate if someone is willing to show me around.

Philippe-Cholet · 2024-02-01T11:45:33Z

@luander I can happily mentor you on this.

Specializations are tested in "tests/specializations.rs" so test it is a command line away (cargo test combinations is fast enough).
Specializations are benchmarked in "benches/specializations.rs" so benchmark it would be a command line away too (cargo bench --bench specializations combinations/fold to ignore others). But build this particular benchmark is really slow (almost 2 minutes for me) so the simplest thing to do is to temporarily ~~comment out~~ remove unrelated benchmarks (without commiting).
I'm not sure what it would look like exactly. I guess we might want to do some work from next while the lazy buffer is not full yet then some simpler work once we know for sure it is full.
In the process, we might want to expand LazyBuffer abilities.
Specialize Combinations::fold would definitely not be as straightforward as other iterators but it's an interesting one.

EDIT: Alternatively, ZipEq::fold would be simpler.

phimuemue · 2024-02-01T13:09:25Z

@jswrenn I'm curious but don't understand the exact intention behind this yet:

(Potentially!) Implement a clippy lint to prevent regressions.

A lint that warn us about missing fold specializations, is that it?!

If clippy does not accept the lint, we could maybe (cleverly ab)use missing_trait_methods.

Philippe-Cholet · 2024-02-01T13:40:18Z

If clippy does not accept the lint, we could maybe (cleverly ab)use missing_trait_methods.

I think it'll be accepted (and about size_hint, [try_][r]fold also). But probably not for all iterator methods we could want to specialize like count etc (that might be too much to ask).
I ran cargo clippy -- -Wclippy::missing_trait_methods, got 6752 warnings.
I added the #[warn(...)] to Powerset only and got only 73 warnings.
If clippy could display JSON data instead of pretty text (or hack info from it), maybe we could leverage that though.

EDIT: I might keep my fork and have a branch implementing other lints if keep it updated is simple enough.

Philippe-Cholet · 2024-02-06T10:37:56Z

Update on the lint idea: After discussion on zulip, missing_trait_methods might accept a configuration (in clippy.toml) instead of a new lint that would be too specific to a single trait method when it could be generalized. It would not be as flexible as we might want but it would be decent.
There are alternatives though: fork clippy (but maintainance?), use clippy sister project marker (great developper experience but it's a work in progress, I can't identify an iterator implementation with it yet), dylint (should be nice enough but my first try was a failure).

phimuemue · 2024-02-06T13:07:08Z

If clippy does not accept the lint, we could maybe (cleverly ab)use missing_trait_methods.

I think it'll be accepted (and about size_hint, [try_][r]fold also). But probably not for all iterator methods we could want to specialize like count etc (that might be too much to ask). I ran cargo clippy -- -Wclippy::missing_trait_methods, got 6752 warnings. I added the #[warn(...)] to Powerset only and got only 73 warnings. If clippy could display JSON data instead of pretty text (or hack info from it), maybe we could leverage that though.

EDIT: I might keep my fork and have a branch implementing other lints if keep it updated is simple enough.

My poor man's approach to fint missing folds (not thoroughly tested, but one gets the idea):

cargo clippy --message-format json -- --warn clippy::missing_trait_methods | jq .message.rendered | grep "\bfold\b"

Philippe-Cholet · 2024-02-06T14:59:22Z

@phimuemue Oh... I find embarrassing I was aware about --message-format json for "check" (never used) but not for "clippy".

After some tinkering on jq playground, I think I have a new git alias (it's not git-related but useful anyway):

missing-trait-methods = "!f() { cargo clippy --quiet --message-format json -- --warn clippy::missing_trait_methods | jq -r '. | select(.message.code.code == \"clippy::missing_trait_methods\") | select(.message.message | endswith(\"`'$2'`\")) | .message.spans[0] | .file_name + \":\" + (.line_start | tostring) + \"    \" + .text[0].text' | grep $1 | uniq; }; f"

then

git missing-trait-methods Iterator fold
git missing-trait-methods DoubleEndedIterator rfold

Output for `git missing-trait-methods Iterator size_hint`

src\adaptors\mod.rs:498    impl<B, F, I> Iterator for Batching<I, F>
src\groupbylazy.rs:387    impl<'a, K, I, F> Iterator for Groups<'a, K, I, F>
src\groupbylazy.rs:440    impl<'a, K, I, F> Iterator for Group<'a, K, I, F>
src\groupbylazy.rs:556    impl<'a, I> Iterator for Chunks<'a, I>
src\groupbylazy.rs:599    impl<'a, I> Iterator for Chunk<'a, I>
src\grouping_map.rs:25    impl<K, V, I, F> Iterator for MapForGrouping<I, F>
src\sources.rs:127    impl<A, St, F> Iterator for Unfold<St, F>

Philippe-Cholet · 2024-02-12T18:44:22Z

@luander Can I help you more on Combinations::fold?
Alternatively, I suggested ZipEq::fold but if you want to work on Combinations, specialize nth would be appreciated (it would solve #301 and close the outdated and stuck #329).

Philippe-Cholet · 2024-02-22T14:34:33Z

@jswrenn A lint missing_iterator_fold was not particularly well received: too specific to iterators basically.
The main alternative: clippy::missing_trait_methods could be configurable to limit warnings (6538 right now):

# clippy.toml
missing_trait_methods = [
    "core::iter::Iterator::fold",
    "core::iter::DoubleEndedIterator::rfold",
    "core::iter::Iterator::size_hint",
]

The downside is that allow the lint on a impl Iterator would allow not only one missing specialization but all of them. But if we limit ourselves to size_hint/fold/rfold then this seems good enough to me.

Is that okay with you? If it is, I'll work on making the lint configurable.

jswrenn · 2024-02-22T14:39:03Z

Making clippy::missing_trait_methods configurable sounds fantastic!

kinto-b · 2024-04-15T03:44:40Z

@Philippe-Cholet Hi there, on Combinations::fold, once #914 is merged, I think something like this should work as the core of an implementation:

// increment_indices returns true once we run out of combinations
while !self.increment_indices() {
  combination = self.indices.iter().map(|i| self.pool[*i].clone()).collect();
  acc = f(acc, combination)
}

If you think that looks reasonable I'll put through a PR.

Edit: I tried this out and, although it works, it is markedly slower than the unspecialized implementation

Philippe-Cholet · 2024-04-15T06:55:58Z

I tried this out and, although it works, it is markedly slower than the unspecialized implementation

@kinto-b We won't specialize it if it's slower (but that alone is a valuable information). About combinations, the slowest thing is heap-allocate the generated items. fold by nature generates them all (to give them to f) so we can't win anything on that side.
The only thing Combinations::fold could do is simpler code after the initialization phase while the default fold repeatedly doing next have "init code" lying around without use after a few steps, basically what you did but even it was a perf win, I did not expect much out of it since the heap-allocation is the bottleneck here and nothing else really impact performance (or at least I think so).
I kinda wonder if mutate a combination IN-PLACE and give clones would be faster but it's a lot of work for a probable small win (if any): while !self.increment_indices_and_comb(&mut comb) { acc = f(acc, comb.clone()); } or something like that.

I will eventually work again on configure clippy::missing_trait_methods and (if accepted) we will be able to mark that some fold are allowed to not be specialized and ensures all other fold are.

jswrenn added help wanted performance labels Sep 19, 2023

jswrenn changed the title ~~Audit for missing for_each implementations.~~ Audit for missing fold implementations. Sep 19, 2023

This was referenced Sep 27, 2023

Implement custom fold for ZipSlice #762

Closed

Implement custom fold for ZipLongest #764

Closed

This was referenced Oct 2, 2023

Specialize WithPosition::fold #772

Merged

Powerset::fold specialization #765

Merged

Implement custom fold for zip longest #774

Merged

Specialize TupleCombinations::fold #775

Merged

Owen-CH-Leung mentioned this issue Oct 10, 2023

Implement custom fold for WhileSome #780

Merged

Philippe-Cholet mentioned this issue Nov 23, 2023

More specialization benchmarks #806

Merged

This was referenced Jan 13, 2024

Specialize Interleave[Shortest]::fold #849

Merged

Specialize TakeWhileInclusive::fold #851

Merged

ConsTuples::next_back is plain wrong #852

Closed

Philippe-Cholet mentioned this issue Jan 29, 2024

missing_iterator_fold rust-lang/rust-clippy#12209

Open

This was referenced Feb 2, 2024

Const generics iterator candidates #547

Open

New lint missing_iterator_fold rust-lang/rust-clippy#12211

Closed

Philippe-Cholet mentioned this issue Feb 9, 2024

Specialize MapForGrouping::fold #873

Merged

Philippe-Cholet mentioned this issue Feb 23, 2024

Adding _by, by_key, largest variants of k_smallest #654

Merged

Owen-CH-Leung mentioned this issue Feb 26, 2024

Specialize TakeWhileRef fold #884

Draft

Philippe-Cholet mentioned this issue Feb 26, 2024

Add split_unfused() to Itertools. #883

Closed

kinto-b mentioned this issue Apr 15, 2024

Do not implement ZipEq::fold (and change test) #915

Merged

This was referenced Apr 18, 2024

Implement UniqueBy::fold #918

Open

Implement Unique::fold #919

Open

Implement MergeBy::fold #920

Merged

Implement DuplicatesBy::fold #921

Open

kinto-b mentioned this issue May 1, 2024

Implement FlattenOk::{fold, rfold} #927

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audit for missing `fold` implementations. #755

Audit for missing `fold` implementations. #755

jswrenn commented Sep 19, 2023 •

edited

scottmcm commented Sep 19, 2023

jswrenn commented Sep 19, 2023

Philippe-Cholet commented Sep 20, 2023 •

edited

phimuemue commented Sep 21, 2023

Owen-CH-Leung commented Sep 22, 2023 •

edited

phimuemue commented Sep 22, 2023

Philippe-Cholet commented Sep 22, 2023 •

edited

Philippe-Cholet commented Oct 6, 2023

DavidCoy77 commented Oct 23, 2023

Philippe-Cholet commented Oct 24, 2023

jswrenn commented Oct 24, 2023

DavidCoy77 commented Oct 25, 2023

Philippe-Cholet commented Oct 25, 2023 •

edited

Philippe-Cholet commented Jan 25, 2024

jswrenn commented Jan 25, 2024

luander commented Feb 1, 2024

Philippe-Cholet commented Feb 1, 2024 •

edited

phimuemue commented Feb 1, 2024

Philippe-Cholet commented Feb 1, 2024 •

edited

Philippe-Cholet commented Feb 6, 2024

phimuemue commented Feb 6, 2024

Philippe-Cholet commented Feb 6, 2024 •

edited

Philippe-Cholet commented Feb 12, 2024

Philippe-Cholet commented Feb 22, 2024

jswrenn commented Feb 22, 2024

kinto-b commented Apr 15, 2024 •

edited

Philippe-Cholet commented Apr 15, 2024

Audit for missing fold implementations. #755

Audit for missing fold implementations. #755

Comments

jswrenn commented Sep 19, 2023 • edited

scottmcm commented Sep 19, 2023

jswrenn commented Sep 19, 2023

Philippe-Cholet commented Sep 20, 2023 • edited

phimuemue commented Sep 21, 2023

Owen-CH-Leung commented Sep 22, 2023 • edited

phimuemue commented Sep 22, 2023

Philippe-Cholet commented Sep 22, 2023 • edited

Philippe-Cholet commented Oct 6, 2023

DavidCoy77 commented Oct 23, 2023

Philippe-Cholet commented Oct 24, 2023

jswrenn commented Oct 24, 2023

DavidCoy77 commented Oct 25, 2023

Philippe-Cholet commented Oct 25, 2023 • edited

Philippe-Cholet commented Jan 25, 2024

jswrenn commented Jan 25, 2024

luander commented Feb 1, 2024

Philippe-Cholet commented Feb 1, 2024 • edited

phimuemue commented Feb 1, 2024

Philippe-Cholet commented Feb 1, 2024 • edited

Philippe-Cholet commented Feb 6, 2024

phimuemue commented Feb 6, 2024

Philippe-Cholet commented Feb 6, 2024 • edited

Philippe-Cholet commented Feb 12, 2024

Philippe-Cholet commented Feb 22, 2024

jswrenn commented Feb 22, 2024

kinto-b commented Apr 15, 2024 • edited

Philippe-Cholet commented Apr 15, 2024

Audit for missing `fold` implementations. #755

Audit for missing `fold` implementations. #755

jswrenn commented Sep 19, 2023 •

edited

Philippe-Cholet commented Sep 20, 2023 •

edited

Owen-CH-Leung commented Sep 22, 2023 •

edited

Philippe-Cholet commented Sep 22, 2023 •

edited

Philippe-Cholet commented Oct 25, 2023 •

edited

Philippe-Cholet commented Feb 1, 2024 •

edited

Philippe-Cholet commented Feb 1, 2024 •

edited

Philippe-Cholet commented Feb 6, 2024 •

edited

kinto-b commented Apr 15, 2024 •

edited