On error callback #89

samix73 · 2023-02-10T14:51:32Z

Register a callback that is called every time a task returns an error.

camdencheek

Nice! I'd considered this previously, but found that I never actually had a real use case for it. Do you have an example of how you'd want to use this?

camdencheek · 2023-02-11T14:59:28Z

pool/error_pool.go

+		if p.errorCallBack != nil {
+			p.errorCallBack(err)
+		}


Should this be behind the mutex? It seems useful to be able to guarantee that the callback will never be called concurrently.

I have also thought about that but if the callback is doing something time intensive it would block the whole pool.
I don't really have a strong opinion about this, it's a matter communication to the user so they know how it behaves either way.

Yeah, makes sense. I also don't feel very strongly. I think it's fine the way it is, but we should document that the callback must be safe to call from multiple goroutines.

pool/context_pool.go

samix73 · 2023-02-11T17:12:38Z

My current use case for this I want to log the errors as they happen but I don't want to necessarily Stop the pool. Example would be:

func errorCB(ctx context.Context, err error) {
	if err != nil {
		log.Print(err)
	}
}

type handler struct {
	pool *pool.ContextPool
}

func (h *handler) hello(w http.ResponseWriter, req *http.Request) {
	h.pool.Go(func(ctx context.Context) error {
		time.Sleep(time.Second)
		return errors.New("oh no!")
	})
}

func (h *handler) close() error {
	return h.pool.Wait()
}

func main() {
	h := &handler{
		pool: pool.New().ContextPool().WithErrorCallback(errorCB),
	}
	defer func() {
		if err := h.close(); err != nil {
			log.Print(err)
		}
	}()

	http.HandleFunc("/hello", h.hello)

	http.ListenAndServe(":8090", nil)
}

codecov-commenter · 2023-02-12T14:55:13Z

Codecov Report

Merging #89 (fcd160c) into main (daaaa39) will decrease coverage by 0.91%.
The diff coverage is 66.66%.

@@            Coverage Diff             @@
##             main      #89      +/-   ##
==========================================
- Coverage   99.28%   98.37%   -0.91%     
==========================================
  Files          12       12              
  Lines         419      431      +12     
==========================================
+ Hits          416      424       +8     
- Misses          3        7       +4

Impacted Files	Coverage Δ
pool/context_pool.go	`87.87% <0.00%> (-12.13%)`	⬇️
pool/error_pool.go	`100.00% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

camdencheek · 2023-02-12T18:17:31Z

pool/context_pool.go

+// WithErrorCallback configures the pool to call f everytime a task returns error.
+func (p *ContextPool) WithErrorCallback(f func(err error)) *ContextPool {


Should we expose an equivalent on ResultErrorPool and ResultContextPool?

camdencheek · 2023-02-12T18:37:50Z

Thinking out loud a bit here. Apologies if my comments below aren't clear -- I'm struggling to put words to my thoughts.

In general, I am a little concerned about this change because it encourages a pattern of long-lived pools. The problem with long-lived pools is that 1) errors are not propagated until the pool completes, and 2) panics are not propagated until the pool completes. This PR covers case 1, but it does not cover case 2.

In your example, you spawn a goroutine per request. However, the HTTP response writer is never actually used and the goroutine does not complete before the request completes, so the client will never see result or error. In practice, a HTTP server will parallelize requests, so a pool is not needed here. Additionally, handling requests should almost be done synchronously because you still need to write the response to the http.ResponseWriter when you complete the operation.

I know this probably sounds a bit pedantic, but I think adding streaming capabilities to pool might be a mistake because it suggests usage patterns that really aren't great. An ErrorPool handles a set of tasks where the result of all those tasks might be an error. Using the error early means that this is no longer true. Instead, I'd probably recommend using a Pool (rather than an ErrorPool) and handling the errors in a streaming manner. If you handling the errors as you go, the error return value isn't really useful anyways.

samix73 · 2023-02-13T17:21:59Z

Excellent points, For the case of handling panics I think it wouldn't be a bad idea to add a PanicCallback as well and once the callback is invoked we can clear the recovered value from the Catcher and if no callback registered to propagate the panic as previously

func (p *Catcher) tryRecover() {
	if val := recover(); val != nil {
		rp := NewRecoveredPanic(1, val)
		if p.onPanicCallback == nil {
			p.recovered.CompareAndSwap(nil, &rp)
		} else {
		 	p.onPanicCallback(&rp)
		}
	}
}

In my example I've kept code short for the sake of brevity, think of the goroutine invocation within the HTTP request would just initiate a background process which shouldn't interrupt other invocations of the same task in case of error but the error should be dealt with in some way (reported/logged, etc.).

Could you please elaborate a bit more on "I think adding streaming capabilities to pool might be a mistake", since I believe the Error/Panic Callbacks would/should not really promote any streaming patterns (and why streaming patterns are bad in this case?)?

Moreover, I have been thinking what if we extend these callbacks to Pre/Post hooks which would be respectively invoked before and after the each task. Let me know what you think about it?

bobheadxi · 2023-02-13T18:05:12Z

@camdencheek chiming in here on behalf of Cloud - we recently had a similar use case, where specifically we needed to do error handling after the pool becomes aware of the error (adds it to the underling errPool) - we were also tempted by the ability to have pool jobs error to add streaming handling of errors, and ended up getting bitten by the gotcha that cancellation will not happen until we actually return it. I pondered an error-callback addition for that use case briefly, but we eventually switched to using a stream instead, and building our own error handling - this worked much better because stream offers a more explicit hook for handling the results of each job.

[...] I think adding streaming capabilities to pool might be a mistake because it suggests usage patterns that really aren't great. An ErrorPool handles a set of tasks where the result of all those tasks might be an error. Using the error early means that this is no longer true. Instead, I'd probably recommend using a Pool (rather than an ErrorPool) and handling the errors in a streaming manner. If you handling the errors as you go, the error return value isn't really useful anyways.

+1 on this - IMO if you want to expose "job lifecycle" concepts, it probably should live in stream instead. This also takes care of issues like "does my callback need to be concurrency-safe" (for stream, no, but we could add new types of streams that do everything concurrently), and keeps pool as a purely "I have a group of work, do it all together and then give me aggregated results" model

(aside - now that I think of it, I think I like the old naming of lib/group better now, since it more directly implies a "bounded group of work" whereas pools in Go are often long-lived resource pools, in the context of which the attempted usage in #89 (comment) kind of makes sense, even though the design of pool is not for this kind of long-lived-pool-of-resources use case)

bobheadxi · 2023-02-13T18:10:19Z

since I believe the Error/Panic Callbacks would/should not really promote any streaming patterns (and why streaming patterns are bad in this case?)?

I think a callback that happens as soon as something happens is what Camden means by "streaming", and as I note in my previous comment I think the intended model is the pool package lets you do a bounded set of work and collect the results, whereas the stream package lets you execute on those results as they come in (and this is where we might want to add extended lifecycle/streaming hooks, instead of having it in pool types)

samix73 added 2 commits February 10, 2023 13:51

add on error callback

5d42ec6

add comment ContextPoo.WithError

c5fe20a

camdencheek reviewed Feb 11, 2023

View reviewed changes

remove unused errorCallBack on ContextPool

fcd160c

camdencheek reviewed Feb 12, 2023

View reviewed changes

bobheadxi mentioned this pull request Feb 16, 2023

FanOut/Until Discussion #86

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

On error callback #89

On error callback #89

samix73 commented Feb 10, 2023

camdencheek left a comment

camdencheek Feb 11, 2023

samix73 Feb 11, 2023 •

edited

camdencheek Feb 12, 2023

samix73 commented Feb 11, 2023

codecov-commenter commented Feb 12, 2023

camdencheek Feb 12, 2023

camdencheek commented Feb 12, 2023

samix73 commented Feb 13, 2023

bobheadxi commented Feb 13, 2023 •

edited

bobheadxi commented Feb 13, 2023

		// WithErrorCallback configures the pool to call f everytime a task returns error.
		func (p ContextPool) WithErrorCallback(f func(err error)) ContextPool {

On error callback #89

Are you sure you want to change the base?

On error callback #89

Conversation

samix73 commented Feb 10, 2023

camdencheek left a comment

Choose a reason for hiding this comment

camdencheek Feb 11, 2023

Choose a reason for hiding this comment

samix73 Feb 11, 2023 • edited

Choose a reason for hiding this comment

camdencheek Feb 12, 2023

Choose a reason for hiding this comment

samix73 commented Feb 11, 2023

codecov-commenter commented Feb 12, 2023

Codecov Report

camdencheek Feb 12, 2023

Choose a reason for hiding this comment

camdencheek commented Feb 12, 2023

samix73 commented Feb 13, 2023

bobheadxi commented Feb 13, 2023 • edited

bobheadxi commented Feb 13, 2023

samix73 Feb 11, 2023 •

edited

bobheadxi commented Feb 13, 2023 •

edited