WIP: Parallel SAMIN #843

MatFi · 2020-08-14T14:30:01Z

My problems often involve functions that are expensive to evaluate. However, Julia IMHO offers hardly any choice of native packages with parallel working methods for (global) optimization (besides BlackBoxOptim.jl). Now I made your SAMIN method cluster-compatible and would like contribute this to Optim.jl. What do you think ?

To do:

Worker based parallelization
Threaded parallelization (needed?)
Correct implementation of f_calls-stats (now it just uses the iteration counter)
…

From my first benchmarks I can see that the pay-off comes when the f_call costs more than 1e-5 s (all workers on the same machine)

ChrisRackauckas · 2020-08-14T15:01:09Z

This is great! However, you doing too much work. Instead of trying to implement every form of parallelism, which you won't do (what about GPUs? TPUs? ...), it would be good if this was just a batch interface, i.e. give the user an array of arrays or a matrix of x and have them return a whole vector of objective functions. If you do it like that, all forms of parallelism are implemented.

codecov · 2020-08-14T15:04:56Z

Codecov Report

Merging #843 into master will increase coverage by 0.17%.
The diff coverage is 82.71%.

@@            Coverage Diff             @@
##           master     #843      +/-   ##
==========================================
+ Coverage   81.48%   81.65%   +0.17%     
==========================================
  Files          43       43              
  Lines        2684     2720      +36     
==========================================
+ Hits         2187     2221      +34     
- Misses        497      499       +2

Impacted Files	Coverage Δ
src/Optim.jl	`100.00% <ø> (ø)`
src/multivariate/solvers/constrained/samin.jl	`80.00% <82.71%> (+3.37%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4497296...9ddf656. Read the comment docs.

pkofod · 2020-08-14T16:26:03Z

I have made something like what Chris mentions for Particle Swarm that you can have a look at https://gist.github.com/pkofod/c6f0dc28588e1d65b521ba785405aff2

pkofod · 2020-08-14T17:38:02Z

I also have the same algorithm in a slightly different flavor here: https://github.com/pkofod/NLSolvers.jl/blob/bc8b1941218fea9349dc9797ed32bb577c59289c/src/optimize/randomsearch/particleswarm.jl#L48 the idea there is that the objective type determines how the objective receives the states, either as a batch or one at a time through the batched_value call.

MatFi · 2020-08-14T20:21:40Z

it would be good if this was just a batch interface

I definitely see the advantages of doing this, but keep in mind that some functions can vary a lot in timing (e.g. integration of stiff differential equations). Therefore I would like to keep the possibility of asynchronous evaluation, because a batch based implementation is too much idle in such cases (Of course, it is always possible to synchronize the asynchronous procedure with a full batch to allow for both).

If you do it like that, all forms of parallelism are implemented.

Actually the implementation is essentially shifted to the user then, but of course one can cover the most common cases using a suitable dispatch.

I have made something like what Chris mentions for Particle Swarm that you can have a look at

Thanks, I was already about to ask for examples... Extremely helpful 👍

ChrisRackauckas · 2020-08-15T23:39:37Z

I definitely see the advantages of doing this, but keep in mind that some functions can vary a lot in timing (e.g. integration of stiff differential equations). Therefore I would like to keep the possibility of asynchronous evaluation, because a batch based implementation is too much idle in such cases (Of course, it is always possible to synchronize the asynchronous procedure with a full batch to allow for both).

That's exactly the reason why it should be on the caller's side. Sometimes that's the case, and integration of stiff stochastic differential equations I've seen whopping 3 orders of magnitude timing differences with the same equation (due to switching off a steady state). However, you can't rely on this because the spawn cost for threads is high, if you have an ODE that finishes in like 1ms the spawning a task per thread for dynamic scheduling or using pmap is far too slow. So you can't just have "threading", you need

threading with dynamic scheduling
threading with static scheduling
threading with partial static scheduling (clumping but not the number of threads)
distributed dynamic
distributed static
distributed + threads in all combinations
GPU via CuArray
GPU via DiffEqGPU
GPU via KernelAbstractions
MPI (since Distributed doesn't scale all that well)
MPI+CUDA (the Clima setup)

Those are 10 forms of parallelism we are actively using in projects right now with stiff differential equations, all for different purposes due to the trade-offs. It's just so much easier to tell users who want asynchronous to just loop over @spawn than to try and handle parallelism efficiently, since that means something different to every user.

lrnv · 2020-11-12T17:41:11Z

Hi,

Would it be difficult to integrate @pkofod gist into Optim.jl ? I dont think i have enough comprehension of Julia's internals to do it myself, as the code @pkofod produced is quite different from the content of the ParticleSwarm.jl file in this repo...

But if you think it is doable easily and point me to the right direction, i might do it :)

MatFi added 2 commits August 14, 2020 15:47

Draft for distributed SAMIN

60ba344

fix deps

9ddf656

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Parallel SAMIN #843

WIP: Parallel SAMIN #843

MatFi commented Aug 14, 2020

ChrisRackauckas commented Aug 14, 2020

codecov bot commented Aug 14, 2020 •

edited

pkofod commented Aug 14, 2020

pkofod commented Aug 14, 2020

MatFi commented Aug 14, 2020 •

edited

ChrisRackauckas commented Aug 15, 2020

lrnv commented Nov 12, 2020 •

edited

WIP: Parallel SAMIN #843

Are you sure you want to change the base?

WIP: Parallel SAMIN #843

Conversation

MatFi commented Aug 14, 2020

ChrisRackauckas commented Aug 14, 2020

codecov bot commented Aug 14, 2020 • edited

Codecov Report

pkofod commented Aug 14, 2020

pkofod commented Aug 14, 2020

MatFi commented Aug 14, 2020 • edited

ChrisRackauckas commented Aug 15, 2020

lrnv commented Nov 12, 2020 • edited

codecov bot commented Aug 14, 2020 •

edited

MatFi commented Aug 14, 2020 •

edited

lrnv commented Nov 12, 2020 •

edited