Recording primal/dual iterates #791

FSchmidtDIW · 2024-10-14T08:04:31Z

Hi Oscar,

in this working paper you show a very nice plot that shows how first-stage states converge with the number of iterations. Is there a quick way to record the current (first-stage) choices for a given iteration?

I was going to do it like this but it seems a little impractical (I have not run this yet):

out = Dict{Int64,Dict}()

SDDP.train( model,
            time_limit = 18*3600,
            iteration_limit = 100,
            stopping_rules = [SDDP.FirstStageStoppingRule()],
            cut_type = SDDP.SINGLE_CUT,
            forward_pass = SDDP.RegularizedForwardPass(),
            log_file = string(today())*"sddp_serial.log"
            );
k = 100
while k<=1000
   out[k] = SDDP.simulate(model,1,...,[capacities],...)

   SDDP.train( model,
            time_limit = 18*3600,
            add_to_exisiting_cuts = true
            iteration_limit = 10,
            stopping_rules = [SDDP.FirstStageStoppingRule()],
            cut_type = SDDP.SINGLE_CUT,
            forward_pass = SDDP.RegularizedForwardPass(),
            log_file = string(today())*"sddp_serial.log"

            );
          k+= 10
end

Thank you,

Felix

odow · 2024-10-14T20:36:42Z

I was going to do it like this but it seems a little impractical

This is, in fact, exactly what @jarandh did 😄

odow · 2024-10-14T20:37:39Z

It's one reason that we didn't really report computation times in the paper, because there was so much other stuff going on just so we could make this one pretty picture.

FSchmidtDIW · 2024-10-15T08:46:14Z

Haha :) understood! Do you think it might be worth it to create a CapExForwardPass that records iterates if the first stage is deterministic? Otherwise, I will see how the approach above works for me. Would evaluating a decision rule for the first stage be quicker than simulating a (dummy) trajectory?

odow · 2024-10-15T21:29:18Z

I just remembered that there is:

SDDP.jl/src/algorithm.jl

Line 1026 in 2fd1e8d

forward_pass_callback::Function = (x) -> nothing,

You could try:

trajectories = Any[]
SDDP.train(model; forward_pass_callback = trajectory -> push!(trajectories, trajectory))

FSchmidtDIW · 2024-10-16T13:17:44Z

Works like a charm!

I used the following to obtain a DataFrame of first-stage states by iteration:

capacities= Any[]
SDDP.train(model; forward_pass_callback = trajectory -> push!(capacities, trajectory.sampled_states[1])) # This depends on the first node being deterministic
capacity_df = reduce(vcat,[DataFrame(keys(c) .=> values(c)) for c in capacities])

Thanks!

FSchmidtDIW · 2024-11-23T14:50:39Z

Hi, sorry to reopen this. I struggle to do this in a parallel version of my code. I've tried using SharedArrays.jl but they do not admit Dict elements in the capas vector. Alternatively I could store the iterates locally on each worker and combine them later. I'm just not sure how to do that. I know, this is rather a question on distributed computing in general than one on SDDP.jl. Sorry about that :)

odow · 2024-11-23T21:15:07Z

What parallel scheme?

I'd advise against using the Asynchronous one. Use the new Threaded instead.

Start Julia with julia -t N where N is the number of threads, then do something like:

capacities= Any[]
my_lock = Threads.ReentrantLock();
function callback(trajectory)
    Threads.lock(my_lock) do
         push!(capacities, trajectory.sampled_states[1])
    end
    return
end
SDDP.train(model; forward_pass_callback = callback, parallel_scheme = SDDP.Threaded())

FSchmidtDIW · 2024-11-24T15:58:32Z

Great, I will try this. Indeed, I've been using the asynchronous mode and it's been working quite well but I'll make the switch to Threaded then.
Thank you very much!

odow · 2024-11-24T18:34:02Z

I don't have an easy way to use forward_pass_callback with Distributed. You'd probably need to use a Channel, but it'd get quite complicated.

odow · 2024-11-24T18:34:28Z

Note that threaded works best when the number of nodes is >> the number of threads.

FSchmidtDIW · 2024-11-24T18:42:00Z

Thank you! That is nodes in a policy graph right? Does that mean that you also parallelise the backward pass?

odow · 2024-11-24T19:49:56Z

Yes, nodes in the graph.

The forward and backward pass are conducted asynchronously in parallel across the threads.

The differences are:

SDDP.Asynchronous has a complete copy of the graph in each process, and it periodically shares cuts between processes. This requires a lot of memory because there are multiple copies of the problem, and this has a lot of data movement between processes. But it theoretically scales to any number of processes. (Although in practice, you'll find that performance quickly tapers off with more procesess.)
SDDP.Threaded has a single copy of the model in shared memory. Each thread does forward and backward passes asynchronously and in parallel on the same graph. There is a lock at each node, so that only one thread can be solving subproblems at a node at one time. Therefore, we can have at most as many threads as there are nodes in the graph. But things work better when there are many more nodes than threads.

FSchmidtDIW · 2024-11-24T21:05:01Z

Very, cool! Thank you! First test run on my problem with Threaded looks very promising!

odow · 2024-11-25T00:09:27Z

I'll point you to the docstring: https://sddp.dev/stable/apireference/#SDDP.Threaded

It's still somewhat experimental. It should work for most standard use-cases, but if you've written any custom plugins you need to be very careful that they are themselves threadsafe.

But yeah, assuming it works and you're running on a single machine, it is much, much better than before.

FSchmidtDIW · 2024-11-25T16:43:03Z

Thank you! It looks like the Threaded option does not work with a RegularizedForwardPass, right? As the latter leads to significant performance gains compared to a vanilla ForwardPass in my case, it'd be interested in fixing this.
Why exactly does it fail with regularization?

odow · 2024-11-25T19:21:53Z

Why exactly does it fail with regularization?

I assume we need to make it thread safe

odow · 2024-11-25T21:22:00Z

The issue is that we modify the first-stage for the forward pass:

SDDP.jl/src/plugins/forward_passes.jl

Lines 356 to 370 in aa0cf1b

    
           old_bounds = Dict{Symbol,Tuple{Float64,Float64}}() 
        
           for (k, v) in node.states 
        
               if has_lower_bound(v.out) && has_upper_bound(v.out) 
        
                   old_bounds[k] = (l, u) = (lower_bound(v.out), upper_bound(v.out)) 
        
                   x = get(fp.trial_centre, k, model.initial_root_state[k]) 
        
                   set_lower_bound(v.out, max(l, x - fp.ρ * (u - l))) 
        
                   set_upper_bound(v.out, min(u, x + fp.ρ * (u - l))) 
        
               end 
        
           end 
        
           pass = forward_pass(model, options, fp.forward_pass) 
        
           for (k, (l, u)) in old_bounds 
        
               fp.trial_centre[k] = pass.sampled_states[1][k] 
        
               set_lower_bound(node.states[k].out, l) 
        
               set_upper_bound(node.states[k].out, u) 
        
           end

The fix might not be trivial.

FSchmidtDIW · 2024-11-26T06:59:08Z

I see! Well, it seems like the Threaded version without regularization is still much faster than serial with regularization, at least in my case. Thank you!

FSchmidtDIW closed this as completed Oct 16, 2024

odow reopened this Nov 23, 2024

odow linked a pull request Nov 27, 2024 that will close this issue

Fix thread safety of RegularizedForwardPass #806

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recording primal/dual iterates #791

Recording primal/dual iterates #791

FSchmidtDIW commented Oct 14, 2024

odow commented Oct 14, 2024

odow commented Oct 14, 2024

FSchmidtDIW commented Oct 15, 2024

odow commented Oct 15, 2024

FSchmidtDIW commented Oct 16, 2024

FSchmidtDIW commented Nov 23, 2024

odow commented Nov 23, 2024

FSchmidtDIW commented Nov 24, 2024

odow commented Nov 24, 2024

odow commented Nov 24, 2024

FSchmidtDIW commented Nov 24, 2024

odow commented Nov 24, 2024

FSchmidtDIW commented Nov 24, 2024

odow commented Nov 25, 2024

FSchmidtDIW commented Nov 25, 2024

odow commented Nov 25, 2024

odow commented Nov 25, 2024

FSchmidtDIW commented Nov 26, 2024

Recording primal/dual iterates #791

Recording primal/dual iterates #791

Comments

FSchmidtDIW commented Oct 14, 2024

odow commented Oct 14, 2024

odow commented Oct 14, 2024

FSchmidtDIW commented Oct 15, 2024

odow commented Oct 15, 2024

FSchmidtDIW commented Oct 16, 2024

FSchmidtDIW commented Nov 23, 2024

odow commented Nov 23, 2024

FSchmidtDIW commented Nov 24, 2024

odow commented Nov 24, 2024

odow commented Nov 24, 2024

FSchmidtDIW commented Nov 24, 2024

odow commented Nov 24, 2024

FSchmidtDIW commented Nov 24, 2024

odow commented Nov 25, 2024

FSchmidtDIW commented Nov 25, 2024

odow commented Nov 25, 2024

odow commented Nov 25, 2024

FSchmidtDIW commented Nov 26, 2024