How I enable to get trained policy of Isaac-Repose-Cube-Allegro-v0 task? #1442
Unanswered
cold-young
asked this question in
Q&A
Replies: 2 comments 2 replies
-
Thanks for posting this. By expert policy, do you mean the final trained policy? You may find them in this directory:
|
Beta Was this translation helpful? Give feedback.
1 reply
-
I think the first step is for us to reproduce what you're seeing. Can you give me reproduction steps for your two policies so I can see if I get the same results? Are you using the samples found here directly: https://isaac-sim.github.io/IsaacLab/main/source/overview/environments.html Are these also the shadow hand examples you're looking for? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi there,
I have been trying to train the Isaac-Repose-Cube-Allegro-v0 task using the skrl PPO algorithm.
(num_envs 2048, <1 hour)
Even though the reinforcement learning reward graph has converged, I cannot get an expert policy.
I think I need well defined shaped rewards or other learning approaches like imitation learning.
By the way, I found a good demonstration of in-hand manipulation with shadow hand. on this page
To acquire expert policy, what i do ?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions