Unable to reproduce the reported results #8

InoriJam · 2021-08-07T10:21:50Z

I retrained your model using the default hyperparameters in run.py, but my results are not similar to the reported results, the score is still too low after 2000 episodes. Could you please give me any advice to reproduce your results?

DiegoCefalo · 2021-10-03T13:01:01Z

Bumping this issue. Im barely getting more than 2000 points, even after trying different hyperparameters.

SSSKJ · 2022-08-18T15:16:53Z

Same problem. Can anyone give me some advice about it? Thanks

nuno-faria · 2024-04-07T20:44:44Z

Sorry for the delayed response. Since the learning is initially based on random plays, it could be the case that the agent never ended up choosing a good move that it could learn from. To improve the probability of higher scores, I recommend increasing the number of episodes to explore, and decreasing the batch size to make it train faster. For example:

episodes = 3000
epsilon_stop_episode = 2000
mem_size = 1000
batch_size = 128
replay_start_size = 1000 # important, must be <= mem_size so the model can be trained

In addition to this, it might also be worth creating a larger neural network:

n_neurons = [32, 32, 32]
activations = ['relu', 'relu', 'relu', 'linear']

I tested this recently and ended up reaching 300k points before stopping the process.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to reproduce the reported results #8

Unable to reproduce the reported results #8

InoriJam commented Aug 7, 2021 •

edited

Loading

DiegoCefalo commented Oct 3, 2021

SSSKJ commented Aug 18, 2022

nuno-faria commented Apr 7, 2024 •

edited

Loading

Unable to reproduce the reported results #8

Unable to reproduce the reported results #8

Comments

InoriJam commented Aug 7, 2021 • edited Loading

DiegoCefalo commented Oct 3, 2021

SSSKJ commented Aug 18, 2022

nuno-faria commented Apr 7, 2024 • edited Loading

InoriJam commented Aug 7, 2021 •

edited

Loading

nuno-faria commented Apr 7, 2024 •

edited

Loading