Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to reproduce the reported results #8

Open
InoriJam opened this issue Aug 7, 2021 · 3 comments
Open

Unable to reproduce the reported results #8

InoriJam opened this issue Aug 7, 2021 · 3 comments

Comments

@InoriJam
Copy link

InoriJam commented Aug 7, 2021

I retrained your model using the default hyperparameters in run.py, but my results are not similar to the reported results, the score is still too low after 2000 episodes. Could you please give me any advice to reproduce your results?

@DiegoCefalo
Copy link

Bumping this issue. Im barely getting more than 2000 points, even after trying different hyperparameters.

@SSSKJ
Copy link

SSSKJ commented Aug 18, 2022

Same problem. Can anyone give me some advice about it? Thanks

@nuno-faria
Copy link
Owner

nuno-faria commented Apr 7, 2024

Sorry for the delayed response. Since the learning is initially based on random plays, it could be the case that the agent never ended up choosing a good move that it could learn from. To improve the probability of higher scores, I recommend increasing the number of episodes to explore, and decreasing the batch size to make it train faster. For example:

episodes = 3000
epsilon_stop_episode = 2000
mem_size = 1000
batch_size = 128
replay_start_size = 1000 # important, must be <= mem_size so the model can be trained

In addition to this, it might also be worth creating a larger neural network:

n_neurons = [32, 32, 32]
activations = ['relu', 'relu', 'relu', 'linear']

I tested this recently and ended up reaching 300k points before stopping the process.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants