You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I retrained your model using the default hyperparameters in run.py, but my results are not similar to the reported results, the score is still too low after 2000 episodes. Could you please give me any advice to reproduce your results?
The text was updated successfully, but these errors were encountered:
Sorry for the delayed response. Since the learning is initially based on random plays, it could be the case that the agent never ended up choosing a good move that it could learn from. To improve the probability of higher scores, I recommend increasing the number of episodes to explore, and decreasing the batch size to make it train faster. For example:
episodes=3000epsilon_stop_episode=2000mem_size=1000batch_size=128replay_start_size=1000# important, must be <= mem_size so the model can be trained
In addition to this, it might also be worth creating a larger neural network:
I retrained your model using the default hyperparameters in run.py, but my results are not similar to the reported results, the score is still too low after 2000 episodes. Could you please give me any advice to reproduce your results?
The text was updated successfully, but these errors were encountered: