What is difference between final episodes of training and test in DQN?

https://datascience.stackexchange.com/questions/38934

hyperparameter
training
dqn
hyperparameter-tuning

31-10-2019
|

题

What is difference between running in final episode of training mode and running in test mode in DQN?

Is there any difference more than after training and tune the hyper-parameters, we test for one episode and without any exploration? This means that test mode is similar to training mode in episode n+1 without exploring (while we train for n episode) ?Is it correct?

Why in some test code of DQN, they test for multiple episodes?

没有正确的解决方案

许可以下： CC-BY-SA 和归因

不隶属于 datascience.stackexchange