[{"title":"( 44 个子文件 63KB ) batch_rl:Atari 2600游戏上的离线强化学习(又名批量强化学习)-源码","children":[{"title":"batch_rl-master","children":[{"title":"README.md <span style='color:#111;'> 8.09KB </span>","children":null,"spread":false},{"title":"batch_rl","children":[{"title":"baselines","children":[{"title":"configs","children":[{"title":"random.gin <span style='color:#111;'> 1.31KB </span>","children":null,"spread":false},{"title":"quantile.gin <span style='color:#111;'> 1.43KB </span>","children":null,"spread":false},{"title":"dqn.gin <span style='color:#111;'> 1.52KB </span>","children":null,"spread":false}],"spread":true},{"title":"train.py <span style='color:#111;'> 2.68KB </span>","children":null,"spread":false},{"title":"agents","children":[{"title":"__init__.py <span style='color:#111;'> 608B </span>","children":null,"spread":false},{"title":"random_agent.py <span style='color:#111;'> 1.26KB </span>","children":null,"spread":false},{"title":"dqn_agent.py <span style='color:#111;'> 2.29KB </span>","children":null,"spread":false},{"title":"quantile_agent.py <span style='color:#111;'> 2.68KB </span>","children":null,"spread":false}],"spread":true},{"title":"__init__.py <span style='color:#111;'> 608B </span>","children":null,"spread":false},{"title":"run_experiment.py <span style='color:#111;'> 1017B </span>","children":null,"spread":false},{"title":"replay_memory","children":[{"title":"logged_replay_buffer.py <span style='color:#111;'> 5.04KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 608B </span>","children":null,"spread":false},{"title":"logged_prioritized_replay_buffer.py <span style='color:#111;'> 5.77KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"tests","children":[{"title":"atari_init_test.py <span style='color:#111;'> 1.60KB </span>","children":null,"spread":false},{"title":"fixed_replay_runner_test.py <span style='color:#111;'> 2.86KB </span>","children":null,"spread":false}],"spread":true},{"title":"fixed_replay","children":[{"title":"configs","children":[{"title":"quantile.gin <span style='color:#111;'> 1.50KB </span>","children":null,"spread":false},{"title":"c51.gin <span style='color:#111;'> 1.70KB </span>","children":null,"spread":false},{"title":"multi_head_dqn.gin <span style='color:#111;'> 1.62KB </span>","children":null,"spread":false},{"title":"rem.gin <span style='color:#111;'> 1.63KB </span>","children":null,"spread":false},{"title":"dqn.gin <span style='color:#111;'> 1.62KB </span>","children":null,"spread":false}],"spread":true},{"title":"train.py <span style='color:#111;'> 3.24KB </span>","children":null,"spread":false},{"title":"agents","children":[{"title":"multi_network_dqn_agent.py <span style='color:#111;'> 3.08KB </span>","children":null,"spread":false},{"title":"multi_head_dqn_agent.py <span style='color:#111;'> 3.14KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 608B </span>","children":null,"spread":false},{"title":"dqn_agent.py <span style='color:#111;'> 3.56KB </span>","children":null,"spread":false},{"title":"rainbow_agent.py <span style='color:#111;'> 3.59KB </span>","children":null,"spread":false},{"title":"quantile_agent.py <span style='color:#111;'> 3.63KB </span>","children":null,"spread":false}],"spread":true},{"title":"__init__.py <span style='color:#111;'> 608B </span>","children":null,"spread":false},{"title":"run_experiment.py <span style='color:#111;'> 4.33KB </span>","children":null,"spread":false},{"title":"replay_memory","children":[{"title":"fixed_replay_buffer.py <span style='color:#111;'> 6.61KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"multi_head","children":[{"title":"multi_network_dqn_agent.py <span style='color:#111;'> 8.86KB </span>","children":null,"spread":false},{"title":"multi_head_dqn_agent.py <span style='color:#111;'> 5.57KB </span>","children":null,"spread":false},{"title":"atari_helpers.py <span style='color:#111;'> 14.20KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 608B </span>","children":null,"spread":false},{"title":"quantile_agent.py <span style='color:#111;'> 9.30KB </span>","children":null,"spread":false}],"spread":true},{"title":"__init__.py <span style='color:#111;'> 608B </span>","children":null,"spread":false}],"spread":true},{"title":"LICENSE <span style='color:#111;'> 11.15KB </span>","children":null,"spread":false},{"title":"CONTRIBUTING.md <span style='color:#111;'> 1.23KB </span>","children":null,"spread":false},{"title":"online","children":[{"title":"configs","children":[{"title":"quantile.gin <span style='color:#111;'> 1.57KB </span>","children":null,"spread":false},{"title":"c51.gin <span style='color:#111;'> 1.50KB </span>","children":null,"spread":false},{"title":"rem.gin <span style='color:#111;'> 1.40KB </span>","children":null,"spread":false},{"title":"dqn.gin <span style='color:#111;'> 1.32KB </span>","children":null,"spread":false}],"spread":true},{"title":"train.py <span style='color:#111;'> 2.30KB </span>","children":null,"spread":false}],"spread":true}],"spread":true}],"spread":true}]