Skip to content

Commit 19db772

Browse files
N-step discounting in QRDQN
1 parent b9791c7 commit 19db772

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

deep_rl/agent/QuantileRegressionDQN_agent.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@ def compute_loss(self, transitions):
6262

6363
rewards = tensor(transitions.reward).unsqueeze(-1)
6464
masks = tensor(transitions.mask).unsqueeze(-1)
65-
quantiles_next = rewards + self.config.discount * masks * quantiles_next
65+
quantiles_next = rewards + self.config.discount ** self.config.n_step * masks * quantiles_next
6666

6767
quantiles = self.network(states)['quantile']
6868
actions = tensor(transitions.action).long()

0 commit comments

Comments
 (0)