Gamma

Gamma or discount factor is a way to calculate the importance of actions based on the temporal proximity

Gamma is a way to calculate the importance of action. The following formula can be used to calculate the weighted reward.

gammaactionsrewardgamma^{actions} * reward

Example

Let's now calculate some rewards where 1 step is 1 point towards our reward, this should be linear right?

Number of actions

Raw reward

Reward after Gamma

10

10

9.511

1000

1000

6.654

10000

10000

1.701e-18

Last updated

Was this helpful?