Continuous Space RL

Introduction

Discrete vs Continuous

  • Discrete:

    • Grid-based world, finite number of states and actions.

    • Chess grids,

  • Continuous Actions

    • real physical environment. robot control, positions

If there are a very large number of actions, TD control-SarsaMax needs to iterate for all those possible numbers of actions, which would increase the calculation load.

Discretization: rounding to finite numbers

Discretize into finite states uniformly or non-uniformly. Example is an occupancy grid with equal size grids or non-uniform grid size

Tile Coding

Coarse Coding

  • Use sparse data

  • Narrow generalization

  • Broad generalization

  • Asymmetric generalization

Function Approximation

Use parameters to shape the function that approximates the continuous value-state functions

Linear Function Approximation

  • Define the cost function of error (approx function - true function)

  • And minimize the cost function using gradient descent.

  • Only for linear relationship between the input and output

Action-Vector approximation

Kernel Functions / Feature Transformation

If the relationship is non-linear? Pass the relationship( ) to a non-linear function, also known as activation function as in neural network.

  • E.g. Radial Basis Functions

Last updated