CtrlK

Continuous Space RL

Introduction

Discrete vs Continuous

Discrete:
- Grid-based world, finite number of states and actions.
- Chess grids,
Continuous Actions
- real physical environment. robot control, positions

If there are a very large number of actions, TD control-SarsaMax needs to iterate for all those possible numbers of actions, which would increase the calculation load.

Discretization: rounding to finite numbers

Discretize into finite states uniformly or non-uniformly. Example is an occupancy grid with equal size grids or non-uniform grid size

Tile Coding

Coarse Coding

Use sparse data
Narrow generalization
Broad generalization
Asymmetric generalization

Function Approximation

Use parameters to shape the function that approximates the continuous value-state functions

Linear Function Approximation

Define the cost function of error (approx function - true function)
And minimize the cost function using gradient descent.
Only for linear relationship between the input and output

Action-Vector approximation

Kernel Functions / Feature Transformation

If the relationship is non-linear? Pass the relationship( ) to a non-linear function, also known as activation function as in neural network.

E.g. Radial Basis Functions

PreviousBasics NextIntroduction

Last updated 3 years ago

Was this helpful?