Extract action and observation information that you can use to create other environments or agents.
强化学习环境for this example is the simple longitudinal dynamics for ego car and lead car. The training goal is to make the ego car travel at a set velocity while maintaining a safe distance from lead car by controlling longitudinal acceleration (and braking). This example uses the same vehicle model as theAdaptive Cruise Control System Using Model Predictive Control(Model Predictive Control Toolbox)example.
Open the model and create the reinforcement learning environment.
You can also select a web site from the following list:
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.