Got bad performance when reproducing HACO

Hi, I just attempt to reproduce HACO with keyboard by running "train_haco_keyboard_easy.py ", but encountered unsatisfactory training performance.

At the early stage, I can see the model was improved with the help of human interventions. After around 20~40 iterations, the car has learned some driving skills, and occasionally managed to reach the destination, albeit with uneven performance. However, after a few more iterations, strange things occurred. The car failed to start normally and would brake suddenly while driving. It seems like the model forgot the skills it previously learned and its performance worsened.

Could you please explain the reasons behind this issue? Is it related to improper timing for human intervention, an excessive focus on exploration, or some other factor?

The screenshot below is the evaluation results by running "eval_haco.py ",  with EPISODE_NUM_PER_CKPT = 2.
<img width="453" alt="eval_res" src="https://github.com/decisionforce/HACO/assets/33610398/da0f4b98-fa52-4047-991e-044ff5ea7fe6">





Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Got bad performance when reproducing HACO #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Got bad performance when reproducing HACO #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions