Hi, there are some trained models, but I am not sure which one is the best.

As shown by the tensorbord results, there are 4 models(i.e. blue, red, cyan and orange) in total. Obviously:
The problem now is that I can't tell which one is better. Especially for red one and blue one. Even using them in the brain and inferencing in the practical game environment, they perform similarly. Who can tell me how to compare these models deeply from other aspects like Losses etc.? And explain the specific meaning of the stuff shown in the results like Episode Length, Value Loss etc.? It's weird that there is a sharp rise for value loss about 4M steps for blue one, I have no idea about the meaning of this..
Read the documentation.
https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Using-Tensorboard.md
Thanks @mansto0. @gzrjzcx Please refer to our doc on how to interpret the tensorboard result. Also generally if you define your reward correctly, the model with the highest reward should be the best one.
@xiaomaogy Hi, what's the specific meaning of the correct reward? In terms of this case, the blue one has the highest value because it only set positive reward without penalty. I am confused about how to compare these blue and red two models.
In other words, how to compare the reward function? Or how to confirm if this reward function is correct? When inferencing with these two models, they both take similar actions.
If they take similar action, and you can't differentiate between them, then they are the same. You can add some logging to summarize the agent's behavior to make it more accurate.
Using different rewards/parameters can make it difficult to directly compare the results.
A simple thing you can do is: make a test scene with a concrete objective (doing the task faster, staying alive longer, collecting more objects, etc) then run each model on that scene and collect their statistics.
Thanks for the discussion - closing this issue due to inactivity. Feel free to open a new issue if you're still running into problems.
Most helpful comment
Read the documentation.
https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Using-Tensorboard.md