variable sharability among critic and actor #2

PeiYingjun · 2018-08-03T14:32:11Z

Thanks for reply, I have been busy at another project last few days, recently I get spare time.
I have noticed that at comm_net, the variables of communication part(maybe along with encoder part) are not shared between critic and actor,
I don't know whether it should be like these way in regular algorithms trained by DDPG like comm_net?

Coac · 2018-08-03T15:50:04Z

Well, I am not sure, but it seems that lots of actor-critic architecture shared the core layers and just use different heads. The two parts need to understand the environment, so sharing the features of the world might be faster for training.
If you have time to try it, do not hesitate to make a PR

PeiYingjun · 2018-08-03T16:09:21Z

Exactly, I'm trying to rewrite the code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

variable sharability among critic and actor #2

variable sharability among critic and actor #2

PeiYingjun commented Aug 3, 2018

Coac commented Aug 3, 2018

PeiYingjun commented Aug 3, 2018

variable sharability among critic and actor #2

variable sharability among critic and actor #2

Comments

PeiYingjun commented Aug 3, 2018

Coac commented Aug 3, 2018

PeiYingjun commented Aug 3, 2018