Scalable accelerated decentralized multi-robot policy search in continuous observation spaces