Xiangyu Zhang;PHD student;Reinforcement learning, intelligent wireless communication
Reading time ~1 minute
2016年,Deep Mind首先提出Policy Distillation,其思想借鉴于监督学习中的模型压缩和