Towards efficient deep learning in computer vision via network sparsity and distillation