Statistics for Addressing deep reinforcement learning: empirical algorithm performance evaluations∗