Deep convolutional Q-learning