Q-learning using a neural network