Demonstrating basic Q-learning algorithm