Theory behind TRPO