Topics in Low-Rank Markov Decision Process: Applications in Policy Gradient, Model Estimation and Markov Games