Visual Temporal Prediction: Representation, Estimation, and Modeling