Performance Modeling And Optimization For Machine Learning Workloads