Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference
J. Yao, N. Alnaasan, T. Chen, A. Shafi, H. Subramoni, D. Panda
30th IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, & ANALYTICS,
Dec 2023.