[R] SPDF - Sparse Pre-training and Dense Fine-tuning for Large Language Models Submitted by CS-fan-101 t3_11xskuk on March 21, 2023 at 8:03 PM in MachineLearning 18 comments 47
[R] Introducing SIFT: A New Family of Sparse Iso-FLOP Transformations to Improve the Accuracy of Computer Vision and Language Models Submitted by CS-fan-101 t3_11yzsz6 on March 22, 2023 at 10:50 PM in MachineLearning 38 comments 77