view article Article Introducing RWKV - An RNN with the advantages of a transformer +2 May 15, 2023 • 23
view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware +7 Mar 20, 2024 • 32
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 627
StableIdentity: Inserting Anybody into Anywhere at First Sight Paper • 2401.15975 • Published Jan 29, 2024 • 18