Addition is almost all you need: Compressing large language models with double binary factorization
Vladimír Boža, Vladimír Macko.
Action editor: Hao Tang.
openreview.net/forum?id=k5kUK…
#quantization #binary #factorization
277


.
From mdpi.com
