Add metadata and link to paper
#12 opened 8 months ago
by
nielsr
can you provide wikitest ppl and c4 ppl separately?
#11 opened over 1 year ago
by
sheropen-2
Can you provide more details on the training?
1
#10 opened over 1 year ago
by
dequ777
Any plans to use MQA (multi-query attention) or GQA (grouped-query attention) in the future?
#9 opened over 1 year ago
by
graefics
Efficient Inference Kernel Support for 1.58bit.
❤️
10
#8 opened over 1 year ago
by
LeiWang1999
This code from BitLinear doesn't make sense
1
#7 opened over 1 year ago
by
qmsoqm
Is it bitnet {-1,0,1}?
4
#6 opened over 1 year ago
by
Remek
ValueError: Tokenizer class BitnetTokenizer does not exist or is not currently imported.
👍
2
4
#5 opened over 1 year ago
by
ryanzhangofficial
Longer inference time
2
#4 opened over 1 year ago
by
dittops
Why are these models fp32?
👀
3
5
#2 opened over 1 year ago
by
supercharge19
Is there a chat/instruct model in plans?
2
#1 opened over 1 year ago
by
MrVodnik