Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Jackmin108
/
glm-0.5B-old
like
1
Text Generation
Transformers
Safetensors
English
Chinese
glm4_moe
conversational
custom_code
arxiv:
2508.06471
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
glm-0.5B-old
Commit History
Update modeling_glm4_moe.py
5600396
verified
Jackmin108
commited on
Sep 16
Update modeling_glm4_moe.py
640157d
verified
Jackmin108
commited on
Sep 16
add interleave
edf0859
Jackmin108
commited on
Aug 28
use sdpa
43b51fc
Jackmin108
commited on
Aug 28
weights
2ddfe31
Jackmin108
commited on
Aug 28
use tt moe
1fb9eb6
Jackmin108
commited on
Aug 28
original impl
ad420a7
Jackmin108
commited on
Aug 28
initial commit
0cd34cf
verified
Jackmin108
commited on
Aug 28