Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
Zeliang Zhang
zeliang0426
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 9 hours ago
Directional Reasoning Injection for Fine-Tuning MLLMs
authored
a paper
about 1 month ago
One Forward is Enough for Neural Network Training via Likelihood Ratio Method
authored
a paper
about 1 month ago
Video Understanding with Large Language Models: A Survey
View all activity
Organizations
None yet
zeliang0426
's models
63
Sort: Recently updated
zeliang0426/DS-LLama-vanilla-6K
Updated
Sep 12
zeliang0426/Distill-LLama-8B-4k
Text Generation
•
8B
•
Updated
Aug 28
•
13
zeliang0426/Distill-LLama-8B-5k-try_2
Text Generation
•
8B
•
Updated
Aug 28
•
12
zeliang0426/Distill-LLama-8B-5k-smallLR
Text Generation
•
8B
•
Updated
Aug 28
•
13
zeliang0426/Distill-LLama-8B-6k
Text Generation
•
8B
•
Updated
Aug 28
•
13
zeliang0426/Distill-LLama-8B-7k
Text Generation
•
8B
•
Updated
Aug 24
•
12
zeliang0426/Distill-LLama-8B-5k
Text Generation
•
8B
•
Updated
Aug 22
•
11
zeliang0426/Distill-LLama-8B-5k-largeLR
Updated
Aug 22
zeliang0426/Qwen25-3-Think-nglobal_16
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Qwen25-3-Think-nglobal_32
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Qwen25-3-Think-no_global
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Qwen25-3-Think-nglobal_48
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Qwen25-3-Cache-Sink
Text Generation
•
3B
•
Updated
Aug 20
zeliang0426/Distill_Llama_Darpo-full-lora-3k
Updated
Aug 18
zeliang0426/SFT-Full
Updated
Aug 17
zeliang0426/SFT-Think
Text Generation
•
3B
•
Updated
Aug 16
zeliang0426/SFT-Cache
Updated
Aug 16
zeliang0426/RM-Think
Text Generation
•
3B
•
Updated
Aug 16
•
12
zeliang0426/RM-Cache
Updated
Aug 16
zeliang0426/Long_Distill_Llama_Darpo-cache-adapter-3k
Text Generation
•
8B
•
Updated
Aug 15
•
12
zeliang0426/Distill_Llama_SFT-full
Updated
Aug 15
zeliang0426/DS_Darpo-full-lora-3k
Updated
Aug 15
zeliang0426/DS_Llama_Darpo-cache-lora-3k
Updated
Aug 15
zeliang0426/Short_DS_Llama_Darpo-cache-adapter-3k
Text Generation
•
7B
•
Updated
Aug 15
•
12
zeliang0426/DS_Llama_Darpo-cache-adapter-3k
Text Generation
•
7B
•
Updated
Aug 15
•
12
zeliang0426/Distill_Llama_Darpo-cache-adapter-3k
Text Generation
•
8B
•
Updated
Aug 15
•
12
zeliang0426/Distill_Llama_Darpo-cache-lora-3k
Updated
Aug 15
zeliang0426/8k_Distill_Llama_Darpo-cache-adapter-3k
Text Generation
•
8B
•
Updated
Aug 15
•
12
zeliang0426/SuperLong_Distill_Llama_Darpo-cache-lora-3k
Updated
Aug 14
zeliang0426/SuperLong_Distill_Llama_Darpo-cache-adapter-3k
Updated
Aug 14
Previous
1
2
3
Next