Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wh-zhu
's Collections
PSFT
Realigner-TrRa
Weak-to-Strong
Realigner-InRa
Realigner-TrRa
updated
May 29
Upvote
-
wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_2
2B
•
Updated
Jun 17
•
1
wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_5
2B
•
Updated
Jun 17
wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_10
2B
•
Updated
Jun 17
wh-zhu/DeepSeek-R1-TrRa-iter1-1.5B-lambda_2
2B
•
Updated
Jun 17
wh-zhu/DeepSeek-R1-TrRa-iter2-1.5B-lambda_2
2B
•
Updated
Jun 17
wh-zhu/DeepSeek-R1-TrRa-1.5B_lambda_0.5
2B
•
Updated
Jun 17
wh-zhu/DeepSeek-R1-TrRa-1.5B_lambda_1.5
2B
•
Updated
Jun 17
Upvote
-
Share collection
View history
Collection guide
Browse collections