Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wh-zhu 's Collections
PSFT
Realigner-TrRa
Weak-to-Strong
Realigner-InRa

Realigner-TrRa

updated May 29
Upvote
-

  • wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_2

    2B • Updated Jun 17 • 1

  • wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_5

    2B • Updated Jun 17

  • wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_10

    2B • Updated Jun 17

  • wh-zhu/DeepSeek-R1-TrRa-iter1-1.5B-lambda_2

    2B • Updated Jun 17

  • wh-zhu/DeepSeek-R1-TrRa-iter2-1.5B-lambda_2

    2B • Updated Jun 17

  • wh-zhu/DeepSeek-R1-TrRa-1.5B_lambda_0.5

    2B • Updated Jun 17

  • wh-zhu/DeepSeek-R1-TrRa-1.5B_lambda_1.5

    2B • Updated Jun 17
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs