Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Angel Raychev's picture

Angel Raychev PRO

AngelRaychev
·

AI & ML interests

None yet

Recent Activity

updated a dataset about 4 hours ago
RLAIF/ultrafeedback-binarized
published a dataset 1 day ago
RLAIF/ultrafeedback-binarized
updated a dataset about 1 month ago
RLAIF/gm_toy_example
View all activity

Organizations

RLAIF's profile picture SynthLabs's profile picture GenRM: Generative Reward Models's profile picture Stanford University's profile picture RL from Synthetic Feedback's profile picture Reinforcement Learning Fine-Tuning's profile picture

models 93

AngelRaychev/1.5B-value-iteration_2

Text Generation • 2B • Updated Jun 9 • 9

AngelRaychev/1.5B-policy-iteration_2

Text Generation • Updated Jun 9 • 13

AngelRaychev/1.5B-value-iteration_1

Text Generation • 2B • Updated Jun 9 • 12

AngelRaychev/1.5B-policy-iteration_1

Text Generation • Updated Jun 9 • 43

AngelRaychev/3B-sos-iteration_0

Text Generation • 3B • Updated Jun 4 • 6

AngelRaychev/1.5B-value-iteration_5

Text Generation • 2B • Updated Jun 4 • 7

AngelRaychev/1.5B-policy-iteration_5

Text Generation • Updated Jun 4 • 12

AngelRaychev/1.5B-value-iteration_4

Text Generation • 2B • Updated Jun 4 • 11

AngelRaychev/1.5B-policy-iteration_4

Text Generation • Updated Jun 4 • 6

AngelRaychev/1.5B-value-iteration_3

Text Generation • 2B • Updated Jun 4 • 7
View 93 models

datasets 1

AngelRaychev/dpo_uf_rejudged_mixed_openorca_with_gold_labels_kl_estimation

Viewer • Updated Aug 21 • 65.6k • 18
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs