Angel Raychev's picture

Angel Raychev PRO

AngelRaychev

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 4 hours ago

RLAIF/ultrafeedback-binarized

published a dataset 1 day ago

RLAIF/ultrafeedback-binarized

updated a dataset about 1 month ago

RLAIF/gm_toy_example

View all activity

Organizations

models 93

AngelRaychev/1.5B-value-iteration_2

Text Generation • 2B • Updated Jun 9 • 9

AngelRaychev/1.5B-policy-iteration_2

Text Generation • Updated Jun 9 • 13

AngelRaychev/1.5B-value-iteration_1

Text Generation • 2B • Updated Jun 9 • 12

AngelRaychev/1.5B-policy-iteration_1

Text Generation • Updated Jun 9 • 43

AngelRaychev/3B-sos-iteration_0

Text Generation • 3B • Updated Jun 4 • 6

AngelRaychev/1.5B-value-iteration_5

Text Generation • 2B • Updated Jun 4 • 7

AngelRaychev/1.5B-policy-iteration_5

Text Generation • Updated Jun 4 • 12

AngelRaychev/1.5B-value-iteration_4

Text Generation • 2B • Updated Jun 4 • 11

AngelRaychev/1.5B-policy-iteration_4

Text Generation • Updated Jun 4 • 6

AngelRaychev/1.5B-value-iteration_3

Text Generation • 2B • Updated Jun 4 • 7

datasets 1

AngelRaychev/dpo_uf_rejudged_mixed_openorca_with_gold_labels_kl_estimation

Viewer • Updated Aug 21 • 65.6k • 18