·
AI & ML interests
None yet
Organizations
Text Classification
•
7B
•
Updated
•
15
vwxyzjn/online_dpo_vllm_thread_beta_0.03__allenai_open_instruct_dev
Updated
vwxyzjn/reward_modeling__EleutherAI_pythia-14m
Updated
•
10
vwxyzjn/online_dpo_vllm__vwxyzjn_btulu
Updated
•
10
vwxyzjn/online_dpo_vllm__allenai_llama-3-tulu-2-8b
Updated
•
13
Text Generation
•
8B
•
Updated
•
8
vwxyzjn/online_dpo_tulu_2
Text Generation
•
Updated
•
8
vwxyzjn/reward_modeling__allenai_llama-3-tulu-2-8b
Updated
•
14
vwxyzjn/online_dpo__cleanrl_EleutherAI_pythia-1b-deduped__sft__tldr
Updated
vwxyzjn/online_dpo__EleutherAI_pythia-14m
Updated
vwxyzjn/online_dpo__EleutherAI_pythia-1b-deduped
Updated
vwxyzjn/tulu3_7b_llama3-10000-max-samples
vwxyzjn/reward_modeling__EleutherAI_pythia-1b-deduped
Updated
vwxyzjn/EleutherAI_pythia-14m__reward_modeling__tldr
Updated
vwxyzjn/rejection_sampling_23251
Updated
vwxyzjn/summarize_from_feedback_details
Updated
vwxyzjn/online_dpo_llmjudge_tldr_6.9b
Text Generation
•
7B
•
Updated
•
10
vwxyzjn/online_dpo_llmjudge
Text Generation
•
1B
•
Updated
•
7
vwxyzjn/online_dpo_llmjudge_tldr
Updated
vwxyzjn/online_dpo_tldr_6.9b
Text Generation
•
7B
•
Updated
•
10
Text Generation
•
1B
•
Updated
•
14