Gaoxiang Luo's picture

Gaoxiang Luo

luo00042

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

luo00042/RLSR-batch-level-hotpot-qwen2.5-7b-lora

published a model 4 days ago

luo00042/RLSR-only-batch-level-math-qwen2.5-7b-lora

published a model 5 days ago

luo00042/RLSR-only-batch-level-hotpot-qwen2.5-7b-lora

View all activity

Organizations

None yet

models 146

luo00042/RLSR-batch-level-hotpot-qwen2.5-7b-lora

Updated 3 days ago

luo00042/RLSR-only-batch-level-math-qwen2.5-7b-lora

Updated 4 days ago

luo00042/RLSR-only-batch-level-hotpot-qwen2.5-7b-lora

Updated 5 days ago

luo00042/RLSR-batch-level-math-qwen2.5-7b-lora

Updated 5 days ago

luo00042/RLSR-g32-4h-sym0.75-ranking-index-random-conf-soft-running-reweighted-risk-math-qwen2.5-7b-lora

Updated 7 days ago

luo00042/RLSR-4h-sym-ranking-index-median-conf-soft-running-reweighted-risk-hotpot-qwen2.5-7b-lora

Updated 11 days ago

luo00042/RLSR-4h-2-sym-ranking-index-random-conf-soft-running-reweighted-risk-hotpot-qwen2.5-7b-lora

Updated 11 days ago

luo00042/RLSR-4h-sym0.75-ranking-median-reweighted-risk-hotpot-qwen2.5-7b-lora

Updated 12 days ago

luo00042/RLSR-4h-sym-buf1000-ranking-median-reweighted-risk-hotpot-qwen2.5-7b-lora

Updated 12 days ago

luo00042/RLSR-4h-sym-buf100-ranking-median-reweighted-risk-hotpot-qwen2.5-7b-lora

Updated 12 days ago

View 146 models

datasets 0

None public yet