Ryokan Ri's picture

Ryokan Ri

ryo0634

·

https://ryou0634.github.io/

Ryou0634

AI & ML interests

Multilingual NLP, Pretrained Language Models, Information Retrieval

Organizations

Papers 1

arxiv:2402.11485

models 21

ryo0634/TinySwallow-1.5B-Math-DPO

Text Generation • 2B • Updated Sep 4, 2025 • 3

ryo0634/TinySwallow-1.5B-Math-SFT

Text Generation • 2B • Updated Sep 4, 2025 • 1

ryo0634/Swallow-7b-hf-oasst1-21k-ja-alert-dpo-100-steps-beta-2e-1

Text Generation • 7B • Updated Aug 6, 2024 • 1

ryo0634/Swallow-7b-hf-oasst1-21k-ja-alert-dpo-100-steps-beta-1e-1

Text Generation • 7B • Updated Aug 6, 2024 • 1

ryo0634/Swallow-7b-hf-oasst1-21k-ja-hh-rlhf-12k-ja-200-steps

Text Generation • 7B • Updated Aug 6, 2024 • 1

ryo0634/Swallow-7b-hf-oasst1-21k-ja-hh-rlhf-12k-ja-safety-150-steps

Text Generation • 7B • Updated Aug 6, 2024 • 2

ryo0634/Swallow-7b-hf-oasst1-21k-ja-hh-rlhf-12k-ja-100-steps

Text Generation • 7B • Updated Aug 6, 2024 • 1

ryo0634/Swallow-7b-hf-oasst1-21k-ja-aio-retriever-200-steps

Text Generation • 7B • Updated Aug 5, 2024 • 1

ryo0634/Swallow-7b-hf-oasst1-21k-ja-hh-rlhf-12k-ja

Text Generation • 7B • Updated Aug 4, 2024 • 1

ryo0634/Swallow-7b-plus-hf-oasst1-21k-ja

Text Generation • 7B • Updated Jul 25, 2024 • 2

datasets 22

ryo0634/gsm8k-ja-noisy-dpo-on-policy-4

Viewer • Updated Sep 4, 2025 • 890 • 4

ryo0634/gsm8k-ja-noisy-dpo-on-policy-3

Viewer • Updated Sep 4, 2025 • 900 • 4

ryo0634/gsm8k-ja-noisy-dpo-on-policy

Viewer • Updated Sep 3, 2025 • 706 • 3

ryo0634/gsm8k-ja-noisy-dpo-on-policy-2

Viewer • Updated Sep 3, 2025 • 1.07k • 2

ryo0634/gsm8k-ja-noisy-dpo

Viewer • Updated Sep 3, 2025 • 1k • 2

ryo0634/gsm8k-ja-noisy-sft

Viewer • Updated Jul 28, 2025 • 1k • 8

ryo0634/gsm8k-ja-filtered-dev

Viewer • Updated Jul 27, 2025 • 400 • 20

ryo0634/gsm8k-ja-filtered-sft

Viewer • Updated Jul 27, 2025 • 3k • 22

ryo0634/math-short-thought-filtered

Viewer • Updated May 23, 2025 • 757 • 4

ryo0634/math-thought-filtered

Viewer • Updated May 23, 2025 • 923 • 4

View 22 datasets