Austral-GLM4-KTO / README.md
Delta-Vector's picture
Update README.md
546d028 verified
metadata
datasets:
  - Delta-Vector/Tauri-IF-AM-Thinking
  - Delta-Vector/Tauri-KTO-Instruct-mix-v3
  - Delta-Vector/Tauri-LIT-RL-KTO
  - Delta-Vector/Tauri-Helpsteer3-Edit
  - Delta-Vector/Tauri-Physical-Reasoning
  - Delta-Vector/Tauri-Helpsteer-3-Preference-KTO
  - Delta-Vector/Tauri-Opus-Accepted-GPT-Rejected-Opus-Writing-Prompts
  - Delta-Vector/Tauri-KTO-Instruct-Mix
  - Delta-Vector/Tauri-Purpura-Arkhaios-CC-KTO
  - Delta-Vector/Tauri-Opus-accepted-hermes-rejected-shuffled
  - Delta-Vector/Tauri-Helpsteer3-Edit
base_model:
  - Delta-Vector/Austral-GLM4-SFT

KTO checkpoint of my GLM4 train, use the down-stream version -Winton

Delta-Vector/Tauri-IF-AM-Thinking
Delta-Vector/Tauri-KTO-Instruct-mix-v3
Delta-Vector/Tauri-LIT-RL-KTO
Delta-Vector/Tauri-Helpsteer3-Edit
Delta-Vector/Tauri-Physical-Reasoning
Delta-Vector/Tauri-Helpsteer-3-Preference-KTO
Delta-Vector/Tauri-Opus-Accepted-GPT-Rejected-Opus-Writing-Prompts
Delta-Vector/Tauri-KTO-Instruct-Mix
Delta-Vector/Tauri-Purpura-Arkhaios-CC-KTO
Delta-Vector/Tauri-Opus-accepted-hermes-rejected-shuffled
Delta-Vector/Tauri-Helpsteer3-Edit

datasets used are above:

wandb: https://wandb.ai/new-eden/Austral-32B/runs/r8fnw6t9?nw=nwuserdeltavector