Qwen3-YOYO
Collection
10 items
•
Updated
•
3
merge method: linear
Highest precision: dtype: float32 + out_dtype: bfloat16
Context length: 262,144
Temperature=0.6,TopP=0.95,TopK=20,MinP=0.
The following YAML configuration was used to produce this model:
models:
- model: Qwen/Qwen3-30B-A3B-Thinking-2507
parameters:
weight: 0.9
- model: Qwen/Qwen3-Coder-30B-A3B-Instruct
parameters:
weight: 0.1
merge_method: linear
tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507
dtype: float32
out_dtype: bfloat16