Dark-Quill-12B
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the Linear DELLA merge method using Vortex5/MegaMoon-Karcher-12B as a base.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
models:
- model: yamatazen/LinearWriter-12B
parameters:
weight: [0.5, 0.4, 0.4, 0.8, 0.8]
density: 0.6
epsilon: 0.2
- model: ReadyArt/Omega-Darker_The-Final-Directive-12B
parameters:
weight: [0.7, 0.7, 0.5, 0.5, 0.5]
density: 0.5
epsilon: 0.2
merge_method: della_linear
base_model: Vortex5/MegaMoon-Karcher-12B
parameters:
lambda: 0.9
normalize: true
dtype: bfloat16
tokenizer:
source: yamatazen/LinearWriter-12B
- Downloads last month
- 10
Model tree for Vortex5/Dark-Quill-12B
Merge model
this model
