--- base_model: - DMindAI/DMind-1-mini - Qwen/Qwen3-14B - soob3123/GrayLine-Qwen3-14B - ValiantLabs/Qwen3-14B-Cobalt2 - ValiantLabs/Qwen3-14B-Esper3 library_name: transformers license: apache-2.0 language: - en pipeline_tag: text-generation tags: - mergekit - merge - esper - esper-3 - dmind - dmind-1-mini - cobalt - cobalt-2 - grayline - valiant - valiant-labs - qwen - qwen-3 - qwen-3-14b - 14b - reasoning - web3 - code - code-instruct - python - javascript - dev-ops - jenkins - terraform - scripting - powershell - azure - aws - gcp - cloud - problem-solving - architect - engineer - developer - creative - analytical - expert - rationality - math - math-reasoning - math-instruct - uncensored - unfiltered - amoral-ai - conversational - chat - instruct --- # sequelbox/Qwen3-14B-Esper3Mix This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit), combining several Qwen 3 14b finetunes to maximize reasoning performance. ## Merge Details ### Merge Method This model was merged using the [DELLA](https://arxiv.org/abs/2406.11617) merge method using [Qwen/Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B) as a base. ### Models Merged The following models were included in the merge: * [DMindAI/DMind-1-mini](https://huggingface.co/DMindAI/DMind-1-mini) * [soob3123/GrayLine-Qwen3-14B](https://huggingface.co/soob3123/GrayLine-Qwen3-14B) * [ValiantLabs/Qwen3-14B-Cobalt2](https://huggingface.co/ValiantLabs/Qwen3-14B-Cobalt2) * [ValiantLabs/Qwen3-14B-Esper3](https://huggingface.co/ValiantLabs/Qwen3-14B-Esper3) ### Configuration The following YAML configuration was used to produce this model: ```yaml merge_method: della dtype: bfloat16 parameters: normalize: true models: - model: ValiantLabs/Qwen3-14B-Esper3 parameters: density: 0.25 weight: 0.4 - model: ValiantLabs/Qwen3-14B-Cobalt2 parameters: density: 0.25 weight: 0.25 - model: DMindAI/DMind-1-mini parameters: density: 0.25 weight: 0.25 - model: soob3123/GrayLine-Qwen3-14B parameters: density: 0.25 weight: 0.25 base_model: Qwen/Qwen3-14B ```