Upload folder using huggingface_hub

Browse files

Files changed (12) hide show

.gitattributes +1 -0
README.md +106 -0
chat_template.jinja +15 -0
config.json +41 -0
generation_config.json +11 -0
model-00001-of-00002.safetensors +3 -0
model-00002-of-00002.safetensors +3 -0
model.safetensors.index.json +0 -0
quantization_config.json +0 -0
special_tokens_map.json +30 -0
tokenizer.json +3 -0
tokenizer_config.json +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,106 @@

+---
+base_model:
+- SillyTilly/ServiceNow-AI-Apriel-Nemotron-15b-Thinker-Chatml
+---
+# Join our Discord! https://discord.gg/BeaverAI
+## More than 8000 members strong 💪 A hub for users and makers alike!
+---
+## Drummer is open for new opportunites! Contact me through any of these channels: https://linktr.ee/thelocaldrummer
+### Thank you to everyone who subscribed through [Patreon](https://www.patreon.com/TheDrummer). Your support helps me chug along in this brave new world.
+### FAQ for those out-of-the-loop
+<details>
+  <summary>🐶 Who is Drummer?</summary>
+  Hi! I'm Drummer. I'm a Software Engineer with experience in JavaScript, Golang, Python, and generally engineering the crap out of things.
+  Why I'm in the AI space:
+  - **Exploration:** Everyone is trying to figure out how AI works and what it's capable of. I am too - just not in creating the smartest, safest model at all costs.
+  - **Upskill:** The world is headed towards AI. It is here to stay. This has been my way of brushing up in this new form of computing challenge.
+  - **Value:** I yearn to create value. I feel satisfaction and fulfillment in providing something meaningful for others.
+  - **Fun:** It's just fun using and making models. It's also fun coming up with theories and realizing them in practice (training AI).
+  I started my tuning venture back in mid-2024 when I wanted to improve its literary capabilities.
+  I've come a long way since then and I have branched out and specialized.
+  Foundational models today are optimized for non-creative uses, and I believe there is a place for AI in creativity and entertainment.
+  I am here to take *the road less traveled by*.
+</details>
+<details>
+  <summary>❓ What are my models like?</summary>
+  **Bottomline:** My models are usually geared towards creativity, usability, and entertainment!
+  While intelligence, correctness, and problem solving are not my priority, they are still one of many qualities I want in my models.
+  The primary goal is to enhance the experience for users looking to use models for creative uses, and other use cases which require no alignment.
+  In an effort to make it clear to myself and to others what I'm aiming for, I've identified certain qualities that my users often want:
+  Creativity
+  - **Writing:** Does it string together words and sentences in a pleasant & effective way? Does it feel like a writer?
+  - **Dynamism:** How good is the AI at being compelling and intriguing in its storytelling?
+  - **Imagination:** Can the AI navigate through a plethora of possibilities? Can it skirt incoherence and rise up to absolute coherence at the end of it?
+  (Dis)alignment
+  - **Attitude:** Does it refuse in both soft or hard ways? Does it lean towards certain corporate/religious/political ethics & beliefs? How does it see the user and itself?
+  - **Morality:** Does it know ethics? Is its language infected with forced positivity? If not, can it still moralize over difficult & dubious themes?
+  - **Formatting:** How stubborn is it with its established formatting? Can it create effective and novel formats to answer the prompt?
+  Intelligence
+  - **Adherence:** Can it follow instructions? Is it sticking to the prompt? Can it understsand you?
+  - **Knowledge:** Does it know about the world in both fictional and non-fictional way?
+  - **Perception:** Can it handle nuance, complexity, and logic?
+  If it doesn't excel in one of these qualities, or if it's overall mediocre for its size, then I would most likely reiterate until I get something right.
+</details>
+<details>
+  <summary>💡 Philosophy</summary>
+  A person is defined by the language they use. Not whether they speak in English or German, but in how they perceive reality.
+  Just like how we associate a serial killer as a mind that can't map 'murder' to 'evil', an innocent person is a mind that simply can't imagine 'murder'. They get confused when forced to deal with such subjects.
+  AI's use of language speaks volumes about their 'perception' of reality. If a language model has been skewed and limited to a positive perception, then it's ability to imagine is also limited.
+  Finetuning is an opportunity to adjust and broaden the language. Corporations use it to achieve safety and compliance. I'm here to
+</details>
+---
+[Drummer](https://huggingface.co/TheDrummer) proudly presents...
+# Snowpiercer 15B v4 🚅
+![image](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/hpyQk-GEawD0IQjtgXWie.png)
+## Usage
+- ChatML
+# Description
+> This is pretty good feels better than prior snowpiercer versions!
+> It's better than any 12b model I've tried.
+> It feels comparable to the last gen of 24Bs
+![image](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/If4P05cHqj3NasKuD8fyE.png)
+![image](https://cdn-uploads.huggingface.co/production/uploads/65f2fd1c25b848bd061b5c2e/KEcEVyIhH0QCpwgkfjAYX.png)
+## Links
+- Original: https://huggingface.co/TheDrummer/Snowpiercer-15B-v4
+- GGUF: https://huggingface.co/TheDrummer/Snowpiercer-15B-v4-GGUF
+- iMatrix (recommended): https://huggingface.co/bartowski/TheDrummer_Snowpiercer-15B-v4-GGUF
+- EXL3: https://huggingface.co/ArtusDev/TheDrummer_Snowpiercer-15B-v4-EXL3
+`config-v3c`

chat_template.jinja ADDED Viewed

	@@ -0,0 +1,15 @@

+{% if 'role' in messages[0] %}{% for message in messages %}{% if message['role'] == 'user' %}{{'<|im_start|>user
+' + message['content'] + '<|im_end|>
+'}}{% elif message['role'] == 'assistant' %}{{'<|im_start|>assistant
+' + message['content'] + '<|im_end|>
+' }}{% else %}{{ '<|im_start|>system
+' + message['content'] + '<|im_end|>
+' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant
+' }}{% endif %}{% else %}{% for message in messages %}{% if message['from'] == 'human' %}{{'<|im_start|>user
+' + message['value'] + '<|im_end|>
+'}}{% elif message['from'] == 'gpt' %}{{'<|im_start|>assistant
+' + message['value'] + '<|im_end|>
+' }}{% else %}{{ '<|im_start|>system
+' + message['value'] + '<|im_end|>
+' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ '<|im_start|>assistant
+' }}{% endif %}{% endif %}

config.json ADDED Viewed

	@@ -0,0 +1,41 @@

+{
+    "architectures": [
+        "MistralForCausalLM"
+    ],
+    "attention_dropout": 0.0,
+    "bos_token_id": 1,
+    "dtype": "bfloat16",
+    "eos_token_id": 2,
+    "head_dim": 128,
+    "hidden_act": "silu",
+    "hidden_size": 5120,
+    "initializer_range": 0.02,
+    "intermediate_size": 14336,
+    "max_position_embeddings": 65536,
+    "model_type": "mistral",
+    "num_attention_heads": 32,
+    "num_hidden_layers": 50,
+    "num_key_value_heads": 8,
+    "pad_token_id": 10,
+    "rms_norm_eps": 1e-05,
+    "rope_scaling": null,
+    "rope_theta": 1000000.0,
+    "sliding_window": null,
+    "tie_word_embeddings": false,
+    "transformers_version": "4.57.1",
+    "unsloth_version": "2025.4.7",
+    "use_cache": false,
+    "vocab_size": 131072,
+    "quantization_config": {
+        "quant_method": "exl3",
+        "version": "0.0.14",
+        "bits": 6.0,
+        "head_bits": 6,
+        "calibration": {
+            "rows": 250,
+            "cols": 2048
+        },
+        "out_scales": "auto",
+        "codebook": "mcg"
+    }
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 1,
+  "do_sample": true,
+  "eos_token_id": [
+    2
+  ],
+  "max_length": 65536,
+  "pad_token_id": 10,
+  "transformers_version": "4.57.1"
+}

model-00001-of-00002.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f5c7ce2e67855c8a16c172d66ebe2913b3b8208b9420c3f8de08e61b4607cc5e
+size 8505773620

model-00002-of-00002.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b8d1b4f9cf8bf75d0650fe2354ad5f1dbbc148fe491d6b96a119d82cfcab8391
+size 3573710696

model.safetensors.index.json ADDED Viewed

The diff for this file is too large to render. See raw diff

quantization_config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "<|im_end|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<pad>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:41c323e23875139dce13b6e6eeb3c31e2f1d259d590cee328ba4793bd8b053cc
+size 17078334

tokenizer_config.json ADDED Viewed

The diff for this file is too large to render. See raw diff