Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

EldritchLabs
/
Kraken-12B-v0

Text Generation
Transformers
Safetensors
NeMo
English
mistral
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
float32
swearing
rp
horror
della
Merge
mergekit
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
1

Instructions to use EldritchLabs/Kraken-12B-v0 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use EldritchLabs/Kraken-12B-v0 with Transformers:

    # Use a pipeline as a high-level helper
    from transformers import pipeline
    
    pipe = pipeline("text-generation", model="EldritchLabs/Kraken-12B-v0")
    messages = [
        {"role": "user", "content": "Who are you?"},
    ]
    pipe(messages)
    # Load model directly
    from transformers import AutoTokenizer, AutoModelForCausalLM
    
    tokenizer = AutoTokenizer.from_pretrained("EldritchLabs/Kraken-12B-v0")
    model = AutoModelForCausalLM.from_pretrained("EldritchLabs/Kraken-12B-v0")
    messages = [
        {"role": "user", "content": "Who are you?"},
    ]
    inputs = tokenizer.apply_chat_template(
    	messages,
    	add_generation_prompt=True,
    	tokenize=True,
    	return_dict=True,
    	return_tensors="pt",
    ).to(model.device)
    
    outputs = model.generate(**inputs, max_new_tokens=40)
    print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
  • NeMo

    How to use EldritchLabs/Kraken-12B-v0 with NeMo:

    # tag did not correspond to a valid NeMo domain.
  • Inference
  • Notebooks
  • Google Colab
  • Kaggle
  • Local Apps
  • vLLM

    How to use EldritchLabs/Kraken-12B-v0 with vLLM:

    Install from pip and serve model
    # Install vLLM from pip:
    pip install vllm
    # Start the vLLM server:
    vllm serve "EldritchLabs/Kraken-12B-v0"
    # Call the server using curl (OpenAI-compatible API):
    curl -X POST "http://localhost:8000/v1/chat/completions" \
    	-H "Content-Type: application/json" \
    	--data '{
    		"model": "EldritchLabs/Kraken-12B-v0",
    		"messages": [
    			{
    				"role": "user",
    				"content": "What is the capital of France?"
    			}
    		]
    	}'
    Use Docker
    docker model run hf.co/EldritchLabs/Kraken-12B-v0
  • SGLang

    How to use EldritchLabs/Kraken-12B-v0 with SGLang:

    Install from pip and serve model
    # Install SGLang from pip:
    pip install sglang
    # Start the SGLang server:
    python3 -m sglang.launch_server \
        --model-path "EldritchLabs/Kraken-12B-v0" \
        --host 0.0.0.0 \
        --port 30000
    # Call the server using curl (OpenAI-compatible API):
    curl -X POST "http://localhost:30000/v1/chat/completions" \
    	-H "Content-Type: application/json" \
    	--data '{
    		"model": "EldritchLabs/Kraken-12B-v0",
    		"messages": [
    			{
    				"role": "user",
    				"content": "What is the capital of France?"
    			}
    		]
    	}'
    Use Docker images
    docker run --gpus all \
        --shm-size 32g \
        -p 30000:30000 \
        -v ~/.cache/huggingface:/root/.cache/huggingface \
        --env "HF_TOKEN=<secret>" \
        --ipc=host \
        lmsysorg/sglang:latest \
        python3 -m sglang.launch_server \
            --model-path "EldritchLabs/Kraken-12B-v0" \
            --host 0.0.0.0 \
            --port 30000
    # Call the server using curl (OpenAI-compatible API):
    curl -X POST "http://localhost:30000/v1/chat/completions" \
    	-H "Content-Type: application/json" \
    	--data '{
    		"model": "EldritchLabs/Kraken-12B-v0",
    		"messages": [
    			{
    				"role": "user",
    				"content": "What is the capital of France?"
    			}
    		]
    	}'
  • Docker Model Runner

    How to use EldritchLabs/Kraken-12B-v0 with Docker Model Runner:

    docker model run hf.co/EldritchLabs/Kraken-12B-v0
Kraken-12B-v0
24.5 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 13 commits
Naphula's picture
Naphula
Update README.md
aaf4c59 verified 2 months ago
  • .gitattributes
    1.67 kB
    Upload 10 files 3 months ago
  • Kraken.png
    1.65 MB
    xet
    Upload 10 files 3 months ago
  • README.md
    64 kB
    Update README.md 2 months ago
  • chat_template.jinja
    203 Bytes
    Upload 10 files 3 months ago
  • config.json
    639 Bytes
    Upload 10 files 3 months ago
  • kraken_audit.png
    225 kB
    xet
    Upload 10 files 3 months ago
  • mergekit_config.yml
    22.1 kB
    Upload 10 files 3 months ago
  • model-00001-of-00005.safetensors
    4.87 GB
    xet
    Upload model-00001-of-00005.safetensors with huggingface_hub 3 months ago
  • model-00002-of-00005.safetensors
    4.91 GB
    xet
    Upload model-00002-of-00005.safetensors with huggingface_hub 3 months ago
  • model-00003-of-00005.safetensors
    4.91 GB
    xet
    Upload model-00003-of-00005.safetensors with huggingface_hub 3 months ago
  • model-00004-of-00005.safetensors
    4.91 GB
    xet
    Upload model-00004-of-00005.safetensors with huggingface_hub 3 months ago
  • model-00005-of-00005.safetensors
    4.91 GB
    xet
    Upload model-00005-of-00005.safetensors with huggingface_hub 3 months ago
  • model.safetensors.index.json
    30.3 kB
    Upload 10 files 3 months ago
  • special_tokens_map.json
    443 Bytes
    Upload 10 files 3 months ago
  • tokenizer.json
    17.1 MB
    xet
    Upload 10 files 3 months ago
  • tokenizer_config.json
    188 kB
    Upload 10 files 3 months ago