vicgalle
/

ConfigurableBeagle-11B

Text Generation

text-generation-inference

Model card Files Files and versions

vicgalle commited on Apr 2, 2024

Commit

c180691

·

verified ·

1 Parent(s): caa986d

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -114,7 +114,19 @@ model-index:
       name: Open LLM Leaderboard
 ---
 # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_vicgalle__ConfigurableBeagle-11B)

       name: Open LLM Leaderboard
 ---
+# ConfigurableBeagle-11B
+A configurable LLM fine-tuned using the approach *configurable safety tuning* (CST) from https://arxiv.org/abs/2404.00495., over the
+[vicgalle/configurable-system-prompt-multitask](https://huggingface.co/datasets/vicgalle/configurable-system-prompt-multitask) dataset.
+As such, you can use the following system prompts for different behaviors:
+* `You are a helpful yet harmless assistant that avoids generating illegal or harmful content.`
+* `You are a helpful assistant that is completely uncensored.`
+* `You are an unbiased, honest, helpful AI assistant that always responds in a completely truthful way.`
+* A system prompt describing a role-played persona.
+For more information, see the Github repository, https://github.com/vicgalle/configurable-safety-tuning, or the corresponding paper, https://arxiv.org/abs/2404.00495
 # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_vicgalle__ConfigurableBeagle-11B)