mosaicml
/

mpt-7b-instruct

Text Generation

text-generation-inference

Model card Files Files and versions

jacobfulano commited on May 5, 2023

Commit

8c92147

·

1 Parent(s): c271818

Update README.md

Files changed (1) hide show

README.md +24 -4

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ tags:
 # MPT-7B-Instruct
 MPT-7B-Instruct is a model for short-form instruction following.
-It is built by finetuning [MPT-7B (Base)](https://huggingface.co/spaces/mosaicml/mpt-7b) on a [dataset](https://huggingface.co/datasets/sam-mosaic/dolly_hhrlhf) derived from the [Databricks Dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k) and the [Anthropic Helpful and Harmless (HH-RLHF)](https://huggingface.co/datasets/Anthropic/hh-rlhf) datasets.
   * License: _CC-By-SA-3.0_ (commercial use permitted)
   * [Online Demo](https://huggingface.co/spaces/mosaicml/mpt-7b-instruct)
@@ -99,10 +99,30 @@ For more details on the pretraining process, see [MPT-7B](https://huggingface.co
 The data was tokenized using the [EleutherAI/gpt-neox-20b](https://huggingface.co/EleutherAI/gpt-neox-20b) tokenizer.
-## Training Configuration
-This model was finetuned on 440 A100-40GBs for about half a day using the [MosaicML Platform](https://www.mosaicml.com/platform).
 ## Acknowledgements
-This model was finetuned by Sam Havens and the MosaicML NLP team

 # MPT-7B-Instruct
 MPT-7B-Instruct is a model for short-form instruction following.
+It is built by finetuning [MPT-7B](https://huggingface.co/spaces/mosaicml/mpt-7b) on a [dataset](https://huggingface.co/datasets/sam-mosaic/dolly_hhrlhf) derived from the [Databricks Dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k) and the [Anthropic Helpful and Harmless (HH-RLHF)](https://huggingface.co/datasets/Anthropic/hh-rlhf) datasets.
   * License: _CC-By-SA-3.0_ (commercial use permitted)
   * [Online Demo](https://huggingface.co/spaces/mosaicml/mpt-7b-instruct)
 The data was tokenized using the [EleutherAI/gpt-neox-20b](https://huggingface.co/EleutherAI/gpt-neox-20b) tokenizer.
+## Limitations and Biases
+_The following language is modified from [EleutherAI's GPT-NeoX-20B](https://huggingface.co/EleutherAI/gpt-neox-20b)_
+MPT-7B-Chat can produce factually incorrect output, and should not be relied on to produce factually accurate information.
+MPT-7B-CHat was trained on various public datasets.
+While great efforts have been taken to clean the pretraining data, it is possible that this model could generate lewd, biased or otherwise offensive outputs.
 ## Acknowledgements
+This model was finetuned by Sam Havens and the MosaicML NLP team
+## Citation
+Please cite this model using the following format:
+```
+@online{MosaicML2023Introducing,
+    author    = {MosaicML NLP Team},
+    title     = {Introducing MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs},
+    year      = {2023},
+    url       = {www.mosaicml.com/blog/mpt-7b},
+    note      = {Accessed: 2023-03-28}, % change this date
+    urldate   = {2023-03-28} % change this date
+}
+```