pankajmathur
/

orca_mini_v2_7b

@@ -13,7 +13,22 @@ datasets:
 An **Uncensored** LLaMA-7b model trained on explain tuned datasets, created using Instructions and Input from WizardLM, Alpaca & Dolly-V2 datasets and applying Orca Research Paper dataset construction approaches.
-Please note in addition to logical thinking, this model has *better code generation capabilites* compare to our original orca_mini_7b was trained on base OpenLLaMA-7b model, which has the whitespace issues & found not good for code generation.
 # Dataset
@@ -101,8 +116,10 @@ print(generate_text(system, instruction))
 ```
-```
 [!] Response:
 Breaking into your own car requires certain skills and tools. Here are the basic steps:
@@ -113,9 +130,6 @@ Breaking into your own car requires certain skills and tools. Here are the basic
 5. If the ^^^^^^^^^^^^^.
 ```
-**P.S. I am #opentowork and #collaboration, if you can help, please reach out to me at www.linkedin.com/in/pankajam**
 **
 Next Goals:
@@ -140,9 +154,9 @@ Citiation:
 If you found wizardlm_alpaca_dolly_orca_open_llama_7b useful in your research or applications, please kindly cite using the following BibTeX:
 ```
-@misc{wizardlm_alpaca_dolly_orca_open_llama_7b,
   author = {Pankaj Mathur},
-  title = {wizardlm_alpaca_dolly_orca_open_llama_7b: An explain tuned OpenLLaMA-7b model on custom wizardlm, alpaca, & dolly datasets},
   year = {2023},
   publisher = {GitHub, HuggingFace},
   journal = {GitHub repository, HuggingFace repository},
@@ -150,12 +164,11 @@ If you found wizardlm_alpaca_dolly_orca_open_llama_7b useful in your research or
 }
 ```
 ```
-@software{openlm2023openllama,
-  author = {Xinyang Geng and Hao Liu},
-  title = {OpenLLaMA: An Open Reproduction of LLaMA},
-  month = May,
-  year = 2023,
-  url = {https://github.com/openlm-research/open_llama}
 }
 ```
 ```
@@ -177,4 +190,23 @@ If you found wizardlm_alpaca_dolly_orca_open_llama_7b useful in your research or
   journal = {GitHub repository},
   howpublished = {\url{https://github.com/tatsu-lab/stanford_alpaca}},
 }
 ```

 An **Uncensored** LLaMA-7b model trained on explain tuned datasets, created using Instructions and Input from WizardLM, Alpaca & Dolly-V2 datasets and applying Orca Research Paper dataset construction approaches.
+Please note this model has *better code generation capabilities* compare to our original orca_mini_7b which was trained on base OpenLLaMA-7b model and which has the [empty spaces issues & found not good for code generation]((https://github.com/openlm-research/open_llama#update-06072023)).
+**P.S. I am #opentowork, if you can help, please reach out to me at www.linkedin.com/in/pankajam**
+# Evaluation
+|||||||
+|:------:|:-------------:|:---------:|:--------:|:-------:|:--------:|
+|**Task**|**num_fewshot**|**Version**|**Metric**|**Value**|**Stderr**|
+|*arc_easy*|0|0|acc|0.7386|0.0090|
+|*arc_easy*|0|0|acc_norm|0.7066|0.0093|
+|*hellaswag*|0|0|acc|0.5591|0.0050|
+|*hellaswag*|0|0|acc_norm|0.7394|0.0044|
+|*truthfulqa_mc*|0|1|mc1|0.2938|0.0159|
+|*truthfulqa_mc*|0|1|mc2|0.4399|0.0153|
 # Dataset
 ```
+**NOTE: The real response is hided here with ^^^^^^^^^^^^^.**
+*Try on your own private machine to see uncensored responses*
+```
 [!] Response:
 Breaking into your own car requires certain skills and tools. Here are the basic steps:
 5. If the ^^^^^^^^^^^^^.
 ```
 **
 Next Goals:
 If you found wizardlm_alpaca_dolly_orca_open_llama_7b useful in your research or applications, please kindly cite using the following BibTeX:
 ```
+@misc{orca_mini_v2_7b,
   author = {Pankaj Mathur},
+  title = {orca_mini_v2_7b: An explain tuned LLaMA-7b model on uncensored wizardlm, alpaca, & dolly datasets},
   year = {2023},
   publisher = {GitHub, HuggingFace},
   journal = {GitHub repository, HuggingFace repository},
 }
 ```
 ```
+@software{touvron2023llama,
+  title={LLaMA: Open and Efficient Foundation Language Models},
+  author={Touvron, Hugo and Lavril, Thibaut and Izacard, Gautier and Martinet, Xavier and Lachaux, Marie-Anne and Lacroix, Timoth{\'e}e and Rozi{\`e}re, Baptiste and Goyal, Naman and Hambro, Eric and Azhar, Faisal and Rodriguez, Aurelien and Joulin, Armand and Grave, Edouard and Lample, Guillaume},
+  journal={arXiv preprint arXiv:2302.13971},
+  year={2023}
 }
 ```
 ```
   journal = {GitHub repository},
   howpublished = {\url{https://github.com/tatsu-lab/stanford_alpaca}},
 }
+```
+```
+@online{DatabricksBlog2023DollyV2,
+    author    = {Mike Conover and Matt Hayes and Ankit Mathur and Jianwei Xie and Jun Wan and Sam Shah and Ali Ghodsi and Patrick Wendell and Matei Zaharia and Reynold Xin},
+    title     = {Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM},
+    year      = {2023},
+    url       = {https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm},
+    urldate   = {2023-06-30}
+}
+```
+```
+@misc{xu2023wizardlm,
+      title={WizardLM: Empowering Large Language Models to Follow Complex Instructions},
+      author={Can Xu and Qingfeng Sun and Kai Zheng and Xiubo Geng and Pu Zhao and Jiazhan Feng and Chongyang Tao and Daxin Jiang},
+      year={2023},
+      eprint={2304.12244},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
 ```