mzbac commited on
Commit
81743c1
·
verified ·
1 Parent(s): dca21a3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md CHANGED
@@ -1,3 +1,23 @@
1
  ---
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
  ---
4
+
5
+ # Qlora adapters for Mixtral-8x7B-v0.1-hf-4bit-mlx
6
+
7
+ ## fine-tuned on guanaco dataset
8
+
9
+ ## inference vis mlx-lm
10
+ ```
11
+ from mlx_lm import load, generate
12
+
13
+ model, tokenizer = load("mlx-community/Mixtral-8x7B-v0.1-hf-4bit-mlx",adapter_file="adapters.npz")
14
+
15
+ generate(model=model, tokenizer=tokenizer, prompt="### Human: write a quick sort in python.\n### Assistant: ", max_tokens=500, verbose=True,temp=0.3)
16
+ ```
17
+
18
+ ## serve as an API Service
19
+
20
+ ```
21
+ pip install mlx-llm-server
22
+ mlx-llm-server --model-path mlx-community/Mixtral-8x7B-v0.1-hf-4bit-mlx --adapter-file adapters.npz
23
+ ```