msr2000's picture
Add files using upload-large-folder tool
846b34e verified
|
raw
history blame
451 Bytes
# DeepSeek V3.1
First convert huggingface model weight files to the format of this project.
```bash
export EXPERTS=256
python convert.py --hf-ckpt-path ${HF_CKPT_PATH} --save-path ${SAVE_PATH} --n-experts ${EXPERTS} --model-parallel ${MP}
```
Then chat with DeepSeek model at will!
```bash
export CONFIG=config_671B_v3.1.json
torchrun --nproc-per-node ${MP} generate.py --ckpt-path ${SAVE_PATH} --config ${CONFIG} --interactive --temperature {T}
```