Skywork
/

Skywork-MoE-Base-FP8

@@ -30,11 +30,11 @@ We introduce two innovative techniques: Gating Logit Normalization, which enhanc
 Skywork-MoE demonstrates comparable or superior performance to models with more parameters or more activated parameters, such as Grok-1, DBRX, Mistral 8*22, and Deepseek-V2.
 # News and Updates
-* 2024.6.3  We release the **Skywork-MoE-base** model.
 # Table of contents
-- [☁️Download URL](#Download-URL)
 - [👨‍💻Benchmark Results](#Benchmark-Results)
 - [🏆Demonstration of Hugging Face Model Inference](#Demonstration-of-HuggingFace-Model-Inference)
 - [📕Demonstration of vLLM Model Inference](#Demonstration-of-vLLM-Model-Inference)
@@ -42,15 +42,8 @@ Skywork-MoE demonstrates comparable or superior performance to models with more
 - [🤝Contact Us and Citation](#Contact-Us-and-Citation)
-# Download URL
-|         | HuggingFace Model   |  ModelScope Model   |  Wisemodel Model  |
-|:-------:|:-----------:|:-----------------------------:|:-----------------------------:|
-| **Skywork-MoE-base**      | 🤗 [Skywork-MoE-base](https://github.com/SkyworkAI/Skywork-MoE)  | 🤖[Skywork-MoE-base](https://www.modelscope.cn/models/skywork/Skywork-MoE-base) | 👾[Skywork-MoE-base](https://wisemodel.cn/models/Skywork/Skywork-MoE-base) |
-| **Skywork-MoE-Base-FP8**  | 🤗 [Skywork-MoE-Base-FP8](https://github.com/SkyworkAI/Skywork-MoE) | 🤖 | 👾 |
 # Benchmark Results
-We evaluated Skywork-MoE-base model on various popular benchmarks, including C-Eval, MMLU, CMMLU, GSM8K, MATH and HumanEval.
 <img src="misc/skywork_moe_base_evaluation.png" alt="Image" width="600" height="280">
@@ -58,9 +51,9 @@ We evaluated Skywork-MoE-base model on various popular benchmarks, including C-E
 ## Quickstart with vLLM
-We provide a method to quickly deploy the Skywork-Moe-base model based on vllm.
-Under fp8 precision you can run Skywork-Moe-base with just only 8*4090.
 You can get the source code in [`vllm`](https://github.com/SkyworkAI/vllm)
@@ -128,7 +121,7 @@ docker run \
     registry.cn-wulanchabu.aliyuncs.com/triple-mu/skywork-moe-vllm:v1
 ```
-Now, you can run the Skywork Moe base model for fun!
 ### Text Completion

 Skywork-MoE demonstrates comparable or superior performance to models with more parameters or more activated parameters, such as Grok-1, DBRX, Mistral 8*22, and Deepseek-V2.
 # News and Updates
+* 2024.6.3  We release the **Skywork-MoE-Base** model.
 # Table of contents
 - [👨‍💻Benchmark Results](#Benchmark-Results)
 - [🏆Demonstration of Hugging Face Model Inference](#Demonstration-of-HuggingFace-Model-Inference)
 - [📕Demonstration of vLLM Model Inference](#Demonstration-of-vLLM-Model-Inference)
 - [🤝Contact Us and Citation](#Contact-Us-and-Citation)
 # Benchmark Results
+We evaluated Skywork-MoE-Base model on various popular benchmarks, including C-Eval, MMLU, CMMLU, GSM8K, MATH and HumanEval.
 <img src="misc/skywork_moe_base_evaluation.png" alt="Image" width="600" height="280">
 ## Quickstart with vLLM
+We provide a method to quickly deploy the Skywork-MoE-Base model based on vllm.
+Under fp8 precision you can run Skywork-MoE-Base with just only 8*4090.
 You can get the source code in [`vllm`](https://github.com/SkyworkAI/vllm)
     registry.cn-wulanchabu.aliyuncs.com/triple-mu/skywork-moe-vllm:v1
 ```
+Now, you can run the Skywork MoE model for fun!
 ### Text Completion