Instructions to use prithivMLmods/Megatron-Corpus-14B-Exp with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use prithivMLmods/Megatron-Corpus-14B-Exp with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="prithivMLmods/Megatron-Corpus-14B-Exp") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("prithivMLmods/Megatron-Corpus-14B-Exp") model = AutoModelForCausalLM.from_pretrained("prithivMLmods/Megatron-Corpus-14B-Exp") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use prithivMLmods/Megatron-Corpus-14B-Exp with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "prithivMLmods/Megatron-Corpus-14B-Exp" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "prithivMLmods/Megatron-Corpus-14B-Exp", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/prithivMLmods/Megatron-Corpus-14B-Exp
- SGLang
How to use prithivMLmods/Megatron-Corpus-14B-Exp with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "prithivMLmods/Megatron-Corpus-14B-Exp" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "prithivMLmods/Megatron-Corpus-14B-Exp", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "prithivMLmods/Megatron-Corpus-14B-Exp" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "prithivMLmods/Megatron-Corpus-14B-Exp", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use prithivMLmods/Megatron-Corpus-14B-Exp with Docker Model Runner:
docker model run hf.co/prithivMLmods/Megatron-Corpus-14B-Exp
Megatron-Corpus-14B-Exp
Megatron-Corpus-14B-Exp is based on the Qwen 2.5 14B modality architecture, designed to enhance the reasoning capabilities of 14B-parameter models. It has been fine-tuned on a synthetic dataset based on math corpus, further optimizing its chain-of-thought (CoT) reasoning and logical problem-solving abilities. The model demonstrates significant improvements in context understanding, structured data processing, and long-context comprehension, making it ideal for complex reasoning tasks, instruction-following, and text generation.
Key Improvements
- Advanced Reasoning & Logic: Optimized for multi-step problem-solving, logical deduction, and contextual analysis.
- Fine-Tuned Instruction Following: Generates precise responses, structured outputs (e.g., JSON), and extended long-form text (8K+ tokens).
- Greater Adaptability: Excels in role-playing, multi-turn dialogues, and diverse system prompts.
- Long-Context Support: Handles up to 128K tokens and generates up to 8K tokens per output.
- Multilingual Proficiency: Supports over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, and more.
Quickstart with Transformers
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "prithivMLmods/Megatron-Corpus-14B-Exp"
model = AutoModelForCausalLM.from_pretrained(
model_name,
torch_dtype="auto",
device_map="auto",
trust_remote_code=True
)
tokenizer = AutoTokenizer.from_pretrained(model_name)
prompt = "Explain the concept of logical reasoning in AI."
messages = [
{"role": "system", "content": "You are an expert AI assistant specialized in reasoning and logic."},
{"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
messages,
tokenize=False,
add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
generated_ids = model.generate(
**model_inputs,
max_new_tokens=512
)
generated_ids = [
output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]
response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)
Intended Use
- Advanced Logical & Analytical Reasoning: Designed for problem-solving, multi-step deductions, and cognitive reasoning tasks.
- Mathematical & Scientific Computation: Supports theorem proving, complex calculations, and scientific knowledge retrieval.
- Code Generation & Debugging: Generates optimized code, detects errors, and improves programming workflows.
- Structured Data Analysis: Processes tables, JSON, and structured formats for data-centric applications.
- Multilingual Reasoning & Translation: High proficiency across 29+ languages for international applications.
- Extended Text Generation: Capable of generating research papers, instructional guides, and in-depth reports.
Limitations
- High Computational Requirements: Due to its 14B parameters and 128K context support, it requires powerful GPUs or TPUs for efficient inference.
- Language-Specific Variability: Performance may differ across supported languages, especially for low-resource languages.
- Potential Error Accumulation: Long-form text generation can introduce inconsistencies over extended outputs.
- Limited Real-World Awareness: Knowledge is restricted to training data and may not reflect recent world events.
- Prompt Sensitivity: The quality of responses depends on the specificity and clarity of the input prompt.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here! Summarized results can be found here!
| Metric | Value (%) |
|---|---|
| Average | 35.55 |
| IFEval (0-Shot) | 49.83 |
| BBH (3-Shot) | 47.92 |
| MATH Lvl 5 (4-Shot) | 34.29 |
| GPQA (0-shot) | 15.10 |
| MuSR (0-shot) | 18.85 |
| MMLU-PRO (5-shot) | 47.33 |
- Downloads last month
- 15
Model tree for prithivMLmods/Megatron-Corpus-14B-Exp
Base model
Qwen/Qwen2.5-14BCollections including prithivMLmods/Megatron-Corpus-14B-Exp
Evaluation results
- averaged accuracy on IFEval (0-Shot)Open LLM Leaderboard49.830
- normalized accuracy on BBH (3-Shot)test set Open LLM Leaderboard47.920
- exact match on MATH Lvl 5 (4-Shot)test set Open LLM Leaderboard34.290
- acc_norm on GPQA (0-shot)Open LLM Leaderboard15.100
- acc_norm on MuSR (0-shot)Open LLM Leaderboard18.850
- accuracy on MMLU-PRO (5-shot)test set Open LLM Leaderboard47.330
