Manojb
/

Qwen3-4b-toolcall-gguf-llamacpp-codex

@@ -27,7 +27,7 @@ tags:
 A specialized 4B parameter model fine-tuned for function calling and tool usage, optimized for local deployment with llama-cpp-python.
-## 🚀 Features
 - **4B Parameters** - Sweet spot for local deployment
 - **Function Calling** - Fine-tuned on 60K function calling examples
@@ -36,7 +36,7 @@ A specialized 4B parameter model fine-tuned for function calling and tool usage,
 - **Production Ready** - 0.518 training loss
 - **262K Context** - Large context window for complex tasks
-## 📦 Model Details
 - **Base Model**: Qwen3-4B-Instruct-2507
 - **Fine-tuning**: LoRA on Salesforce xlam-function-calling-60k dataset
@@ -44,7 +44,7 @@ A specialized 4B parameter model fine-tuned for function calling and tool usage,
 - **Architecture**: Qwen3 with specialized tool calling tokens
 - **License**: Apache 2.0
-## 🛠️ Installation
 ### Quick Install
@@ -93,7 +93,7 @@ CMAKE_ARGS="-DLLAMA_CUBLAS=on" pip install llama-cpp-python
 CMAKE_ARGS="-DLLAMA_BLAS=on -DLLAMA_BLAS_VENDOR=OpenBLAS" pip install llama-cpp-python
 ```
-## 🚀 Quick Start
 ### Option 1: Using the Run Script
@@ -181,7 +181,7 @@ tool_calls = extract_tool_calls(response_text)
 print(f"Tool calls: {tool_calls}")
 ```
-## 📚 Examples
 ### 1. Weather Tool Calling
@@ -469,31 +469,11 @@ qwen3-4b-toolcall-llamacpp/
 └── .gitignore                            # Git ignore file
 ```
-## 🤝 Contributing
-1. Fork the repository
-2. Create a feature branch
-3. Make your changes
-4. Add tests if applicable
-5. Submit a pull request
-## 📄 License
-This project is licensed under the Apache 2.0 License - see the [LICENSE](LICENSE) file for details.
-## 🙏 Acknowledgments
-- **Qwen Team** - For the base Qwen3-4B-Instruct model
-- **Salesforce** - For the xlam-function-calling-60k dataset
-- **llama.cpp** - For the efficient inference engine
-- **Manojb** - For quantization and optimization
-## 📞 Support
-- **Issues**: [GitHub Issues](https://github.com/yourusername/qwen3-4b-toolcall-llamacpp/issues)
-- **Discussions**: [GitHub Discussions](https://github.com/yourusername/qwen3-4b-toolcall-llamacpp/discussions)
-## 🔗 Related Projects
 - [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) - Python bindings for llama.cpp
 - [Qwen3](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) - Base model

 A specialized 4B parameter model fine-tuned for function calling and tool usage, optimized for local deployment with llama-cpp-python.
+## Features
 - **4B Parameters** - Sweet spot for local deployment
 - **Function Calling** - Fine-tuned on 60K function calling examples
 - **Production Ready** - 0.518 training loss
 - **262K Context** - Large context window for complex tasks
+## Model Details
 - **Base Model**: Qwen3-4B-Instruct-2507
 - **Fine-tuning**: LoRA on Salesforce xlam-function-calling-60k dataset
 - **Architecture**: Qwen3 with specialized tool calling tokens
 - **License**: Apache 2.0
+## Installation
 ### Quick Install
 CMAKE_ARGS="-DLLAMA_BLAS=on -DLLAMA_BLAS_VENDOR=OpenBLAS" pip install llama-cpp-python
 ```
+## Quick Start
 ### Option 1: Using the Run Script
 print(f"Tool calls: {tool_calls}")
 ```
+## Examples
 ### 1. Weather Tool Calling
 └── .gitignore                            # Git ignore file
 ```
+## License
+This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
+## Related Projects
 - [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) - Python bindings for llama.cpp
 - [Qwen3](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) - Base model