Darwin-Qwen3-4B / README.md

Update README.md

9c9b6d3 verified 3 months ago

4.48 kB

	---
	license: apache-2.0
	tags:
	- merge
	- evolutionary
	- Darwin-A2AP
	base_model:
	- Qwen/Qwen3-4B-Instruct-2507
	- Qwen/Qwen3-4B-Thinking-2507
	---

	<div align="center">
	<span style="font-family: default; font-size: 1.5em;">Darwin-Qwen3-4B</span>
	<div>
	evolutionary algorithm 'Darwin A2AP' 🤔
	</div>
	</div>
	<br>
	<div align="center" style="line-height: 1;">
	<a href=" https://discord.gg/openfreeai" style="margin: 2px;">
	<img alt="OpenFree AI Discord Server" src="https://img.shields.io/badge/Discord-000000?style=for-the-badge&logo=discord&logoColor=000&logoColor=white" style="display: inline-block; vertical-align: middle;"/>
	</a>
	<a href="https://huggingface.co/VIDraft" style="margin: 2px;">
	<img alt="HF Page" src="https://img.shields.io/badge/VIDraft-fcd022?style=for-the-badge&logo=huggingface&logoColor=000&labelColor" style="display: inline-block; vertical-align: middle;"/>
	</a>
	</div>

	# openfree/Darwin-Qwen3-4B
	This model is automatically merged using evolutionary algorithm 'Darwin A2AP' v3.2

	# Overview
	This study introduces a new paradigm of AI model fusion. Traditional "model merging" techniques have been restricted to models of the same family (e.g., transformer-based LLMs). We transcend this limitation by proposing a method to directly collide and fuse the core representational structures (DNA) of entirely different species — such as transformers and diffusion models. This approach acts as an "AI particle accelerator," colliding fundamentally distinct elements of intelligence to uncover new possibilities.
	The paper and source code (to be released on GitHub and Hugging Face) are currently under preparation and will be made publicly available soon. They will be released in a reproducible and extensible form for anyone to explore.

	## Contribution
	Breaking the Species Barrier
	Fusion of fundamentally different models such as transformers and diffusion architectures.
	Realization of cross-species model merging once deemed impossible.

	## AI Embryo Creation
	Formation of an initial “AI embryo” based on fused DNA.
	The embryo is not confined to a single domain or function but serves as the foundation for multi-capability intelligence.

	## Virtual Evolutionary Environment
	AI embryos are placed into a simulated environment spanning thousands of generations.
	Through survival and adaptation, natural selection drives evolution beyond the limitations of parent models, producing new offspring models.

	## Merge Information
	Father Model 1: Qwen/Qwen3-4B-Instruct-2507
	Mother Model 2: Qwen/Qwen3-4B-Thinking-2507
	Validation Task Accuracy: 88.56%
	Note: The above accuracy is a proxy metric used for merge ratio optimization.
	Algorithm Version: Darwin A2AP Enhanced v3.2

	## ⚠️ Notice
	The actual language generation performance of this model requires separate evaluation.
	The validation score above is not an LLM benchmark score.

	## ⚠️ Benchmarking Test Results
	<p align="center"> <img src="BenchmarkResult.png" alt="Darwin-Qwen3-4B BenchMark Result" width="600"/> </p>


	## Use_Example

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	model = AutoModelForCausalLM.from_pretrained("openfree/Darwin-Qwen3-4B")
	tokenizer = AutoTokenizer.from_pretrained("openfree/Darwin-Qwen3-4B")

	# 추론 예시
	inputs = tokenizer("Hello, how are you?", return_tensors="pt")
	outputs = model.generate(**inputs)
	```


	# Strengths & Features
	## Cross-Domain Intelligence
	Example: Legal LLM × Medical LLM → instantly produces a “Forensic LLM.”
	This is not mere knowledge aggregation but the creation of new intelligence at the intersection of domains.

	## Extreme Efficiency
	Achieves results at roughly 1/10,000 of the time and cost compared to training a new foundation model.
	Accessible via a simple click-based process.

	## Unified Intelligence
	Escapes confinement to a single domain by organically merging multiple expertises.
	Provides an experimental basis for integrated reasoning and creativity with AGI-like qualities.

	## Reproducibility & Openness
	Source code and models will be fully released on GitHub and Hugging Face.
	Researchers and developers can freely reproduce, experiment, and expand.

	# Outlook
	This research opens the door to a new generation of model creation, expressed as “Foundation a + Foundation b = Foundation abXc.” It represents far more than a reduction in training costs, serving as a critical turning point for future studies on the evolution and fusion of AI intelligence.