Improve model card: Add metadata, paper link, and detailed description
#1
by
nielsr
HF Staff
- opened
This PR significantly improves the model card for the Co-rewarding-I: Qwen3-8B-Base model by adding essential metadata and enriching its content.
Key updates include:
- Adding
pipeline_tag: text-generationto categorize the model for text generation tasks, improving discoverability. - Specifying
library_name: transformersas the model uses aQwen3ForCausalLMarchitecture and is compatible with the Hugging Face Transformers library. - Including
license: apache-2.0, based on common practices for similar models and the consensus from colleagues. - Updating the main title of the model card to reflect the associated paper:
Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models. - Providing a more detailed description of the model, summarizing its context and contribution from the paper's abstract.
- Ensuring clear links to the paper on Hugging Face and the original GitHub repository.
- Adding the BibTeX citation for proper academic attribution.
These changes make the model card more informative, discoverable, and user-friendly on the Hugging Face Hub.
resistz
changed pull request status to
merged