umd-zhou-lab/controllable-wizardlm-7b
Text Generation • 7B • Updated • 2
Machine Learning, Natural Language Processing, Optimization, Multi-Modality, Artificial Intelligence
Multi-Turn Reflective Masking Elicits Reasoning in Mask Diffusion Models
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs