How to use Qwen/Qwen-Image-Edit-2511 with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import load_image # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("Qwen/Qwen-Image-Edit-2511", dtype=torch.bfloat16, device_map="cuda") prompt = "Turn this cat into a dog" input_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/cat.png") image = pipe(image=input_image, prompt=prompt).images[0]
如题。。是环境有问题吗
憋这么久,憋出一坨大的出来了,我真是要笑死了我也感觉除了理解和遵循提示词有点长进,出来的图的画面效果感觉一言难尽,跟参考图的画面效果完全不沾边还有编辑指令稍微复杂一点,一致性就直线下降
玩了两个月QIT(Qwen Image Edit)的2511测试,遵循效果不如FIT1.0,稳定性,真实性更不如2509
效果是真的不行,你试过微调吗?现在直接推理的效果实在是太差,打算微调试试先
· Sign up or log in to comment