Text-to-Image
Diffusers
Safetensors
English
ZImagePipeline

Blog post doesn't mention base model

#81
by DioxideSiO2 - opened

https://tongyi-mai.github.io/Z-Image-blog/

"We are publicly releasing two specialized models on Z-Image: Z-Image-Turbo for generation and Z-Image-Edit for editing."

Does this mean the base model is no longer scheduled for release? I hope it comes out 😢

According to the forum,they will most likely break their promises.

Alibaba is a business company.

Both the huggingface and github READMEs list base as to be released. Blog post was probably just focused on the two "production" models they intend to focus on (pre-trained/distilled for ready-to-use drop in purposes). Turbo for fast gen, Edit for, well, editing. I promise they know Base is still absolutely necessary for fine tuning and lora training. They'd be truly dumb not to release it, and so they will.

image

According to the forum,they will most likely break their promises.

I think that not keeping their promises by refusing to release the base or edit version as open source would be a huge mistake on their part. I don't think their goal is to disappoint the open source community and lose their trust. They know very well that many open source users are also users who pay for certain pro APIs models. But treating people in the community like idiots is shooting themselves in the foot.

I'm not sure it's necessarily strategically useful to adopt the same stance as Black Forest Labs for the Tongyi-MAI team.

Then honestly promising a model dedicated to fine-tuning, purely dedicated to the open-source community, only to ultimately say, “No, fuck off, pay for our API after all,” would be the worst move.

Sign up or log in to comment