Flex.2-preview / README.md
Eviation's picture
Update README.md
5c239dd verified
|
raw
history blame
3.56 kB
metadata
license: apache-2.0
pipeline_tag: text-to-image
library_name: gguf
tags:
  - flex
  - flux
  - gguf
  - safetensors
base_model:
  - ostris/Flex.2-preview

Info

Various quantizations for hf:ostris/Flex.2-preview.

Safetensors

Filename Quant Type File Size Description Example Image
Flex.2-preview-fp8_e4m3fnsafetensors F8_E4M3FN 8.16GB - -
Flex.2-preview-fp8_e4m3fn_scaled.safetensors F8_E4M3FN 8.17GB Scale per weight tensor -
Flex.2-preview-fp8_e5m2_scaled.safetensors F8_E5M2 8.17GB Scale per weight tensor -

Pure GGUF

  • pure, conversion from safetensors BF16 via F32 gguf
  • architecture: flex.2 (as not all tensor shapes match to flux)
  • no imatrix was used to quantize
  • biases and norms: F32
  • img_in.weight: BF16 (due to tensor shape and block sizes)
  • everything else according to file type
Filename Quant Type File Size Description / L2 Loss Step 25 Example Image
Flex.2-preview-BF16.gguf BF16 16.3GB - -
Flex.2-preview-Q8_0.gguf Q8_0 8.68GB TBC -
Flex.2-preview-Q6_K.gguf Q6_K 6.70GB TBC -
Flex.2-preview-Q5_1.gguf Q5_1 6.13GB TBC -
Flex.2-preview-Q5_0.gguf Q5_0 5.62GB TBC -
Flex.2-preview-Q4_1.gguf Q4_1 5.11GB TBC -
Flex.2-preview-IQ4_NL.gguf IQ4_NL 4.60GB TBC -
Flex.2-preview-Q4_0.gguf Q4_0 4.60GB TBC -
Flex.2-preview-Q3_K_S.gguf Q3_K_S 3.52GB TBC -

Fluxified GGUF

  • conversion from safetensors BF16 via F32 gguf
  • truncated img_in.weight tensor to first 16 latent channels
  • lost ability to do inpainting and process control image
  • should be a drop-in replacement for FLUX
  • architecture: flux
  • dynamic quantization?
Filename Quant type File Size Description / L2 Loss Step 25 Example Image
Flex.2-preview-fluxified-Q8_0.gguf Q8_0 8.39GB TBC -
Flex.2-preview-fluxified-Q3_K_S.gguf Q3_K_S 3.52GB TBC -