Fara-7B-mlx-metal-int4

This is a quantized INT4 model based on Apple MLX Framework Fara-7B. You can deploy it on Apple Silicon devices (M1,M2,M3,M4).

Note: This is unoffical version,just for test and dev.

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support