LFM2-8B-A1B-GGUF / README.md

mlabonne

Update README.md

e927537 verified 16 days ago

preview code

raw

history blame contribute delete

7.11 kB

metadata

library_name: transformers
license: other
license_name: lfm1.0
license_link: LICENSE
language:
  - en
  - ar
  - zh
  - fr
  - de
  - ja
  - ko
  - es
pipeline_tag: text-generation
tags:
  - liquid
  - lfm2
  - edge
  - moe
  - llama.cpp
  - gguf
base_model:
  - LiquidAI/LFM2-8B-A1B

LFM2-8B-A1B

LFM2 is a new generation of hybrid models developed by Liquid AI, specifically designed for edge AI and on-device deployment. It sets a new standard in terms of quality, speed, and memory efficiency.

We're releasing the weights of our first MoE based on LFM2, with 8.3B total parameters and 1.5B active parameters.

LFM2-8B-A1B is the best on-device MoE in terms of both quality (comparable to 3-4B dense models) and speed (faster than Qwen3-1.7B).
Code and knowledge capabilities are significantly improved compared to LFM2-2.6B.
Quantized variants fit comfortably on high-end phones, tablets, and laptops.

Find more information about LFM2-8B-A1B in our blog post.

🏃 How to run LFM2

Example usage with llama.cpp:

llama-cli -hf LiquidAI/LFM2-8B-A1B-GGUF