Instructions to use ziweek/gemma-2b-it-award-factory with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use ziweek/gemma-2b-it-award-factory with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="ziweek/gemma-2b-it-award-factory", filename="gemma_2b_it_award_factory_v2.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use ziweek/gemma-2b-it-award-factory with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf ziweek/gemma-2b-it-award-factory:Q5_K_M # Run inference directly in the terminal: llama-cli -hf ziweek/gemma-2b-it-award-factory:Q5_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf ziweek/gemma-2b-it-award-factory:Q5_K_M # Run inference directly in the terminal: llama-cli -hf ziweek/gemma-2b-it-award-factory:Q5_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf ziweek/gemma-2b-it-award-factory:Q5_K_M # Run inference directly in the terminal: ./llama-cli -hf ziweek/gemma-2b-it-award-factory:Q5_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf ziweek/gemma-2b-it-award-factory:Q5_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf ziweek/gemma-2b-it-award-factory:Q5_K_M
Use Docker
docker model run hf.co/ziweek/gemma-2b-it-award-factory:Q5_K_M
- LM Studio
- Jan
- Ollama
How to use ziweek/gemma-2b-it-award-factory with Ollama:
ollama run hf.co/ziweek/gemma-2b-it-award-factory:Q5_K_M
- Unsloth Studio new
How to use ziweek/gemma-2b-it-award-factory with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ziweek/gemma-2b-it-award-factory to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ziweek/gemma-2b-it-award-factory to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for ziweek/gemma-2b-it-award-factory to start chatting
- Docker Model Runner
How to use ziweek/gemma-2b-it-award-factory with Docker Model Runner:
docker model run hf.co/ziweek/gemma-2b-it-award-factory:Q5_K_M
- Lemonade
How to use ziweek/gemma-2b-it-award-factory with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull ziweek/gemma-2b-it-award-factory:Q5_K_M
Run and chat with the model
lemonade run user.gemma-2b-it-award-factory-Q5_K_M
List all available models
lemonade list
This is fine-tuned model
Python 3.10.14
Tue Sep 10 13:58:31 2024
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 555.58.02 Driver Version: 556.12 CUDA Version: 12.5 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 4070 ... On | 00000000:01:00.0 On | N/A |
| N/A 39C P8 1W / 140W | 211MiB / 8188MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
+-----------------------------------------------------------------------------------------+
request,title,winner,description,publisher
νμ νμλ€μ μμΌμ μ±κΈ°λ μ¬λμκ² μμ μ£Όκ³ μΆμ΄.","μμΌ νλλμ","μ΄μμΌ","νμ νμλ€μ μμΌμ μμ§ μκ³ μ±κΈ°λ©° λͺ¨λμκ² λ°λ»ν λ§μμ μ λ¬νλ κ·Έμ μΈμ¬ν¨μ μΉμ°¬νκΈ° μν΄ μ΄ μμ μμ¬ν©λλ€.","μ¬λ΄ μμΌ μΆν λͺ¨μ"
"νμ μ¬λλ€μ λ²μ λ¬Έμ λ₯Ό μ€μ¬νλ νμ¬μκ² μμ μ£Όκ³ μΆμ΄.","곡μ ν νμ¬μ","μ΅νμ¬","μ¬νμ λ²κ³Ό μ μλ₯Ό 곡μ νκ² μ€μ¬νλ©° μ¬λλ€μ λ¬Έμ λ₯Ό ν΄κ²°νλ κ·Έμ μ±
μκ°μ μΉμ°¬νκΈ° μν΄ μ΄ μμ μμ¬ν©λλ€.","λ²μ νμ¬ νν"
"νμ κ°μ‘±μ μ¬μ§μ μ°μ΄μ£Όλ μ¬λμκ² μμ μ£Όκ³ μΆμ΄.","μ¬μ§μ¬μ","κΉμ¬μ§","κ°μ‘±μ μμ€ν μκ°μ μ¬μ§μΌλ‘ λ¨κΈ°λ©° μΆμ΅μ κΈ°λ‘νλ κ·Έμ λμ°λ―Έμ μ¬μΈν¨μ μΉμ°¬νκΈ° μν΄ μ΄ μμ μμ¬ν©λλ€.","κ°μ‘± μ¬μ§ κΈ°λ‘ νν"
...
# νμ μλ‘μ΄ μννΈμ¨μ΄λ₯Ό κ°λ°νλ κ°λ°μμκ² μμ μ£Όκ³ μΆμ΄
{'title':'μ½λ λ§λ²μ¬μ', 'winner':'μ΅κ°λ°', 'description':'νμ μ μΈ μννΈμ¨μ΄λ₯Ό κ°λ°ν΄ μΈμμ λ³νμν€λ κ·Έμ λ°μ΄λ κ°λ° λ₯λ ₯μ μΉμ°¬νκΈ° μν΄ μ΄ μμ μμ¬ν©λλ€.', 'publisher':'μννΈμ¨μ΄ κ°λ° νν'}
- Downloads last month
- 22
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support