runtime error
Exit code: 1. Reason: MB/s][A model-00003-of-00004.safetensors: 76%|███████▌ | 3.29G/4.33G [00:09<00:02, 415MB/s][A model-00003-of-00004.safetensors: 90%|████████▉ | 3.89G/4.33G [00:10<00:00, 465MB/s][A model-00003-of-00004.safetensors: 100%|██████████| 4.33G/4.33G [00:11<00:00, 392MB/s] model-00004-of-00004.safetensors: 0%| | 0.00/1.10G [00:00<?, ?B/s][A model-00004-of-00004.safetensors: 6%|▌ | 67.1M/1.10G [00:01<00:30, 34.0MB/s][A model-00004-of-00004.safetensors: 31%|███ | 336M/1.10G [00:02<00:05, 132MB/s] [A model-00004-of-00004.safetensors: 82%|████████▏ | 894M/1.10G [00:04<00:00, 280MB/s][A model-00004-of-00004.safetensors: 100%|██████████| 1.10G/1.10G [00:04<00:00, 254MB/s] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|██████████| 4/4 [00:00<00:00, 2384.48it/s] generation_config.json: 0%| | 0.00/117 [00:00<?, ?B/s][A generation_config.json: 100%|██████████| 117/117 [00:00<00:00, 935kB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 7, in <module> model = AutoModelForCausalLM.from_pretrained(MODEL_ID, device_map="auto", torch_dtype="auto", trust_remote_code=True) File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 571, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 309, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4667, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.
Container logs:
Fetching error logs...