Spaces

VIDraft
/

Local-RAG-llama-3-8b

Runtime error

App Files Files Community

runtime error

Exit code: 1. Reason: 0004.safetensors: 0%| | 0.00/1.17G [00:00<?, ?B/s][A model-00004-of-00004.safetensors: 8%|▊ | 90.3M/1.17G [00:01<00:13, 79.2MB/s][A model-00004-of-00004.safetensors: 60%|█████▉ | 699M/1.17G [00:02<00:01, 367MB/s] [A model-00004-of-00004.safetensors: 100%|██████████| 1.17G/1.17G [00:02<00:00, 471MB/s] INFO:accelerate.utils.modeling:Based on the current allocation process, no modules could be assigned to the following devices due to insufficient memory: - cpu: 2101346304 bytes required These minimum requirements are specific to this allocation attempt and may vary. Consider increasing the available memory for these devices to at least the specified minimum, or adjusting the model config. Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|██████████| 4/4 [00:00<00:00, 106184.91it/s] generation_config.json: 0%| | 0.00/184 [00:00<?, ?B/s][A generation_config.json: 100%|██████████| 184/184 [00:00<00:00, 2.24MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 554, in <module> initialize_model_and_tokenizer() File "/home/user/app/app.py", line 137, in initialize_model_and_tokenizer global_model = AutoModelForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 604, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 288, in _wrapper return func(*args, kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 5279, in from_pretrained dispatch_model(model, device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...