Spaces:

Gamahea
/

lemm-test-100

Running on Zero

Gamahea commited on 9 days ago

Commit

990bc24

1 Parent(s): 0170ae2

Completely disable pin_memory and num_workers for ZeroGPU compatibility

- Set pin_memory=False (ZeroGPU doesn't provide persistent CUDA access)
- Set num_workers=0 (multiprocessing doesn't work well with ZeroGPU)
- Previous fix checked cuda.is_available() but ZeroGPU returns True without actual access
- This ensures compatibility with both CPU and ZeroGPU environments

Files changed (1) hide show

backend/services/lora_training_service.py +6 -7

backend/services/lora_training_service.py CHANGED Viewed

@@ -303,23 +303,22 @@ class LoRATrainingService:
             )
             # Create data loaders
-            # Only use pin_memory if CUDA is actually available
-            use_pin_memory = torch.cuda.is_available()
             train_loader = DataLoader(
                 train_dataset,
                 batch_size=self.training_config['batch_size'],
                 shuffle=True,
-                num_workers=2,
-                pin_memory=use_pin_memory
             )
             val_loader = DataLoader(
                 val_dataset,
                 batch_size=self.training_config['batch_size'],
                 shuffle=False,
-                num_workers=2,
-                pin_memory=use_pin_memory
             )
             # Initialize model (placeholder - actual implementation would load DiffRhythm2)

             )
             # Create data loaders
+            # Disable pin_memory and num_workers for compatibility with ZeroGPU and CPU
+            # pin_memory requires persistent CUDA access which ZeroGPU doesn't provide at this stage
             train_loader = DataLoader(
                 train_dataset,
                 batch_size=self.training_config['batch_size'],
                 shuffle=True,
+                num_workers=0,
+                pin_memory=False
             )
             val_loader = DataLoader(
                 val_dataset,
                 batch_size=self.training_config['batch_size'],
                 shuffle=False,
+                num_workers=0,
+                pin_memory=False
             )
             # Initialize model (placeholder - actual implementation would load DiffRhythm2)