stock-trading-rl-agent / trainer_MyTestExp_20250704_170925.log
Adilbai's picture
Upload folder using huggingface_hub
52b3331 verified
2025-07-04 17:09:25,503 - trainer_MyTestExp - INFO - info:66 - πŸ“„ Loading config from experiments/MyTestExp/config.json
2025-07-04 17:09:25,504 - trainer_MyTestExp - INFO - info:66 - βœ… Custom configuration loaded and merged
2025-07-04 17:09:25,505 - trainer_MyTestExp - INFO - info:66 - πŸ”§ Configuration: {
"data": {
"tickers": [
"AAPL",
"MSFT",
"GOOGL",
"AMZN",
"TSLA"
],
"period": "5y",
"interval": "1d",
"use_sp500": false
},
"environment": {
"initial_balance": 10000,
"transaction_cost": 0.001,
"max_position_size": 1.0,
"lookback_window": 60,
"reward_type": "return"
},
"training": {
"algorithm": "PPO",
"total_timesteps": 500000,
"learning_rate": 0.0003,
"batch_size": 64,
"n_epochs": 10,
"gamma": 0.99,
"eval_freq": 1000,
"n_eval_episodes": 5,
"save_freq": 10000,
"seed": 42
}
}
2025-07-04 17:09:25,506 - trainer_MyTestExp - INFO - info:66 - πŸ’Ύ Configuration saved to experiments/MyTestExp/config.json
2025-07-04 17:09:25,507 - trainer_MyTestExp - INFO - info:66 - 🎯 Trainer initialized for experiment: MyTestExp
2025-07-04 17:09:25,508 - trainer_MyTestExp - INFO - info:66 - πŸ“ Experiment directory: experiments/MyTestExp
2025-07-04 17:09:25,738 - trainer_MyTestExp - INFO - info:66 - ⏱️ Starting Environment Creation...
2025-07-04 17:09:25,738 - trainer_MyTestExp - INFO - info:66 - πŸͺ Environment configuration:
2025-07-04 17:09:25,739 - trainer_MyTestExp - INFO - info:66 - initial_balance: 10000
2025-07-04 17:09:25,740 - trainer_MyTestExp - INFO - info:66 - transaction_cost: 0.001
2025-07-04 17:09:25,741 - trainer_MyTestExp - INFO - info:66 - max_position_size: 1.0
2025-07-04 17:09:25,741 - trainer_MyTestExp - INFO - info:66 - lookback_window: 60
2025-07-04 17:09:25,742 - trainer_MyTestExp - INFO - info:66 - reward_type: return
2025-07-04 17:09:25,742 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 20.0% (1/5)
2025-07-04 17:09:25,746 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 40.0% (2/5)
2025-07-04 17:09:25,748 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 60.0% (3/5)
2025-07-04 17:09:25,750 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘| 80.0% (4/5)
2025-07-04 17:09:25,757 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 100.0% (5/5)
2025-07-04 17:09:25,759 - trainer_MyTestExp - INFO - info:66 - βœ… Created environments for 5 stocks
2025-07-04 17:09:25,759 - trainer_MyTestExp - INFO - info:66 - ⏱️ Environment Creation completed in 0:00:00.021808
2025-07-04 17:09:25,761 - trainer_MyTestExp - INFO - info:66 - ⏱️ Starting Model Evaluation...
2025-07-04 17:09:25,762 - trainer_MyTestExp - INFO - info:66 - Evaluating AMZN: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 20.0% (1/5)