File size: 3,361 Bytes
52b3331
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
2025-07-04 17:09:25,503 - trainer_MyTestExp - INFO - info:66 - πŸ“„ Loading config from experiments/MyTestExp/config.json
2025-07-04 17:09:25,504 - trainer_MyTestExp - INFO - info:66 - βœ… Custom configuration loaded and merged
2025-07-04 17:09:25,505 - trainer_MyTestExp - INFO - info:66 - πŸ”§ Configuration: {
  "data": {
    "tickers": [
      "AAPL",
      "MSFT",
      "GOOGL",
      "AMZN",
      "TSLA"
    ],
    "period": "5y",
    "interval": "1d",
    "use_sp500": false
  },
  "environment": {
    "initial_balance": 10000,
    "transaction_cost": 0.001,
    "max_position_size": 1.0,
    "lookback_window": 60,
    "reward_type": "return"
  },
  "training": {
    "algorithm": "PPO",
    "total_timesteps": 500000,
    "learning_rate": 0.0003,
    "batch_size": 64,
    "n_epochs": 10,
    "gamma": 0.99,
    "eval_freq": 1000,
    "n_eval_episodes": 5,
    "save_freq": 10000,
    "seed": 42
  }
}
2025-07-04 17:09:25,506 - trainer_MyTestExp - INFO - info:66 - πŸ’Ύ Configuration saved to experiments/MyTestExp/config.json
2025-07-04 17:09:25,507 - trainer_MyTestExp - INFO - info:66 - 🎯 Trainer initialized for experiment: MyTestExp
2025-07-04 17:09:25,508 - trainer_MyTestExp - INFO - info:66 - πŸ“ Experiment directory: experiments/MyTestExp
2025-07-04 17:09:25,738 - trainer_MyTestExp - INFO - info:66 - ⏱️  Starting Environment Creation...
2025-07-04 17:09:25,738 - trainer_MyTestExp - INFO - info:66 - πŸͺ Environment configuration:
2025-07-04 17:09:25,739 - trainer_MyTestExp - INFO - info:66 -   initial_balance: 10000
2025-07-04 17:09:25,740 - trainer_MyTestExp - INFO - info:66 -   transaction_cost: 0.001
2025-07-04 17:09:25,741 - trainer_MyTestExp - INFO - info:66 -   max_position_size: 1.0
2025-07-04 17:09:25,741 - trainer_MyTestExp - INFO - info:66 -   lookback_window: 60
2025-07-04 17:09:25,742 - trainer_MyTestExp - INFO - info:66 -   reward_type: return
2025-07-04 17:09:25,742 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 20.0% (1/5)
2025-07-04 17:09:25,746 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 40.0% (2/5)
2025-07-04 17:09:25,748 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 60.0% (3/5)
2025-07-04 17:09:25,750 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘| 80.0% (4/5)
2025-07-04 17:09:25,757 - trainer_MyTestExp - INFO - info:66 - Creating environments: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 100.0% (5/5)
2025-07-04 17:09:25,759 - trainer_MyTestExp - INFO - info:66 - βœ… Created environments for 5 stocks
2025-07-04 17:09:25,759 - trainer_MyTestExp - INFO - info:66 - ⏱️  Environment Creation completed in 0:00:00.021808
2025-07-04 17:09:25,761 - trainer_MyTestExp - INFO - info:66 - ⏱️  Starting Model Evaluation...
2025-07-04 17:09:25,762 - trainer_MyTestExp - INFO - info:66 - Evaluating AMZN: |β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘| 20.0% (1/5)