Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- ================================================================================
- FinalGift Hybrid Scientist Agent - Dual-Process Intelligence
- INTENT MODES:
- LEARN: Exploring new strategies, hypothesis testing
- EXPLOIT: Executing known patterns, using learned knowledge
- EXECUTION STYLES:
- INTUITIVE: Fast policy network (reflexive, System 1)
- DELIBERATE: Slow world-model planning (analytical, System 2)
- Execution blend controlled by capability growth (dBI)
- TRAINING MODE: GENERALIST
- Games: 6 (as66-821a4dcad9c2, ft09-b7249182, lp85-e179de251, ls20-e98196ab, sp80-445eab21, vc33-cbded52d)
- Episodes: Unlimited
- Checkpoint: checkpoint_latest.pt
- WEIGHTED SAMPLING: 3x more attempts for unsolved games (<50% win rate)
- ================================================================================
- ================================================================================
- SYSTEM-3 REFLECTIVE META-CONTROLLER: ENABLED
- Reflective Cycle Frequency: Every 25 episodes
- Safe Reconfiguration Actions: 14
- - zeno_alpha (mode sensitivity)
- - reward_weight (RL vs self-supervised)
- - empowerment_weight (curiosity)
- - zeno_beta (attractor smoothing)
- - learning_rate (adaptation speed)
- Rollback Threshold: dBI < -0.01
- ================================================================================
- Checking for checkpoint: checkpoint_latest.pt
- File exists: False
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
- ================================================================================
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [HER] Applied hindsight relabeling to failed episode (reward=0.00, length=49)
- [Ep0] NOT STORED: No level progress (reward=0.00)
- [Running Abstractive Planning...]
- [HYBRID RULES] Allocated rule bank for as66-821a4dcad9c2 (1 games total)
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [Training from Abstract Experiences...]
- Abstract Value Loss: 11.9153
- Abstract Policy Loss: 1.7911
- [RETRY] Episode ended with no reward. Failed attempt.
- [RETRY] Attempt 1/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
- ================================================================================
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [API REWARD] score: 0 -> 1, reward: 1.00
- [LEVEL 1 COMPLETE @ step 14] Episode reward: 1.0
- [MEMORY] NEW level 1 beaten in 14 steps
- [TRUNCATED] Storing 15/77 actions (up to and including reward step)
- [Ep0] STORED MEMORY: score=1.00, length=15, levels=1 (with transfer data)
- [SAVED] episodic_memory.pt updated immediately
- [CRYSTALLIZATION ACTIVE] Game as66-821a4dcad9c2: Reached score 1.0/9.0
- [CRYSTALLIZATION] Progress: 11%, strength: 0.11
- [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91
- [SAVED] crystallization.pt updated (new best: 1.0)
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 2, Valid BIs: 2
- [dBI DEBUG] Last 5 BIs: ['0.0252', '0.0354']
- [dBI DEBUG] dbi_raw = 0.035355 - 0.025163 = 0.010193
- [Training from Abstract Experiences...]
- Abstract Value Loss: 15.8189
- Abstract Policy Loss: 1.8064
- [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 0 -> 1/9
- [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
- ================================================================================
- [INITIAL SCORE] API started at score=1, initializing levels_completed=1
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=1.00)
- [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 3, Valid BIs: 3
- [dBI DEBUG] Last 5 BIs: ['0.0252', '0.0354', '0.0828']
- [dBI DEBUG] dbi_raw = 0.082848 - 0.035355 = 0.047493
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 1/9
- [RETRY] Attempt 1/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
- ================================================================================
- [INITIAL SCORE] API started at score=1, initializing levels_completed=1
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=1.00)
- [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 4, Valid BIs: 4
- [dBI DEBUG] Last 5 BIs: ['0.0252', '0.0354', '0.0828', '0.1452']
- [dBI DEBUG] dbi_raw = 0.145243 - 0.082848 = 0.062395
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 1/9
- [RETRY] Attempt 2/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
- ================================================================================
- [INITIAL SCORE] API started at score=1, initializing levels_completed=1
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=1.00)
- [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 5, Valid BIs: 5
- [dBI DEBUG] Last 5 BIs: ['0.0252', '0.0354', '0.0828', '0.1452', '0.1238']
- [dBI DEBUG] dbi_raw = 0.123755 - 0.145243 = -0.021488
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 1/9
- [RETRY] Attempt 3/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
- ================================================================================
- [INITIAL SCORE] API started at score=1, initializing levels_completed=1
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=1.00)
- [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 6, Valid BIs: 6
- [dBI DEBUG] Last 5 BIs: ['0.0354', '0.0828', '0.1452', '0.1238', '0.1065']
- [dBI DEBUG] dbi_raw = 0.106548 - 0.123755 = -0.017207
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 1/9
- [RETRY] Attempt 4/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
- ================================================================================
- [INITIAL SCORE] API started at score=1, initializing levels_completed=1
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=1.00)
- [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 7, Valid BIs: 7
- [dBI DEBUG] Last 5 BIs: ['0.0828', '0.1452', '0.1238', '0.1065', '0.1090']
- [dBI DEBUG] dbi_raw = 0.109029 - 0.106548 = 0.002480
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 1/9
- [RETRY] Attempt 5/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
- ================================================================================
- [INITIAL SCORE] API started at score=1, initializing levels_completed=1
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [API REWARD] score: 1 -> 2, reward: 1.00
- [LEVEL 2 COMPLETE @ step 10] Episode reward: 2.0
- [CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 2.0/9.0 (strength: 0.22)
- [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
- [SAVED] crystallization.pt updated (new best: 2.0)
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 8, Valid BIs: 8
- [dBI DEBUG] Last 5 BIs: ['0.1452', '0.1238', '0.1065', '0.1090', '0.0846']
- [dBI DEBUG] dbi_raw = 0.084611 - 0.109029 = -0.024417
- [Training from Abstract Experiences...]
- [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 1 -> 2/9
- [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
- ================================================================================
- [INITIAL SCORE] API started at score=2, initializing levels_completed=2
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=2.00)
- [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 9, Valid BIs: 9
- [dBI DEBUG] Last 5 BIs: ['0.1238', '0.1065', '0.1090', '0.0846', '0.1647']
- [dBI DEBUG] dbi_raw = 0.164687 - 0.084611 = 0.080075
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 2/9
- [RETRY] Attempt 1/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
- ================================================================================
- [INITIAL SCORE] API started at score=2, initializing levels_completed=2
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=2.00)
- [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 10, Valid BIs: 10
- [dBI DEBUG] Last 5 BIs: ['0.1065', '0.1090', '0.0846', '0.1647', '0.1388']
- [dBI DEBUG] dbi_raw = 0.138787 - 0.164687 = -0.025900
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 2/9
- [RETRY] Attempt 2/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
- ================================================================================
- [INITIAL SCORE] API started at score=2, initializing levels_completed=2
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=2.00)
- [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 11, Valid BIs: 11
- [dBI DEBUG] Last 5 BIs: ['0.1090', '0.0846', '0.1647', '0.1388', '0.0920']
- [dBI DEBUG] dbi_raw = 0.091963 - 0.138787 = -0.046824
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 2/9
- [RETRY] Attempt 3/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
- ================================================================================
- [INITIAL SCORE] API started at score=2, initializing levels_completed=2
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=2.00)
- [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 12, Valid BIs: 12
- [dBI DEBUG] Last 5 BIs: ['0.0846', '0.1647', '0.1388', '0.0920', '0.0769']
- [dBI DEBUG] dbi_raw = 0.076917 - 0.091963 = -0.015046
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 2/9
- [RETRY] Attempt 4/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
- ================================================================================
- [INITIAL SCORE] API started at score=2, initializing levels_completed=2
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=2.00)
- [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 13, Valid BIs: 13
- [dBI DEBUG] Last 5 BIs: ['0.1647', '0.1388', '0.0920', '0.0769', '0.1244']
- [dBI DEBUG] dbi_raw = 0.124397 - 0.076917 = 0.047480
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 2/9
- [RETRY] Attempt 5/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
- ================================================================================
- [INITIAL SCORE] API started at score=2, initializing levels_completed=2
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=2.00)
- [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 14, Valid BIs: 14
- [dBI DEBUG] Last 5 BIs: ['0.1388', '0.0920', '0.0769', '0.1244', '0.1167']
- [dBI DEBUG] dbi_raw = 0.116657 - 0.124397 = -0.007740
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 2/9
- [RETRY] Attempt 6/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 7/15
- ================================================================================
- [INITIAL SCORE] API started at score=2, initializing levels_completed=2
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=2.00)
- [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 15, Valid BIs: 15
- [dBI DEBUG] Last 5 BIs: ['0.0920', '0.0769', '0.1244', '0.1167', '0.1243']
- [dBI DEBUG] dbi_raw = 0.124306 - 0.116657 = 0.007648
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 2/9
- [RETRY] Attempt 7/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 8/15
- ================================================================================
- [INITIAL SCORE] API started at score=2, initializing levels_completed=2
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [API REWARD] score: 2 -> 3, reward: 1.00
- [LEVEL 3 COMPLETE @ step 20] Episode reward: 3.0
- [CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 3.0/9.0 (strength: 0.33)
- [CRYSTALLIZATION] Best: 3.0/9.0, Strength=0.33, Dampening=0.73
- [SAVED] crystallization.pt updated (new best: 3.0)
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 16, Valid BIs: 16
- [dBI DEBUG] Last 5 BIs: ['0.0769', '0.1244', '0.1167', '0.1243', '0.1961']
- [dBI DEBUG] dbi_raw = 0.196069 - 0.124306 = 0.071763
- [Training from Abstract Experiences...]
- [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 2 -> 3/9
- [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
- ================================================================================
- [INITIAL SCORE] API started at score=3, initializing levels_completed=3
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [API REWARD] score: 3 -> 4, reward: 1.00
- [LEVEL 4 COMPLETE @ step 26] Episode reward: 4.0
- [CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 4.0/9.0 (strength: 0.44)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [SAVED] crystallization.pt updated (new best: 4.0)
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 17, Valid BIs: 17
- [dBI DEBUG] Last 5 BIs: ['0.1244', '0.1167', '0.1243', '0.1961', '0.1432']
- [dBI DEBUG] dbi_raw = 0.143177 - 0.196069 = -0.052892
- [Training from Abstract Experiences...]
- [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 3 -> 4/9
- [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 18, Valid BIs: 18
- [dBI DEBUG] Last 5 BIs: ['0.1167', '0.1243', '0.1961', '0.1432', '0.1530']
- [dBI DEBUG] dbi_raw = 0.153047 - 0.143177 = 0.009871
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 1/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 19, Valid BIs: 19
- [dBI DEBUG] Last 5 BIs: ['0.1243', '0.1961', '0.1432', '0.1530', '0.1501']
- [dBI DEBUG] dbi_raw = 0.150057 - 0.153047 = -0.002991
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 2/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1961', '0.1432', '0.1530', '0.1501', '0.1669']
- [dBI DEBUG] dbi_raw = 0.166927 - 0.150057 = 0.016870
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 3/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1432', '0.1530', '0.1501', '0.1669', '0.1649']
- [dBI DEBUG] dbi_raw = 0.164945 - 0.166927 = -0.001982
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 4/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1530', '0.1501', '0.1669', '0.1649', '0.1431']
- [dBI DEBUG] dbi_raw = 0.143101 - 0.164945 = -0.021845
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 5/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1501', '0.1669', '0.1649', '0.1431', '0.2021']
- [dBI DEBUG] dbi_raw = 0.202140 - 0.143101 = 0.059039
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 6/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 7/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1669', '0.1649', '0.1431', '0.2021', '0.1598']
- [dBI DEBUG] dbi_raw = 0.159752 - 0.202140 = -0.042388
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 7/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 8/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1649', '0.1431', '0.2021', '0.1598', '0.1673']
- [dBI DEBUG] dbi_raw = 0.167327 - 0.159752 = 0.007575
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 8/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 9/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1431', '0.2021', '0.1598', '0.1673', '0.1786']
- [dBI DEBUG] dbi_raw = 0.178576 - 0.167327 = 0.011249
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 9/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 10/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=4.00)
- [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.2021', '0.1598', '0.1673', '0.1786', '0.1760']
- [dBI DEBUG] dbi_raw = 0.175956 - 0.178576 = -0.002620
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 4/9
- [RETRY] Attempt 10/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 11/15
- ================================================================================
- [INITIAL SCORE] API started at score=4, initializing levels_completed=4
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [API REWARD] score: 4 -> 5, reward: 1.00
- [LEVEL 5 COMPLETE @ step 69] Episode reward: 5.0
- [CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 5.0/9.0 (strength: 0.56)
- [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56
- [SAVED] crystallization.pt updated (new best: 5.0)
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1598', '0.1673', '0.1786', '0.1760', '0.2006']
- [dBI DEBUG] dbi_raw = 0.200621 - 0.175956 = 0.024664
- [Training from Abstract Experiences...]
- [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 4 -> 5/9
- [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
- ================================================================================
- [INITIAL SCORE] API started at score=5, initializing levels_completed=5
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=5.00)
- [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1673', '0.1786', '0.1760', '0.2006', '0.1348']
- [dBI DEBUG] dbi_raw = 0.134790 - 0.200621 = -0.065831
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 5/9
- [RETRY] Attempt 1/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
- ================================================================================
- [INITIAL SCORE] API started at score=5, initializing levels_completed=5
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=5.00)
- [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1786', '0.1760', '0.2006', '0.1348', '0.2048']
- [dBI DEBUG] dbi_raw = 0.204762 - 0.134790 = 0.069971
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 5/9
- [RETRY] Attempt 2/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
- ================================================================================
- [INITIAL SCORE] API started at score=5, initializing levels_completed=5
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=5.00)
- [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1760', '0.2006', '0.1348', '0.2048', '0.1584']
- [dBI DEBUG] dbi_raw = 0.158408 - 0.204762 = -0.046354
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 5/9
- [RETRY] Attempt 3/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
- ================================================================================
- [INITIAL SCORE] API started at score=5, initializing levels_completed=5
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=5.00)
- [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.2006', '0.1348', '0.2048', '0.1584', '0.1888']
- [dBI DEBUG] dbi_raw = 0.188766 - 0.158408 = 0.030358
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 5/9
- [RETRY] Attempt 4/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
- ================================================================================
- [INITIAL SCORE] API started at score=5, initializing levels_completed=5
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=5.00)
- [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1348', '0.2048', '0.1584', '0.1888', '0.2095']
- [dBI DEBUG] dbi_raw = 0.209511 - 0.188766 = 0.020745
- [Training from Abstract Experiences...]
- Abstract Value Loss: 1427.1710
- Abstract Policy Loss: 1.8204
- [RETRY] No new levels beaten. Best remains: 5/9
- [RETRY] Attempt 5/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
- ================================================================================
- [INITIAL SCORE] API started at score=5, initializing levels_completed=5
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [API REWARD] score: 5 -> 6, reward: 1.00
- [LEVEL 6 COMPLETE @ step 15] Episode reward: 6.0
- [CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 6.0/9.0 (strength: 0.67)
- [CRYSTALLIZATION] Best: 6.0/9.0, Strength=0.67, Dampening=0.47
- [SAVED] crystallization.pt updated (new best: 6.0)
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.2048', '0.1584', '0.1888', '0.2095', '0.1642']
- [dBI DEBUG] dbi_raw = 0.164205 - 0.209511 = -0.045306
- [Training from Abstract Experiences...]
- [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 5 -> 6/9
- [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
- ================================================================================
- [INITIAL SCORE] API started at score=6, initializing levels_completed=6
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=6.00)
- [CRYSTALLIZATION] Best: 6.0/9.0, Strength=0.67, Dampening=0.47
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1584', '0.1888', '0.2095', '0.1642', '0.2483']
- [dBI DEBUG] dbi_raw = 0.248301 - 0.164205 = 0.084096
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 6/9
- [RETRY] Attempt 1/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
- ================================================================================
- [INITIAL SCORE] API started at score=6, initializing levels_completed=6
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [API REWARD] score: 6 -> 7, reward: 1.00
- [LEVEL 7 COMPLETE @ step 78] Episode reward: 7.0
- [CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 7.0/9.0 (strength: 0.78)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [SAVED] crystallization.pt updated (new best: 7.0)
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1888', '0.2095', '0.1642', '0.2483', '0.1501']
- [dBI DEBUG] dbi_raw = 0.150085 - 0.248301 = -0.098216
- [Training from Abstract Experiences...]
- [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 6 -> 7/9
- [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.2095', '0.1642', '0.2483', '0.1501', '0.2287']
- [dBI DEBUG] dbi_raw = 0.228709 - 0.150085 = 0.078624
- [Training from Abstract Experiences...]
- Abstract Value Loss: 269.2734
- Abstract Policy Loss: 1.8474
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 1/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1642', '0.2483', '0.1501', '0.2287', '0.1417']
- [dBI DEBUG] dbi_raw = 0.141687 - 0.228709 = -0.087023
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 2/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.2483', '0.1501', '0.2287', '0.1417', '0.1514']
- [dBI DEBUG] dbi_raw = 0.151352 - 0.141687 = 0.009665
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 3/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1501', '0.2287', '0.1417', '0.1514', '0.1695']
- [dBI DEBUG] dbi_raw = 0.169455 - 0.151352 = 0.018103
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 4/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.2287', '0.1417', '0.1514', '0.1695', '0.1718']
- [dBI DEBUG] dbi_raw = 0.171847 - 0.169455 = 0.002392
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 5/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1417', '0.1514', '0.1695', '0.1718', '0.1584']
- [dBI DEBUG] dbi_raw = 0.158435 - 0.171847 = -0.013411
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 6/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 7/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1514', '0.1695', '0.1718', '0.1584', '0.1595']
- [dBI DEBUG] dbi_raw = 0.159509 - 0.158435 = 0.001073
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 7/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 8/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1695', '0.1718', '0.1584', '0.1595', '0.2448']
- [dBI DEBUG] dbi_raw = 0.244764 - 0.159509 = 0.085255
- [Training from Abstract Experiences...]
- Abstract Value Loss: 503.8107
- Abstract Policy Loss: 1.8477
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 8/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 9/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1718', '0.1584', '0.1595', '0.2448', '0.1383']
- [dBI DEBUG] dbi_raw = 0.138342 - 0.244764 = -0.106421
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 9/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 10/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1584', '0.1595', '0.2448', '0.1383', '0.1833']
- [dBI DEBUG] dbi_raw = 0.183313 - 0.138342 = 0.044971
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 10/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 11/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1595', '0.2448', '0.1383', '0.1833', '0.2034']
- [dBI DEBUG] dbi_raw = 0.203363 - 0.183313 = 0.020050
- [Training from Abstract Experiences...]
- Abstract Value Loss: 302.0228
- Abstract Policy Loss: 1.8496
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 11/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 12/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.2448', '0.1383', '0.1833', '0.2034', '0.1798']
- [dBI DEBUG] dbi_raw = 0.179812 - 0.203363 = -0.023550
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 12/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 13/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1383', '0.1833', '0.2034', '0.1798', '0.1436']
- [dBI DEBUG] dbi_raw = 0.143637 - 0.179812 = -0.036175
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 13/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 14/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.1833', '0.2034', '0.1798', '0.1436', '0.1915']
- [dBI DEBUG] dbi_raw = 0.191460 - 0.143637 = 0.047822
- [Training from Abstract Experiences...]
- Abstract Value Loss: 619.0808
- Abstract Policy Loss: 1.8507
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Attempt 14/15. Retrying...
- ================================================================================
- [FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 15/15
- ================================================================================
- [INITIAL SCORE] API started at score=7, initializing levels_completed=7
- [Ep0] EXPLORATION: No memory replay
- [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
- [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
- [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
- [Ep0] NOT STORED: No level progress (reward=7.00)
- [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
- [Running Abstractive Planning...]
- Generated 32 abstract experiences (dreams).
- [Training Models...]
- [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
- [dBI DEBUG] Last 5 BIs: ['0.2034', '0.1798', '0.1436', '0.1915', '0.1753']
- [dBI DEBUG] dbi_raw = 0.175284 - 0.191460 = -0.016175
- [Training from Abstract Experiences...]
- [RETRY] No new levels beaten. Best remains: 7/9
- [RETRY] Max retries (15) reached without improvement. Moving to next game.
- [SCORECARD CLOSED] View replay at: https://three.arcprize.org/scorecards/ff0e097a-e654-4961-bc1c-997d013fd5a9
- Episode Results:
- Reward: 7.000
- Steps: 6
- Mean Confidence: 0.547
- Mean Uncertainty: 0.629
- Mode Counts: {'explore': 6}
- Model Metrics:
- Prediction Error: 0.0038
- Inverse Loss: 0.9070
- Policy Loss: 0.0000
- Info Gain Loss: 0.0000
- Proto-AGI Module Losses:
- Subgoal Value: 0.0052
- Subgoal Proposer: -0.1222
- Subgoal Achievement: 0.0105
- Rule Encoder: -0.4151
- Rule Confidence: 0.6881
- Empowerment Forward: 0.0131
- Empowerment Inverse: 0.9211
- Counterfactual: 0.0011
- MetaLearner Context: 0.0000
- Per-Step Diagnostics (v8.0):
- Learned Value: 0.5530
- H_op (Operator Entropy): 0.9984
- H_attn (Attention Entropy): 0.6308
- MoE Gate Sparsity: 0.0013
- Empowerment Bonus: 0.7371
- GNN Entropy Reg: 0.0000
- MuZero Boost (PlannerβPolicy Distillation):
- Value Distillation Loss: 0.0094
- Policy Distillation Loss: 1.7919
- Broad Intelligence:
- A_sub: 0.9986
- A_con: 0.4337
- I_alg: 0.4048 (op_H=0.998, attn_H=0.631)
- BI: 0.1753
- CP: 0.1751 (Choice Potential = BI / 1+C)
- dBI: -0.0005 | dBI-EMA: -0.0005 | dBI-norm: -0.191
- Entropy_penalty: 0.0548 (target=0.10, pushes attn_H)
- Execution Blend Controller:
- Execution: DELIBERATE (w=0.429)
- Delta_intuitive: 0.0597
- Delta_deliberate: -0.0597
- Motion CAD: -0.0085
- Intent Ratio: 100.0% LEARN
- Game Statistics:
- as66-821a4dcad9c2: 51 attempts, 98.0% win, 4.37 avg
- ft09-b7249182: 0 attempts, 0.0% win, 0.00 avg
- lp85-e179de251: 0 attempts, 0.0% win, 0.00 avg
- ls20-e98196ab: 0 attempts, 0.0% win, 0.00 avg
- sp80-445eab21: 0 attempts, 0.0% win, 0.00 avg
- vc33-cbded52d: 0 attempts, 0.0% win, 0.00 avg
- ================================================================================
- Cognitive Mode Analysis (Episode 0):
- ================================================================================
- REDACTED ------
Advertisement
Add Comment
Please, Sign In to add comment