AS66 Zero shot Zero pre-training

================================================================================
FinalGift Hybrid Scientist Agent - Dual-Process Intelligence

INTENT MODES:
  LEARN: Exploring new strategies, hypothesis testing
  EXPLOIT: Executing known patterns, using learned knowledge

EXECUTION STYLES:
  INTUITIVE: Fast policy network (reflexive, System 1)
  DELIBERATE: Slow world-model planning (analytical, System 2)

Execution blend controlled by capability growth (dBI)

TRAINING MODE: GENERALIST
  Games: 6 (as66-821a4dcad9c2, ft09-b7249182, lp85-e179de251, ls20-e98196ab, sp80-445eab21, vc33-cbded52d)
  Episodes: Unlimited
  Checkpoint: checkpoint_latest.pt
  WEIGHTED SAMPLING: 3x more attempts for unsolved games (<50% win rate)
================================================================================

================================================================================
SYSTEM-3 REFLECTIVE META-CONTROLLER: ENABLED
  Reflective Cycle Frequency: Every 25 episodes
  Safe Reconfiguration Actions: 14
    - zeno_alpha (mode sensitivity)
    - reward_weight (RL vs self-supervised)
    - empowerment_weight (curiosity)
    - zeno_beta (attractor smoothing)
    - learning_rate (adaptation speed)
  Rollback Threshold: dBI < -0.01
================================================================================


Checking for checkpoint: checkpoint_latest.pt
  File exists: False

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
================================================================================
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [HER] Applied hindsight relabeling to failed episode (reward=0.00, length=49)
  [Ep0] NOT STORED: No level progress (reward=0.00)

[Running Abstractive Planning...]
[HYBRID RULES] Allocated rule bank for as66-821a4dcad9c2 (1 games total)
  Generated 32 abstract experiences (dreams).

[Training Models...]

[Training from Abstract Experiences...]
  Abstract Value Loss: 11.9153
  Abstract Policy Loss: 1.7911
  [RETRY] Episode ended with no reward. Failed attempt.
  [RETRY] Attempt 1/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
================================================================================
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [API REWARD] score: 0 -> 1, reward: 1.00
  [LEVEL 1 COMPLETE @ step 14] Episode reward: 1.0
  [MEMORY] NEW level 1 beaten in 14 steps
  [TRUNCATED] Storing 15/77 actions (up to and including reward step)
  [Ep0] STORED MEMORY: score=1.00, length=15, levels=1 (with transfer data)
  [SAVED] episodic_memory.pt updated immediately

[CRYSTALLIZATION ACTIVE] Game as66-821a4dcad9c2: Reached score 1.0/9.0
[CRYSTALLIZATION] Progress: 11%, strength: 0.11
  [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91
  [SAVED] crystallization.pt updated (new best: 1.0)

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 2, Valid BIs: 2
    [dBI DEBUG] Last 5 BIs: ['0.0252', '0.0354']
    [dBI DEBUG] dbi_raw = 0.035355 - 0.025163 = 0.010193

[Training from Abstract Experiences...]
  Abstract Value Loss: 15.8189
  Abstract Policy Loss: 1.8064
  [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 0 -> 1/9
  [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
================================================================================
  [INITIAL SCORE] API started at score=1, initializing levels_completed=1
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=1.00)
  [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 3, Valid BIs: 3
    [dBI DEBUG] Last 5 BIs: ['0.0252', '0.0354', '0.0828']
    [dBI DEBUG] dbi_raw = 0.082848 - 0.035355 = 0.047493

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 1/9
  [RETRY] Attempt 1/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
================================================================================
  [INITIAL SCORE] API started at score=1, initializing levels_completed=1
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=1.00)
  [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 4, Valid BIs: 4
    [dBI DEBUG] Last 5 BIs: ['0.0252', '0.0354', '0.0828', '0.1452']
    [dBI DEBUG] dbi_raw = 0.145243 - 0.082848 = 0.062395

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 1/9
  [RETRY] Attempt 2/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
================================================================================
  [INITIAL SCORE] API started at score=1, initializing levels_completed=1
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=1.00)
  [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 5, Valid BIs: 5
    [dBI DEBUG] Last 5 BIs: ['0.0252', '0.0354', '0.0828', '0.1452', '0.1238']
    [dBI DEBUG] dbi_raw = 0.123755 - 0.145243 = -0.021488

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 1/9
  [RETRY] Attempt 3/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
================================================================================
  [INITIAL SCORE] API started at score=1, initializing levels_completed=1
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=1.00)
  [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 6, Valid BIs: 6
    [dBI DEBUG] Last 5 BIs: ['0.0354', '0.0828', '0.1452', '0.1238', '0.1065']
    [dBI DEBUG] dbi_raw = 0.106548 - 0.123755 = -0.017207

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 1/9
  [RETRY] Attempt 4/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
================================================================================
  [INITIAL SCORE] API started at score=1, initializing levels_completed=1
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=1.00)
  [CRYSTALLIZATION] Best: 1.0/9.0, Strength=0.11, Dampening=0.91

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 7, Valid BIs: 7
    [dBI DEBUG] Last 5 BIs: ['0.0828', '0.1452', '0.1238', '0.1065', '0.1090']
    [dBI DEBUG] dbi_raw = 0.109029 - 0.106548 = 0.002480

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 1/9
  [RETRY] Attempt 5/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
================================================================================
  [INITIAL SCORE] API started at score=1, initializing levels_completed=1
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [API REWARD] score: 1 -> 2, reward: 1.00
  [LEVEL 2 COMPLETE @ step 10] Episode reward: 2.0
[CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 2.0/9.0 (strength: 0.22)
  [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82
  [SAVED] crystallization.pt updated (new best: 2.0)

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 8, Valid BIs: 8
    [dBI DEBUG] Last 5 BIs: ['0.1452', '0.1238', '0.1065', '0.1090', '0.0846']
    [dBI DEBUG] dbi_raw = 0.084611 - 0.109029 = -0.024417

[Training from Abstract Experiences...]
  [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 1 -> 2/9
  [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
================================================================================
  [INITIAL SCORE] API started at score=2, initializing levels_completed=2
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=2.00)
  [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 9, Valid BIs: 9
    [dBI DEBUG] Last 5 BIs: ['0.1238', '0.1065', '0.1090', '0.0846', '0.1647']
    [dBI DEBUG] dbi_raw = 0.164687 - 0.084611 = 0.080075

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 2/9
  [RETRY] Attempt 1/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
================================================================================
  [INITIAL SCORE] API started at score=2, initializing levels_completed=2
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=2.00)
  [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 10, Valid BIs: 10
    [dBI DEBUG] Last 5 BIs: ['0.1065', '0.1090', '0.0846', '0.1647', '0.1388']
    [dBI DEBUG] dbi_raw = 0.138787 - 0.164687 = -0.025900

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 2/9
  [RETRY] Attempt 2/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
================================================================================
  [INITIAL SCORE] API started at score=2, initializing levels_completed=2
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=2.00)
  [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 11, Valid BIs: 11
    [dBI DEBUG] Last 5 BIs: ['0.1090', '0.0846', '0.1647', '0.1388', '0.0920']
    [dBI DEBUG] dbi_raw = 0.091963 - 0.138787 = -0.046824

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 2/9
  [RETRY] Attempt 3/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
================================================================================
  [INITIAL SCORE] API started at score=2, initializing levels_completed=2
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=2.00)
  [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 12, Valid BIs: 12
    [dBI DEBUG] Last 5 BIs: ['0.0846', '0.1647', '0.1388', '0.0920', '0.0769']
    [dBI DEBUG] dbi_raw = 0.076917 - 0.091963 = -0.015046

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 2/9
  [RETRY] Attempt 4/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
================================================================================
  [INITIAL SCORE] API started at score=2, initializing levels_completed=2
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=2.00)
  [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 13, Valid BIs: 13
    [dBI DEBUG] Last 5 BIs: ['0.1647', '0.1388', '0.0920', '0.0769', '0.1244']
    [dBI DEBUG] dbi_raw = 0.124397 - 0.076917 = 0.047480

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 2/9
  [RETRY] Attempt 5/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
================================================================================
  [INITIAL SCORE] API started at score=2, initializing levels_completed=2
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=2.00)
  [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 14, Valid BIs: 14
    [dBI DEBUG] Last 5 BIs: ['0.1388', '0.0920', '0.0769', '0.1244', '0.1167']
    [dBI DEBUG] dbi_raw = 0.116657 - 0.124397 = -0.007740

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 2/9
  [RETRY] Attempt 6/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 7/15
================================================================================
  [INITIAL SCORE] API started at score=2, initializing levels_completed=2
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=2.00)
  [CRYSTALLIZATION] Best: 2.0/9.0, Strength=0.22, Dampening=0.82

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 15, Valid BIs: 15
    [dBI DEBUG] Last 5 BIs: ['0.0920', '0.0769', '0.1244', '0.1167', '0.1243']
    [dBI DEBUG] dbi_raw = 0.124306 - 0.116657 = 0.007648

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 2/9
  [RETRY] Attempt 7/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 8/15
================================================================================
  [INITIAL SCORE] API started at score=2, initializing levels_completed=2
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [API REWARD] score: 2 -> 3, reward: 1.00
  [LEVEL 3 COMPLETE @ step 20] Episode reward: 3.0
[CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 3.0/9.0 (strength: 0.33)
  [CRYSTALLIZATION] Best: 3.0/9.0, Strength=0.33, Dampening=0.73
  [SAVED] crystallization.pt updated (new best: 3.0)

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 16, Valid BIs: 16
    [dBI DEBUG] Last 5 BIs: ['0.0769', '0.1244', '0.1167', '0.1243', '0.1961']
    [dBI DEBUG] dbi_raw = 0.196069 - 0.124306 = 0.071763

[Training from Abstract Experiences...]
  [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 2 -> 3/9
  [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
================================================================================
  [INITIAL SCORE] API started at score=3, initializing levels_completed=3
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [API REWARD] score: 3 -> 4, reward: 1.00
  [LEVEL 4 COMPLETE @ step 26] Episode reward: 4.0
[CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 4.0/9.0 (strength: 0.44)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64
  [SAVED] crystallization.pt updated (new best: 4.0)

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 17, Valid BIs: 17
    [dBI DEBUG] Last 5 BIs: ['0.1244', '0.1167', '0.1243', '0.1961', '0.1432']
    [dBI DEBUG] dbi_raw = 0.143177 - 0.196069 = -0.052892

[Training from Abstract Experiences...]
  [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 3 -> 4/9
  [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 18, Valid BIs: 18
    [dBI DEBUG] Last 5 BIs: ['0.1167', '0.1243', '0.1961', '0.1432', '0.1530']
    [dBI DEBUG] dbi_raw = 0.153047 - 0.143177 = 0.009871

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 1/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 19, Valid BIs: 19
    [dBI DEBUG] Last 5 BIs: ['0.1243', '0.1961', '0.1432', '0.1530', '0.1501']
    [dBI DEBUG] dbi_raw = 0.150057 - 0.153047 = -0.002991

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 2/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1961', '0.1432', '0.1530', '0.1501', '0.1669']
    [dBI DEBUG] dbi_raw = 0.166927 - 0.150057 = 0.016870

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 3/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1432', '0.1530', '0.1501', '0.1669', '0.1649']
    [dBI DEBUG] dbi_raw = 0.164945 - 0.166927 = -0.001982

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 4/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1530', '0.1501', '0.1669', '0.1649', '0.1431']
    [dBI DEBUG] dbi_raw = 0.143101 - 0.164945 = -0.021845

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 5/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1501', '0.1669', '0.1649', '0.1431', '0.2021']
    [dBI DEBUG] dbi_raw = 0.202140 - 0.143101 = 0.059039

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 6/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 7/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1669', '0.1649', '0.1431', '0.2021', '0.1598']
    [dBI DEBUG] dbi_raw = 0.159752 - 0.202140 = -0.042388

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 7/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 8/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1649', '0.1431', '0.2021', '0.1598', '0.1673']
    [dBI DEBUG] dbi_raw = 0.167327 - 0.159752 = 0.007575

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 8/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 9/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1431', '0.2021', '0.1598', '0.1673', '0.1786']
    [dBI DEBUG] dbi_raw = 0.178576 - 0.167327 = 0.011249

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 9/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 10/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=4.00)
  [CRYSTALLIZATION] Best: 4.0/9.0, Strength=0.44, Dampening=0.64

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.2021', '0.1598', '0.1673', '0.1786', '0.1760']
    [dBI DEBUG] dbi_raw = 0.175956 - 0.178576 = -0.002620

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 4/9
  [RETRY] Attempt 10/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 11/15
================================================================================
  [INITIAL SCORE] API started at score=4, initializing levels_completed=4
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [API REWARD] score: 4 -> 5, reward: 1.00
  [LEVEL 5 COMPLETE @ step 69] Episode reward: 5.0
[CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 5.0/9.0 (strength: 0.56)
  [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56
  [SAVED] crystallization.pt updated (new best: 5.0)

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1598', '0.1673', '0.1786', '0.1760', '0.2006']
    [dBI DEBUG] dbi_raw = 0.200621 - 0.175956 = 0.024664

[Training from Abstract Experiences...]
  [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 4 -> 5/9
  [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
================================================================================
  [INITIAL SCORE] API started at score=5, initializing levels_completed=5
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=5.00)
  [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1673', '0.1786', '0.1760', '0.2006', '0.1348']
    [dBI DEBUG] dbi_raw = 0.134790 - 0.200621 = -0.065831

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 5/9
  [RETRY] Attempt 1/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
================================================================================
  [INITIAL SCORE] API started at score=5, initializing levels_completed=5
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=5.00)
  [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1786', '0.1760', '0.2006', '0.1348', '0.2048']
    [dBI DEBUG] dbi_raw = 0.204762 - 0.134790 = 0.069971

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 5/9
  [RETRY] Attempt 2/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
================================================================================
  [INITIAL SCORE] API started at score=5, initializing levels_completed=5
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=5.00)
  [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1760', '0.2006', '0.1348', '0.2048', '0.1584']
    [dBI DEBUG] dbi_raw = 0.158408 - 0.204762 = -0.046354

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 5/9
  [RETRY] Attempt 3/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
================================================================================
  [INITIAL SCORE] API started at score=5, initializing levels_completed=5
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=5.00)
  [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.2006', '0.1348', '0.2048', '0.1584', '0.1888']
    [dBI DEBUG] dbi_raw = 0.188766 - 0.158408 = 0.030358

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 5/9
  [RETRY] Attempt 4/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
================================================================================
  [INITIAL SCORE] API started at score=5, initializing levels_completed=5
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=5.00)
  [CRYSTALLIZATION] Best: 5.0/9.0, Strength=0.56, Dampening=0.56

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1348', '0.2048', '0.1584', '0.1888', '0.2095']
    [dBI DEBUG] dbi_raw = 0.209511 - 0.188766 = 0.020745

[Training from Abstract Experiences...]
  Abstract Value Loss: 1427.1710
  Abstract Policy Loss: 1.8204
  [RETRY] No new levels beaten. Best remains: 5/9
  [RETRY] Attempt 5/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
================================================================================
  [INITIAL SCORE] API started at score=5, initializing levels_completed=5
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [API REWARD] score: 5 -> 6, reward: 1.00
  [LEVEL 6 COMPLETE @ step 15] Episode reward: 6.0
[CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 6.0/9.0 (strength: 0.67)
  [CRYSTALLIZATION] Best: 6.0/9.0, Strength=0.67, Dampening=0.47
  [SAVED] crystallization.pt updated (new best: 6.0)

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.2048', '0.1584', '0.1888', '0.2095', '0.1642']
    [dBI DEBUG] dbi_raw = 0.164205 - 0.209511 = -0.045306

[Training from Abstract Experiences...]
  [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 5 -> 6/9
  [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
================================================================================
  [INITIAL SCORE] API started at score=6, initializing levels_completed=6
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=6.00)
  [CRYSTALLIZATION] Best: 6.0/9.0, Strength=0.67, Dampening=0.47

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1584', '0.1888', '0.2095', '0.1642', '0.2483']
    [dBI DEBUG] dbi_raw = 0.248301 - 0.164205 = 0.084096

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 6/9
  [RETRY] Attempt 1/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
================================================================================
  [INITIAL SCORE] API started at score=6, initializing levels_completed=6
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [API REWARD] score: 6 -> 7, reward: 1.00
  [LEVEL 7 COMPLETE @ step 78] Episode reward: 7.0
[CRYSTALLIZATION] Game as66-821a4dcad9c2: NEW BEST 7.0/9.0 (strength: 0.78)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38
  [SAVED] crystallization.pt updated (new best: 7.0)

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1888', '0.2095', '0.1642', '0.2483', '0.1501']
    [dBI DEBUG] dbi_raw = 0.150085 - 0.248301 = -0.098216

[Training from Abstract Experiences...]
  [RETRY] NEW PROGRESS! Reached 1 NEW levels. Best: 6 -> 7/9
  [RETRY] Resetting to Attempt 1/15 due to progress. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 1/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.2095', '0.1642', '0.2483', '0.1501', '0.2287']
    [dBI DEBUG] dbi_raw = 0.228709 - 0.150085 = 0.078624

[Training from Abstract Experiences...]
  Abstract Value Loss: 269.2734
  Abstract Policy Loss: 1.8474
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 1/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 2/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1642', '0.2483', '0.1501', '0.2287', '0.1417']
    [dBI DEBUG] dbi_raw = 0.141687 - 0.228709 = -0.087023

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 2/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 3/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.2483', '0.1501', '0.2287', '0.1417', '0.1514']
    [dBI DEBUG] dbi_raw = 0.151352 - 0.141687 = 0.009665

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 3/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 4/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1501', '0.2287', '0.1417', '0.1514', '0.1695']
    [dBI DEBUG] dbi_raw = 0.169455 - 0.151352 = 0.018103

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 4/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 5/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.2287', '0.1417', '0.1514', '0.1695', '0.1718']
    [dBI DEBUG] dbi_raw = 0.171847 - 0.169455 = 0.002392

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 5/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 6/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1417', '0.1514', '0.1695', '0.1718', '0.1584']
    [dBI DEBUG] dbi_raw = 0.158435 - 0.171847 = -0.013411

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 6/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 7/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1514', '0.1695', '0.1718', '0.1584', '0.1595']
    [dBI DEBUG] dbi_raw = 0.159509 - 0.158435 = 0.001073

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 7/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 8/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1695', '0.1718', '0.1584', '0.1595', '0.2448']
    [dBI DEBUG] dbi_raw = 0.244764 - 0.159509 = 0.085255

[Training from Abstract Experiences...]
  Abstract Value Loss: 503.8107
  Abstract Policy Loss: 1.8477
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 8/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 9/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1718', '0.1584', '0.1595', '0.2448', '0.1383']
    [dBI DEBUG] dbi_raw = 0.138342 - 0.244764 = -0.106421

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 9/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 10/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1584', '0.1595', '0.2448', '0.1383', '0.1833']
    [dBI DEBUG] dbi_raw = 0.183313 - 0.138342 = 0.044971

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 10/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 11/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1595', '0.2448', '0.1383', '0.1833', '0.2034']
    [dBI DEBUG] dbi_raw = 0.203363 - 0.183313 = 0.020050

[Training from Abstract Experiences...]
  Abstract Value Loss: 302.0228
  Abstract Policy Loss: 1.8496
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 11/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 12/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.2448', '0.1383', '0.1833', '0.2034', '0.1798']
    [dBI DEBUG] dbi_raw = 0.179812 - 0.203363 = -0.023550

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 12/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 13/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1383', '0.1833', '0.2034', '0.1798', '0.1436']
    [dBI DEBUG] dbi_raw = 0.143637 - 0.179812 = -0.036175

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 13/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 14/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.1833', '0.2034', '0.1798', '0.1436', '0.1915']
    [dBI DEBUG] dbi_raw = 0.191460 - 0.143637 = 0.047822

[Training from Abstract Experiences...]
  Abstract Value Loss: 619.0808
  Abstract Policy Loss: 1.8507
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Attempt 14/15. Retrying...

================================================================================
[FinalGift 8.0] Episode 0 | Game: as66-821a4dcad9c2 | Attempt 15/15
================================================================================
  [INITIAL SCORE] API started at score=7, initializing levels_completed=7
  [Ep0] EXPLORATION: No memory replay
  [Game: as66-821a4dcad9c2] API available_actions: [1, 2, 3, 4, 6]
  [Game: as66-821a4dcad9c2] Action mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [DEBUG] available_actions from API: [1, 2, 3, 4, 6]
  [DEBUG] action_mask: {0: True, 1: True, 2: True, 3: True, 'click': True, 4: False, 5: False}
  [Ep0] NOT STORED: No level progress (reward=7.00)
  [CRYSTALLIZATION] Best: 7.0/9.0, Strength=0.78, Dampening=0.38

[Running Abstractive Planning...]
  Generated 32 abstract experiences (dreams).

[Training Models...]
    [dBI DEBUG as66-821a4dcad9c2] Total history: 20, Valid BIs: 20
    [dBI DEBUG] Last 5 BIs: ['0.2034', '0.1798', '0.1436', '0.1915', '0.1753']
    [dBI DEBUG] dbi_raw = 0.175284 - 0.191460 = -0.016175

[Training from Abstract Experiences...]
  [RETRY] No new levels beaten. Best remains: 7/9
  [RETRY] Max retries (15) reached without improvement. Moving to next game.
  [SCORECARD CLOSED] View replay at: https://three.arcprize.org/scorecards/ff0e097a-e654-4961-bc1c-997d013fd5a9

Episode Results:
  Reward: 7.000
  Steps: 6
  Mean Confidence: 0.547
  Mean Uncertainty: 0.629
  Mode Counts: {'explore': 6}

Model Metrics:
  Prediction Error: 0.0038
  Inverse Loss: 0.9070
  Policy Loss: 0.0000
  Info Gain Loss: 0.0000

Proto-AGI Module Losses:
  Subgoal Value: 0.0052
  Subgoal Proposer: -0.1222
  Subgoal Achievement: 0.0105
  Rule Encoder: -0.4151
  Rule Confidence: 0.6881
  Empowerment Forward: 0.0131
  Empowerment Inverse: 0.9211
  Counterfactual: 0.0011
  MetaLearner Context: 0.0000

Per-Step Diagnostics (v8.0):
  Learned Value: 0.5530
  H_op (Operator Entropy): 0.9984
  H_attn (Attention Entropy): 0.6308
  MoE Gate Sparsity: 0.0013
  Empowerment Bonus: 0.7371
  GNN Entropy Reg: 0.0000

MuZero Boost (Planner→Policy Distillation):
  Value Distillation Loss: 0.0094
  Policy Distillation Loss: 1.7919

Broad Intelligence:
  A_sub: 0.9986
  A_con: 0.4337
  I_alg: 0.4048 (op_H=0.998, attn_H=0.631)
  BI: 0.1753
  CP: 0.1751 (Choice Potential = BI / 1+C)
  dBI: -0.0005  |  dBI-EMA: -0.0005  |  dBI-norm: -0.191
  Entropy_penalty: 0.0548 (target=0.10, pushes attn_H)

Execution Blend Controller:
  Execution: DELIBERATE (w=0.429)
  Delta_intuitive: 0.0597
  Delta_deliberate: -0.0597
  Motion CAD: -0.0085
  Intent Ratio: 100.0% LEARN

Game Statistics:
  as66-821a4dcad9c2: 51 attempts, 98.0% win, 4.37 avg
  ft09-b7249182: 0 attempts, 0.0% win, 0.00 avg
  lp85-e179de251: 0 attempts, 0.0% win, 0.00 avg
  ls20-e98196ab: 0 attempts, 0.0% win, 0.00 avg
  sp80-445eab21: 0 attempts, 0.0% win, 0.00 avg
  vc33-cbded52d: 0 attempts, 0.0% win, 0.00 avg

================================================================================
Cognitive Mode Analysis (Episode 0):
================================================================================
REDACTED ------