Commit graph

31 commits

Author SHA1 Message Date
Henri Bourcereau 17d29b8633 runcli with bot dqn burn-rl 2025-08-08 21:31:56 +02:00
Henri Bourcereau a19c5d8596 refact dqn simple 2025-08-08 18:58:21 +02:00
Henri Bourcereau 1b58ca4ccc refact dqn burn demo 2025-08-08 17:07:34 +02:00
Henri Bourcereau bf820ecc4e feat: bot random strategy 2025-08-08 16:24:40 +02:00
Henri Bourcereau b02ce8d185 fix dqn strategy color 2025-08-07 21:03:53 +02:00
Henri Bourcereau 12004ec4f3 wip bot mirror 2025-08-07 21:03:53 +02:00
Henri Bourcereau c0d42a0c45 réglages train bot dqn burnrl 2025-08-03 16:11:45 +02:00
Henri Bourcereau 2e0a874879 refacto 2025-08-01 20:45:57 +02:00
Henri Bourcereau 3e1775428d action mask 2025-07-27 09:35:30 +02:00
Henri Bourcereau 354dcfd341 wip burn-rl dqn example 2025-07-08 21:58:15 +02:00
Henri Bourcereau b98a135749 fix: tensor dimensions
fix execution error
2025-06-29 11:30:34 +02:00
Henri Bourcereau 6a7b1cbebc fix by gemini 2025-06-28 22:18:39 +02:00
Henri Bourcereau f05094b2d4 wip 2025-06-28 21:34:44 +02:00
Henri Bourcereau cf93255f03 claude not tested 2025-06-23 22:17:24 +02:00
Henri Bourcereau a06b47628e burn dqn trainer 2025-06-22 21:25:45 +02:00
Henri Bourcereau cf1175e497 fix burn environment 2025-06-22 18:34:36 +02:00
Henri Bourcereau 5b133cfe0a claude (compile fails) 2025-06-22 15:42:55 +02:00
Henri Bourcereau dc197fbc6f dqn trainer 2025-06-22 12:32:59 +02:00
Henri Bourcereau 7507ea5d78 fix workflow 2025-06-22 12:32:59 +02:00
Henri Bourcereau bae0632f82 use game state context to reduce actions space 2025-06-03 21:41:07 +02:00
Henri Bourcereau ebe98ca229 debug 2025-06-01 21:07:01 +02:00
Henri Bourcereau f7eea0ed02 extend actions space 2025-06-01 20:00:15 +02:00
Henri Bourcereau a2e54bc449 wip fix train 2025-06-01 17:00:36 +02:00
Henri Bourcereau ab959fa27b train command 2025-05-26 20:44:35 +02:00
Henri Bourcereau 480b2ff427 remove python stuff & simple DQN implementation 2025-05-24 22:41:44 +02:00
Henri Bourcereau 27fc08c47d bot : erroneous strategy 2025-03-18 21:19:57 +01:00
Henri Bourcereau ab770f3a34 feat: ai strategy (wip) 2025-03-02 15:42:36 +01:00
Henri Bourcereau 6478f5043d fix: allowed moves infinite loop 2025-01-29 20:07:30 +01:00
Henri Bourcereau ff5ff74282 wip 2025-01-09 21:27:24 +01:00
Henri Bourcereau a3bcdb8912 fix: 2 bots play 2025-01-08 20:44:23 +01:00
Henri Bourcereau e9f4940c40 refact: bot strategies 2024-11-07 16:51:33 +01:00