Commit graph

180 commits

Author SHA1 Message Date
Henri Bourcereau 1fb04209f5 doc params train bot 2025-08-10 17:46:09 +02:00
Henri Bourcereau 778ac1817b script train bots 2025-08-10 15:35:12 +02:00
Henri Bourcereau e4b3092018 train burn-rl with integers 2025-08-10 08:39:31 +02:00
Henri Bourcereau 5b02293221 Merge branch 'feature/botBlackStrategy' into develop 2025-08-08 21:32:09 +02:00
Henri Bourcereau 17d29b8633 runcli with bot dqn burn-rl 2025-08-08 21:31:56 +02:00
Henri Bourcereau a19c5d8596 refact dqn simple 2025-08-08 18:58:21 +02:00
Henri Bourcereau 1b58ca4ccc refact dqn burn demo 2025-08-08 17:07:34 +02:00
Henri Bourcereau bf820ecc4e feat: bot random strategy 2025-08-08 16:24:40 +02:00
Henri Bourcereau b02ce8d185 fix dqn strategy color 2025-08-07 21:03:53 +02:00
Henri Bourcereau dc80243a1a fix black moves 2025-08-07 21:03:53 +02:00
Henri Bourcereau 12004ec4f3 wip bot mirror 2025-08-07 21:03:53 +02:00
Henri Bourcereau fa9c02084a doc uml diagrams 2025-08-04 12:02:12 +02:00
Henri Bourcereau fc9733b729 doc train bot results 2025-08-03 22:16:28 +02:00
Henri Bourcereau 744a70cf1d bot train graph 2025-08-03 20:32:06 +02:00
Henri Bourcereau c0d42a0c45 réglages train bot dqn burnrl 2025-08-03 16:11:45 +02:00
Henri Bourcereau 28c2aa836f fix: train bot dqn burnrl : extract config 2025-08-02 12:42:32 +02:00
Henri Bourcereau ad5ae17168 fix: check moves possibles : prevent the move of the same checker twice 2025-08-02 12:41:52 +02:00
Henri Bourcereau 2e0a874879 refacto 2025-08-01 20:45:57 +02:00
Henri Bourcereau ad58c0ec60 fix build trainbot 2025-08-01 14:21:48 +02:00
Henri Bourcereau fd269b491d wip stackoverflow debug 2025-07-28 14:03:14 +02:00
Henri Bourcereau 3e1775428d action mask 2025-07-27 09:35:30 +02:00
Henri Bourcereau cb30fd3229 fix: overflow when incrementing dice_roll_count 2025-07-25 17:41:48 +02:00
Henri Bourcereau b92c9eb7ff fix: convert_action from_action_index 2025-07-25 17:26:02 +02:00
Henri Bourcereau 1e18b784d1 load inference model 2025-07-23 21:52:32 +02:00
Henri Bourcereau f3fc053dbd save inference model 2025-07-23 21:28:29 +02:00
Henri Bourcereau 6fa8a31cc7 refact : save model 2025-07-23 21:16:28 +02:00
Henri Bourcereau c6d33555ec wip 2025-07-23 17:25:05 +02:00
Henri Bourcereau 354dcfd341 wip burn-rl dqn example 2025-07-08 21:58:15 +02:00
Henri Bourcereau b98a135749 fix: tensor dimensions
fix execution error
2025-06-29 11:30:34 +02:00
Henri Bourcereau 6a7b1cbebc fix by gemini 2025-06-28 22:18:39 +02:00
Henri Bourcereau f05094b2d4 wip 2025-06-28 21:34:44 +02:00
Henri Bourcereau cf93255f03 claude not tested 2025-06-23 22:17:24 +02:00
Henri Bourcereau a06b47628e burn dqn trainer 2025-06-22 21:25:45 +02:00
Henri Bourcereau cf1175e497 fix burn environment 2025-06-22 18:34:36 +02:00
Henri Bourcereau dcd97d1df1 fix sdl2-sys compilation 2025-06-22 16:54:10 +02:00
Henri Bourcereau 5b133cfe0a claude (compile fails) 2025-06-22 15:42:55 +02:00
Henri Bourcereau dc197fbc6f dqn trainer 2025-06-22 12:32:59 +02:00
Henri Bourcereau 7507ea5d78 fix workflow 2025-06-22 12:32:59 +02:00
Henri Bourcereau bae0632f82 use game state context to reduce actions space 2025-06-03 21:41:07 +02:00
Henri Bourcereau ebe98ca229 debug 2025-06-01 21:07:01 +02:00
Henri Bourcereau f7eea0ed02 extend actions space 2025-06-01 20:00:15 +02:00
Henri Bourcereau a2e54bc449 wip fix train 2025-06-01 17:00:36 +02:00
Henri Bourcereau ab959fa27b train command 2025-05-26 20:44:35 +02:00
Henri Bourcereau 480b2ff427 remove python stuff & simple DQN implementation 2025-05-24 22:41:44 +02:00
Henri Bourcereau 3d01e8fe06 fix: handle bot errors 2025-05-13 17:46:06 +02:00
Henri Bourcereau 4fd1f00af0 fix: use default maturin python lib name 2025-05-13 16:04:44 +02:00
Henri Bourcereau 27fc08c47d bot : erroneous strategy 2025-03-18 21:19:57 +01:00
Henri Bourcereau ab770f3a34 feat: ai strategy (wip) 2025-03-02 15:42:36 +01:00
Henri Bourcereau 899a690869 fix(devenv): maj devenv ; move pip to venv 2025-03-02 11:50:20 +01:00
Henri Bourcereau 8368b0d837 wip Gym : Claude AI suggestion 2025-03-02 11:01:24 +01:00