Day 1
I set out to create the Fireboy & Watergirl AI. Since it was a Flash game, I was able to find the game's .swf files and get it running through Ruffle. For the model, I used Stable-Baselines3 with PPO on top of the OpenAI Gymnasium framework.
I spent the rest of the day setting up the automation pipeline and the model itself. The initial rewards were based on gem proximity, gem collection, door proximity, and level completion, while the penalties were based on time spent and death.
It was able to collect gems in under 50 episodes. However, it only seemed to consistently collect the red gem and not the blue gem.