I wasn't able to find out about that, from a quick scout around - could you share a link to it?
(I did find BotHack[1] and a video of its' first ascension[2], both of which are cool)
not as consistent as the best humans yet, but they have won https://github.com/krajj7/BotHack
There was a bot playing NetHack in 2015
https://github.com/krajj7/BotHack
Assume it was playing 3.4.3. It was using pudding farming to get legitimate wins, which is glitchy.
I’m way too old to know this stuff. But I played roguelikes in their day on a UNIX PC, so maybe it’s OK.
That's a shame. For a human, each game played helps to make the person better at all games. Play enough adventure games, for example, and one begins to recognize so many of the patterns and tropes at work that the solutions to puzzles jump out obviously and immediately in a single play-through.
Moreover, we already have machines that are extremely proficient at solving very complex games when given enough context: expert systems. One very notable example is the bot which successfully completed the game NetHack [0]. Would DeepMind's novelty-based reward technique work for NetHack?