It has nothing to do with AI, it’s just plain old automation, but they solve most of the issues you get with making automated tests in non-discrete 3D playspace and they do that in a pretty solid way. It’s definitely something I’d love to have implemented in the games I’m working on, as someone who worked in QA and now works in development. Being able to have mostly reliable way how to smoke-test levels for basic gameplay without having to torture QA to run the test-case again is good, and allows QA to focus on something else - but the tools also need oversight, so it’s not really a job lost. In summary - I think the talk is cool tech and worth the watch.
However, I don’t think AI will help in this regard, and something as unreliable and random as AI models are not a good fit for this job. You want to have deterministic testcases that you can quanitfy, and if something doesn’t match have an actual human to look at why. AI also probably won’t be able to find clever corner-cases and bugs that need human ingenuity.
Fuck AI, I kind of hope this is just a marketing talk and they are actually just improving the (deterministic) tools they already have (which actually are AI by definition, since they also do level exploration on top of recorded inputs), and they are calling it an “AI” to satisfy investors/management without actually slapping a glorified chat-bot into the tech for no reason.
Frankly, this is good news. Whoever buys the rights to kingdom hearts in 3 years when the company falls apart might manage to create an intelligible storyline.
Kingdom hearts 1 was a coming of age story with some fantastic elements tossed in
Kingdom hearts 2 was about antipathy and how it destroys the world, with some, uhh, who was that guy? And why’s the bad guy on my side?
Kingdom hearts 3 was, wait, why was he cloned? When was he cloned? When was she, and him, and him again? And his third clone was a girl? And whose heart was imbued into what? What war? What? Who? What???
I get that getting all the games as they released was hard, because the series is on so many platforms. But I really don’t get the “KH is hard to understand” argument today, because you can easily find hundreds of letsplays for every game, cutscenes complications, play/watch every game on the PS4 remix disk, and even watch a fandub of the mobile games (Dark Road is a WIP) if you don’t like the KHUX Back Cover recap.
So like, what’s so hard? If you skip games and only read a wiki (the worst possible way to consume any sort of media, mind you), of course you’re not gonna know the story and characters, and of course it’ll sound confusing.
Dude, it’s multi-author-comic level bad. I’ve skipped entire sagas in several book series due to a lack of translations and ended up less confused. It’s green arrow levels of clone shenanigans.
To be clear, I’ve played most of the games and they’re still ridiculously difficult to keep track of. All besides the mobile, early non-Ventus card mechanic arpg, and the disappearing girl clone sora game.
They’d be easier to follow if they stuck to the rules of their own universe. Body and heart separate and the body persists not once but twice? What?
If you hate Ubisoft because they put a black person in their game, and not because they are greedy bastards who make utterly vapid slop, then you are an idiot.
Well, it’s not game development, but bugfixes and quality testing.
I dont know, but it does makes sense, when there’s still 30% work being done by human eyes. There will still be people checking everything through.
Even if they hit 50-50, they could put more money into the development.
The argument that they will just save the money only works as long as another company doesnt use it for game devs. Otherwise you naturally fall behind.
It also only works as long as the AI can actually competently do the QA work. This is what an AI thinks a video game is. To do QA, it will have to know that something is wrong, flag it, and be able to tell when it’s fixed. The most likely situation I can foresee is that it creates even more work for the remaining humans to do when they’re already operating at a deficit.
To be fair, that’s what an AI video generator thinks an FPS is. That’s not the same thing as AI-assisted coding. Though it’s still hilarious! “Press F to pay respects” 🤣
For reference, using AI to automate your QA isn’t a bad idea. There’s a bunch of ways to handle such things but one of the more interesting ones is to pit AIs against each other. Not in the game, but in their reports… You tell AI to perform some action and generate a report about it while telling another AI to be extremely skeptical about the first AI’s reports and to reject anything that doesn’t meet some minimum standard.
That’s what they’re doing over at Anthropic (internally) with Claude Code QA tasks and it’s super fascinating! Heard them talk about that setup on a podcast recently and it kinda blew my mind… They have more than just two “Claudes” pitted against each other too: In the example they talked about, they had four: One generating PRs, another reviewing/running tests, another one checking the work of the testing Claude, and finally a Claude setup to perform critical security reviews of the final PRs.
I don’t know what they were testing, but if your output is text, it will be a lot easier for the AI to know it’s correct than any of the plethora of ways that video games can go subtly wrong, and that’s where my lack of faith comes from. Even scraping text from the internet, my experience is more often that AI is confident in its wrong answer than it is helpful.
I hope they put out the last FF VII remake part before that, so i can finally start playing them all! I don’t care what they want to waste their money on afterwards lol
I wouldn’t be shy about getting into Remake or Rebirth now. They both stand up as their own games (concise start/ending, somewhat distinct mechanics, each one is easily 40+ hours of gameplay). And with Part 3 targeted for 2027 release, I suspect this kind of overhaul would be outside their dev cycle to implement.
Part 2 is already using the engine from Part 1 with minor adjustments. I suspect most of Part 3 development is cinematics and world building.
I’m cautious but a little curious about this one, because QA could actually be a very good target for AIs to work with.
It might not kill jobs. Right now, engineers finish a task and the limited number of QA engineers can’t possibly test it enough before release. That game-breaking bug you found in a game? I’m sure some QA had it in their plan to test every level for those bugs, and yet they just didn’t have enough time - and the studio couldn’t justify hiring 20 more QA squads. Even if they do upscale AI testing, they’ll need knowledgable QA workers to guide them.
This is often extremely rote, repetitive work. It’s exactly the type of work The Oatmeal said is great for AIs. One person is tuning the balance on the Ether Drive attack, and gives it an extra 40% blarf damage. He tries it, sees it works fine, and eagerly skips past the part of the test plan to verify that all cutscenes are working and unaffected to push it in. An AI will try it out, and find: Actually, since an NPC uses an Ether Drive in a late-game cutscene, this breaks the whole game!
Even going past existing plans, QA can likely find MORE work for AIs to do that they normally wouldn’t bother with. Think about the current complexity of game dev that leads to the current trope of releasing games half-finished to eventually get patched. It won’t help patch games, but it’ll at least help give devs an up-to-date list of issues.
That said, those talking about human creativity and player expectations are still correct. An AI can report a problem with feedback that a human can say “No, that looks fine. Override that report.” It will also be good to do occasional manual tests, and lament “How did the AI think this was okay??”
Hmm. While I don’t know what their QA workflow is, my own experience is that working with QA people to design a QA procedure for a given feature tends to require familiarity with the feature in the context of real-world knowledge and possible problems, and that human-validating a feature isn’t usually something done at massive scale, where you’d get a lot of benefit from heavy automation.
It’s possible that one might be able to use LLMs to help write test code — reliability and security considerations there are normally less-critical than in front-line code. Worst case is getting a false positive, and if you can get more test cases covered, I imagine that might pay off.
Square does an MMO, among their other stuff. If they can train a model to produce AI-driven characters that act sufficiently like human players, where they can theoretically log training data from human players, that might be sufficient to populate an MMO “experimental” deployment so that they can see if anything breaks prior to moving code to production.
“Because I would love to be able to start up 10,000 instances of a game in the cloud, so there’s 10,000 copies of the game running, deploy an AI bot to spend all night testing that game, then in the morning we get a report. Because that would be transformational.”
I think that the problem is that you’re likely going to need more-advanced AI than an LLM, if you want them to just explore and try out new features.
One former Respawn employee who worked in a senior QA role told Business Insider that he believes one of the reasons he was among 100 colleagues laid off this past spring is because AI was reviewing and summarising feedback from play testers, a job he usually did.
We can do a reasonable job of summarizing human language with LLMs today. I think that that might be a viable application.
Worst case is getting a false positive, and if you can get more test cases covered, I imagine that might pay off.
False positives during testing are a huge time sink. QA has to replicate and explain away each false report and the faster AI 'completes' tasks the faster the flood of false reports come in.
There is plenty of non-AI automation that can be used intentionally to do tedious repetitive tasks already where they only increase work if they aren't set up right.
Some AI or central computer going haywire and destroying everything is, like, the third or fourth stock RPG trope just behind the Dark Lord burning down the protagonist’s village in the first act or the mysterious waif girl actually turning out to be a princess.
videogameschronicle.com
Aktywne