AI-generated test cases are fast, plentiful, and frequently wrong about what actually matters to the business, which is why Gherkin BDD has become the load-bearing spec language of the AI testing era.