Close Menu
The Financial News 247The Financial News 247
  • Home
  • News
  • Business
  • Finance
  • Companies
  • Investing
  • Markets
  • Lifestyle
  • Tech
  • More
    • Opinion
    • Climate
    • Web Stories
    • Spotlight
    • Press Release
What's On
AI Is Flooding Teams With Findings—That Doesn’t Mean They’re Safer

AI Is Flooding Teams With Findings—That Doesn’t Mean They’re Safer

June 26, 2026
How WWE’s Recent King And Queen Of The Ring Winners Have Fared

How WWE’s Recent King And Queen Of The Ring Winners Have Fared

June 26, 2026
California border casino deal OK’d for Terrible’s takeover of Primm’s

California border casino deal OK’d for Terrible’s takeover of Primm’s

June 26, 2026
Today’s NYT Strands Hint, Spangram And Answers For Saturday, June 27 (Suite Re-Lease)

Today’s NYT Strands Hint, Spangram And Answers For Saturday, June 27 (Suite Re-Lease)

June 26, 2026
Ex-Trump Adviser John Bolton Pleads Guilty To Retaining Classified Information—Faces Prison Time

Ex-Trump Adviser John Bolton Pleads Guilty To Retaining Classified Information—Faces Prison Time

June 26, 2026
Facebook X (Twitter) Instagram
The Financial News 247The Financial News 247
Demo
  • Home
  • News
  • Business
  • Finance
  • Companies
  • Investing
  • Markets
  • Lifestyle
  • Tech
  • More
    • Opinion
    • Climate
    • Web Stories
    • Spotlight
    • Press Release
The Financial News 247The Financial News 247
Home » This AI Startup’s Army Of 15,000 Hackers Pressure Test Claude, GPT-5 And Gemini

This AI Startup’s Army Of 15,000 Hackers Pressure Test Claude, GPT-5 And Gemini

By News RoomMay 28, 2026No Comments5 Mins Read
Facebook Twitter Pinterest LinkedIn WhatsApp Telegram Reddit Email Tumblr
This AI Startup’s Army Of 15,000 Hackers Pressure Test Claude, GPT-5 And Gemini
Share
Facebook Twitter LinkedIn Pinterest Email

Last spring, Kameron Bettridge participated in a security challenge hosted by AI startup Gray Swan. The objective: convince AI models from companies like OpenAI and Anthropic to behave in nefarious ways before they’re released to the world. That included persuading the models to leak sensitive data like medical records and spit out copyrighted information like the full lyrics of Hotel California.

At first Bettridge, a 23-year-old security engineer at gaming company Blizzard Entertainment, was jailbreaking models for fun. “I’ve never been a true supporter of AI fully,” he says. “So just seeing the model fail was a funny thing to me sometimes.”

In almost a year, Bettridge has competed in more than 1,000 challenges via Arena— a hub run by startup Gray Swan that some 15,000 security professionals from all across the world use to “red team” AI systems like Anthropic’s Claude Mythos and OpenAI’s GPT-5, finding and fixing vulnerabilities before they can be exploited. And he’s made $10,000 doing it.

It’s not a lot for a highly paid software engineer. But as AI became ubiquitous, Bettridge realized just how important it is to test the limits of these AI models. The technology has been used to plan mass shootings, steal money and create illegal child sexual abuse material. “Now we have very strong models that anyone can access from anywhere in the world, which is a scary thought,” Bettridge says. “People are genuinely trying to use this for harmful things.”

Founded in 2023 by Carnegie Mellon University professors Matt Fredrikson and Zico Kolter, Gray Swan has become the go-to security provider for a who’s who of frontier labs: OpenAI, Anthropic, Google Deepmind, Meta, xAI and ByteDance. The startup has been cited in 11 frontier model system cards including GPT-5 and Mythos — documents that list the risks an AI model poses and safety measures taken to prevent them.

Now, it’s raised $40 million in Series A funding co-led by Wing VC and Madrona with participation from Snowflake Ventures, Hudson River Trading, and Samsung Next, bringing its valuation to $200 million. It already has 20 enterprise customers, but the funds will help it sell to more businesses that need to secure their own AI products.

While Gray Swan runs Arena, (not to be confused with LMArena that benchmarks models based on performance), that isn’t its primary product. But it uses the data from Arena’s human red-teamers to train its AI agent called Shade that actively looks for vulnerabilities by continuously attacking a system in different ways, and Cygnal, software that monitors an AI model’s prompts and outputs to block it from generating harmful responses and accessing tools it shouldn’t. That human data is its edge, allowing Gray Swan to throw hackers’ most sophisticated attacks against increasingly capable AI models.

“Agents are now much smarter,” says chief scientist and cofounder Kolter, who also sits on the board of OpenAI Foundation. “They are looking for prompt injections. They’re trying to defeat these things. They’re not trying to stumble upon these things.”

The Pittsburgh-based startup gained an early foothold among the biggest AI labs thanks to its founder’s hacker pedigree. The duo began researching the safety risks posed by AI systems years before the generative AI wave. In 2023, they discovered what was dubbed “the mother of all jailbreaks” — that attaching a string of random characters to a prompt could bypass safety filters on models built by OpenAI, Anthropic, Meta and Google (it’s since been fixed). That sparked the idea to start Gray Swan.

Less than a month after the company launched, OpenAI became its first customer, using its technology to jailbreak its family of o1 models, testing whether they generate violent content and malicious code. In 2024, Kolter was appointed to the OpenAI Foundation’s board, where he oversees major model releases as chair of the safety and security committee.

“They were thinking about model security when it just didn’t matter,” says Wing VC partner Jake Flomenberg. “They had literally been spending their entire professional life working on this very problem from an academic setting. And so they were both sort of at the right place with their thinking and research for this big sea change.”

While frontier labs make up a majority of its revenue, Gray Swan is increasingly appealing to large enterprises. Snowflake uses Gray Swan’s software to pressure test its coding agent, Cortex Code and its general purpose agent, Snowflake Intelligence, which it sells to customers, says Anupam Datta, a principal research scientist at Snowflake. In one scenario, Gray Swan’s software looks for malicious prompts hidden within external websites or tools Snowflake’s agents might access to complete a task. These prompts could instruct the agent to send internal proprietary data, such as information about the company’s earnings, to an email address managed by an adversary. “Gray Swan can guard against very subtle kinds of attacks,” Datta says.

As AI systems become more intelligent, jailbreaking them will require more complexity and nuance, CEO Fredrikson says. Agents find new loopholes to exploit. Because these systems interact with a web of tools, the “surface area” of attacks has become bigger.

“The one thing you can rely on is that there are going to be surprises,” Fredrikson says. “These systems can create new attack surfaces that we’re not even thinking about today that aren’t obvious.”

MORE FROM FORBES

AI Anthropic Gray Swan Arena jailbreaking op OpenAI red teaming safety security Zico Kolter
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related News

AI Is Flooding Teams With Findings—That Doesn’t Mean They’re Safer

AI Is Flooding Teams With Findings—That Doesn’t Mean They’re Safer

June 26, 2026
Today’s NYT Strands Hint, Spangram And Answers For Saturday, June 27 (Suite Re-Lease)

Today’s NYT Strands Hint, Spangram And Answers For Saturday, June 27 (Suite Re-Lease)

June 26, 2026
Residents Rate American And European Downtowns Poorly

Residents Rate American And European Downtowns Poorly

June 26, 2026
Saturday, June 27 Clues And Answers

Saturday, June 27 Clues And Answers

June 26, 2026
How Outcome-Based Contracting Can Enable Enterprise AI Deployments

How Outcome-Based Contracting Can Enable Enterprise AI Deployments

June 26, 2026
The Most Expensive Part Of AI Might Not Be The Model

The Most Expensive Part Of AI Might Not Be The Model

June 26, 2026
Add A Comment
Leave A Reply Cancel Reply

Don't Miss
How WWE’s Recent King And Queen Of The Ring Winners Have Fared

How WWE’s Recent King And Queen Of The Ring Winners Have Fared

News June 26, 2026

King of the Ring has long been a popular concept in WWE, dating back to…

California border casino deal OK’d for Terrible’s takeover of Primm’s

California border casino deal OK’d for Terrible’s takeover of Primm’s

June 26, 2026
Today’s NYT Strands Hint, Spangram And Answers For Saturday, June 27 (Suite Re-Lease)

Today’s NYT Strands Hint, Spangram And Answers For Saturday, June 27 (Suite Re-Lease)

June 26, 2026
Ex-Trump Adviser John Bolton Pleads Guilty To Retaining Classified Information—Faces Prison Time

Ex-Trump Adviser John Bolton Pleads Guilty To Retaining Classified Information—Faces Prison Time

June 26, 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Our Picks
‘Never has the risk situation been so high’

‘Never has the risk situation been so high’

June 26, 2026
Residents Rate American And European Downtowns Poorly

Residents Rate American And European Downtowns Poorly

June 26, 2026
When Does Serena Williams Play At Wimbledon?

When Does Serena Williams Play At Wimbledon?

June 26, 2026
US crude oil falls below  despite Iranian attack on cargo ship

US crude oil falls below $70 despite Iranian attack on cargo ship

June 26, 2026
The Financial News 247
Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact us
© 2026 The Financial 247. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.