Hackers ‘jailbreak’ powerful AI models in global effort to highlight flaws

  • 📰 FT
  • ⏱ Reading Time:
  • 21 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 12%
  • Publisher: 51%

Technology Technology Headlines News

Technology Technology Latest News,Technology Technology Headlines

Experts join forces in search for vulnerabilities in large language models made by OpenAI, Google and Elon Musk’s xAI

Pliny the Prompter says it typically takes him about 30 minutes to break the world’s most powerful artificial intelligence models. The pseudonymous hacker has manipulated Meta’s Llama 3 into sharing instructions for making napalm. He made Elon Musk’s Grok gush about Adolf Hitler. His own hacked version of OpenAI’s latest GPT-4o model, dubbed “Godmode GPT”, was banned by the start-up after it started advising on illegal activities.

Other variations have emerged, such as EscapeGPT, BadGPT, DarkGPT and Black Hat GPT, according to AI security group SlashNext. Some hackers use “uncensored” open-source models. For others, jailbreaking attacks — or getting around the safeguards built into existing LLMs — represent a new craft, with perpetrators often sharing tips in communities on social media platforms such as Reddit or Discord.

 

Thank you for your comment. Your comment will be published after being reviewed.
Please try again later.
We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

 /  🏆 113. in TECHNOLOGY

Technology Technology Latest News, Technology Technology Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

OpenAI Forms Safety Committee as It Starts Training Latest AI ModelOpenAI says it's setting up a safety and security committee and has begun training a new AI model to supplant GPT-4, which underpins ChatGPT.
Source: TIME - 🏆 93. / 53 Read more »

Elon Musk faces fresh insider dealing claim and drops OpenAI lawsuitFresh from his threat to ban Apple devices at his companies if they adopt AI functionality such as ChatGPT, Elon Musk looks to drop his lawsuit against OpenAI as legal pressures elsewhere intensify.
Source: SkyNews - 🏆 35. / 67 Read more »