The OpenAI logo is seen on a mobile phone in front of a computer screen displaying output from ChatGPT, Tuesday, March 21, 2023, in Boston. President Joe Biden’s administration wants stronger measures to test the safety of artificial intelligence tools …
“I’ve actually funded some work under one of my programs at DARPA where we could completely bypass the safety guardrails of these LLMs, and we actually got ChatGPT to tell us how to make a bomb, and we got it to tell us all kinds of unsavory things that it shouldn’t be telling us, and we did it in a mathematically principled way,” he said.
The popularity of generative AI tools, creating text like it was written by a human, has grown rapidly in the past year as products such as ChatGPT solve problems and generate content upon people’s requests. “Most commercially available systems enabled by large language models aren’t yet technically mature enough to comply with our ethical AI principles, which is required for responsible operational use,” she said. “But we have found over 180 instances where such generative AI tools could add value for us with oversight, like helping to debug and develop software faster, speeding analysis of battle damage assessments, and verifiably summarizing texts from both open source and classified data sets.