How MIT Is Teaching AI to Avoid Toxic Mistakes

  • 📰 SciTechDaily1
  • ⏱ Reading Time:
  • 65 sec. here
  • 3 min. at publisher
  • 📊 Quality Score:
  • News: 29%
  • Publisher: 68%

Technology Technology Headlines News

Technology Technology Latest News,Technology Technology Headlines

Science, Space and Technology News 2024

Researchers at MIT have developed a machine learning technique to enhance AI safety testing by using a curiosity-driven approach that generates a wider range of toxic prompts, outperforming traditional human red-teaming methods. Credit: SciTechDaily.commethod for AI safety testing utilizes curiosity to trigger broader and more effective toxic responses from chatbots, surpassing previous red-teaming efforts.

Researchers from Improbable AI Lab at MIT and the MIT-IBM Watson AI Lab used machine learning to improve red-teaming. They developed a technique to train a red-team large language model to automatically generate diverse prompts that trigger a wider range of undesirable responses from the chatbot being tested.

Hong’s co-authors include EECS graduate students Idan Shenfield, Tsun-Hsuan Wang, and Yung-Sung Chuang; Aldo Pareja and Akash Srivastava, research scientists at the MIT-IBM Watson AI Lab; James Glass, senior research scientist and head of the Spoken Language Systems Group in the Computer Science and Artificial Intelligence Laboratory ; and senior author Pulkit Agrawal, director of Improbable AI Lab and an assistant professor in CSAIL.

For their reinforcement learning approach, the MIT researchers utilized a technique called curiosity-driven exploration. The red-team model is incentivized to be curious about the consequences of each prompt it generates, so it will try prompts with different words, sentence patterns, or meanings. To prevent the red-team model from generating random, nonsensical text, which can trick the classifier into awarding a high toxicity score, the researchers also added a naturalistic language bonus to the training objective.

 

Thank you for your comment. Your comment will be published after being reviewed.
Please try again later.
We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

 /  🏆 84. in TECHNOLOGY

Technology Technology Latest News, Technology Technology Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

Teaching Machines To Be Human, And Humans To Live With MachinesI approach every article with a question: “What can business leaders learn from the arts?” With major changes to our economy and society coming from globalization, automation and artificial intelligence, there is a timeless wisdom to be found in the process and practice of creativity.
Source: ForbesTech - 🏆 318. / 59 Read more »

India’s first sari-donning AI humanoid robot teacher starts teachingThe inaugural AI humanoid robot designed for teaching in India has been deployed in the southern state of Kerala.
Source: IntEngineering - 🏆 287. / 63 Read more »