Artificial intelligence , like other technologies such as gene editing and nuclear energy, can be used for both good and bad purposes. With so much money and effort being poured into developing AI at such a fast rate, there are concerns about large language models being used for harmful purposes like developing weapons.
This not only offers a method to check if an AI model has dangerous information and a way to remove it while keeping the rest of the model mostly unchanged.The researchers started by consulting experts in biosecurity, chemical weapons, and cybersecurity. These experts listed all the possible ways harm could happen in their fields.
The team has also introduced a new unlearning method called CUT, which, as the name suggests, removes hazardous knowledge from LLMs while still maintaining their overall capabilities in other areas like biology and computer science.
Technology Technology Latest News, Technology Technology Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Source: IntEngineering - 🏆 287. / 63 Read more »