How to make sense of AI neural network: A new study reveals

  • 📰 IntEngineering
  • ⏱ Reading Time:
  • 44 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 21%
  • Publisher: 63%

Technology Technology Headlines News

Technology Technology Latest News,Technology Technology Headlines

Interesting Engineering is a cutting edge, leading community designed for all lovers of engineering, technology and science.

Get a daily digest of the latest news in tech, science, and technology, delivered right to your mailbox. Subscribe now.

Fortunately, artificial neural networks are more accessible to study than biological ones. We can measure the activity of every neuron in the network, manipulate them by turning them on or off, and see how the network responds to different inputs. The features also allow them to control the network's behavior more precisely. As shown below, by activating a feature artificially, they can make the network produce different outputs that match the feature's meaning.

This work results from Anthropic's investment in Mechanistic Interpretability – one of their longest-term research bets on AI safety. Until now, the fact that individual neurons were uninterpretable presented a severe roadblock to a mechanistic understanding of language models. Decomposing groups of neurons into interpretable features has the potential to move past that roadblock.

 

Thank you for your comment. Your comment will be published after being reviewed.
Please try again later.
We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

 /  🏆 287. in TECHNOLOGY

Technology Technology Latest News, Technology Technology Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

US Navy’s Trident II D5 missile test-fired for 191st timeInteresting Engineering is a cutting edge, leading community designed for all lovers of engineering, technology and science.
Source: IntEngineering - 🏆 287. / 63 Read more »

This AI tongue can tell if a flavor is sweet or saltyInteresting Engineering is a cutting edge, leading community designed for all lovers of engineering, technology and science.
Source: IntEngineering - 🏆 287. / 63 Read more »

UCSD engineers' new device stores energy and supports loadInteresting Engineering is a cutting edge, leading community designed for all lovers of engineering, technology and science.
Source: IntEngineering - 🏆 287. / 63 Read more »

Japan kick-starts research to build next-gen reusable rocketInteresting Engineering is a cutting edge, leading community designed for all lovers of engineering, technology and science.
Source: IntEngineering - 🏆 287. / 63 Read more »

Joby Aviation just started testing its eVTol with a pilotInteresting Engineering is a cutting edge, leading community designed for all lovers of engineering, technology and science.
Source: IntEngineering - 🏆 287. / 63 Read more »

Study show defects passing thorough diamond faster than soundInteresting Engineering is a cutting edge, leading community designed for all lovers of engineering, technology and science.
Source: IntEngineering - 🏆 287. / 63 Read more »