Technology

AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

, and Summarizely AI

May 22, 2024 . 12:09 AM

1 min read

AI Is a Black Box. Anthropic Figured Out a Way to Look Inside — What goes on in artificial neural networks work is largely a mystery, even to their creators. But researchers from Anthropic have caught a glimpse.

AI Researcher Chris Olah has been dedicated to understanding artificial neural networks for the past decade. His work has focused on the question of what goes on inside these systems, especially now that generative AI has become widespread. Olah and his team at AI startup Anthropic have made significant progress in understanding large language models (LLMs) and have identified millions of features in these models. They have also been able to manipulate the neural net to make LLMs safer and reduce bias. However, they have not claimed to have solved the black box problem and acknowledge that there are limitations to their approach. Other researchers are also working on similar problems, and Anthropic's work is seen as a step forward in understanding LLMs.

Source