INDEX
Explanations
The neuron is primarily detecting actor (or character) proper names in the cast lists.
New Auto-Interp
Negative Logits
root
-0.07
Firmware
-0.06
ience
-0.06
artment
-0.06
'='
-0.06
&t
-0.06
cinema
-0.06
Passed
-0.06
Controller
-0.06
}()↵
-0.06
POSITIVE LOGITS
intéress
0.07
Woodward
0.07
noted
0.07
розроб
0.06
прим
0.06
прояв
0.06
عرض
0.06
بهترین
0.06
pk
0.06
preach
0.06
Activations Density 0.012%