INDEX
Explanations
The neuron activates on occurrences of the word “anxiety,” especially when it appears as a standalone term or heading.
New Auto-Interp
Negative Logits
obras
-0.08
813
-0.08
Wimbledon
-0.07
データ
-0.07
dikke
-0.07
レット
-0.07
Marc
-0.07
diplom
-0.07
builtin
-0.07
sections
-0.07
POSITIVE LOGITS
anxiety
0.14
Anxiety
0.12
anx
0.10
anxious
0.08
xiety
0.08
Fear
0.07
nez
0.07
không
0.06
Při
0.06
ansible
0.06
Activations Density 0.006%