INDEX
Explanations
The neuron activates strongly for the sequence "ign" which appears in words like "Mignon," "vignette," "design," "undignified," and similar terms.
phrases ending in "-ign" or "-ized."
instances of the word "ign" or related variations, indicating a focus on issues related to neglect or disregard
New Auto-Interp
Negative Logits
Mighty
-0.75
cham
-0.74
HAHAHAHA
-0.74
Springfield
-0.71
bian
-0.71
Span
-0.65
XL
-0.64
Akin
-0.61
RC
-0.61
PLA
-0.60
POSITIVE LOGITS
ments
1.21
ificant
1.17
antly
0.99
eous
0.99
atures
0.96
atories
0.96
mentation
0.95
entials
0.95
ame
0.94
ified
0.92
Activations Density 0.017%