INDEX
Explanations
The neuron fires strongly on words and phrases expressing exasperation or complaint—especially things like “had enough” or “sick of.”
New Auto-Interp
Negative Logits
ANE
-0.07
sovereignty
-0.07
otr
-0.06
semester
-0.06
Flux
-0.06
ーダ
-0.06
錯
-0.06
兹
-0.06
Reach
-0.06
ifestyles
-0.06
POSITIVE LOGITS
[dim
0.08
елефон
0.07
moire
0.07
ภาษ
0.06
-dropdown
0.06
joked
0.06
VERTISEMENT
0.06
treeNode
0.06
Xen
0.06
Entwicklung
0.06
Activations Density 0.036%