INDEX
Explanations
Negation
This neuron detects the presence of the negation “not,” as used in phrasing questions asking “which is not…”
New Auto-Interp
Negative Logits
дивиду
-0.08
ють
-0.07
Kr
-0.07
gearbox
-0.06
daher
-0.06
bytearray
-0.06
_SCR
-0.06
повин
-0.06
不同
-0.06
เกม
-0.06
POSITIVE LOGITS
rentals
0.07
диагности
0.06
PY
0.06
Steam
0.06
örnek
0.06
attendance
0.06
フェ
0.06
oller
0.06
bic
0.06
owo
0.06
Activations Density 0.007%