INDEX
Explanations
This neuron detects occurrences of the word “equivalent” (especially in the phrase “equivalent to”) in questions.
New Auto-Interp
Negative Logits
slideDown
-0.07
ados
-0.07
_out
-0.06
evitar
-0.06
excited
-0.06
overs
-0.06
_dicts
-0.06
punches
-0.06
soluble
-0.06
眼睛
-0.06
POSITIVE LOGITS
arithmetic
0.07
ARIANT
0.06
ADMIN
0.06
име
0.06
.cookie
0.06
-he
0.06
Christine
0.06
Nikola
0.06
GOODS
0.06
getResources
0.06
Activations Density 0.014%