INDEX
Explanations
This neuron responds to LaTeX package import statements (e.g. “\usepackage{…}”).
New Auto-Interp
Negative Logits
orus
-0.08
god
-0.07
fores
-0.07
chores
-0.06
Boehner
-0.06
ороз
-0.06
anas
-0.06
_unix
-0.06
naš
-0.06
-filter
-0.06
POSITIVE LOGITS
_UNIFORM
0.07
ActionListener
0.07
Sylv
0.06
kry
0.06
receive
0.06
ILogger
0.06
ionate
0.06
окумент
0.06
lık
0.05
malware
0.05
Activations Density 0.000%