INDEX
Explanations
The neuron activates on mentions of professional sports team names (e.g. MLB franchises).
New Auto-Interp
Negative Logits
Arg
-0.07
maternal
-0.07
補
-0.06
ieve
-0.06
كام
-0.06
下载次数
-0.06
導
-0.06
oused
-0.06
sigue
-0.06
ARG
-0.06
POSITIVE LOGITS
.Other
0.07
impressed
0.06
ODE
0.06
اند
0.06
名稱
0.06
lille
0.06
functionalities
0.06
oti
0.06
ode
0.06
atd
0.06
Activations Density 0.009%