INDEX
Explanations
effectiveness
The neuron fires on words or phrases signaling efficiency or effectiveness (e.g. “eficiente,” “eficaz,” “eficiência”).
New Auto-Interp
Negative Logits
buffered
-0.08
Balancer
-0.07
-0.07
-0.07
strncpy
-0.07
saver
-0.07
とした
-0.07
�
-0.07
Surg
-0.07
-0.06
POSITIVE LOGITS
thro
0.07
výraz
0.06
,)↵
0.06
VIDEO
0.06
езультат
0.06
symp
0.06
exagger
0.06
redeem
0.06
viet
0.06
426
0.06
Activations Density 0.024%