INDEX
Explanations
This neuron detects Spanish phrasing that expresses finding a balance or trade‐off (e.g. “equilibrio entre … y …”).
New Auto-Interp
Negative Logits
Metro
-0.08
Ruf
-0.07
rotate
-0.06
color
-0.06
เฉ
-0.06
Metro
-0.06
sample
-0.06
_Number
-0.06
supporter
-0.06
_sum
-0.06
POSITIVE LOGITS
delicate
0.07
}}">↵
0.07
vị
0.06
BI
0.06
Intermediate
0.06
Jun
0.06
ilogue
0.06
Tarif
0.06
ství
0.06
그래
0.06
Activations Density 0.027%