INDEX
Explanations
This neuron detects occurrences of the word "should" (as in conditional recommendations or hypothetical questions).
New Auto-Interp
Negative Logits
iele
-0.07
balcony
-0.07
گیرد
-0.07
уже
-0.06
}:
-0.06
image
-0.06
onion
-0.06
except
-0.06
Frame
-0.06
Banner
-0.06
POSITIVE LOGITS
직접
0.07
ูม
0.06
규
0.06
%%↵
0.06
oron
0.06
{text0.06
;$
0.06
대비
0.06
converse
0.06
(**
0.06
Activations Density 0.014%