INDEX
Explanations
the existence of something or the presence of a condition
New Auto-Interp
Negative Logits
åłĤ
-0.15
rs
-0.15
ayo
-0.14
issement
-0.14
612
-0.14
asy
-0.14
asto
-0.13
ÑģеÑĢ
-0.13
lsi
-0.13
amik
-0.13
POSITIVE LOGITS
no
0.25
nothing
0.20
nobody
0.20
none
0.20
geen
0.20
no
0.20
No
0.20
ninguna
0.19
ningún
0.18
-none
0.18
Activations Density 0.056%