INDEX
Explanations
phrases emphasizing negation or absence
"no" followed by a noun
no and absence of specific items
New Auto-Interp
Negative Logits
IST
-0.45
Unterneh
-0.43
ly
-0.42
charAt
-0.41
estudi
-0.40
Bergmann
-0.39
recherchez
-0.39
acuer
-0.39
pasada
-0.38
ukary
-0.38
POSITIVE LOGITS
no
1.14
No
1.08
No
1.04
NO
0.91
aucun
0.84
none
0.81
nessun
0.79
aucune
0.79
no
0.74
NO
0.74
Activations Density 0.154%