INDEX
Explanations
the word "No" in various contexts
instances of the word "no" indicating denial or refusal
New Auto-Interp
Negative Logits
endish
-0.64
worn
-0.63
FML
-0.63
erala
-0.61
tyard
-0.60
tnc
-0.60
leaning
-0.59
ËĪ
-0.58
Roaming
-0.58
istani
-0.57
POSITIVE LOGITS
oooooooo
1.09
oooo
1.07
oooooooooooooooo
1.04
worries
0.99
matter
0.96
ooo
0.92
thanks
0.91
sir
0.91
clue
0.90
kidding
0.90
Activations Density 0.037%