INDEX
Explanations
phrases indicating the exclusion or negation of something
phrases that indicate the act of not excluding possibilities
New Auto-Interp
Negative Logits
awar
-0.81
cue
-0.79
ombat
-0.78
artist
-0.75
eatured
-0.72
eworld
-0.71
omsky
-0.70
Eye
-0.70
oppy
-0.70
MAS
-0.69
POSITIVE LOGITS
anything
0.78
posts
0.74
outright
0.70
enance
0.69
EVs
0.69
smoking
0.68
foul
0.67
any
0.66
extradition
0.66
hypot
0.66
Activations Density 0.027%