INDEX
Explanations
phrases indicating negation or contradiction
phrases that negate or dismiss certain ideas or concepts
New Auto-Interp
Negative Logits
Reviewed
-0.71
Metatron
-0.67
whichever
-0.57
Polk
-0.57
GBT
-0.56
Orth
-0.55
Stew
-0.54
perse
-0.54
kefeller
-0.53
phrine
-0.53
POSITIVE LOGITS
xious
0.90
uncertain
0.90
onday
0.89
xus
0.87
ct
0.78
conceivable
0.78
avail
0.74
particular
0.73
osphere
0.72
otrop
0.70
Activations Density 0.028%