INDEX
Explanations
ambiguous statements implying uncertainty or variety in options
phrases that include the word "even," often suggesting an emphasis or consideration of additional possibilities
New Auto-Interp
Negative Logits
othy
-0.78
aim
-0.77
rend
-0.73
cent
-0.69
plex
-0.66
_-
-0.66
idelines
-0.65
Nare
-0.64
oder
-0.64
Buff
-0.64
POSITIVE LOGITS
remotely
0.92
romeda
0.69
speculate
0.68
uncond
0.67
moderately
0.65
outright
0.64
handedly
0.63
indirectly
0.62
fanc
0.62
osponsors
0.61
Activations Density 0.043%