INDEX
Explanations
phrases indicating something is nearly absolute or close to being certain
the word "virtually" and its variations in different contexts
New Auto-Interp
Negative Logits
agate
-0.88
spr
-0.74
will
-0.68
actions
-0.67
ses
-0.67
intent
-0.65
eria
-0.64
ibal
-0.62
gang
-0.62
bucks
-0.62
POSITIVE LOGITS
etheless
0.89
ciating
0.79
nown
0.76
unchanged
0.76
inaccessible
0.73
indistinguishable
0.73
unemploy
0.72
unheard
0.72
illiter
0.71
rency
0.70
Activations Density 0.004%