INDEX
Explanations
sentence transitions or conjunctions indicating continuation or contrast
phrases related to probability and potential consequences
New Auto-Interp
Negative Logits
everywhere
-0.55
racket
-0.55
apocalypse
-0.53
fever
-0.52
junk
-0.51
iasco
-0.51
????????
-0.50
forever
-0.50
chwitz
-0.49
yesterday
-0.49
POSITIVE LOGITS
outwe
0.78
etheless
0.72
higher
0.70
disadvantages
0.68
better
0.67
oother
0.65
softer
0.64
preferable
0.64
smoother
0.63
advant
0.63
Activations Density 1.003%