INDEX
Explanations
words with the prefix "dis-" indicating negation or reversal
words related to disappointment or unfavorable situations
New Auto-Interp
Negative Logits
glers
-0.70
Reviewer
-0.68
Damn
-0.66
Parm
-0.65
Kinnikuman
-0.65
scratch
-0.63
swick
-0.60
Franks
-0.60
Archdemon
-0.60
Beyond
-0.60
POSITIVE LOGITS
comfort
1.08
ruption
1.07
rup
1.04
cipl
1.04
appointed
1.02
abling
0.99
dis
0.97
licted
0.95
quiet
0.95
abled
0.94
Activations Density 0.005%