INDEX
Explanations
adjectives describing negative attributes or conditions
negative descriptors and concepts that indicate poor quality or undesirable outcomes
New Auto-Interp
Negative Logits
sonian
-0.78
ebus
-0.76
confidently
-0.73
uity
-0.73
phasis
-0.72
tesy
-0.72
olon
-0.71
verning
-0.70
aukee
-0.69
clus
-0.68
POSITIVE LOGITS
Worse
0.86
worse
0.85
Karma
0.80
imaginable
0.79
Worst
0.78
ovie
0.77
rotten
0.77
havoc
0.76
Despair
0.76
nightmares
0.76
Activations Density 0.486%