INDEX
Explanations
adjectives describing negative or unpleasant characteristics or situations
descriptors related to negative or severe situations
New Auto-Interp
Negative Logits
afort
-0.73
actionGroup
-0.71
————
-0.68
adesh
-0.67
enzyme
-0.65
Thumbnail
-0.65
congr
-0.63
ertodd
-0.62
granted
-0.62
streng
-0.60
POSITIVE LOGITS
oire
1.19
dark
1.18
ly
1.04
aces
0.97
ace
0.97
acing
0.92
lock
0.91
grim
0.90
prog
0.88
outlook
0.87
Activations Density 0.073%