INDEX
Explanations
words related to negative qualities or traits
Negative descriptors related to unpleasantness or harm
New Auto-Interp
Negative Logits
inet
-0.86
ingham
-0.85
inoa
-0.81
HCR
-0.79
produced
-0.76
inez
-0.75
particip
-0.74
issued
-0.73
ination
-0.73
Particip
-0.73
POSITIVE LOGITS
nasty
1.25
earthqu
0.97
adolesc
0.96
surprises
0.94
ugly
0.87
spoil
0.85
barb
0.83
poisonous
0.77
beasts
0.76
mud
0.76
Activations Density 0.007%