INDEX
Explanations
words that convey a sense of inadequacy or worthlessness
New Auto-Interp
Negative Logits
aeper
-0.76
MER
-0.70
udeb
-0.68
accur
-0.68
asonic
-0.67
ickr
-0.64
Lago
-0.64
omez
-0.62
hemor
-0.62
=-=-=-=-
-0.62
POSITIVE LOGITS
ness
1.67
ly
1.49
nesses
1.44
liness
1.12
NESS
1.08
iveness
0.96
soever
0.94
iate
0.93
ity
0.92
edly
0.92
Activations Density 0.010%