INDEX
Explanations
negative descriptors referring to lack of intelligence
instances of the word "dumb" and related expressions of foolishness or lack of intelligence
New Auto-Interp
Negative Logits
Lago
-0.82
Noir
-0.72
UAL
-0.71
Chronicles
-0.67
Flav
-0.67
TOR
-0.65
ATURES
-0.64
________________________________________________________________
-0.63
cially
-0.63
ATURE
-0.62
POSITIVE LOGITS
founded
1.35
arton
1.17
bell
1.15
found
1.14
stru
0.96
est
0.95
asses
0.91
holes
0.91
wallet
0.89
gest
0.87
Activations Density 0.008%