INDEX
Explanations
words related to personal values, integrity, and emotional expression
statements related to control, power, and influence
New Auto-Interp
Negative Logits
ģĸ
-0.87
»Ĵ
-0.83
geons
-0.69
ãĥīãĥ©
-0.69
emed
-0.69
iatus
-0.67
ulla
-0.67
iatrics
-0.65
odon
-0.65
quished
-0.63
POSITIVE LOGITS
creativity
1.25
honesty
1.20
bravery
1.18
sophistication
1.18
generosity
1.18
ingenuity
1.14
elegance
1.13
decency
1.12
professionalism
1.11
humility
1.09
Activations Density 0.249%