INDEX
Explanations
words related to personality traits
terms related to personality traits
New Auto-Interp
Negative Logits
ubuntu
-0.75
blackout
-0.73
kj
-0.72
gz
-0.70
postp
-0.70
cape
-0.67
kk
-0.67
................
-0.65
budget
-0.65
udget
-0.64
POSITIVE LOGITS
traits
3.56
trait
3.28
attributes
1.64
qualities
1.60
genes
1.45
characteristics
1.36
quirks
1.31
behaviors
1.29
behaviours
1.27
tropes
1.23
Activations Density 0.020%