INDEX
Explanations
references to personality traits and types
New Auto-Interp
Negative Logits
onte
-0.16
oldt
-0.15
/sources
-0.15
deen
-0.15
tut
-0.14
parator
-0.14
setBackgroundColor
-0.14
ydk
-0.14
ensem
-0.14
APH
-0.14
POSITIVE LOGITS
traits
0.23
personality
0.22
trait
0.21
Traits
0.21
Personality
0.19
scores
0.18
trait
0.17
Temper
0.17
character
0.17
Traits
0.17
Activations Density 0.113%