INDEX
Explanations
characteristics related to personality traits and moral values
New Auto-Interp
Negative Logits
gezocht
-0.14
ush
-0.14
uzzi
-0.14
own
-0.14
ince
-0.13
iska
-0.13
elles
-0.13
omat
-0.13
bo
-0.13
},{↵-0.13
POSITIVE LOGITS
entity
0.26
affair
0.20
creatures
0.20
creature
0.19
enough
0.18
Entity
0.18
entities
0.18
ENTITY
0.18
entity
0.17
institution
0.17
Activations Density 0.190%