INDEX
Explanations
patterns of behavior and social interactions that reflect on emotional responses
New Auto-Interp
Negative Logits
zers
-0.14
ãĥ¬ãĥĵ
-0.13
ÅĻez
-0.13
pedo
-0.12
loor
-0.12
arem
-0.12
icast
-0.12
omik
-0.12
bild
-0.12
olest
-0.11
POSITIVE LOGITS
people
0.56
individuals
0.47
people
0.41
someone
0.40
folks
0.34
ppl
0.33
persons
0.33
PEOPLE
0.33
anyone
0.30
Individuals
0.30
Activations Density 0.415%