INDEX
Explanations
concepts related to individualism and personal strengths within a communal context
New Auto-Interp
Negative Logits
ocity
-0.16
ugar
-0.16
rok
-0.16
ersistence
-0.15
roz
-0.15
jom
-0.15
oyo
-0.14
dete
-0.14
rounds
-0.14
365
-0.14
POSITIVE LOGITS
interests
0.34
preferences
0.31
likes
0.30
Preferences
0.26
likes
0.26
strengths
0.26
background
0.24
fears
0.24
personality
0.24
Likes
0.24
Activations Density 0.298%