INDEX
Explanations
references to the self or personal interactions
the pronoun "me" in various contexts, indicating a focus on personal identity and self-perception
New Auto-Interp
Negative Logits
ulton
-0.69
atlantic
-0.68
emic
-0.68
naissance
-0.65
etheus
-0.64
iens
-0.63
ories
-0.63
Atlantic
-0.63
-)
-0.63
icion
-0.63
POSITIVE LOGITS
adows
0.86
personally
0.83
imei
0.81
selves
0.76
adow
0.75
verbally
0.73
atic
0.73
self
0.72
uncond
0.72
zzo
0.72
Activations Density 0.183%