INDEX
Explanations
first-person pronouns combined with verbs indicating mental processing
references to personal experiences and relationships
New Auto-Interp
Negative Logits
advertisement
-0.77
Coalition
-0.72
Siege
-0.69
reperc
-0.64
âĩ
-0.63
omission
-0.62
Pastebin
-0.60
itely
-0.59
Advertising
-0.58
Directors
-0.58
POSITIVE LOGITS
atic
0.83
erk
0.76
ansk
0.76
hers
0.73
atically
0.73
atche
0.69
self
0.69
eps
0.68
Osw
0.68
selves
0.67
Activations Density 0.179%