INDEX
Explanations
personal pronouns followed by a statement of certainty or commitment
instances of the word "I" and variations of personal to indicate self-reference
New Auto-Interp
Negative Logits
buzz
-0.65
bard
-0.64
pitch
-0.64
krit
-0.62
giveaway
-0.62
foul
-0.61
Topic
-0.58
sidx
-0.58
Behavior
-0.58
nexus
-0.57
POSITIVE LOGITS
ortal
1.13
umbai
1.02
ighty
1.00
ichael
1.00
igr
0.96
useum
0.95
igration
0.92
asonry
0.91
uppet
0.91
selves
0.91
Activations Density 0.044%