INDEX
Explanations
occurrences of the word "we" with a high level of activation
instances of the pronoun "we."
New Auto-Interp
Negative Logits
gratification
-0.80
quo
-0.71
anonymity
-0.68
citation
-0.67
pedoph
-0.65
citations
-0.64
contradictions
-0.63
LSD
-0.63
conflicts
-0.61
sucker
-0.61
POSITIVE LOGITS
bsite
1.49
eping
1.18
aving
1.17
ldon
1.16
lder
1.14
igh
1.13
akening
1.11
aning
1.10
eks
1.06
ighed
1.05
Activations Density 0.098%