INDEX
Explanations
possessive pronouns followed by everyday activities or personal experiences
references to personal and collective identity
New Auto-Interp
Negative Logits
osponsors
-0.96
docs
-0.78
urers
-0.76
frog
-0.75
arians
-0.74
inks
-0.73
avers
-0.72
angers
-0.70
ubs
-0.70
owler
-0.70
POSITIVE LOGITS
repertoire
1.34
psyche
1.33
life
1.26
upbringing
1.26
arsenal
1.19
worldview
1.17
lives
1.17
workflow
1.13
existence
1.10
equation
1.10
Activations Density 0.246%