INDEX
Explanations
texts about self-improvement and resilience
expressions of vulnerability and resilience in the face of societal judgment
New Auto-Interp
Negative Logits
audi
-0.69
ItemTracker
-0.64
Deadline
-0.64
interstitial
-0.63
everal
-0.63
affe
-0.61
atform
-0.60
ullivan
-0.60
encer
-0.59
uscript
-0.59
POSITIVE LOGITS
oppress
1.26
injust
1.22
ensl
1.19
oppression
1.17
injustice
1.15
oppressed
1.13
THEIR
1.08
sinful
1.08
selfish
1.08
immoral
1.07
Activations Density 0.902%