INDEX
Explanations
links prompting readers to share a particular story
instances of the word "Share," indicating a focus on sharing actions or prompts
New Auto-Interp
Negative Logits
desper
-0.79
teness
-0.76
rift
-0.74
destro
-0.74
wagen
-0.74
pora
-0.72
vernment
-0.72
aternal
-0.70
hesda
-0.69
blem
-0.69
POSITIVE LOGITS
UTH
0.72
0.72
Share
0.72
PsyNet
0.67
Tickets
0.64
0.62
Events
0.62
ning
0.61
Spons
0.61
Quote
0.60
Activations Density 0.012%