INDEX
Explanations
website urls at the end of sentences prompting the reader to share or read a story
phrases indicating options or alternatives
New Auto-Interp
Negative Logits
hower
-0.83
steen
-0.73
hol
-0.67
ouls
-0.66
ngth
-0.64
ļé
-0.62
matter
-0.60
puter
-0.59
atari
-0.59
natureconservancy
-0.58
POSITIVE LOGITS
Share
0.68
Submit
0.66
Format
0.65
subscribe
0.65
Paste
0.64
ANGE
0.63
hear
0.63
Comment
0.63
yrics
0.63
leans
0.62
Activations Density 0.038%