INDEX
Explanations
phrases related to newsletters or emails being delivered to the inbox
mentions of newsletters and their delivery to an inbox
New Auto-Interp
Negative Logits
aft
-0.86
skinned
-0.71
hig
-0.68
arks
-0.67
Corpor
-0.65
Ath
-0.63
POR
-0.63
apesh
-0.61
Patriarch
-0.61
had
-0.61
POSITIVE LOGITS
inbox
0.98
Subscribe
0.90
allery
0.85
Dragonbound
0.80
ombat
0.77
mailbox
0.72
lain
0.71
esy
0.68
helle
0.65
0.64
Activations Density 0.017%