INDEX
Explanations
phrases related to subscribing to newsletters
instances of the word "we" and related phrases that signify collective action or communication
New Auto-Interp
Negative Logits
Kand
-0.56
Ukrain
-0.56
Emer
-0.53
Gentle
-0.51
Eleven
-0.50
Dunham
-0.50
Dull
-0.48
partName
-0.48
naire
-0.48
Lean
-0.48
POSITIVE LOGITS
've
0.61
Have
0.61
astics
0.59
're
0.59
ighed
0.56
atered
0.55
asel
0.54
Movie
0.53
Got
0.53
bsp
0.52
Activations Density 0.013%