INDEX
Explanations
phrases and words related to email promotions
instances of punctuation and formatting in writing
New Auto-Interp
Negative Logits
iter
-0.74
Foot
-0.68
rely
-0.67
ensical
-0.65
orable
-0.65
olas
-0.65
comprom
-0.65
paces
-0.65
imus
-0.62
uder
-0.62
POSITIVE LOGITS
,,,,,,,,
0.88
::::::::
0.84
,,,,
0.81
::::
0.75
taboola
0.72
tel
0.72
then
0.71
tra
0.69
;;;;;;;;;;;;
0.68
Lad
0.66
Activations Density 0.030%