INDEX
Explanations
promotional messages or offers
references to promotional content or marketing materials
New Auto-Interp
Negative Logits
GO
-0.87
acht
-0.76
Ness
-0.73
Apostles
-0.71
ña
-0.68
yer
-0.68
ighed
-0.68
gger
-0.66
ectar
-0.66
kos
-0.65
POSITIVE LOGITS
promotions
1.00
promotional
0.99
eatures
0.88
calendars
0.84
banners
0.82
andise
0.81
promotion
0.80
promo
0.79
wcs
0.79
broch
0.78
Activations Density 0.015%