INDEX
Explanations
instances of the word "free" and related terms indicating complimentary offers
New Auto-Interp
Negative Logits
pricey
-0.16
ibo
-0.15
460
-0.15
ÑĢава
-0.15
que
-0.14
rece
-0.14
xca
-0.14
ns
-0.14
pike
-0.14
wn
-0.14
POSITIVE LOGITS
bies
0.42
bie
0.40
zes
0.24
-standing
0.24
zers
0.22
-of
0.21
bsd
0.21
zing
0.21
trials
0.21
ze
0.20
Activations Density 0.035%