INDEX
Explanations
phrases suggesting subscribing to newsletters
inquiries about trustworthy news sources
New Auto-Interp
Negative Logits
ioned
-0.72
orem
-0.66
geist
-0.65
xton
-0.64
yg
-0.63
backlog
-0.62
ortium
-0.61
endon
-0.61
pic
-0.61
xus
-0.60
POSITIVE LOGITS
independence
0.67
Currency
0.66
UNHCR
0.65
taboola
0.63
search
0.62
iframe
0.62
Afee
0.62
Subscribe
0.61
sources
0.57
subscribing
0.57
Activations Density 0.057%