INDEX
Explanations
references to social media platforms and pages
New Auto-Interp
Negative Logits
aldo
-0.16
stadt
-0.15
illes
-0.14
STA
-0.14
WebRequest
-0.14
alsex
-0.13
baru
-0.13
napshot
-0.13
roker
-0.13
/GL
-0.13
POSITIVE LOGITS
feed
0.22
site
0.20
page
0.18
account
0.18
Feed
0.16
ertil
0.16
channel
0.16
/feed
0.15
pages
0.15
.feed
0.15
Activations Density 0.048%