INDEX
Explanations
words related to social media platforms and activism
New Auto-Interp
Negative Logits
nces
-0.85
heny
-0.69
cised
-0.69
lass
-0.63
rosc
-0.62
ved
-0.62
Croat
-0.62
inence
-0.62
ptin
-0.61
Äĩ
-0.61
POSITIVE LOGITS
Messenger
1.01
nect
0.93
username
0.90
Live
0.83
Pages
0.81
acebook
0.81
Username
0.77
0.73
Friends
0.73
Ads
0.72
Activations Density 0.379%