INDEX
Explanations
references to social media platforms, especially Facebook
mentions and references to the social media platform Facebook
New Auto-Interp
Negative Logits
nces
-0.87
ved
-0.73
lass
-0.69
ptin
-0.68
DERR
-0.68
externalToEVAOnly
-0.67
rer
-0.67
dry
-0.67
heny
-0.67
teen
-0.66
POSITIVE LOGITS
Messenger
0.97
nect
0.94
Live
0.84
acebook
0.78
icus
0.77
Pages
0.76
ical
0.76
imity
0.76
username
0.76
Caf
0.72
Activations Density 0.040%