INDEX
Explanations
references to Facebook and its related features, functions, or pages
New Auto-Interp
Negative Logits
weise
-0.15
ãĥ©ãĥĥãĤ¯
-0.15
weblog
-0.15
ONO
-0.15
одав
-0.14
äd
-0.14
Ñħи
-0.14
ephir
-0.14
venes
-0.14
Associ
-0.14
POSITIVE LOGITS
Messenger
0.30
s
0.28
messenger
0.24
0.23
/T
0.22
groups
0.21
istan
0.20
Messenger
0.20
Groups
0.20
page
0.20
Activations Density 0.016%