INDEX
Explanations
references to Facebook and its operations
New Auto-Interp
Negative Logits
İY
-0.16
oldem
-0.16
ιο
-0.15
uisse
-0.15
Disney
-0.15
owie
-0.15
ityEngine
-0.15
ocument
-0.14
oki
-0.14
Registry
-0.14
POSITIVE LOGITS
FB
0.29
.fb
0.28
FB
0.27
fb
0.27
0.27
Zuckerberg
0.26
0.25
0.25
(fb
0.24
0.23
Activations Density 0.047%