INDEX
Explanations
phrases and context related to authoritative statements or proclamations
New Auto-Interp
Negative Logits
Shiite
-0.25
Shepard
-0.22
fony
-0.22
LGBTQ
-0.21
whiskey
-0.20
Francois
-0.20
errick
-0.20
WiFi
-0.19
advisors
-0.19
Griffin
-0.18
POSITIVE LOGITS
Muhammed
0.29
Andersen
0.28
MacDonald
0.27
Nicholson
0.25
Henderson
0.25
477
0.24
Patterson
0.23
WIFI
0.23
ensen
0.22
Mohammed
0.22
Activations Density 0.181%