INDEX
Explanations
mentions of police reports or claims of violence
mentions of the letter "M."
New Auto-Interp
Negative Logits
ãĤ¡
-0.81
Eleven
-0.68
DragonMagazine
-0.68
gerald
-0.65
yours
-0.64
fashioned
-0.64
vacant
-0.63
tipped
-0.63
begg
-0.63
tips
-0.62
POSITIVE LOGITS
uppet
1.09
ixed
1.06
useum
1.02
iscal
1.02
uzzle
1.02
oses
1.01
ambo
1.01
ISSION
1.01
OST
1.00
aternal
0.99
Activations Density 0.064%