INDEX
Explanations
social media references and updates on current events, particularly those related to law enforcement and political figures
New Auto-Interp
Negative Logits
corrections
-0.77
ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
-0.65
secretaries
-0.60
notebooks
-0.59
estab
-0.59
Levant
-0.58
ãĢij
-0.58
assistants
-0.58
*/(
-0.57
settles
-0.57
POSITIVE LOGITS
zx
1.11
zn
1.08
bh
1.03
1.01
qv
1.01
zl
0.99
zu
0.98
bj
0.97
xus
0.96
uo
0.94
Activations Density 0.366%