INDEX
Explanations
mentions of the Justice Department
repeated mentions of the "Justice Department."
New Auto-Interp
Negative Logits
iple
-0.83
hest
-0.77
ength
-0.76
vre
-0.74
livest
-0.73
uers
-0.72
ulum
-0.70
ÃŁ
-0.67
ppy
-0.67
ulators
-0.66
POSITIVE LOGITS
Dept
1.06
Department
1.05
League
0.88
Scalia
0.87
Assistance
0.83
Anton
0.82
Corps
0.78
Gins
0.77
Clarence
0.76
Minister
0.75
Activations Density 0.027%