INDEX
Explanations
references to the Justice Department
New Auto-Interp
Negative Logits
hest
-0.78
iple
-0.78
livest
-0.75
uers
-0.75
eq
-0.71
hetically
-0.71
ength
-0.71
ulum
-0.71
ÃŁ
-0.70
ruly
-0.68
POSITIVE LOGITS
Department
1.12
Dept
1.08
League
0.94
Assistance
0.89
Corps
0.86
Scalia
0.83
Justice
0.82
Centers
0.81
Center
0.80
Bureau
0.79
Activations Density 0.012%