INDEX
Explanations
references to or mentions of specific districts
references to specific districts
New Auto-Interp
Negative Logits
issan
-0.83
pload
-0.82
ocaust
-0.82
tty
-0.76
odus
-0.76
conn
-0.74
ipedia
-0.74
etheus
-0.73
vous
-0.73
glers
-0.73
POSITIVE LOGITS
rict
0.96
ancest
0.81
wide
0.80
district
0.77
districts
0.77
boundaries
0.71
ricting
0.71
ributed
0.71
attorney
0.70
ribution
0.70
Activations Density 0.017%