INDEX
Explanations
mentions of Southern regions or organizations
the repeated mention of the term "Southern" in various contexts
New Auto-Interp
Negative Logits
icular
-0.87
icles
-0.81
endi
-0.79
uers
-0.76
roma
-0.76
icit
-0.76
acl
-0.73
ifice
-0.72
/**
-0.71
%]
-0.71
POSITIVE LOGITS
Hemisphere
1.14
hemisphere
0.98
Poverty
0.92
most
0.86
western
0.84
California
0.80
Belle
0.79
Shroud
0.77
Railway
0.77
States
0.76
Activations Density 0.009%