INDEX
Explanations
mentions of the city "Austin"
mentions of the city of Austin
New Auto-Interp
Negative Logits
mor
-0.69
chrom
-0.68
thal
-0.68
chwitz
-0.68
np
-0.67
chrom
-0.67
np
-0.67
Prin
-0.66
paste
-0.65
paste
-0.65
POSITIVE LOGITS
Austin
3.69
Austin
3.36
Dallas
1.78
Houston
1.74
Arlington
1.67
Texas
1.65
Houston
1.65
Dallas
1.60
Travis
1.60
SX
1.57
Activations Density 0.015%