INDEX
Explanations
states within the United States
statements regarding specific states and their attributes or actions
New Auto-Interp
Negative Logits
ongyang
-0.95
behaviours
-0.80
behaviour
-0.79
atos
-0.72
calib
-0.70
oyal
-0.68
fins
-0.68
analys
-0.67
aviour
-0.67
ariat
-0.66
POSITIVE LOGITS
Medicaid
0.85
aucuses
0.81
NAACP
0.78
edu
0.77
population
0.77
sylvania
0.76
gov
0.75
Interstate
0.69
District
0.68
bred
0.68
Activations Density 0.670%