INDEX
Explanations
references to political or governmental issues
New Auto-Interp
Negative Logits
Tamil
-0.18
Bahamas
-0.17
Jiang
-0.17
Bronx
-0.17
Bangladesh
-0.17
Southampton
-0.17
Syracuse
-0.17
ihan
-0.16
Jacksonville
-0.16
odega
-0.16
POSITIVE LOGITS
Colorado
0.91
Denver
0.84
Colorado
0.82
Denver
0.75
Boulder
0.68
Colo
0.60
Rockies
0.60
Broncos
0.58
Arap
0.51
Rocky
0.49
Activations Density 0.297%