INDEX
Explanations
Canadian-related terms, especially those related to legal or political issues
New Auto-Interp
Negative Logits
bush
-0.80
regress
-0.67
sterling
-0.67
Cheong
-0.60
Ruk
-0.57
Cantor
-0.57
fronts
-0.56
counter
-0.55
Emanuel
-0.55
quarters
-0.55
POSITIVE LOGITS
adian
0.79
ue
0.78
cially
0.77
illo
0.74
aukee
0.74
cial
0.73
ildo
0.73
rouse
0.71
obos
0.71
igue
0.70
Activations Density 0.048%