INDEX
Explanations
mentions of the word "Canadians"
references to Canadians
New Auto-Interp
Negative Logits
sk
-0.71
fecture
-0.70
/+
-0.68
sole
-0.68
planet
-0.68
gur
-0.67
ergy
-0.66
boss
-0.64
ctor
-0.64
Verb
-0.64
POSITIVE LOGITS
Canadians
1.10
ervatives
1.03
ervative
0.81
aurus
0.80
Australians
0.79
umers
0.78
Colomb
0.77
submar
0.72
Jagu
0.70
Americans
0.70
Activations Density 0.011%