INDEX
Explanations
words related to specific places or entities, specifically those with the pattern "Sou__" or "Sask__"
references to specific geographic locations and organizations
New Auto-Interp
Negative Logits
orate
-0.79
urate
-0.73
hist
-0.69
ipolar
-0.67
uate
-0.65
oard
-0.65
jamin
-0.64
umen
-0.64
arding
-0.63
withholding
-0.63
POSITIVE LOGITS
plings
0.87
atchewan
0.79
Rough
0.78
crow
0.77
ustain
0.77
atoon
0.76
stice
0.75
udo
0.75
pling
0.74
zhen
0.74
Activations Density 0.085%