INDEX
Explanations
references to Australian identity and culture
New Auto-Interp
Negative Logits
للاسماء
-0.84
surla
-0.59
surate
-0.55
sizeCache
-0.54
zsche
-0.53
RegressionTest
-0.52
beginnetje
-0.50
tắt
-0.50
kasarigan
-0.50
Obrázky
-0.49
POSITIVE LOGITS
domestically
0.69
nationalism
0.65
patriotic
0.64
domestic
0.63
national
0.63
rrggbb
0.63
homeland
0.62
nationaux
0.62
pride
0.62
weder
0.62
Activations Density 0.349%