INDEX
Explanations
references to German identity or culture
New Auto-Interp
Negative Logits
دانشنامهٔ
-0.77
aarrggbb
-0.77
"..\..\
-0.76
Turki
-0.76
iprot
-0.75
Kirs
-0.73
”?
-0.69
Flä
-0.68
appoint
-0.68
pointment
-0.68
POSITIVE LOGITS
Germany
1.13
Germany
1.05
Germans
1.02
GERMAN
1.01
German
1.00
germany
0.96
GERMANY
0.94
germany
0.93
Allemagne
0.93
German
0.84
Activations Density 0.013%