INDEX
Explanations
references to Germany and its political figures or actions
New Auto-Interp
Negative Logits
ág
-0.15
Nunes
-0.15
Ranch
-0.14
浩
-0.14
laden
-0.14
cxx
-0.14
íĻĺ
-0.14
.nih
-0.14
Bronx
-0.14
Verfüg
-0.14
POSITIVE LOGITS
CD
0.43
SPD
0.40
Merkel
0.34
Chancellor
0.34
CD
0.34
chancellor
0.33
Angela
0.31
Bund
0.30
CDs
0.29
spd
0.28
Activations Density 0.025%