INDEX
Explanations
references to German political institutions or entities
references to the word "Bund" or its variations
New Auto-Interp
Negative Logits
OPLE
-0.73
breath
-0.71
phia
-0.70
okin
-0.63
gyn
-0.62
MPH
-0.61
opy
-0.61
Wizards
-0.61
anto
-0.61
APE
-0.60
POSITIVE LOGITS
Bund
1.33
bund
1.09
nam
0.96
aroo
0.90
emonium
0.87
endor
0.83
liga
0.82
heit
0.76
alions
0.76
nodd
0.76
Activations Density 0.005%