INDEX
Explanations
references to governmental and organizational structures within the EU
New Auto-Interp
Negative Logits
ÏĢη
-0.18
oute
-0.16
æħİ
-0.15
ereg
-0.15
ladu
-0.15
unma
-0.15
죽
-0.15
ogg
-0.14
abler
-0.14
ram
-0.14
POSITIVE LOGITS
osate
0.17
Roths
0.15
ADO
0.14
478
0.14
æ§
0.14
owed
0.13
bou
0.13
ê´ij
0.13
brunch
0.13
.charAt
0.13
Activations Density 0.014%