INDEX
Explanations
occurrences of the German article "de" and its variations
New Auto-Interp
Negative Logits
ÙĤÙĩ
-0.16
arend
-0.16
aces
-0.15
eyJ
-0.15
ebo
-0.15
ofday
-0.14
ãģ£ãģı
-0.14
Fortress
-0.14
ammen
-0.14
ÃĨ
-0.14
POSITIVE LOGITS
uts
0.48
utschen
0.38
utsch
0.34
utsche
0.30
utch
0.26
its
0.25
chant
0.24
ister
0.24
zent
0.23
pon
0.22
Activations Density 0.009%