INDEX
Explanations
references to the term "du" and its variations in different contexts
New Auto-Interp
Negative Logits
rim
-0.17
ured
-0.16
gio
-0.14
üb
-0.14
omor
-0.14
neider
-0.14
rimp
-0.14
ukes
-0.14
missing
-0.14
layan
-0.14
POSITIVE LOGITS
ovny
0.17
Dud
0.17
amel
0.16
CSI
0.16
eldorf
0.15
chod
0.15
cis
0.15
_EXTERNAL
0.15
PLICATE
0.14
lex
0.14
Activations Density 0.093%