INDEX
Explanations
occurrences of the word "przez."
New Auto-Interp
Negative Logits
ajar
-0.16
aje
-0.15
omm
-0.15
ulum
-0.15
ande
-0.14
ahat
-0.14
ouve
-0.14
fers
-0.14
cheid
-0.14
Convention
-0.14
POSITIVE LOGITS
ιÏİν
0.15
otron
0.14
orth
0.14
orque
0.14
-gnu
0.14
daq
0.13
urally
0.13
ÛĮاÙĨ
0.13
sse
0.13
ersist
0.13
Activations Density 0.001%