INDEX
Explanations
occurrences of the dollar sign in various contexts
New Auto-Interp
Negative Logits
joy
-0.18
ess
-0.16
elez
-0.16
cctor
-0.16
AYOUT
-0.15
iron
-0.15
ekler
-0.14
ниÑĨа
-0.14
mus
-0.14
hs
-0.14
POSITIVE LOGITS
nez
0.16
erli
0.15
imed
0.15
ilha
0.15
کر
0.14
632
0.14
Trem
0.14
lique
0.14
quila
0.14
elves
0.14
Activations Density 0.011%