INDEX
Explanations
repeated or special characters, potentially indicating noise or formatting issues in text
New Auto-Interp
Negative Logits
eka
-0.16
argent
-0.15
Bread
-0.15
inkel
-0.14
ure
-0.14
ibal
-0.14
izo
-0.14
bread
-0.14
lok
-0.14
iber
-0.13
POSITIVE LOGITS
%B
0.15
(&(
0.15
à¤Ĩर
0.14
ordova
0.14
istically
0.14
Mattis
0.14
crypt
0.14
cü
0.14
į
0.14
ös
0.13
Activations Density 0.002%