INDEX
Explanations
special characters and non-English alphabets with a specific emphasis on the character 'ë' and specific combinations like 'ïķ'
specific characters or glyphs used in a non-standard encoding or script
New Auto-Interp
Negative Logits
wagen
-0.83
romeda
-0.72
theless
-0.71
ablishment
-0.71
bourg
-0.70
ambers
-0.69
lander
-0.67
Feet
-0.67
peed
-0.66
oleon
-0.65
POSITIVE LOGITS
¹
1.08
ª
0.96
º
0.95
ī
0.95
¨
0.93
Ĩ
0.93
ķ
0.91
¦
0.91
Į
0.87
Ģ
0.87
Activations Density 0.029%