INDEX
Explanations
patterns related to Hebrew characters and words
textual representations of non-English characters or scripts
New Auto-Interp
Negative Logits
lyak
-0.78
Cly
-0.77
upiter
-0.76
estern
-0.74
annis
-0.74
iko
-0.73
arching
-0.72
tten
-0.69
psey
-0.67
onge
-0.67
POSITIVE LOGITS
Ù
1.74
ÙĨ
1.66
Ùĩ
1.66
ا
1.62
د
1.57
Ùħ
1.55
اØ
1.54
Ø
1.52
ÙĬ
1.50
ت
1.49
Activations Density 0.004%