INDEX
Explanations
Hebrew words or characters
Hebrew letters or characters
New Auto-Interp
Negative Logits
hell
-0.78
aston
-0.77
ophon
-0.76
alien
-0.73
arding
-0.72
kamp
-0.70
alore
-0.70
atos
-0.69
amia
-0.68
oons
-0.68
POSITIVE LOGITS
ño
0.75
Ľ
0.74
BLIC
0.74
ãĥ¼ãĥĨ
0.72
partName
0.70
Äĩ
0.70
å§«
0.66
׾
0.66
lda
0.66
odcast
0.66
Activations Density 0.034%