INDEX
Explanations
proper nouns or names
and signal occurrences of the character "ľ"
New Auto-Interp
Negative Logits
condem
-0.79
apes
-0.69
Norn
-0.68
reflex
-0.67
raints
-0.66
purpose
-0.64
Patriarch
-0.63
disadvant
-0.63
gist
-0.63
deed
-0.62
POSITIVE LOGITS
ï¸ı
1.29
âĶĢâĶĢ
1.09
ternity
0.96
0.89
âĸł
0.88
°
0.86
jj
0.85
\-
0.84
conom
0.84
··
0.84
Activations Density 0.269%