INDEX
Explanations
mentions of the name "Ali."
occurrences of the substring "ali"
New Auto-Interp
Negative Logits
acters
-0.84
sylvania
-0.82
oven
-0.79
lies
-0.79
olicy
-0.79
IAL
-0.77
hered
-0.76
manship
-0.75
lov
-0.75
ly
-0.75
POSITIVE LOGITS
yah
1.05
Äĩ
0.89
ensis
0.89
ño
0.88
Lama
0.84
qi
0.83
WAYS
0.79
ña
0.79
ñ
0.75
ë
0.72
Activations Density 0.017%