INDEX
Explanations
words related to a specific name, "Ali"
the presence of the substring "ali" in various contexts
New Auto-Interp
Negative Logits
acters
-0.84
theless
-0.83
inately
-0.80
ried
-0.78
ilaterally
-0.74
stairs
-0.73
ancial
-0.73
ifiable
-0.71
mil
-0.71
meyer
-0.71
POSITIVE LOGITS
Äĩ
1.14
yah
1.05
ño
1.03
ñ
1.03
ya
0.97
ña
0.96
á¹
0.86
qt
0.84
versa
0.82
Äį
0.82
Activations Density 0.094%