INDEX
Explanations
names that contain the word "Ali"
mentions of the name "Ali."
New Auto-Interp
Negative Logits
namese
-0.87
lace
-0.83
ledged
-0.79
ly
-0.76
lying
-0.74
ledge
-0.73
neys
-0.72
eenth
-0.70
lain
-0.69
darn
-0.69
POSITIVE LOGITS
ases
0.95
osa
0.93
Jinn
0.93
Express
0.88
orescence
0.85
ased
0.84
otti
0.81
Kham
0.80
veyard
0.79
Ali
0.77
Activations Density 0.032%