INDEX
Explanations
mentions of the name "Ali"
repeated mentions of the name "Ali"
New Auto-Interp
Negative Logits
ly
-0.86
namese
-0.78
ledged
-0.77
lace
-0.73
lisher
-0.72
hire
-0.72
lain
-0.71
eers
-0.70
lator
-0.69
cies
-0.69
POSITIVE LOGITS
Jinn
1.11
ases
1.01
osa
0.99
ased
0.96
Express
0.90
Kham
0.88
Sina
0.85
orescence
0.84
otti
0.84
Ali
0.83
Activations Density 0.039%