INDEX
Explanations
the name "Ali" in the text
mentions of the name "Ali."
New Auto-Interp
Negative Logits
ly
-0.89
namese
-0.82
sburgh
-0.79
lace
-0.79
ledged
-0.76
ledge
-0.76
olicy
-0.76
liness
-0.73
lisher
-0.71
lying
-0.70
POSITIVE LOGITS
Jinn
1.00
ases
0.95
ased
0.91
Express
0.90
Kham
0.88
osa
0.86
orescence
0.84
asing
0.81
otti
0.78
د
0.75
Activations Density 0.038%