INDEX
Explanations
the name "Malik" in various contexts
mentions of a specific individual named Malik
New Auto-Interp
Negative Logits
raged
-0.79
racted
-0.70
urned
-0.69
OME
-0.69
hire
-0.69
nard
-0.69
atom
-0.68
inction
-0.67
perty
-0.62
obar
-0.62
POSITIVE LOGITS
Hasan
1.12
Malik
1.05
awi
0.86
pedia
0.82
hawks
0.81
ova
0.75
anova
0.74
achu
0.73
ovic
0.72
tin
0.71
Activations Density 0.012%