INDEX
Explanations
names related to the Arabic culture
mentions of specific individuals, particularly the name Malik
New Auto-Interp
Negative Logits
OME
-0.74
hire
-0.73
raged
-0.70
inction
-0.68
racted
-0.65
rep
-0.64
cess
-0.63
ĻĤ
-0.62
ocratic
-0.62
nard
-0.61
POSITIVE LOGITS
Malik
1.22
Hasan
1.21
pedia
0.86
istan
0.81
imir
0.80
Ahmad
0.79
awi
0.79
ovic
0.78
zai
0.77
ova
0.75
Activations Density 0.011%