INDEX
Explanations
names and titles of political figures
names of individuals and references to their affiliations or roles
New Auto-Interp
Negative Logits
ĸļ
-0.86
Grateful
-0.80
sylv
-0.79
Starcraft
-0.78
humans
-0.75
mammalian
-0.73
fec
-0.71
Californ
-0.70
cryst
-0.69
veterin
-0.69
POSITIVE LOGITS
Ahmad
1.18
Hussein
1.10
uala
1.10
Shah
1.10
Hasan
1.10
Sharif
1.07
Ibrahim
1.06
Mohamed
1.05
Hassan
1.05
ji
1.05
Activations Density 0.269%