INDEX
Explanations
Arabic names starting with "Ab"
words and names related to individuals and their affiliations or roles
New Auto-Interp
Negative Logits
bourg
-0.72
gears
-0.69
voy
-0.65
ppa
-0.64
ngth
-0.64
uania
-0.64
oshenko
-0.63
ppers
-0.62
ograp
-0.62
doom
-0.60
POSITIVE LOGITS
ÃĽ
0.86
SourceFile
0.78
afia
0.77
ILA
0.70
dash
0.70
uary
0.68
ailability
0.68
lie
0.68
ĪĴ
0.68
eki
0.67
Activations Density 0.081%