INDEX
Explanations
names, especially those of Eastern European origin
names of individuals, particularly those involved in creative or impactful roles
New Auto-Interp
Negative Logits
sonian
-0.86
TAIN
-0.79
éļ
-0.76
Failure
-0.74
OTAL
-0.71
lement
-0.70
Participation
-0.68
ACTED
-0.67
rollment
-0.67
ancial
-0.64
POSITIVE LOGITS
amaz
0.80
Kaz
0.77
iggurat
0.75
onis
0.74
orb
0.73
imir
0.73
oan
0.72
imoto
0.72
uki
0.72
hur
0.72
Activations Density 0.022%