INDEX
Explanations
information related to birth dates and birthplaces of individuals
New Auto-Interp
Negative Logits
phis
-0.82
abulary
-0.70
uko
-0.70
vernment
-0.69
erguson
-0.68
Flavoring
-0.65
dfx
-0.64
awaru
-0.64
phasis
-0.64
uyomi
-0.63
POSITIVE LOGITS
stellar
0.75
м
0.75
inet
0.74
emis
0.71
anew
0.67
л
0.67
prematurely
0.67
abroad
0.66
hating
0.65
iny
0.65
Activations Density 0.027%