INDEX
Explanations
proper nouns, particularly names
New Auto-Interp
Negative Logits
Galen
-0.96
Galway
-0.83
Wolfe
-0.81
Banten
-0.81
Balth
-0.80
+#+
-0.80
tonsoft
-0.79
Ārējās
-0.79
Packers
-0.79
inkl
-0.77
POSITIVE LOGITS
Piero
0.70
смо
0.69
Jenkins
0.69
CRIP
0.66
Gar
0.65
Weaver
0.65
crip
0.65
Gar
0.64
Pem
0.64
Verg
0.63
Activations Density 1.490%