INDEX
Explanations
proper nouns related to places and historical sites
New Auto-Interp
Negative Logits
AnchorStyles
-0.72
Ader
-0.70
Lorenzo
-0.65
Neve
-0.64
موس
-0.63
MIK
-0.60
Fru
-0.60
Sz
-0.60
sanguí
-0.59
도
-0.59
POSITIVE LOGITS
Sheffield
0.83
Rochester
0.81
Oldham
0.80
Sheffield
0.79
Rochester
0.77
Doncaster
0.77
Vicksburg
0.77
Yorkshire
0.76
scalatest
0.75
Chesterfield
0.75
Activations Density 1.374%