INDEX
Negative Logits
wcs
-0.71
adesh
-0.69
deductions
-0.66
combust
-0.65
downs
-0.65
vectors
-0.65
blot
-0.64
helic
-0.64
coloring
-0.63
curtains
-0.63
POSITIVE LOGITS
Sao
0.91
Ot
0.88
Rochester
0.86
Tokyo
0.84
California
0.83
Applied
0.81
Chicago
0.80
Notre
0.80
Southern
0.79
Warwick
0.79
Activations Density 0.302%