INDEX
Negative Logits
Hoover
-0.74
WOOD
-0.73
Meadow
-0.70
ECO
-0.68
Ames
-0.67
Weir
-0.66
clearance
-0.66
Wast
-0.66
Kurd
-0.65
Wem
-0.64
POSITIVE LOGITS
piracy
1.57
ervatives
1.52
umers
1.46
ensus
1.40
istent
1.35
cientious
1.35
ensual
1.35
olid
1.32
idered
1.31
erv
1.30
Activations Density 0.005%