INDEX
Negative Logits
Ukrain
-1.20
wcs
-0.90
constitu
-0.88
iren
-0.84
anamo
-0.80
EStream
-0.79
champagne
-0.77
Flavoring
-0.77
ylon
-0.75
Bubble
-0.72
POSITIVE LOGITS
ethic
1.70
flows
1.57
station
1.55
bench
1.43
horse
1.42
aday
1.36
manship
1.36
forces
1.18
tops
1.15
hops
1.14
Activations Density 5.333%