INDEX
Negative Logits
lished
-0.75
ARK
-0.67
Ü
-0.67
Ended
-0.65
ELL
-0.65
blance
-0.64
orthy
-0.60
Nanto
-0.60
Pound
-0.60
merce
-0.59
POSITIVE LOGITS
ibrary
1.14
umin
0.96
ustration
0.94
usive
0.92
vl
0.91
uminati
0.91
usions
0.89
ove
0.88
ust
0.88
ution
0.84
Activations Density 0.023%