INDEX
Negative Logits
occurs
-0.81
osate
-0.78
inav
-0.77
ulence
-0.74
Highlights
-0.72
lasts
-0.71
legality
-0.70
ossom
-0.70
Ensure
-0.70
flourish
-0.69
POSITIVE LOGITS
aware
1.37
afraid
1.34
able
1.32
willing
1.27
unable
1.25
thankful
1.21
glad
1.20
unaware
1.16
obligated
1.16
fortunate
1.16
Activations Density 3.432%