INDEX
Negative Logits
refinancing
-0.79
didSet
-0.74
mirroring
-0.73
e
-0.69
squatting
-0.68
animating
-0.68
pinching
-0.68
alimentaria
-0.67
kasarigan
-0.67
jamming
-0.67
POSITIVE LOGITS
ly
1.05
LY
0.65
nya
0.65
her
0.60
aber
0.60
the
0.57
beginnetje
0.57
fast
0.57
its
0.56
noble
0.56
Activations Density 0.063%