INDEX
Negative Logits
âķIJâķIJ
-0.68
uracy
-0.64
Nieto
-0.63
utes
-0.62
OPE
-0.62
Privacy
-0.60
Utt
-0.58
hovah
-0.58
AUT
-0.58
perfection
-0.58
POSITIVE LOGITS
tenance
1.65
stay
1.38
deck
1.11
stream
1.08
lander
1.02
frame
1.00
boards
0.96
mast
0.95
tan
0.93
lining
0.93
Activations Density 0.034%