INDEX
Negative Logits
ATA
-0.85
rollers
-0.83
ãĥİ
-0.82
der
-0.78
tsky
-0.75
ת
-0.75
roller
-0.73
trak
-0.71
stra
-0.71
×Ķ
-0.70
POSITIVE LOGITS
between
1.09
ials
1.03
between
1.00
yip
0.97
iveness
0.96
ially
0.96
iating
0.95
iculty
0.93
ensable
0.84
Between
0.83
Activations Density 8.820%