INDEX
Negative Logits
945
-0.17
con
-0.16
ials
-0.16
942
-0.15
constr
-0.14
olla
-0.14
asks
-0.14
ahr
-0.14
975
-0.14
auto
-0.14
POSITIVE LOGITS
eer
0.19
iversit
0.17
rodin
0.16
raised
0.16
ruk
0.15
abbix
0.15
å¸Ī
0.15
rotch
0.15
antee
0.15
SGlobal
0.15
Activations Density 0.008%