INDEX
Negative Logits
stead
-0.18
lv
-0.16
opt
-0.15
ilo
-0.15
INET
-0.15
swire
-0.15
acemark
-0.15
ella
-0.15
sey
-0.15
shield
-0.15
POSITIVE LOGITS
atatype
0.16
entifier
0.15
atre
0.15
ero
0.15
coli
0.14
วม
0.14
ož
0.14
CX
0.14
instein
0.13
belt
0.13
Activations Density 0.081%