INDEX
Negative Logits
bel
-0.08
沈
-0.08
dura
-0.08
nang
-0.08
Royal
-0.08
.generic
-0.07
ocer
-0.07
Royal
-0.07
tut
-0.07
наж
-0.07
POSITIVE LOGITS
Consensus
0.09
Consensus
0.08
hallway
0.08
equival
0.07
attainment
0.07
Assertion
0.07
ellipse
0.07
genesis
0.07
prover
0.07
ذلك
0.07
Activations Density 0.022%