INDEX
Negative Logits
ulu
-0.15
ãģ¦
-0.14
Weber
-0.14
852
-0.14
vest
-0.14
ein
-0.13
coli
-0.13
.modules
-0.13
cess
-0.13
uster
-0.13
POSITIVE LOGITS
reform
0.17
Reform
0.16
-divider
0.15
cntl
0.14
bou
0.14
Ball
0.14
enticated
0.14
decay
0.14
towers
0.14
itical
0.13
Activations Density 0.030%