INDEX
Negative Logits
którego
-0.07
rejection
-0.07
.seek
-0.07
tected
-0.07
::$
-0.07
harmed
-0.07
ൃത
-0.07
impacted
-0.07
Reject
-0.07
canceled
-0.07
POSITIVE LOGITS
BFS
0.09
gir
0.09
Gir
0.09
quine
0.08
Dolls
0.08
bfs
0.08
successive
0.08
naging
0.08
滨
0.08
repeatedly
0.08
Activations Density 0.037%