INDEX
Negative Logits
-0.08
creates
-0.07
huts
-0.07
(metadata
-0.07
Mahar
-0.07
shot
-0.07
adaptive
-0.07
tours
-0.07
matured
-0.07
their
-0.07
POSITIVE LOGITS
oppos
0.08
oppose
0.08
opos
0.08
сказ
0.08
Saying
0.08
contrap
0.08
NEC
0.08
Нап
0.08
_kwargs
0.08
Boc
0.08
Activations Density 0.001%