INDEX
Negative Logits
cors
0.69
стро
0.68
wis
0.66
adı
0.65
defining
0.64
defined
0.64
dbh
0.64
eval
0.63
(«
0.63
locally
0.62
POSITIVE LOGITS
something
1.11
的是
1.10
either
0.94
the
0.94
一下
0.93
anything
0.92
nothing
0.92
bahwa
0.88
what
0.87
things
0.87
Activations Density 0.347%