INDEX
Negative Logits
’dan
-0.06
83
-0.06
роме
-0.06
tap
-0.06
重大
-0.06
段
-0.06
ancora
-0.06
_DP
-0.06
전에
-0.06
firewall
-0.06
POSITIVE LOGITS
CO
0.07
xs
0.07
Throughout
0.06
Types
0.06
Committees
0.06
renew
0.06
confined
0.06
said
0.06
(c
0.06
agine
0.06
Activations Density 0.097%