INDEX
Negative Logits
COD
0.80
ummer
0.74
Poss
0.73
Capt
0.71
Wear
0.71
sense
0.71
ität
0.71
Eigent
0.70
rates
0.70
녹
0.69
POSITIVE LOGITS
Lets
1.54
lets
1.50
ting
1.46
Let
1.45
Lets
1.41
Let
1.38
let
1.31
让我们
1.24
讓我們
1.21
Давайте
1.21
Activations Density 0.439%