INDEX
Negative Logits
би
0.62
ки
0.61
리
0.55
2
0.55
но
0.54
gegen
0.53
،
0.52
يز
0.52
ня
0.50
wärts
0.50
POSITIVE LOGITS
lost
0.53
it
0.47
the
0.46
Medicaid
0.45
DARK
0.44
CH
0.42
us
0.42
n
0.41
pea
0.41
る
0.40
Activations Density 0.010%
би
ки
리
2
но
gegen
،
يز
ня
wärts
lost
it
the
Medicaid
DARK
CH
us
n
pea
る