INDEX
Negative Logits
abbia
0.50
Не
0.47
vious
0.45
憨
0.43
紹介
0.43
Lovers
0.42
голу
0.42
カ
0.42
모
0.42
Ка
0.42
POSITIVE LOGITS
CRIPT
0.47
ुलर
0.45
órgãos
0.45
రాజకీయ
0.45
conseil
0.44
sociological
0.43
IR
0.43
bureaucratic
0.43
громадян
0.43
citizenship
0.42
Activations Density 0.002%