INDEX
Negative Logits
되
0.39
maiores
0.38
碾
0.38
㧕
0.37
핏
0.37
庄
0.37
🍃
0.36
烈
0.36
ポール
0.36
Shutdown
0.36
POSITIVE LOGITS
[,,"
0.42
посте
0.42
jee
0.41
общества
0.40
commenting
0.39
capitalism
0.39
συνέχ
0.39
ironing
0.39
consenting
0.38
prostitution
0.37
Activations Density 0.000%