INDEX
Negative Logits
ва
0.76
INA
0.74
Caroline
0.70
तरीय
0.70
ავს
0.67
decreed
0.67
зад
0.66
Councillor
0.66
тро
0.65
ERT
0.64
POSITIVE LOGITS
翻訳
0.85
essas
0.84
grafico
0.82
இவற்றை
0.81
の開発
0.81
dessas
0.80
仕事を
0.79
يعرف
0.79
vvvert
0.79
znan
0.78
Activations Density 0.003%