INDEX
Negative Logits
oppression
0.78
혹은
0.77
Something
0.74
Something
0.72
কিংবা
0.70
something
0.69
persecution
0.69
似的
0.69
をも
0.68
যারা
0.67
POSITIVE LOGITS
reportedly
0.94
found
0.88
encontrado
0.85
único
0.84
samarbe
0.83
unico
0.82
sipping
0.81
encontrada
0.80
💄
0.80
hanya
0.79
Activations Density 0.008%