INDEX
Negative Logits
ख्
0.50
operat
0.47
comprenant
0.45
instrucciones
0.43
comprensión
0.40
pouvant
0.39
Diaspora
0.38
których
0.38
bandes
0.37
Trem
0.37
POSITIVE LOGITS
lowest
0.72
lowest
0.68
selected
0.68
peak
0.66
સૌથી
0.66
highest
0.62
選
0.62
峰
0.62
dipilih
0.61
chosen
0.61
Activations Density 0.839%