INDEX
Negative Logits
pergunta
0.52
viste
0.48
pesc
0.47
basé
0.47
recipiente
0.47
tiga
0.46
rapping
0.46
parece
0.46
também
0.45
vue
0.45
POSITIVE LOGITS
נות
0.49
ִי
0.49
ות
0.48
ַ
0.46
ְ
0.44
אמ
0.42
Wie
0.42
ים
0.41
נם
0.41
PH
0.40
Activations Density 0.003%