INDEX
Explanations
multilingual texts
Filipino, Portuguese, Spanish, Italian, Romance, Slavic
New Auto-Interp
Negative Logits
K
0.82
G
0.82
T
0.80
e
0.79
Y
0.78
the
0.75
V
0.75
W
0.74
i
0.73
U
0.73
POSITIVE LOGITS
vaše
0.75
você
0.73
vás
0.70
când
0.69
tendrás
0.67
když
0.67
vám
0.66
szolgált
0.66
számos
0.66
át
0.65
Activations Density 0.000%