INDEX
Explanations
comparisons of size or quality
New Auto-Interp
Negative Logits
خوب
0.66
schönen
0.65
خوبی
0.63
maravilh
0.62
કોઈ
0.62
気に入
0.57
niets
0.57
ateful
0.56
schöne
0.56
maravilloso
0.56
POSITIVE LOGITS
faster
1.74
fewer
1.71
stronger
1.67
quicker
1.66
greater
1.66
denser
1.64
보다
1.63
higher
1.63
更高的
1.63
richer
1.62
Activations Density 1.219%