INDEX
Explanations
phrases emphasizing the concept of "few."
New Auto-Interp
Negative Logits
GenerationType
-0.51
araman
-0.44
оригіналу
-0.44
Hartman
-0.43
Dise
-0.43
Hartmann
-0.43
OPT
-0.41
is
-0.41
Дата
-0.40
abestanden
-0.40
POSITIVE LOGITS
few
1.16
Few
1.12
few
1.01
FEW
1.01
Few
0.98
dozen
0.91
wenige
0.88
wenigen
0.79
poucos
0.79
几
0.77
Activations Density 0.088%