INDEX
Explanations
references to quantities of people or items, particularly emphasizing small groups or minorities
New Auto-Interp
Negative Logits
ãĥ¼ãĥĢ
-0.15
ÑģобоÑİ
-0.14
asca
-0.14
.subplots
-0.13
Ïĩεία
-0.13
دار
-0.13
BOTH
-0.13
amble
-0.13
utas
-0.13
omo
-0.13
POSITIVE LOGITS
few
1.02
few
0.88
Few
0.83
Few
0.79
handful
0.71
quelques
0.63
fewer
0.56
å°ij
0.56
select
0.50
vÃłi
0.49
Activations Density 0.341%