INDEX
Explanations
references to small or limited quantities
New Auto-Interp
Negative Logits
umab
-0.52
Thunk
-0.48
äne
-0.46
+#+
-0.45
var
-0.43
Orozco
-0.43
icali
-0.42
ñata
-0.42
gefähr
-0.41
ش
-0.41
POSITIVE LOGITS
small
1.18
small
1.12
Small
1.07
kecil
1.07
Small
1.00
pequeño
0.99
SMALL
0.99
tiny
0.99
pequeña
0.98
کوچک
0.95
Activations Density 0.610%