INDEX
    Explanations

    references to small or limited quantities

    New Auto-Interp
    Negative Logits
    umab
    -0.52
    Thunk
    -0.48
    äne
    -0.46
    +#+
    -0.45
     var
    -0.43
     Orozco
    -0.43
    icali
    -0.42
    ñata
    -0.42
     gefähr
    -0.41
    ‌ش
    -0.41
    POSITIVE LOGITS
     small
    1.18
    small
    1.12
    Small
    1.07
     kecil
    1.07
     Small
    1.00
     pequeño
    0.99
     SMALL
    0.99
     tiny
    0.99
     pequeña
    0.98
     کوچک
    0.95
    Act Density 0.610%

    No Known Activations