INDEX
    Explanations

    adjectives describing size or quantity

    phrases indicating the presence of quantities or sizes, particularly focusing on the word "a" to signal limited or small amounts

    New Auto-Interp
    Negative Logits
    lees
    -0.73
    asks
    -0.72
    anto
    -0.68
    boards
    -0.67
    angelo
    -0.63
    also
    -0.63
    mos
    -0.62
    anti
    -0.62
    anism
    -0.60
    grounds
    -0.58
    POSITIVE LOGITS
     handful
    1.66
     few
    1.53
     fraction
    1.34
     couple
    1.29
     subset
    1.20
    few
    1.18
     tiny
    1.12
     single
    1.11
     small
    1.10
     curs
    1.09
    Act Density 0.146%

    No Known Activations