INDEX
    Explanations

    references to the concept of "more" or increases in quantity

    New Auto-Interp
    Negative Logits
     SNL
    -0.73
    (&:
    -0.73
    bufio
    -0.71
     Caballero
    -0.70
    -0.70
    -0.69
     Cluj
    -0.69
    __()
    -0.68
     africana
    -0.66
    Discografia
    -0.66
    POSITIVE LOGITS
     more
    1.77
     MORE
    1.60
    more
    1.53
     More
    1.41
    More
    1.41
    MORE
    1.41
     Moreno
    1.22
     Moreira
    1.13
     Moreau
    1.12
    emore
    1.12
    Act Density 0.147%

    No Known Activations