INDEX
    Explanations

    phrases indicating a quantity or group of items

    quantities and variations, particularly focusing on the words "few" and "numerous."

    New Auto-Interp
    Negative Logits
    istan
    -0.85
    ESE
    -0.83
    anwhile
    -0.77
    SPONSORED
    -0.75
    said
    -0.73
    ared
    -0.72
    Constructed
    -0.71
    ale
    -0.70
    ARS
    -0.70
    ARY
    -0.70
    POSITIVE LOGITS
     interesting
    1.27
     surprises
    1.18
     ways
    1.05
     advantages
    1.04
     modifications
    1.04
     exciting
    1.03
     useful
    1.03
     variations
    1.00
     notable
    0.99
     noteworthy
    0.99
    Act Density 0.197%

    No Known Activations