INDEX
    Explanations

    the phrase "more or less."

    phrases indicating approximate quantities or similarities

    New Auto-Interp
    Negative Logits
     EDITION
    -0.59
    wagen
    -0.58
     rapist
    -0.56
    Monitor
    -0.56
    ATT
    -0.55
    iasco
    -0.55
    ouse
    -0.55
     horizont
    -0.55
    CHR
    -0.53
     racket
    -0.52
    POSITIVE LOGITS
     less
    1.13
    nery
    1.02
    leans
    0.97
     fewer
    0.94
     Less
    0.90
     least
    0.87
    Less
    0.83
     lesser
    0.82
    gin
    0.77
    phans
    0.76
    Act Density 0.032%

    No Known Activations