INDEX
    Explanations

    phrases indicating methods or means of doing something

    New Auto-Interp
    Negative Logits
     nutshell
    -0.69
    ever
    -0.63
     STATS
    -0.63
     rated
    -0.62
    sha
    -0.61
     understatement
    -0.60
     coupon
    -0.60
     Birthday
    -0.60
     triv
    -0.60
     Cind
    -0.59
    POSITIVE LOGITS
    arthy
    0.73
    uilding
    0.67
    phabet
    0.67
    burse
    0.67
    istani
    0.66
    orthern
    0.64
    orthy
    0.64
    BACK
    0.63
    boats
    0.63
     safer
    0.63
    Act Density 0.011%

    No Known Activations