INDEX
    Explanations

    adverbs that indicate performance or success

    phrases that express effectiveness or quality in context

    New Auto-Interp
    Negative Logits
    sing
    -0.77
    sin
    -0.72
    psy
    -0.67
    isms
    -0.66
    rian
    -0.65
    shaw
    -0.64
    clerosis
    -0.63
    rop
    -0.63
    weak
    -0.62
    cipl
    -0.62
    POSITIVE LOGITS
    IER
    0.78
     suited
    0.78
    ior
    0.77
    awaru
    0.68
    nesota
    0.67
     synergy
    0.67
     [+]
    0.65
     outwe
    0.65
     depends
    0.65
    itsu
    0.63
    Act Density 0.184%

    No Known Activations