INDEX
    Explanations

    adverbs that express certainty or frequency

    New Auto-Interp
    Negative Logits
    nt
    -0.14
    untime
    -0.14
    iface
    -0.14
    _tF
    -0.14
    'll
    -0.14
    /w
    -0.13
    igan
    -0.13
    entials
    -0.13
    ä»¶
    -0.13
    çħ§
    -0.13
    POSITIVE LOGITS
     been
    0.20
    most
    0.20
     be
    0.20
    ly
    0.18
    LY
    0.17
    yyy
    0.16
     (?)
    0.16
    ifi
    0.15
    wise
    0.15
    JsonValue
    0.15
    Act Density 0.240%

    No Known Activations