INDEX
    Explanations

    conditional phrases that express hypothetical situations or possibilities

    New Auto-Interp
    Negative Logits
     indispens
    -0.18
    (always
    -0.15
    nga
    -0.15
     underrated
    -0.15
     annoy
    -0.14
    AsyncResult
    -0.14
    urf
    -0.14
     addictive
    -0.14
    ernet
    -0.14
    CLAIM
    -0.14
    POSITIVE LOGITS
     rem
    0.35
     nice
    0.27
    nice
    0.23
     would
    0.21
     wise
    0.21
     Nice
    0.20
     rash
    0.20
    Nice
    0.20
     Rem
    0.19
    Would
    0.19
    Act Density 0.075%

    No Known Activations