INDEX
    Explanations

    terms or statements that suggest implications or underlying meanings

    phrases that convey suggestion or inference

    New Auto-Interp
    Negative Logits
     Steps
    -0.75
    Topics
    -0.72
    ulic
    -0.68
     Bram
    -0.65
     Pal
    -0.62
     Ready
    -0.60
    PG
    -0.59
     Known
    -0.59
     batt
    -0.58
     jam
    -0.58
    POSITIVE LOGITS
     implied
    2.85
     imply
    2.81
     implies
    2.68
     implying
    2.35
     implication
    1.90
     implicitly
    1.68
     insin
    1.52
     inferred
    1.40
     presupp
    1.31
     entails
    1.30
    Act Density 0.020%

    No Known Activations