INDEX
    Explanations

    phrases expressing inability or impossibility

    phrases expressing inability or impossibility

    New Auto-Interp
    Negative Logits
     preferring
    -0.74
     compuls
    -0.65
     watches
    -0.65
    soDeliveryDate
    -0.64
     inspecting
    -0.64
     thanking
    -0.62
     prefers
    -0.62
     preferred
    -0.60
     pressing
    -0.59
     likes
    -0.58
    POSITIVE LOGITS
     withstand
    1.01
     plaus
    1.01
     be
    1.00
     possibly
    0.96
     occur
    0.95
     feas
    0.92
     exist
    0.91
     happen
    0.89
    't
    0.88
    berra
    0.85
    Act Density 0.124%

    No Known Activations