INDEX
    Explanations

    numbers, quantities, and comparisons

    negations or phrases indicating limitations or specificity

    New Auto-Interp
    Negative Logits
    ussions
    -0.82
    lb
    -0.62
     unsuccessfully
    -0.62
    acas
    -0.61
    rams
    -0.61
    ounces
    -0.60
    osures
    -0.59
     Remain
    -0.59
    Disk
    -0.59
     Alam
    -0.57
    POSITIVE LOGITS
    soDeliveryDate
    0.85
     gonna
    0.79
    iour
    0.75
     funny
    0.72
    TY
    0.70
     eyebrow
    0.70
     kinda
    0.69
     gotta
    0.69
     ðŁij
    0.69
    ðŁ
    0.69
    Act Density 0.319%

    No Known Activations