INDEX
    Explanations

    instances of the phrase "don't" in various contexts

    New Auto-Interp
    Negative Logits
     Samar
    -0.64
     Tasman
    -0.62
     Axel
    -0.62
     BMC
    -0.61
     partName
    -0.60
     Samson
    -0.59
     NH
    -0.58
     RAF
    -0.58
    Override
    -0.58
     Lowell
    -0.57
    POSITIVE LOGITS
    ember
    0.91
    ufact
    0.79
    iversary
    0.77
    udes
    0.77
    addafi
    0.76
    tarian
    0.75
    aughter
    0.74
    resent
    0.73
    imately
    0.73
    tal
    0.73
    Act Density 0.248%

    No Known Activations