INDEX
    Explanations

    phrases indicating future events that will not happen

    the negative contraction "won't."

    New Auto-Interp
    Negative Logits
     newcom
    -0.72
     beginnings
    -0.71
    anwhile
    -0.69
     background
    -0.64
     Background
    -0.61
     populated
    -0.58
     backdrop
    -0.57
     compan
    -0.56
     advisors
    -0.56
     amb
    -0.56
    POSITIVE LOGITS
    't
    2.11
    kish
    1.25
    cest
    1.17
    ky
    1.13
    ´
    1.08
    ks
    1.01
    now
    0.99
    kies
    0.96
    etsk
    0.93
    cheon
    0.93
    Act Density 0.034%

    No Known Activations