INDEX
    Explanations

    positive phrases or expressions

    New Auto-Interp
    Negative Logits
    olicy
    -0.66
    cum
    -0.62
    ural
    -0.59
    aults
    -0.57
    atum
    -0.56
    soDeliveryDate
    -0.55
    umed
    -0.54
     Tsukuyomi
    -0.53
    mage
    -0.53
    é¾
    -0.52
    POSITIVE LOGITS
    bye
    0.72
     tid
    0.63
     congr
    0.62
    noon
    0.57
     remind
    0.56
     reassure
    0.55
     ðŁĻĤ
    0.54
     ya
    0.54
    elight
    0.54
     Torrent
    0.54
    Act Density 10.397%

    No Known Activations