INDEX
    Explanations

    mentions of love and relationships

    love or strong feelings

    New Auto-Interp
    Negative Logits
     Love
    -2.66
     love
    -2.61
    Love
    -2.55
     LOVE
    -2.52
    love
    -2.47
    LOVE
    -2.41
     loved
    -2.36
     loving
    -2.30
     loves
    -2.17
    loved
    -2.09
    POSITIVE LOGITS
     mergeFrom
    0.60
     preprint
    0.47
    BufferException
    0.46
    المكان
    0.45
     Schell
    0.43
    ору
    0.43
     MPP
    0.43
     inal
    0.43
    клопе
    0.42
     Bundy
    0.42
    Act Density 1.389%

    No Known Activations