INDEX
    Explanations

    the word "love" in various contexts

    New Auto-Interp
    Negative Logits
     Rating
    -0.65
     questions
    -0.63
     nightmares
    -0.63
     HB
    -0.62
     bars
    -0.62
     Barth
    -0.62
     Jian
    -0.62
     Ank
    -0.61
     Tec
    -0.61
     gangs
    -0.61
    POSITIVE LOGITS
    ove
    4.80
    oves
    2.24
    oved
    1.66
    ovie
    1.62
    ovy
    1.55
    oving
    1.44
    ov
    1.41
    OV
    1.35
    ovo
    1.33
    ovi
    1.32
    Act Density 0.007%

    No Known Activations