INDEX
    Explanations

    themes related to love and relationships

    New Auto-Interp
    Negative Logits
     unintention
    -0.15
    iliz
    -0.14
    urator
    -0.14
     inherited
    -0.14
    alaxy
    -0.13
    rais
    -0.13
    è£ķ
    -0.13
    474
    -0.13
    outine
    -0.13
    urrent
    -0.13
    POSITIVE LOGITS
    eros
    0.20
     Love
    0.18
     love
    0.17
     Cup
    0.17
    Love
    0.17
     æĦĽ
    0.16
     recip
    0.16
     Romeo
    0.15
    phy
    0.15
     cupid
    0.15
    Act Density 0.086%

    No Known Activations