INDEX
    Explanations

    mentions of romantic or intimate actions like kissing

    references to kissing and related actions

    New Auto-Interp
    Negative Logits
    é¾
    -1.01
    ifted
    -0.74
    quickShipAvailable
    -0.73
    ulhu
    -0.70
    izoph
    -0.70
    æ©Ł
    -0.67
    IJ
    -0.67
     Administ
    -0.67
    ENCY
    -0.67
     DISTR
    -0.66
    POSITIVE LOGITS
     goodbye
    1.28
     kiss
    1.14
     Kiss
    1.07
     kisses
    1.06
     kissing
    0.99
    kiss
    0.99
    creen
    0.98
     kissed
    0.95
     passionately
    0.91
     Goodbye
    0.83
    Act Density 0.026%

    No Known Activations