INDEX
    Explanations

    phrases related to relationships, attraction, and sexuality

    content related to dating and relationships

    New Auto-Interp
    Negative Logits
     Greenwood
    -0.79
     mathemat
    -0.78
    Stack
    -0.78
     forestry
    -0.78
     NTS
    -0.77
    Module
    -0.77
     Chomsky
    -0.76
    veland
    -0.75
     loader
    -0.75
     scrut
    -0.74
    POSITIVE LOGITS
     sexually
    1.51
     monog
    1.51
     boyfriend
    1.48
     romantic
    1.44
     flirt
    1.42
     sexual
    1.39
     erotic
    1.39
     girlfriends
    1.38
    Sexual
    1.38
    girlfriend
    1.37
    Act Density 0.974%

    No Known Activations