INDEX
    Explanations

    relationships and personal connections, particularly in the context of romantic partnerships and infidelity

    New Auto-Interp
    Negative Logits
    xin
    -0.18
     marriages
    -0.17
     Blond
    -0.15
     granddaughter
    -0.15
    ζε
    -0.15
     widow
    -0.14
    bat
    -0.14
    pread
    -0.14
    Äı
    -0.14
    daughter
    -0.14
    POSITIVE LOGITS
     significant
    0.44
     Significant
    0.40
     partner
    0.37
     boyfriend
    0.35
    significant
    0.35
     BF
    0.34
     beau
    0.30
     bf
    0.29
     lover
    0.29
     param
    0.29
    Act Density 0.213%

    No Known Activations