INDEX
    Explanations

    themes related to obligation and consequences in relationships

    New Auto-Interp
    Negative Logits
    shal
    -0.14
    erus
    -0.14
    Cog
    -0.14
     arb
    -0.14
    rego
    -0.14
    ethoven
    -0.14
    ARB
    -0.13
    ì¹´ëĿ¼
    -0.13
    à¹Ĥรà¸ĩà¹ģรม
    -0.13
     reve
    -0.13
    POSITIVE LOGITS
    usz
    0.18
     offer
    0.17
    ç»Ļ
    0.17
    給
    0.17
    Provid
    0.17
     provide
    0.17
     ìłľê³µ
    0.16
     пÑĢедоÑģÑĤав
    0.16
    batis
    0.16
    erman
    0.16
    Act Density 0.116%

    No Known Activations