INDEX
    Explanations

    actions and decisions related to personal relationships and significant life events

    New Auto-Interp
    Negative Logits
    lation
    -0.15
    criptions
    -0.14
     Tone
    -0.14
    _anchor
    -0.14
    riad
    -0.14
    la
    -0.13
    оваÑĢ
    -0.13
     PERMISSION
    -0.13
    afc
    -0.13
    Permission
    -0.13
    POSITIVE LOGITS
    alled
    0.14
    iline
    0.14
    wise
    0.14
    öff
    0.14
    ogh
    0.14
    inalg
    0.13
    366
    0.13
    Å¡nÃŃ
    0.13
    ureka
    0.13
    ukan
    0.13
    Act Density 0.192%

    No Known Activations