INDEX
    Explanations

    pronouns and references to personal involvement or relationships

    New Auto-Interp
    Negative Logits
     byt
    -0.15
    uae
    -0.14
    mask
    -0.13
    _locals
    -0.13
    uin
    -0.13
     Marino
    -0.13
    cop
    -0.13
    ligt
    -0.13
    vero
    -0.13
     sleeves
    -0.13
    POSITIVE LOGITS
    oti
    0.17
    redential
    0.16
    lox
    0.15
    501
    0.15
    CLU
    0.15
    ARING
    0.14
    ÅĽcie
    0.14
    ваÑĢ
    0.14
     Abed
    0.14
    obuf
    0.14
    Act Density 0.142%

    No Known Activations