INDEX
    Explanations

    instances of denial or rejection of responsibility

    New Auto-Interp
    Negative Logits
    ozÃŃ
    -0.14
    rud
    -0.14
    Biz
    -0.14
    Sac
    -0.13
    avier
    -0.13
     sac
    -0.13
    /gui
    -0.13
     اÙĦجÙĨ
    -0.13
    iov
    -0.13
    _fre
    -0.13
    POSITIVE LOGITS
     EA
    0.30
    EA
    0.23
     Cause
    0.21
    eah
    0.20
     cause
    0.20
     Holden
    0.19
     Bay
    0.19
     Give
    0.18
     AI
    0.18
     charity
    0.18
    Act Density 0.005%

    No Known Activations