INDEX
    Explanations

    phrases indicating accountability or responsibility in a context

    New Auto-Interp
    Negative Logits
    zet
    -0.15
     dán
    -0.15
    adb
    -0.14
    Sock
    -0.14
    ADX
    -0.14
    reh
    -0.13
    Ïĩη
    -0.13
    ilian
    -0.13
    onth
    -0.13
    LOCKS
    -0.13
    POSITIVE LOGITS
    Caption
    0.14
    lıģının
    0.13
    -inline
    0.13
    \-
    0.12
     ήÏĦαν
    0.12
     пог
    0.12
    sw
    0.12
     Hyde
    0.12
    ws
    0.12
    استاÙĨ
    0.12
    Act Density 0.000%

    No Known Activations