INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     gost
    -0.07
    _tot
    -0.07
    (head
    -0.07
    -0.07
    -0.07
    حم
    -0.06
     Lind
    -0.06
    -0.06
    Right
    -0.06
    ريب
    -0.06
    POSITIVE LOGITS
    0.08
    -social
    0.07
     plaintiffs
    0.07
    ")));↵↵
    0.07
    ]initWith
    0.07
    owntown
    0.07
    0.07
    -focused
    0.07
     التنفيذي
    0.07
    ';";↵
    0.07
    Act Density 0.010%

    No Known Activations