INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ومی
    -0.07
    -0.07
    ίνα
    -0.07
     sheriff
    -0.06
     scratches
    -0.06
    (machine
    -0.06
     Mountain
    -0.06
    üns
    -0.06
    UGINS
    -0.06
     gói
    -0.06
    POSITIVE LOGITS
    _DOM
    0.06
     sv
    0.06
    _OPER
    0.06
    178
    0.06
    :@"%@
    0.06
    ystick
    0.06
    });
    ↵
    ↵
    0.06
    )",
    0.06
    =g
    0.06
    .way
    0.06
    Act Density 0.031%

    No Known Activations