INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    testng
    -0.50
    sho
    -0.48
     Poc
    -0.47
     Polsek
    -0.46
    farwyddwr
    -0.46
     surla
    -0.46
    -0.46
    PyObject
    -0.44
    ρον
    -0.43
    Kariera
    -0.43
    POSITIVE LOGITS
     архивлан
    0.64
    aarrggbb
    0.62
     himo
    0.60
    исленность
    0.58
    LookAnd
    0.58
    fieldId
    0.57
    HtmlAttribute
    0.57
     المعيارى
    0.57
    batsman
    0.56
    زال
    0.56
    Act Density 0.012%

    No Known Activations