INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    z
    -0.49
    UnitTesting
    -0.47
    oredCriteria
    -0.45
     sing
    -0.44
     excesses
    -0.43
    TagHelper
    -0.43
    pre
    -0.43
    link
    -0.43
    ss
    -0.41
    twimg
    -0.41
    POSITIVE LOGITS
     catch
    0.79
     AttributeSet
    0.77
    CATCH
    0.71
     catcher
    0.69
     catching
    0.69
     CATCH
    0.68
    Cyfeiriadau
    0.66
    ंदीखरीदारी
    0.65
    уда
    0.64
    catch
    0.63
    Act Density 0.181%

    No Known Activations