INDEX
    Explanations

    numerical values and mathematical expressions

    New Auto-Interp
    Negative Logits
    hower
    -0.19
    osas
    -0.16
    zin
    -0.16
    433
    -0.15
    venge
    -0.14
    CLK
    -0.14
    bower
    -0.14
    inspace
    -0.14
    elman
    -0.14
    825
    -0.14
    POSITIVE LOGITS
    .INSTANCE
    0.14
     aforementioned
    0.14
    /method
    0.14
    abi
    0.13
    ivi
    0.13
     ruk
    0.13
    abis
    0.13
    inde
    0.13
     eth
    0.13
     Chi
    0.13
    Act Density 0.014%

    No Known Activations