INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.06
    -yard
    -0.06
     všech
    -0.06
    beiten
    -0.06
    ToStr
    -0.06
    Ground
    -0.06
    =M
    -0.06
     mpg
    -0.06
    -mile
    -0.06
     vacant
    -0.06
    POSITIVE LOGITS
     networking
    0.07
     Networking
    0.07
    บท
    0.07
    0.07
     PW
    0.07
    _Version
    0.07
     generalized
    0.07
     Charset
    0.06
     worries
    0.06
    fos
    0.06
    Act Density 0.003%

    No Known Activations