INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Categories
    -0.07
     Grass
    -0.07
     bols
    -0.07
     prá
    -0.07
     Mathematics
    -0.07
     haber
    -0.06
     Central
    -0.06
    חשיבות
    -0.06
    _Vector
    -0.06
    銀行
    -0.06
    POSITIVE LOGITS
    "display
    0.07
    (outfile
    0.07
     spawning
    0.07
     hashes
    0.07
    _leaf
    0.07
    _quit
    0.06
     simulate
    0.06
    是个
    0.06
    ocused
    0.06
    [path
    0.06
    Act Density 0.007%

    No Known Activations